Hermes Agent is NousResearch's flagship open-source agentic framework built around a three-file persistent identity system: SOUL.md (agent persona and values), MEMORY.md (factual knowledge accumulated across sessions), and USER.md (evolving model of who you are). Combined with a GEPA self-improvement loop (ICLR 2026 Oral), Hermes becomes progressively more capable and aligned with your working style over time. Runs as a single persistent process across Telegram, Discord, Slack, WhatsApp, Signal, and CLI simultaneously. At ★144,352 (+2,229 on 2026-05-11), it sits at GitHub trending #1 — the open-source counter-formation to the closed-lab consolidation narrative (Anthropic-at-$1T week).
Hermes Agent is NousResearch’s answer to the core frustration of agentic AI: every session starting from scratch. The framework pairs persistent SQLite-backed memory with GEPA (Generative Evolutionary Prompt Architecture), a peer-reviewed self-improvement loop that reads execution traces and automatically refines skills and prompts. Rather than relying on the agent’s own self-assessment, GEPA evaluates objective metrics — token efficiency, task completion rates, failure modes — and updates skills accordingly.
Nous Research built its credibility on the Hermes fine-tuned model series, which became workhorses for the open-source LLM community. Hermes Agent moves that same design philosophy up the stack: take what works, make it persistent, make it improve.
Self-improvement loop: After each task, hermes-agent automatically writes a reusable Markdown Skill file into SQLite. If a better approach emerges, the skill is revised. The companion hermes-agent-self-evolution framework (ICLR 2026 Oral) provides the evaluation backbone — reading execution traces to understand why things fail, not just that they failed.
Cross-session memory: All sessions are stored in SQLite with FTS5 full-text search, enabling recall of conversations from weeks ago. The Honcho user modeling layer builds a persistent understanding of user preferences, goals, and communication style in separate memory.md and user.md files.
Multi-platform gateway: A single hermes-agent process handles Telegram, Discord, Slack, WhatsApp, Signal, and CLI simultaneously. No separate deployment per platform.
Hermes Agent is strongest for workflows where context accumulates over time: complex software projects, ongoing research tasks, personal automation with specific preferences, and any scenario where “the agent already knows this” matters. The multi-platform gateway makes it practical as a personal AI assistant that follows you across communication channels.
For coding tasks specifically, hermes-agent’s 40+ built-in tools cover the full development loop. Developers running it on a local machine or cheap VPS get a persistent coding partner that remembers project context and improves its approaches over iterations.
The self-improvement loop works best on tasks with clear success criteria — code execution, API calls, file operations. For ambiguous tasks (research synthesis, creative work), the GEPA loop has less objective signal to work with. The framework requires a model with at least 64,000 tokens of context; most 7B local models don’t qualify, so you’ll need a hosted API or a larger local model.
Hermes Agent is built for developers and power users who want a persistent AI that gets better over time. The MIT license and minimal infrastructure requirements ($5 VPS) make it accessible to individual developers. Teams building on top of the framework benefit from the reusable skill library that compounds across agents. Anyone frustrated by AI tools that forget everything between sessions will find hermes-agent’s design philosophy immediately valuable.
Persistent memory layer for AI coding agents — benchmark-backed (95.2% on LongMemEval-S), 92% fewer tokens per session vs full-context pasting, zero manual memory.add() calls.
Open-source AI pair programming tool that works in your terminal to edit code across your entire repository.
Open-source AI coding harness builder that makes AI coding workflows deterministic and repeatable via YAML-defined DAG workflows.