The state of Hermes Agent — May 2026
Six weeks later: what changed since the April report.
TL;DR
Six weeks ago we published the first State of Hermes report. At the time, Hermes Agent had just hit 57,200 GitHub stars in its first six weeks — already the fastest-growing open-source agent of 2026 — and we ended that report with five "watch for Q2" calls.
Six weeks later, here's where things stand:
- 165,033 stars on the core repo (+108K, ~2.9x in 43 days)
- 27,160 forks (+19.6K, ~3.6x)
- Six new releases — v0.9 through v0.14 — averaging a major drop every 7 days
- 22 messaging platforms (was 14): iMessage, WeChat, QQBot, Yuanbao, Teams, Google Chat, LINE, SimpleX all landed
- 123 quality-filtered ecosystem projects (was 80) with 389,920 combined stars (was 90,750, ~4.3x) — the ecosystem outgrew the core repo
- Memory & Context is the breakout category — 6 → 21 providers tracked, ~148,000 combined stars — with mem0 raising $24M and Letta raising $10M during the window
If the April report was about what Hermes is, this one is about what Hermes became. The shorthand: it grew up. The "Curator → Tenacity → Foundation" release trilogy in late April / mid May turned Hermes from a fast-moving hobbyist favorite into something you can actually deploy, install, and trust to finish a job.
This is a companion to the April report — we won't re-explain Hermes from scratch. If you're new, start there, then come back.
1. Six weeks of delta
| Metric | Apr 11 | May 24 | Δ |
|---|---|---|---|
| GitHub stars (core repo) | 57,200 | 165,033 | +107,833 (2.9x) |
| Forks (core repo) | 7,572 | 27,160 | +19,588 (3.6x) |
| Releases shipped | 8 (v0.1 → v0.8) | 14 (v0.1 → v0.14) | +6 in window |
| Messaging platforms | 14 | 22 | +8 |
| Atlas-tracked ecosystem repos | 80 | 123 | +43 |
| Total ecosystem stars | 90,750 | 389,920 | +299,170 (4.3x) |
| Memory & Context (repos / stars) | 6 / ~10k | 21 / 148,450 | ~15x in stars |
| Workspaces & GUIs (repos / stars) | 4 | 11 / 28,727 | +7 repos |
| Skills & Registries | 17 | 20 | +3 |
Apr 11 numbers are as published in the April report; May 24 numbers were polled live from the GitHub API on publication day.
Two things jump off that table. First: the ecosystem outgrew the core repo. Total ecosystem stars went 4.3x while the core went 2.9x — people aren't just starring Hermes, they're building things other people star. Second: Memory & Context went from a ~10K-star category to 148K stars in 43 days. That single category is now bigger than the entire tracked ecosystem was in April. We come back to that in §4.
2. The release trilogy: Curator → Tenacity → Foundation
Six releases shipped in the window. Three of them define the arc, and the other three (v0.9 "Everywhere," v0.10 "Tool Gateway," v0.11 "Interface") were necessary setup. The trilogy:
v0.12 "Curator" (April 30) — when the self-improvement loop got real
The headline claim of Hermes Agent has always been "the agent that grows with you." Until v0.12, that growth was agentic but reactive — when a session produced something worth keeping, the agent would save a skill, and the library would slowly accumulate.
v0.12 shipped hermes curator: a background agent that runs on the gateway's cron ticker (7-day cycle by default), grades your skill library by usage, consolidates related skills, prunes dead ones, and writes a per-run report to logs/curator/REPORT.md. The defense-in-depth gates protect bundled and hub-installed skills from mutation. Pinning blocks curator writes. Archived skills get classified as consolidated vs. pruned by a model + heuristic combination so you can audit later. You can pick the curator's own model in hermes model under auxiliary.curator.
At the same time, the background review fork (the core of Hermes' self-improvement: after each turn it decides what to save or update) got rewritten:
- Class-first grading — rubric-based, not free-form "should we save this"
- Active-update bias — prefers updating the skill the agent just loaded over creating new ones
- Parent runtime inheritance — the fork now actually uses the live provider, model, and credentials of the parent session (previously aggregator users were silently routed to a cheap provider default)
- Toolset restriction — locked to memory + skills so it can't sprawl into doing real work in the background
This is the release where "the agent that grows with you" stopped being a tagline and became a deterministic property of the system. Compounding becomes the default, not the result of careful prompting.
v0.13 "Tenacity" (May 7) — the agent finishes what it starts
If v0.12 was about self-maintenance, v0.13 was about durability. Headlines:
- Durable multi-agent Kanban — spin up a board, drop tasks on it, multiple Hermes workers pick them up, hand off, and close them out. Heartbeats, reclaim, zombie detection, retry budgets, hallucination gate. One install, many kanbans.
/goal— lock the agent onto a target and it stays on task across turns. The Ralph loop as a first-class primitive. The next release (v0.14) added/subgoalfor layering criteria on mid-run.- Checkpoints v2 — state persistence rewritten with real pruning, disk guardrails, no more orphan shadow repos.
- Sessions survive restarts — gateway bounces mid-agent,
/updaterestarts, source-file reloads — conversations auto-resume when the gateway comes back. - 8 P0 security closures in one release. Secret redaction is on by default now. Discord role-allowlists became guild-scoped (closing a CVSS 8.1 cross-guild DM bypass). WhatsApp rejects strangers by default. TOCTOU windows closed across
auth.jsonand MCP OAuth. - Providers are now plugins.
ProviderProfileABC +plugins/model-providers/. Drop in a third-party provider without touching core. - 7 i18n locales (Chinese, Japanese, German, Spanish, French, Ukrainian, Turkish) plus a Chinese docs site.
- Google Chat became platform #20.
Tenacity is the release where Hermes stopped being a single-session companion and became infrastructure. You can leave it running over a long-horizon project, walk away, restart the box, and pick up where it was.
v0.14 "Foundation" (May 16) — Hermes installs anywhere
The biggest release of the window. 808 commits, 633 merged PRs, 215 contributors, 545 issues closed (12 P0, 50 P1). The headlines:
pip install hermes-agent && hermes— Hermes is now a real PyPI package. No more cloning the repo or running shell installers.- Native Windows support (early beta) — first-class native Windows path across CLI / gateway / TUI / tools. PowerShell installer handles MinGit auto-install, Microsoft Store python stub detection, and the foreground Ctrl+C dance. ~40 Windows-specific fixes already landed in-window alongside the beta stamp.
- Cold-start performance wave — ~19 seconds shaved off
hermeslaunch. Thehermes toolsAll-Platforms screen alone dropped from 14s to under 1.5s. - 180x faster
browser_consoleevaluations — routed through a persistent CDP WebSocket instead of spinning up a new DevTools session per call. - Debloating wave — heavy backends (Slack, Matrix, Feishu, DingTalk adapters, image-gen SDKs, voice/TTS providers) now lazy-install on first use. The
[all]extras drop everything covered by lazy-deps. A supply-chain advisory checker scans every install for unsafe versions. Smaller disk footprint, fewer transitive vulnerabilities. - Cross-session 1h Claude prompt cache — when you use Claude through Anthropic, OpenRouter, or Nous Portal, the prompt prefix (system prompt, skills, memory) now caches for an hour across sessions. Start a
/newsession and the first response comes back faster and cheaper. - xAI Grok via SuperGrok OAuth — and
grok-4.3jumps to a 1M context window. No API key, no separate billing. If you pay for SuperGrok, you can use Grok inside Hermes by signing in. Drop a whole codebase into a single prompt. x_search— first-class X (Twitter) search tool with OAuth-or-API-key auth.- Microsoft Teams end-to-end — Graph auth + webhook listener + pipeline runtime + outbound delivery. Wire up the bot once, then chat to your agent from any Teams channel, DM, or group.
- LINE + SimpleX Chat — two more messaging platforms. Total: 22.
- 9 new optional skills including Hyperliquid (perp/spot trading), Yahoo Finance, OSINT investigation, unified EVM multi-chain, watchers (RSS/HTTP/GitHub polling), pinggy-tunnel, and a full Notion overhaul.
Foundation is the release where Hermes stopped being a project you clone and started being software you install. The boring stuff — packaging, cold start, lazy deps, real Windows support — is what turns critical mass into a default.
3. The subscription unification play
The biggest unwritten story in the ecosystem right now is what happened between v0.10 and v0.14 if you connect the dots.
- April 16, v0.10: the Nous Tool Gateway. Paid Nous Portal subscribers get automatic access to web search (Firecrawl), image generation (FAL / FLUX 2 Pro), TTS (OpenAI), and browser automation (Browser Use) through their existing subscription. No separate API keys.
- May 16, v0.14: the OpenAI-compatible local proxy. Run
hermes proxyand you get ahttp://localhost:<port>endpoint that speaks the OpenAI API but is backed by whichever OAuth provider you're signed into — Claude Pro, ChatGPT Pro, SuperGrok. Now any tool that expects an OpenAI-compatible endpoint (Codex CLI, Aider, Cline, Continue) just works with your existing subscription, no API key required. - Same release: xAI Grok lands as a SuperGrok OAuth provider.
grok-4.3bumps to a 1M context window inside Hermes.
Connect those: Hermes is quietly becoming the subscription bus for the AI stack. You sign in to your Anthropic, OpenAI, xAI, and Nous subscriptions once. Hermes handles the auth, the model routing, the format translation. Anything OpenAI-compatible — your IDE assistant, your CLI tool, your custom script — gets to use your existing seats.
This is a more important strategic move than any single feature in this window. It changes what Hermes is. It's not just an agent harness anymore — it's the multiplexer between you and the closed-model providers you're already paying.
OpenRouter does the API-key version of this and charges a margin. Hermes does the OAuth version and doesn't. For developers running their own stack — especially the long tail who pay for Claude Pro plus ChatGPT Pro and want to use both from one CLI — this is the killer-app feature, and it's mostly being slept on by the broader AI press.
4. Memory, six weeks later
The Memory & Context category went from 6 entries to 21 in six weeks — and from roughly 10K combined stars to 148,450. By star-count, it's the biggest standalone category on the atlas now, larger than every other ecosystem category combined. mem0 raised $24M (currently 56,567 stars). Letta raised $10M. Honcho, ByteRover, Supermemory, OpenViking, RetainDB, GBrain, Mnemosyne, yantrikdb, Ladybug, PLUR, FlowState-QMD, and several others all shipped meaningful versions or first releases in the window.
Why memory specifically? Two reasons.
First: Hermes treats memory as first-class plug-in infrastructure, not a feature. The MemoryProvider ABC means anyone with an opinion about how agent memory should work can ship a provider that drops into Hermes cleanly, exposes its own tools to the agent, and competes head-to-head with the official 8. Eight providers ship in the box; community provides another wave on top.
Second: memory is where the actual differentiation between agent harnesses is going to be. Tool calling is solved. Context windows keep growing. What's not solved is "the agent should know what happened last Tuesday, surface my preferences without being asked, and not re-ask the same questions for the rest of my life." That's a memory problem, and the architectural choices matter — flat fact extraction vs. dialectic user modeling vs. four-network retrieval vs. tiered cognitive architectures all make different bets about what memory should actually do.
If you want the full breakdown of the 8 official providers, the strongest community plug-ins (GBrain and Mnemosyne stand out), and how to actually choose, I wrote a deep-dive: "The Hermes Agent Memory Guidebook".
The short version: most users don't need to change anything — Layer 1 (the bundled MEMORY.md + USER.md + SQLite session DB with FTS5) is enough for the median case. When you outgrow it, Honcho models how you think (the Discord favorite), Mem0 is the 30-second-setup default, Hindsight is the benchmark king (first memory system to cross 90 on LongMemEval), Holographic runs fully air-gapped, and Supermemory is the latency leader at scale. The new Mem0 v3 algorithm (94.8 LongMemEval / 91.6 LoCoMo) leapfrogged the previous benchmark holder in April.
The one thing to clear up, because every other write-up gets it wrong: the v0.12 Curator does not manage memory. It manages the skill library at ~/.hermes/skills/. MEMORY.md and USER.md are still managed by the agent itself, with a hard cap and an 80%-fill prompt-instruction (not code). If that distinction matters to you, the memory guidebook walks through the actual code paths.
5. The ecosystem at 123 curated projects
Updated category breakdown as of today:
| Category | Apr 11 | May 24 | Category stars | Top project (live stars) |
|---|---|---|---|---|
| Core & Official | 6 | 6 | 173,476 | hermes-agent (165,033) |
| Memory & Context | 6 | 21 | 148,450 | mem0 (56,567) |
| Workspaces & GUIs | 4 | 11 | 28,727 | hermes-webui (8,467) |
| Skills & Skill Registries | 17 | 20 | 15,609 | Anthropic-Cybersecurity-Skills (7,829) |
| Guides & Docs | 5 | 9 | 8,869 | awesome-hermes-agent (3,343) |
| Multi-Agent & Orchestration | 9 | 7 | 5,759 | mission-control (4,951) |
| Developer Tools | 9 | 8 | 3,854 | tokscale (3,157) |
| Plugins & Extensions | 6 | 12 | 2,105 | — |
| Deployment & Infra | 7 | 8 | 1,409 | llm-agents.nix (1,291) |
| Integrations & Bridges | 5 | 10 | 844 | — |
| Domain Applications | 8 | 8 | 421 | — |
| Forks & Derivatives | 3 | 3 | 397 | — |
The Memory & Context explosion is the headline. But Workspaces & GUIs nearly tripling (4 → 11 entries, 28,727 combined stars) is the quieter signal: it means the next layer of the stack — the one most non-developer users will interact with — is starting to fill in. nesquena/hermes-webui leads the pack at 8.5K stars. outsourc-e/hermes-workspace (4.8K, our current "featured this week" pick) is the cleanest of the new entries: a zero-fork browser command center that talks to a vanilla Hermes install via its existing API. No fork to maintain.
Other notable arrivals (or breakouts) in the window:
- mem0 (56,567 stars) — finally landed on the atlas after a long contributor revision arc
- garrytan/gbrain (18,604) — Garry Tan's 8-layer knowledge engine, MIT-licensed, ships as a skillpack + MCP server rather than a MemoryProvider
- vectorize-io/hindsight (14,369) — the benchmark king (first memory system to publicly cross 90 on LongMemEval)
- plastic-labs/honcho (4,140) — the dialectic user-modeling memory provider, named most often in the Hermes Discord
- AMAP-ML/SkillClaw (1,452) — high-star community skill registry
- EfficientContext/ContextPilot — context engineering toolkit
- aivrar/portable-hermes-agent — pre-baked workspace runtime
The full live, security-reviewed, sortable map is at hermesatlas.com.
6. Atlas itself — what shipped on the meta layer
In the spirit of disclosure: Hermes Atlas (this site, this report) shipped meaningful infrastructure in the same window.
- Newsletter went live (April 24) and has 1300+ subscribers as of today after its first two releases.
hermes-atlas-mcp(npm + MCP Registry) — five tools, four resources, installer subcommand. Anyone running Hermes can query the atlas live (npm install -g hermes-atlas-mcp→ register the MCP server → ask your agent about Hermes-compatible projects in chat).- "Featured this week" — rotating community-pick module on the homepage to highlight one standout project.
- GEO arc — three PRs landed April 24 targeting LLM citation surface:
/guide/install/,/guide/vs-claude-code/,/guide/. Early indexation looks positive. - Auto-merging release-notes pipeline — the daily release-monitor workflow watches for new Hermes releases, opens a PR with structured notes, auto-merges, and rebuilds the RAG chunks. Most of the release content powering this report flowed through that pipeline.
- Repo coverage went from 80 → 123 with weekly dead-repo auditing now running automatically to catch archived/deleted upstream projects before they break the build.
If you find this useful, the easiest way to keep the lights on is the same call to action as before, slightly evolved — see the close.
7. Q1 predictions — scored honestly
The April report ended with five things to watch in Q2. We're nearly two-thirds through the quarter; here's how those calls held up:
1. The skill marketplace network effect. Predicted: Hermes could pass OpenClaw on curated skill quality (if not volume) by mid-Q2.
Verdict: trending right, jury still out. Skills & Registries grew slowly (17 → 20 atlas entries), but huggingface/skills became a trusted default tap in v0.14, and the v0.12 Curator changes the dynamics — it consolidates skill drift instead of accumulating it. Watch for whether the Curator's effects show up as visible quality improvements in user-shared skill libraries over the next 6 weeks.
2. The multi-instance enterprise story. Predicted: per-tenant memory isolation and audit logging in v0.9 or v0.10.
Verdict: partially shipped, faster than expected. v0.13 added X-Hermes-Session-Key for stable session identifiers on the API server (the substrate for memory isolation). v0.13 also closed 8 P0s including the cross-guild DM bypass and TOCTOU windows on auth.json and MCP OAuth — exactly the audit/isolation hardening enterprise needs. The dashboard plugin/profile management pages landed in v0.13. Full per-tenant audit logging isn't in yet, but the building blocks all are.
3. Self-improvement becoming measurable. Predicted: real numbers showing measurable improvement on repeated task families.
Verdict: foundation laid, no benchmarks yet. The v0.12 Curator's per-run REPORT.md is the substrate for this — it logs exactly which skills got consolidated, pruned, or archived. The class-first rubric in the background review fork is the substrate too. No public benchmark suite yet. This is the slowest of the five predictions, and the most important one.
4. The Nous Portal subscription economics. Predicted: free-tier promotions had to convert to a sustainable model.
Verdict: pivoting — different shape than expected. Instead of pushing harder on standalone Portal subscriptions, Nous shipped the Tool Gateway (v0.10) and the OAuth proxy (v0.14). The bet appears to be: make Hermes the routing layer for all your AI subscriptions, and Nous Portal becomes the most natural place to add tools that other providers don't bundle (managed search, image-gen, browser, etc.). It's a smarter long-term play than convincing users to migrate their primary model spend.
5. The "first deletion" moment. Predicted: at some point, things had to move out of core into optional plugins.
Verdict: this happened. It was the v0.14 debloating wave. Heavyweight messaging adapters, image-gen SDKs, voice/TTS providers all moved to lazy-install on first use. The [all] extras dropped everything covered by lazy-deps. The bundled disk-cleanup plugin is opt-in by default as a reference for the new pattern. Plus providers themselves became plugins in v0.13. Nous handled the transition cleanly — the community didn't revolt — and the resulting installs are smaller and safer.
Three out of five clearly directionally right. One (benchmarks) is the slow one. One (subscription economics) was more wrong than right, but the alternative move was a better strategic bet than the one we predicted.
8. What to watch through end of Q2
Three fresh calls for the rest of the quarter:
1. The Foundation release tail. v0.14 was big. Big releases produce long tails of cleanup PRs. The first follow-up release will tell us whether the debloating wave held (no broken installs from lazy-deps), whether native Windows actually works in the wild (the "early beta" stamp was real — ~40 Windows-only fixes already landed in-window), and whether the OAuth proxy survives contact with edge cases.
2. Multi-agent durability getting tested. The v0.13 durable Kanban is the most ambitious multi-agent system in any open-source agent harness. Until people run real long-horizon work on it, we don't know whether heartbeat / reclaim / zombie detection / hallucination gates hold up. Expect either a stunning case study or a long bug list — both are interesting.
3. The first credible "I replaced X with Hermes" piece from a recognizable company. Right now Hermes adoption is dominated by individual developers. The infrastructure (durable Kanban, per-session API keys, dashboard plugin/profile management, OAuth providers, native Windows) is now sufficient for the first small-team or single-department case study. Whoever ships that piece publicly gets a lot of attention — both for being credible adopters and for legitimizing Hermes as enterprise-shippable.
About this report
Methodology: Synthesized from the v0.9 – v0.14 release notes (six releases, ~5,200 commits, ~2,800 merged PRs, ~800 closed issues, ~600 unique contributors across the window), 45+ research files including the Hermes Agent docs site, contemporaneous X/Twitter coverage, the Hermes Agent Discord, the Hermes Agent Memory Guidebook, and the live atlas at hermesatlas.com.
Data freshness: Every star count, fork count, and ecosystem total in this report — including category aggregates and individual repo numbers — was polled live from the GitHub GraphQL API on publication day (2026-05-24). April figures are quoted as published in the prior report. The 123 tracked repos are independently security-reviewed and refreshed automatically on a daily schedule.
Companion to: The State of Hermes Agent — April 2026. This report assumes you read that one and skips re-explaining what Hermes is or how it compares to Claude Code / OpenClaw.
Not affiliated with: Nous Research. Hermes Atlas is an independent community project celebrating their work.
Corrections welcome: Open an issue.
Stay current
The next State of Hermes report will land late June or early July. In between, we ship a roughly monthly newsletter — release deep-dives, community spotlights, the projects we're tracking, the ones we're not, and the patterns we're seeing.
The newsletter is the primary way we share ecosystem signal between reports. It's also how we hear back from you about what to feature next. If this report was useful, that's the call to action.
— Kevin · x.com/ksimback