sheawinkler/hermes-agent-ultra
Hermes Agent Ultra (built from hermes-agent-rs core by lumioresearch, @oxterrybit). Hermes-agent feature parity at commit level. Rust Performance
Hermes Agent Ultra is a Rust-native autonomous agent runtime designed to provide functional parity with the original Hermes Agent while adding advanced reliability and operator controls. It replaces Python-based execution paths with deterministic Rust implementations to improve performance and security across local and gateway sessions. The system features a comprehensive tool runtime with policy enforcement, session time-travel for debugging, and a multi-provider router supporting backends like Ollama and vLLM. Users can manage complex agent workflows through a TUI or CLI with built-in support for memory fusion and automated diagnostics.
- Rust-native core for deterministic execution and high-performance agent loops
- Advanced session branching and time-travel for checkpointing and replaying traces
- Multi-backend support including Ollama, llama.cpp, vLLM, and Apple ANE
full readme from github
Hermes Agent Ultra
██ ██ ███████ ██████ ███ ███ ███████ ███████
██ ██ ██ ██ ██ ████ ████ ██ ██
███████ █████ ██████ ██ ████ ██ █████ ███████
██ ██ ██ ██ ██ ██ ██ ██ ██ ██
██ ██ ███████ ██ ██ ██ ██ ███████ ███████
A G E N T U L T R A
Rust-first autonomous agent runtime with functional parity goals against NousResearch/hermes-agent, plus an Ultra reliability, security, and operator-control layer.
What You Get
- Fully Rust-native core runtime (agent loop, tools, gateway, skills, CLI/TUI)
- Multi-provider inference routing and OAuth-capable provider flows
- First-class local/self-host backends: Ollama, llama.cpp, vLLM, MLX, Apple ANE endpoint, SGLang, TGI
- Tool runtime with policy enforcement, MCP integration, cron, and memory backends
- Parity upkeep system for upstream drift triage and controlled roll-forward
- Production operations surface (
doctor, replay traces, sync gates, parity artifacts)
Why Ultra Exists
NousResearch/hermes-agent is the canonical upstream product surface.
Hermes Agent Ultra keeps that surface in scope while focusing on:
- deterministic Rust execution paths
- explicit safety and policy controls
- better observability and incident debugging
- easier operator workflows for long-running local and gateway sessions
Differentiation vs Upstream
Ultra keeps parity work separate from intentional extensions.
Runtime policy engine: enforce/audit/simulate tool policy decisions at runtimeSession branching + time-travel: checkpoint/rollback/replay navigation from the TUITool-call simulator: preview policy allow/deny outcomes before running risky tool invocationsAdaptive repo-review budget controls: tune discovery-loop trimming live (balanced/aggressive/relaxed/off)Semantic repo graph: inspect dependency hubs/edges with inline Mermaid previewProvider QoS router controls: inspect route learning/health and apply autotune from chatLive session eval harness: score real saved sessions and gate quality trends from actual usageRTK raw-mode controls: inspect unwrapped tool payloads when debugging integrationsMemory fusion: ContextLattice + external memory providers with scoring/fusion logicAdvanced sync gates: differential parity checks, red-team/adversarial gating, elite sync gateOperational tooling: deep doctor snapshots, replay traces, queue-based upstream webhook syncRust-only implementation strategy: parity in Rust first; no direct Python runtime vendoring
Install
One-line installer
curl -fsSL https://raw.githubusercontent.com/sheawinkler/hermes-agent-ultra/main/scripts/install.sh | bash
Custom install path:
curl -fsSL https://raw.githubusercontent.com/sheawinkler/hermes-agent-ultra/main/scripts/install.sh | sudo INSTALL_DIR=/usr/local/bin bash
From source
cargo install --git https://github.com/sheawinkler/hermes-agent-ultra hermes-cli --locked --bin hermes-agent-ultra --bin hermes-ultra
Quick Start
Need a shorter path? See README_QUICKSTART.md.
Setup:
hermes-ultra setup
Interactive session:
hermes-ultra
Interactive mode is single-instance per Hermes home by default (prevents accidental parallel TUI sessions sharing the same state).
If you intentionally want parallel interactive sessions, run:
HERMES_ALLOW_PARALLEL_INTERACTIVE=1 hermes-ultra
One-shot query:
hermes-ultra chat --query "summarize this repository"
Gateway mode:
hermes-ultra gateway --live
Skip API-key collection With Nous Portal
Hermes Agent Ultra still supports direct provider and per-tool keys. If you prefer one managed subscription for model access plus hosted tool backends, Nous Portal can cover:
- 300+ models, selectable with
/model <name>. - Tool Gateway routing for web search, image generation, text-to-speech, and cloud browser backends.
Fresh install path:
hermes-ultra setup --portal
That starts Nous OAuth setup, sets Nous as the provider, and enables Tool Gateway routing. Inspect the current state with:
hermes-ultra portal status
You can still bring your own keys for individual tools; gateway routing is per backend, not all-or-nothing.
Deep diagnostics bundle:
hermes-ultra doctor --deep --snapshot --bundle
Optional Sentrux MCP profile:
hermes-ultra mcp sentrux
hermes-ultra mcp sentrux-status
Key operator commands:
# Capability diagnostics for current or target model
/model explain
/model why-not --cap tools,reasoning --min-context 200000
/swarm status
/swarm plan graph
/swarm run 4 sequential
# Deterministic trace controls
/raw trace status
/raw trace verify
/raw trace export 200
# Runtime policy packs
/policy list
/policy strict
/policy standard
/policy dev
# Adaptive intelligence-performance autopilot
/ops autopilot status
/ops autopilot run
/ops autopilot recommend
/ops autopilot apply
# OpenHuman-derived P0/P1 operator control-plane
/commands search boot
/boot quick
/boot profile prod
/walkthrough start quick
/walkthrough insights
/integrations status
/integrations repair
/integrations snapshot
/triage eval webhook "secret leak panic outage"
/triage feedback webhook critical "secret leak panic outage"
/subconscious status
/subconscious profile strict
/subconscious run 2 --dry-run
/compress rules recommend
/compress rules autotune apply user
# Session time-travel + simulation
/timetravel list
/timetravel goto <snapshot>
/simulate terminal {"cmd":"ls -la"}
# QoS + eval runtime surfaces
/qos status
/qos health
/ops budget balanced
/ops eval run
Local Backends
hermes-ultra setup now includes local/self-host provider options with no mandatory API key:
ollama-local(defaulthttp://127.0.0.1:11434/v1)llama-cpp(defaulthttp://127.0.0.1:8080/v1)vllm(defaulthttp://127.0.0.1:8000/v1)mlx(defaulthttp://127.0.0.1:8080/v1)apple-ane(defaulthttp://127.0.0.1:8081/v1)sglang(defaulthttp://127.0.0.1:30000/v1)tgi(defaulthttp://127.0.0.1:8082/v1)
Override endpoint URLs via env vars:
OLLAMA_BASE_URLLLAMA_CPP_BASE_URLVLLM_BASE_URLMLX_BASE_URLAPPLE_ANE_BASE_URLSGLANG_BASE_URLTGI_BASE_URL
Detailed guide: docs/local-backends.md
Built-In Context + Memory Behavior
Ultra auto-loads high-value project and persona context:
SOUL.mdAGENTS.mdDESIGN.md.hermes.md/HERMES.mdMEMORY.md/USER.md
Subdirectory discovery is enabled so context follows the code path being edited.
Skills and Registry Surface
Skills commands support multi-registry search/install and local tap flows.
- Registry-aware installs include:
official/...skills.sh/...github/...lobehub/...clawhub/...claude-marketplace/...
- Mandatory skill security scanning runs before install and before use.
OpenHuman runbooks and matrices:
docs/implementation/openhuman-p0-p1-runbook.mddocs/implementation/openhuman-p0-p1-surface-matrix.mddocs/implementation/openhuman-p2a-p2b-runbook.mddocs/implementation/openhuman-p2a-p2b-surface-matrix.mddocs/implementation/openhuman-p3-swarms-runbook.mddocs/implementation/openhuman-p3-swarms-surface-matrix.md
Security Posture
- Skill content security scanning blocks dangerous patterns and restricted URL targets
- Skill guard modes:
strict(default),relaxed(only blocks destructivermops),off - Policy-controlled tool execution modes:
off,audit,simulate,enforce - Tool policy presets:
strict,balanced,dev,relaxed - Sensitive field redaction in traces/log surfaces
- Guardrails for path traversal, unsafe file ops, and runtime boundary violations
Operator runtime overrides (env):
HERMES_SKILL_GUARD_MODE=relaxedHERMES_TOOL_POLICY_PRESET=relaxedHERMES_MAX_TURNS_UNLIMITED=1(or setmax_turns: 0in config/profile)HERMES_FORCE_RUNTIME_AUTH_REFRESH=1HERMES_AUTH_REFRESH_MAX_RETRIES=6
Upstream Sync and Parity Upkeep
Ultra uses controlled sync workflows, not blind merges.
- Upstream source of truth:
NousResearch/hermes-agent - Fetch/sync tooling:
scripts/sync-upstream.shscripts/upstream_webhook_sync.py
- Parity artifacts:
docs/parity/.sync-reports/
Live Upstream Sync Status (auto-generated)
- Generated at:
20260504-053352 - Source report:
upstream-sync-20260504-053352.txt - Sync timestamp (
timestamp_utc):20260504-053352 origin/mainat sync:1861c5dcfb8cad8dcddb5f15c1a5a8c34c7f1ce2upstream/mainat sync:95f395027f72c69f06bddcecb08da53cfd10c440- Pending commits captured in report:
1512 - Queue summary (
docs/parity/upstream-missing-queue.json): pending0, ported63, superseded1387 - Parity gates (
docs/parity/global-parity-proof.json): releasepass, cipass - Workstream snapshot (
docs/parity/workstream-status.json):upstream/main@8163d371922768c32f43eb6036d7d36e56775605(generated2026-05-04T01:23:44-06:00)
Note: this repository intentionally tracks parity via queue/gate workflows because upstream and ultra history can diverge materially.
Contributing
Interested in helping? Start with CONTRIBUTING.md for setup, PR expectations, parity rules, and the no-stub completeness gate.
Official References and Attribution
Canonical/official upstream references:
- Upstream (official): https://github.com/NousResearch/hermes-agent
- Ultra (this repository): https://github.com/sheawinkler/hermes-agent-ultra
- Ultra fork archive (historical): https://github.com/sheawinkler/hermes-agent-rs-fork
Integrated ecosystem references used in Ultra workflows:
- OpenAI skills repository: https://github.com/openai/skills
- Anthropic skills repository: https://github.com/anthropics/skills
- VoltAgent skills aggregation: https://github.com/VoltAgent/awesome-agent-skills
- Ratatui (TUI foundation): https://github.com/ratatui/ratatui
- tui-textarea (composer/editor behavior): https://github.com/rhysd/tui-textarea
Additional ownership, provenance, and credit notes are maintained in UPSTREAM_ATTRIBUTION.md.
Architecture Map
Primary Rust workspace crates:
crates/hermes-agent: agent loop, memory orchestration, provider controlcrates/hermes-tools: tool registry and execution backendscrates/hermes-cli: CLI/TUI, setup, model/personality switching, operator commandscrates/hermes-gateway: gateway adapters and live runtime pathscrates/hermes-skills: skill storage, guardrails, hub and registry pathwayscrates/hermes-mcp: MCP transport/client/server supportcrates/hermes-config: config model and runtime loadingcrates/hermes-telemetry: tracing and metrics surfaces
License
Distributed under this repository's license and notices.
See LICENSE, NOTICE, and UPSTREAM_ATTRIBUTION.md.