What just happened
Hermes Agent — Nous Research's orchestrator — shipped v0.8 in late April, and the design choice it codifies matters more than the version bump. Hermes is no longer a single-model wrapper; it is a persistent coordinator that maintains long-term project memory and routes tasks to specialist workers. Two of those workers are Claude Code (the reasoning-heavy 'Lead Engineer') and Codex (the rapid-scaffolding 'Autonomous Worker'). The pattern Hermes is operationalizing is the same one the CCA-F D1 blueprint calls hub-and-spoke with subagent isolation.
5 things that matter for the exam
- Put the orchestrator in charge of context, not coding. Hermes owns long-term memory and task hand-offs. The workers don't share context with each other, only with the coordinator. That is the canonical D1 isolation rule.
- Split work by benchmark strength. Claude Opus 4.7 hits 87.6% on SWE-bench Verified — best for refactors, security review, architecture. Codex GPT-5.5 hits 82.7% on Terminal-Bench 2.0 — best for scaffolding, tests, lint, terminal-heavy chores.
- Lock task boundaries to prevent context bleed. Hermes ships a 'Kanban' feature that pins a task to one agent. Without it, both workers touch everything and context leakage tanks reliability. Same lesson as the subagent isolation rule.
- Watch the token-drain refactor loop. Claude Code can consume up to 4× more tokens than Codex on the same task ($155 vs $15 in a reported case). Use Codex first for scaffolding; use Claude only for final architectural review.
- Run the night-shift in Docker, with a PTY for risky commands. Autonomous overnight runs are now common. Use Docker isolation and
hermes start --ptyso Claude Code pauses for human confirmation onrm,git push, and other destructive commands.
Three orchestration anti-patterns
Skill overwriting on the self-improving loop
Hermes' self-improvement can clobber manually tuned skills. Mark them immutable: true in the skill markdown frontmatter.
Letting Claude Code do the cheap work
If you route lint + scaffolding through Opus 4.7, you are paying premium prices for cheap output. The classic 'bigger model' distractor in CCA-F D1 questions.
Skipping version pinning
Hermes shipped v0.1 → v0.8 in months. Pin your version in CI/CD; treat protocol drift as a real failure mode.
Sources
Where this lands in the exam-prep map
Each blog post bridges into the evergreen pillars. These are the most relevant follow-ups for this story.
Concept
Subagents
Hermes is the textbook coordinator; the workers are subagents with isolated context.
Open ↗Scenario
Multi-agent research
Same hub-and-spoke pattern at production research scale.
Open ↗Knowledge
Claude Code 101
Foundation course on the daily-driver workflow Hermes is composing.
Open ↗Exam Guide
Day-of distractor patterns
The 'pick the bigger model' distractor is exactly what Specialist Routing solves.
Open ↗3 questions answered
What is Specialist Routing?
Why does Claude Code beat Codex on SWE-bench but lose on Terminal-Bench?
Does this map to a CCA-F domain?
Synthesized from research output on 2026-05-02. LinkedIn cross-post pending.
Last reviewed 2026-05-06.
