Skip to content

AI Agent Workflows

Hermes Agent: Self-Review Instead of One-Shot Output

A short visual guide to the Hermes self-improvement loop: run, review, rewrite, and test the skill again.

May 11, 2026 · Dominic Hückmann

Short Answer

Hermes gets interesting when an agent does not only produce output, but reviews the run: execute, measure, critique, rewrite the skill, and test again. The loop pays off mainly for repeatable workflows.

Short answer

Hermes is not magic autopilot. The useful part is the review loop: the agent inspects its own run and rebuilds the workflow.

3x
faster in one community example after two review rounds
-80%
API cost reduction in the same example
2
review rounds are usually enough

Self-improvement loop

structured cycle, not prompt magic

  1. 01
    Run task
  2. 02
    Inspect logs + output
  3. 03
    Score speed / cost / quality
  4. 04
    Rewrite skill
  5. 05
    Test same task again

What actually changes

  • Hermes optimizes the strategy, not just the wording.
  • Slow steps, redundant searches, and expensive token paths become visible.
  • The improved skill is worth it only if you reuse it.
  • Humans still approve risky decisions.

The 5-minute loop

  1. Pick a skill you actually use.
  2. Run it on a real task.
  3. Ask Hermes to review speed, cost, and output quality.
  4. Let it rewrite the skill.
  5. Run the same task again and compare.

Before / after

One-shot agent

  • Task in, answer out, done.
  • Same mistakes next time.
  • Token cost stays hidden.
  • Prompt gets slightly nicer.

Review loop

  • Task → answer → critique → better skill.
  • Mistakes become rules or steps.
  • Expensive steps get cut.
  • Workflow, tools, and order can change.

Review prompt

Review the skill you just ran.
Find:
1. slowest steps
2. most expensive token usage
3. weak parts of the output

Rewrite the skill so it becomes faster,
cheaper, and more reliable.

When to use it

Use when

  • ✓ repetitive research or coding workflows
  • ✓ tasks with measurable success: faster, cheaper, more accurate
  • ✓ skills with multiple tool calls or repeated searches

Avoid when

  • × one-off tasks
  • × purely subjective creative work
  • × risky actions without human approval

Token math

Self-review also spends tokens. Simple rule:

If the workflow will not repeat, do not optimize it.

If a run costs €0.10 and the review costs €0.30, you need future runs before the rewrite pays off. Daily skill: yes. One-off experiment: no.

Community patterns

Practical rules

  • One skill = one narrow scope. Do not mix research, coding, and communication.
  • Write a tiny contract first: input, output, success criteria.
  • Stop after two review rounds.
  • Use a cheaper model for review when possible.

Sources

FAQ

What does self-improvement mean for Hermes Agent?

Hermes can review completed tasks, spot inefficient steps, and improve skills or prompts for the next run.

Do I need special configuration?

No. Use the skill system: run the task, request a review, rewrite the skill, and test it again.

When is it worth it?

When the workflow repeats. For one-off tasks, the review can cost more than it saves.

Need AI-first architecture support?

Send me a short note about your project or technical bottleneck.

Get in touch