Blog

huecki

Practical writing about agents, automation, and software architecture.

Detected Focus

Agent Harnesses

Several recent posts point to the same engineering problem: agents do not only need better prompts. They need a runtime contract around the prompt.

The minimal model

A useful harness defines the agent’s phase, allowed actions, required evidence, exit condition, and blocker rules. That is the difference between an AI that tries to help and an agent that can be operated.

Read the full 2-minute summary

An agent harness is the control layer around an AI agent. It defines allowed actions, state transitions, required evidence, and stop conditions.

Without a harness, the model decides too much from chat context. It may edit before inspecting, mark work done without running checks, keep looping after the useful work is finished, or create fake review comments because the prompt asked it to criticize.

The practical fix is to move those rules out of vibes and into an explicit contract: phase state controls which actions are allowed; evidence gates control when the agent may claim progress; exit conditions control when the loop stops; blocker rules control when the agent must ask for help; review rules prevent pointless self-critique.

Agent harnesses should be specs, not hidden glue code

Natural-Language Agent Harnesses give a useful name to an important shift: the agent policy should be an inspectable document that a runtime executes, not invisible glue hidden inside controller code.

Give Your Agent Seatbelts, Not a Longer Prompt

When an agent keeps jumping from planning to editing to testing at the wrong time, the fix is not usually another paragraph of system prompt. Put the workflow into explicit states, give each state a tiny tool policy, and make phase changes visible.

AI Agents Need Evidence Before They Click

When an agent clicks, sends, pays, deletes, or extracts data, the critical truth cannot live only in model prose. Put a small evidence gate before risky tool calls: predicate, evidence type, source, decision.

Agents Don’t Need ‘Keep Going’. They Need Exit Conditions.

The useful lesson behind Claude Code /goal is not that agents can run forever. It is that long-running agent work needs an explicit, observable exit condition: what proves done, what stays in scope, and when to stop blocked.

Stop Asking AI to Critically Self-Check

Open-ended instructions like “critically self-check this” accidentally reward the model for producing criticism. The fix is not less review. It is calibrated review: explicit criteria, PASS_NO_CHANGE, evidence per finding, severity thresholds, and a tiny change budget.

July 17, 2026 · AI Agent Workflows

Your Agent Harness Needs a Behavior Map

Harness Handbook points at a practical bottleneck in agent engineering: the behavior you want to change is scattered across prompts, state managers, tool calls, policy code, and tests. Build a behavior map before editing the harness.

AI AgentsAgent HarnessCoding AgentsDeveloper WorkflowAI Engineering