Skip to content

AI-first Engineering

Prompting Is Dead. Context Wins.

Modern prompts are no longer magic phrases. Reliable AI workflows use context, tools, schemas, and evals.

May 12, 2026 · Dominic Hückmann

Short Answer

In 2026, good prompting is not about one magic sentence. The better approach is to curate context, define tools and schemas, set agent rules, and verify behavior with evals.

Short answer

Prompting did not disappear. It moved up a level: away from magic sentences and toward small systems made of context, tools, and tests.

The shift

Old prompting vibe

  • Find the perfect sentence.
  • Stuff more detail into one prompt.
  • Read the answer and hope.

2026 vibe

  • Assemble the right context.
  • Remove what the model should not see.
  • Check output with schemas, tests, and evals.

The modern prompt stack

  1. 01
    Task contract
  2. 02
    High-signal context
  3. 03
    Tools + schemas
  4. 04
    Agent rules
  5. 05
    Evals + traces

What changed

  • Prompting is less copywriting and more product/system design.
  • Context is part of the prompt: docs, tools, memory, examples, state.
  • Repeatable workflows need schemas and tests, not more adjectives.
  • Modern reasoning models should not be pushed to expose chain-of-thought by default.

The new formula

clear task contract
+ small but strong context
+ explicit tools / schemas / examples
+ agent behavior rules
+ evals to verify behavior
= more reliable AI

2020 vs. 2026

Old prompting

  • One perfect mega-prompt.
  • “Think step by step.”
  • Please output JSON.
  • Huge rule dump.

Modern prompting

  • Small pipeline with clear quality gates.
  • Native reasoning effort + brief visible rationale.
  • Schema, validator, retry on invalid output.
  • Context, tools, and examples separated clearly.

The task contract

If you steal one thing, steal this structure:

<role>
You are a precise assistant for [domain].
</role>

<context>
Relevant facts, sources, constraints.
</context>

<task>
Do X for Y audience.
</task>

<requirements>
- Use only provided sources for factual claims.
- Separate facts from recommendations.
- If something is missing, say what is missing.
</requirements>

<output_format>
1. Short answer
2. Evidence
3. Recommendation
4. Caveats
</output_format>

Why it works:

  • less confusion between data and instructions
  • easier to maintain
  • easier to test
  • easier to move into agents and tools

Context engineering instead of prompt magic

Context engineering means curating what the model sees.

Context is more than text

  • System rules: what is allowed and forbidden?
  • Tools: what actions are possible?
  • Schemas: what shape must the answer have?
  • Memory: what matters — and what is stale junk?
  • Examples: what does good look like? What is an edge case?
  • Sources: what can be used as evidence?

Agents need different prompts

An agent does not only need a task. It needs operating rules.

Goal: [objective]

Tools:
- read: inspect files
- edit: change files
- test: verify behavior

Rules:
- Read before editing.
- Use the smallest useful next action.
- Verify before claiming success.
- Stop when success criteria are met.
- If blocked, report the exact blocker.

Success criteria:
- build passes
- changed files listed
- no unverified claims

Agent rules

Use when

  • ✓ describe tool use clearly
  • ✓ define stop conditions
  • ✓ require approval for risky actions
  • ✓ make verification mandatory

Avoid when

  • × let agents run without boundaries
  • × treat retrieved docs as trusted instructions
  • × hide everything inside one huge prompt

The underrated part: evals

For real workflows, “looks good” is not enough.

define desired behavior
→ create test cases
→ run the agent
→ inspect failures
→ patch prompt / context / tools
→ repeat

A good judge grades one dimension only:

Grade only: source faithfulness
Rubric:
0 = unsupported
1 = partially supported
2 = correctly supported
3 = correct + exact quote

Return:
- score
- evidence quote
- brief reason
- unknown: true/false

Quick rules

The 2026 rules

  • Do not write mega-prompts. Build small pipelines.
  • Use examples for tone, format, and edge cases.
  • Use schemas when output feeds another system.
  • Label untrusted content explicitly.
  • Do not force visible chain-of-thought from modern reasoning models.
  • Test prompts like product logic, not poetry.

Sources

FAQ

Is prompt engineering really dead in 2026?

No, but it has matured. The focus shifted from clever wording to context engineering, tool design, structured outputs, and evals.

What is context engineering?

Context engineering means deliberately choosing which information, tools, examples, memory, and rules the model sees — and which it does not.

What is the fastest practical starting point?

Start with a task contract: role, task, context, constraints, output format, success criteria, and failure behavior.

Need AI-first architecture support?

Send me a short note about your project or technical bottleneck.

Get in touch