Published: March 18, 2026 | 8 min read

When Agents Break in Production: The Incident Response Mystery (Solved by Cryptographic Proof)

The Incident

It's 3am. Your on-call engineer gets paged. A customer's loan approval agent just rejected 47 legitimate applications with the same error: "Risk score unavailable."

Your team starts the investigation.

Hypothesis 1: Did the risk-scoring API go down?

Check API logs: No errors. API was up the whole time.

Hypothesis 2: Did the model change?

Check git logs: Model version is the same as 6 hours ago when everything worked.

Hypothesis 3: Did the prompt drift?

Check prompt history: Same prompt. No updates in 24 hours.

Hypothesis 4: Did the agent hallucinate the API response?

Check agent logs: Agent claims it called the API and got a response. But did it really? The log says it did, but logs are just claims. They're written by the same agent that might be hallucinating.

At 5am, still no root cause. Your customer is furious. Your compliance team is asking if this violated regulatory requirements.

At 7am, someone realizes: the API endpoint changed its schema yesterday. The agent called the API successfully, but the response format changed, and the agent hallucinated a confidence score when the field didn't exist. The agent didn't know it hallucinated—it confidently reported a score based on inference over missing data.

Total incident response time: 4 hours. All preventable.

The Problem: Logs Are Claims, Not Proof

When an agent says "I called the API," that's a claim. When you check the logs, you're reading claims. Neither is proof.

Agent: "I called GET /risk-score?customer_id=123"
Agent: "Response: {status: 200, score: 45.2, confidence: 0.98}"
Agent: "Decision: approve application"

(But the API actually changed its schema)
(Response was: {status: 200, score: 45.2} — no confidence field)
(Agent hallucinated the confidence.confidence field)

Your logs show:

Agent execution time ✓
Agent decision ✓
Agent claimed outputs ✓

Your logs do NOT show:

Did the API response actually match the claim? ✗
Did the agent hallucinate the response format? ✗
Which version of the API schema was active? ✗
Did the response timestamp match the agent execution time? ✗

Why Incident Response Is Slow Today

Average incident response time: 2–4 hours for multi-agent systems.

In regulated industries (fintech, healthcare), this becomes a compliance nightmare. Regulators ask: "Prove what happened." Your answer: "Here are logs (claims)." Their response: "That's not proof."

The Solution: Cryptographic Proof at Decision Time

Trust Layer records cryptographic proof at every decision point. Instead of claims, you get signatures.

Trust Layer captures:
- Timestamp of API call (cryptographically signed)
- API endpoint called (cryptographically signed)
- Request parameters (cryptographically signed)
- Response received (cryptographically signed)
- Response timestamp (cryptographically signed)
- Agent model version (cryptographically signed)
- Agent prompt hash (cryptographically signed)
- Agent context (cryptographically signed)
- Agent decision (cryptographically signed)

What Gets Verified

When you query Trust Layer for incident response, you get:

Execution proof: This exact agent, at this exact time, made this exact decision.
Input proof: The agent received these exact inputs (API responses, context, parameters).
Model proof: The agent used this exact model version.
Prompt proof: The agent used this exact prompt.
Output proof: The agent produced this exact output.
Causality proof: Given these inputs + this model + this prompt = this output (detects hallucinations).

Next Steps

If you're running agents in production, ask yourself:

Can you prove what your agents actually did? (Not what logs claim, but what cryptographic proofs verify?)
When an agent breaks, how long does incident response take? (Hours of guessing or minutes of verification?)
Can you prove compliance to regulators? (Logs or signatures?)

If the answers are "no," "hours," or "logs"—you need independent verification.

Trust Layer is the forensic witness your production agents need.

Not optional. Essential.

Back to ArkForge | All articles