Original agent spend benchmark

AI Agent Runaway Spend Benchmark

Autonomous agents do not need malicious intent to create expensive incidents. Loops, retries, delegated sub-agents, and MCP tool fanout can turn small unit costs into thousands of dollars before a dashboard catches up.

Benchmark method

This benchmark models common autonomous-agent failure modes using five variables: active agents, paid calls per minute, delegation fanout, cost per call, and detection delay.

Uncontrolled cost assumes the loop continues until a human, dashboard alert, or provider billing alarm catches it. Controlled cost assumes a request-path economic firewall stops new paid calls after five minutes through budget, per-tool cap, route policy, expiry, or revocation.

The point is not that every workload has these exact numbers. The point is the curve: once agents can act in parallel, cost grows with time and fanout faster than humans can approve individual requests.

Formula

cost = agents × calls/min × fanout × minutes × cost/call

Uncontrolled: minutes = detection delay

Controlled: minutes = five-minute enforcement window

Avoided: cost blocked before the next upstream API or MCP tool call

Benchmark scenarios

Representative agent failure modes, modeled with and without request-path budget enforcement.

See AI agent cost control
ScenarioAgentsCalls/minFanoutCost/callDetectionUncontrolledControlledAvoided
Single coding agent loop1181×$0.0645 min$49$590%
MCP tool retry storm1283×$0.1260 min$2,074$17392%
Support-agent swarm5064×$0.0490 min$4,320$24094%
Premium research workflow20105×$0.2530 min$7,500$1,25083%
Enterprise background agents20042×$0.03120 min$5,760$24096%

Findings

Detection delay dominates cost

A dashboard that notices spend after 30-120 minutes is too late. The expensive decision has already happened thousands of times.

Fanout multiplies every mistake

Sub-agents, MCP tools, retries, and background workers turn one bad loop into a parallel spend event.

Small unit costs still become material

A few cents per call looks harmless until agents generate thousands of paid requests before anyone sees the bill.

Inline enforcement changes the curve

Budget checks, per-tool caps, route policy, expiry, and revocation stop the next request instead of explaining the last one.

Observe

Route agent traffic through SatGate to attribute cost by agent, workflow, route, tool, tenant, and MCP server before enforcing hard limits.

Control

Enforce per-agent budgets, per-tool caps, route policy, revocation, expiry, and kill switches before upstream API calls execute.

Charge

When external agents become API customers, use SatGate Charge with L402 Lightning payments to collect before access is granted.

The fix is not a better bill. It is a pre-request decision.

SatGate is the economic control plane for AI agents: observe cost, control spend before execution, and charge robot customers when autonomous systems need paid API access.