Question 1

Why do agentic workflows cost so much more than single calls?

Accepted Answer

Each tool call triggers a fresh model invocation. The full conversation history (initial prompt + each tool call + each tool result) gets re-sent on every step. A 5-tool-call query can cost 3-8x a single shot because input tokens compound across iterations.

Question 2

How much does a tool definition cost in tokens?

Accepted Answer

Each function definition (name, description, JSON schema) typically runs 200-500 tokens. A 10-tool agent might carry 3-5K tokens of definitions on every call. They're sent on every iteration unless you trim the toolset based on context.

Question 3

Can prompt caching help with agent workflows?

Accepted Answer

Yes — significantly. Cache the system prompt + tool definitions. Anthropic's cache cuts that prefix to 10% on reads. A 5K-token tool catalog cached across 10 iterations saves $0.05-$0.50 per query depending on model.

Question 4

What's the cheapest way to run a multi-tool agent?

Accepted Answer

Haiku 4.5 or GPT-4o-mini for routing + tool execution, escalating to Sonnet/GPT-4o only when reasoning requires it. Add prompt caching on stable prefixes. Use parallel tool calls where possible — most providers support concurrent function calls in a single turn.

Question 5

How do tool results affect output token cost?

Accepted Answer

Tool results count as input on the next iteration. Output cost is just the model's text response and tool call requests — usually small. The bigger cost driver is repeated input tokens accumulating across iterations.

Model	Input tokens	Per query	Monthly
GPT-4o	25,900	$0.0712	$711.50
GPT-4o-mini	25,900	$0.0043	$42.69
Claude Sonnet 4.6	25,900	$0.0873	$873.00
Claude Haiku 4.5	25,900	$0.0291	$291.00
Gemini 2.5 Pro	25,900	$0.0356	$355.75
Gemini 2.5 Flash	25,900	$0.0094	$93.70

Function Calling Cost Calculator

About This Tool

Why agent costs balloon

The math, step by step

Cost-reduction levers

Model selection for agents

Reliability vs cost

Frequently Asked Questions

You might also like

Base64 Encoder & Decoder

LLM Cost Comparison

Whisper API Cost Calculator