Question 1

How does Claude prompt caching change cost?

Accepted Answer

Cache writes cost 1.25x the base input price (a one-time premium). Cache reads cost 0.1x the base — a 90% discount. If a 10K-token system prompt is reused 100 times, you write once at 1.25x and read 99 times at 0.1x. That cuts the bill on that prompt by roughly 87% versus paying full price each call.

Question 2

Which Claude model should I default to?

Accepted Answer

Sonnet 4.6 at $3 input / $15 output. It handles 90% of production workloads at a fraction of Opus pricing. Reach for Opus 4.7 ($15/$75) when you need the strongest reasoning or coding. Drop to Haiku 4.5 ($1/$5) for high-volume classification and extraction.

Question 3

When does cached input pay off?

Accepted Answer

When the same prefix repeats across many requests. Common cases: stable system prompts, retrieved documents reused across follow-up turns, few-shot examples. The break-even is two cache reads — beyond that you save. Cache TTL defaults to 5 minutes; an extended 1-hour cache is also available.

Question 4

How does Claude pricing compare to GPT-4o?

Accepted Answer

Claude Sonnet 4.6 ($3/$15) is more expensive than GPT-4o ($2.50/$10) on a per-token basis. Sonnet typically wins on coding, structured output, and long-context tasks. Haiku 4.5 ($1/$5) is more expensive than GPT-4o-mini ($0.15/$0.60), but tends to outperform on instruction following.

Question 5

Are output tokens really 5x more than input?

Accepted Answer

Yes. Across all Claude tiers, output costs 5x input. This makes prompt caching especially powerful — caching only affects input, but reducing prompt size also reduces the response wandering, which trims output cost too.

Model	Per call	Monthly
Claude Opus 4.7 $15/M in · $75/M out	$0.0900	$900.00
Claude Sonnet 4.6 $3/M in · $15/M out	$0.0180	$180.00
Claude Haiku 4.5 $1/M in · $5/M out	$0.006000	$60.00

Claude API Cost Calculator

Prompt caching (optional)

About This Tool

Anthropic pricing (April 2026)

Prompt caching mechanics

When to skip caching

Frequently Asked Questions

You might also like

Whisper API Cost Calculator

JSON to YAML Converter

Gemini API Cost Calculator