Question 1

How do Anthropic and OpenAI prompt caching differ?

Accepted Answer

Anthropic charges 1.25x base on cache writes and 0.1x on reads — a 90% discount on cached portions. OpenAI charges base price on writes and 0.5x on cached reads (50% discount, automatic). Anthropic saves more per cached call but requires explicit cache_control flags.

Question 2

What's the break-even for caching?

Accepted Answer

Anthropic: roughly 2 reads. The 0.25x premium on the write is recouped after just two cached calls. OpenAI: break-even is immediate since writes are at base price. Beyond break-even, every reuse compounds savings.

Question 3

What can be cached?

Accepted Answer

Stable prefixes — system prompts, tool definitions, retrieved documents, few-shot examples. The cache key is the prefix bytes. Even a single character change invalidates the entry. Cache TTL defaults to 5 minutes on Anthropic; 1-hour extended cache available.

Question 4

Does caching work across users?

Accepted Answer

Yes if the prefix is identical. A SaaS app with a global system prompt benefits from cache hits across all users. Personalized prompts (user names, preferences) don't share cache entries. Structure prompts so personal data comes after stable content.

Question 5

Why doesn't caching help small prompts?

Accepted Answer

Anthropic enforces a 1024-token minimum for caching. Below that, your prompt isn't cacheable. OpenAI auto-caches at 1024 tokens too. Optimize for caching by frontloading stable context — even moving a static instruction block to the system prompt unlocks caching.

Model	No cache	With cache	Saved	%
Claude Opus 4.7 0.1x reads	$4125.00	$955.36	$3169.64	77%
Claude Sonnet 4.6 0.1x reads	$825.00	$191.07	$633.93	77%
Claude Haiku 4.5 0.1x reads	$275.00	$63.69	$211.31	77%
GPT-4o 0.5x reads	$687.50	$389.88	$297.62	43%
GPT-4o-mini 0.5x reads	$41.25	$23.39	$17.86	43%

Prompt Caching Savings Calculator

About This Tool

Anthropic prompt caching mechanics

OpenAI prompt caching mechanics

Real-world savings examples

How to maximize hit rate

Frequently Asked Questions

You might also like

Markdown to HTML Converter

JSON to YAML Converter

TTS Cost Calculator