Prompt Cache Savings Calculator
Model repeated prompt-cache scenarios and compare cached versus uncached request cost.
Also useful for: prompt caching calculator prompt cache savings cached token calculator llm cache cost calculator Prompt Cache Savings Calculator
Prompt Cache Savings Calculator
Runs in your browser. Inputs are not uploaded.Prompt Cache Savings Calculator Example Run
Compare repeated requests with and without cache reads.
Sample inputs Traffic Repeated requests: 1000 Input tokens per request: 12000 Cached input tokens: 10000 Output tokens per request: 400 Rates Base input per 1M: 3 Cache write per 1M: 3.75 Cache read per 1M: 0.30 Output per 1M: 15 Generated result Summary: Estimated savings: $26.9625 across 1000 requests. Stats - Requests: 1000 - Cached Tokens: 10000 - Savings: 26.9625 Issues - No issues found. Output Without cache: $42.0000 With cache: $15.0375 Estimated savings: $26.9625 Savings rate: 64.2%
What the Prompt Cache Savings Calculator Checks
-
Repeated request count
Savings are calculated across a batch of repeated calls, not a single isolated prompt.
-
Cached token boundary
Cached tokens are capped at the total input token count to avoid impossible scenarios.
-
Write versus read rates
The calculator separates first cache write cost from later cache read cost.
-
Output token cost
Generated output is included in both cached and uncached totals.
-
Scenario clarity
The result shows absolute savings and savings rate so tiny wins are easy to spot.
Prompt Cache Savings Calculator Questions
- Q: Does the prompt cache savings calculator upload my token assumptions? A: No. Inputs are processed in your browser, and pasted content is not sent to analytics or a server.
- Q: What should I paste into the prompt cache savings calculator? A: Enter request count, input tokens, cached tokens, output tokens, and the relevant per-million rates.
- Q: What can I copy or download? A: You can copy the scenario summary with cached, uncached, and savings totals.
- Q: What does the prompt cache savings calculator not verify? A: It does not confirm a provider cache hit, account for minimum cache lengths, or update prices automatically.
Related Tools
Updates
- v1.0.0 New 2026-05-17: Initial cached-token savings model and cost comparison output added.