Prompt Cache Savings Calculator

Model repeated prompt-cache scenarios and compare cached versus uncached request cost.

Also useful for: prompt caching calculator prompt cache savings cached token calculator llm cache cost calculator Prompt Cache Savings Calculator

Prompt Cache Savings Calculator

Runs in your browser. Inputs are not uploaded.
Input
Results
Summary: Estimated savings: $26.9625 across 1000 requests.

Stats
- Requests: 1000
- Cached Tokens: 10000
- Savings: 26.9625

Issues
- No issues found.

Output
Without cache: $42.0000
With cache: $15.0375
Estimated savings: $26.9625
Savings rate: 64.2%

Prompt Cache Savings Calculator Example Run

Compare repeated requests with and without cache reads.

Sample inputs
Traffic
Repeated requests: 1000
Input tokens per request: 12000
Cached input tokens: 10000
Output tokens per request: 400

Rates
Base input per 1M: 3
Cache write per 1M: 3.75
Cache read per 1M: 0.30
Output per 1M: 15

Generated result
Summary: Estimated savings: $26.9625 across 1000 requests.

Stats
- Requests: 1000
- Cached Tokens: 10000
- Savings: 26.9625

Issues
- No issues found.

Output
Without cache: $42.0000
With cache: $15.0375
Estimated savings: $26.9625
Savings rate: 64.2%

What the Prompt Cache Savings Calculator Checks

  • Repeated request count

    Savings are calculated across a batch of repeated calls, not a single isolated prompt.

  • Cached token boundary

    Cached tokens are capped at the total input token count to avoid impossible scenarios.

  • Write versus read rates

    The calculator separates first cache write cost from later cache read cost.

  • Output token cost

    Generated output is included in both cached and uncached totals.

  • Scenario clarity

    The result shows absolute savings and savings rate so tiny wins are easy to spot.

Prompt Cache Savings Calculator Questions

  • Q: Does the prompt cache savings calculator upload my token assumptions? A: No. Inputs are processed in your browser, and pasted content is not sent to analytics or a server.
  • Q: What should I paste into the prompt cache savings calculator? A: Enter request count, input tokens, cached tokens, output tokens, and the relevant per-million rates.
  • Q: What can I copy or download? A: You can copy the scenario summary with cached, uncached, and savings totals.
  • Q: What does the prompt cache savings calculator not verify? A: It does not confirm a provider cache hit, account for minimum cache lengths, or update prices automatically.

Related Tools

Updates

  • v1.0.0 New 2026-05-17: Initial cached-token savings model and cost comparison output added.

Use Prompt Cache Savings Calculator with an LLM

Copy the prompt or pass the llms.txt contract link to your assistant.

Prompt
Use the CleanUtils Prompt Cache Savings Calculator tool for this request.
Tool contract: https://cleanutils.com/developer-tools/prompt-cache-savings-calculator/llms.txt

Read the llms.txt contract, ask me for any missing required input, then follow the contract exactly. If the contract includes JavaScript, run `runCleanUtilsTool(userInput)` in a sandboxed JavaScript runtime and return the result with any warnings or errors.