Prompt Caching — Claude Cert Academy

Prompt caching reuses the processed prefix of a prompt across requests to cut latency and cost.

When a large, stable chunk such as a system prompt or document is reused, Claude can cache it so repeated calls skip re-processing it. This dramatically lowers cost for high-volume or long-context workloads. You mark cacheable spans explicitly in the API.

Continue to Claude Cert Academy