Home icon

Amazon Bedrock announces preview of prompt caching

News



Amazon Bedrock has announced a preview of prompt caching, a new feature designed to optimize generative AI model performance and reduce costs.

  • Reduces costs by up to 90% and latency by up to 85% for supported models
  • Caches frequently used prompts across multiple API calls
  • Avoids reprocessing repetitive context like system prompts and common examples
  • Currently available for Claude 3.5 Haiku, Claude 3.5 Sonnet v2, and Nova models
  • Initially limited to select customers in US West (Oregon) and US East (N. Virginia) regions

The feature is part of Amazon Bedrock's broader goal of providing secure, privacy-focused generative AI capabilities with improved performance and cost-efficiency.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Apr 7
2025
Amazon Bedrock announces general availability of prompt caching
Apr 7
2025
Effectively use prompt caching on Amazon Bedrock
Jan 26
2026
Amazon Bedrock now supports 1-hour duration for prompt caching
Jul 10
2024
Amazon Bedrock Prompt Management and Prompt Flows now available in preview

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.