Amazon Bedrock announces preview of prompt caching
News
Amazon Bedrock has announced a preview of prompt caching, a new feature designed to optimize generative AI model performance and reduce costs.
- Reduces costs by up to 90% and latency by up to 85% for supported models
- Caches frequently used prompts across multiple API calls
- Avoids reprocessing repetitive context like system prompts and common examples
- Currently available for Claude 3.5 Haiku, Claude 3.5 Sonnet v2, and Nova models
- Initially limited to select customers in US West (Oregon) and US East (N. Virginia) regions
The feature is part of Amazon Bedrock's broader goal of providing secure, privacy-focused generative AI capabilities with improved performance and cost-efficiency.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.