Announcing latency-optimized inference for Amazon Nova Pro foundation model in Amazon Bedrock
News
Amazon has announced latency-optimized inference for the Nova Pro foundation model in Amazon Bedrock, focusing on improved performance for generative AI applications.
- Enables faster response times for latency-sensitive applications
- Requires no additional setup or model fine-tuning
- Allows immediate performance enhancement for existing applications
- Available via cross-region inference in three US regions:
- US West (Oregon)
- US East (Virginia)
- US East (Ohio)
This enhancement provides developers with more flexibility to optimize generative AI performance and improve end-user experience without complex configurations.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Dec 3
2024
2024
Introducing latency-optimized inference for foundation models in Amazon Bedrock
Dec 23
2024
2024
Amazon Bedrock Agents, Flows, and Knowledge Bases now supports Latency Optimized Models
Jan 28
2025
2025
Optimizing AI responsiveness: A practical guide to Amazon Bedrock latency-optimized inference
Dec 3
2024
2024
Announcing Amazon Nova foundation models available today in Amazon Bedrock
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.