Amazon Bedrock Agents, Flows, and Knowledge Bases now supports Latency Optimized Models
News
Amazon Bedrock has introduced support for latency-optimized models in its Agents, Flows, and Knowledge Bases services, enhancing AI application performance.
- Supports Anthropic's Claude 3.5 Haiku and Meta's Llama 3.1 models (405B and 70B)
- Provides faster response times without compromising accuracy
- Ideal for real-time applications like customer service chatbots and coding assistants
- Leverages AWS Trainium2 and advanced software optimizations
- Can be integrated into existing applications without additional setup
- Available in US East (Ohio) Region via SDK with runtime configuration
This update enables developers to create more responsive AI applications with improved performance and reduced latency.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Dec 3
2024
2024
Introducing latency-optimized inference for foundation models in Amazon Bedrock
Dec 3
2024
2024
Amazon Bedrock now supports multi-agent collaboration
Mar 5
2025
2025
Announcing latency-optimized inference for Amazon Nova Pro foundation model in Amazon Bedrock
Apr 30
2026
2026
Amazon Bedrock AgentCore launches capabilities for optimizing agent performance in preview
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.