Home icon

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Machine Learning Blog



Amazon has announced the general availability of Amazon Bedrock Model Distillation, a technique that enables organizations to create smaller, more efficient AI models with performance comparable to larger foundation models.

  • Transfers knowledge from large teacher models to smaller student models
  • Focuses on improving agent function calling accuracy
  • Reduces model latency by up to 72%
  • Provides significant cost savings with smaller models
  • Supports expanded model options like Llama 3.x and Anthropic Claude models

Key benefits include faster inference, lower operational costs, and maintaining high accuracy across various AI agent applications. The technology helps organizations deploy sophisticated AI experiences more economically and efficiently.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Dec 3
2024
Build faster, more cost-efficient, highly accurate models with Amazon Bedrock Model Distillation (preview)
Dec 23
2024
Amazon Bedrock Agents, Flows, and Knowledge Bases now supports Latency Optimized Models
Dec 3
2025
Amazon Bedrock now supports reinforcement fine-tuning delivering 66% accuracy gains on average over base models
Dec 3
2024
Introducing latency-optimized inference for foundation models in Amazon Bedrock

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.