Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Machine Learning Blog

Amazon has announced the general availability of Amazon Bedrock Model Distillation, a technique that enables organizations to create smaller, more efficient AI models with performance comparable to larger foundation models.

Transfers knowledge from large teacher models to smaller student models
Focuses on improving agent function calling accuracy
Reduces model latency by up to 72%
Provides significant cost savings with smaller models
Supports expanded model options like Llama 3.x and Anthropic Claude models

Key benefits include faster inference, lower operational costs, and maintaining high accuracy across various AI agent applications. The technology helps organizations deploy sophisticated AI experiences more economically and efficiently.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Dec 3
2024

Build faster, more cost-efficient, highly accurate models with Amazon Bedrock Model Distillation (preview)

Dec 23
2024

Amazon Bedrock Agents, Flows, and Knowledge Bases now supports Latency Optimized Models

Dec 3
2025

Amazon Bedrock now supports reinforcement fine-tuning delivering 66% accuracy gains on average over base models

Dec 3
2024

Introducing latency-optimized inference for foundation models in Amazon Bedrock

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Amazon Bedrock Model Distillation: Boost function calling accuracy while reducing cost and latency

Related articles