Amazon SageMaker AI now supports serverless reinforcement fine-tuning for 12 additional models

News

This article announces that Amazon SageMaker AI now supports serverless reinforcement fine-tuning for 12 additional open-weight models.

Newly supported models include Qwen, DeepSeek, Llama, and gpt-oss variants
Enables fine-tuning without provisioning or managing infrastructure
Supports supervised fine-tuning (SFT), direct preference optimization (DPO), and reinforcement fine-tuning (RFT)
RLVR improves accuracy on verifiable tasks like code generation and math
RLAIF uses AI-generated feedback for quality and safety alignment
Available in US East, US West, Asia Pacific, and EU regions
Pay-per-use pricing model with no cluster setup required

SageMaker AI now enables serverless fine-tuning of 12 additional models using advanced reinforcement techniques for domain-specific reasoning tasks.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Apr 22
2026

Amazon SageMaker AI now supports serverless model customization for Qwen3.5 models

Dec 3
2025

New serverless customization in Amazon SageMaker AI accelerates model fine-tuning

Dec 3
2025

New serverless model customization capability in Amazon SageMaker AI

Jun 3
2026

Amazon SageMaker AI launches multi-turn reinforcement learning for AI agent model customization

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Amazon SageMaker AI now supports serverless reinforcement fine-tuning for 12 additional models

Related articles