Amazon SageMaker AI now supports serverless reinforcement fine-tuning for 12 additional models
News
This article announces that Amazon SageMaker AI now supports serverless reinforcement fine-tuning for 12 additional open-weight models.
- Newly supported models include Qwen, DeepSeek, Llama, and gpt-oss variants
- Enables fine-tuning without provisioning or managing infrastructure
- Supports supervised fine-tuning (SFT), direct preference optimization (DPO), and reinforcement fine-tuning (RFT)
- RLVR improves accuracy on verifiable tasks like code generation and math
- RLAIF uses AI-generated feedback for quality and safety alignment
- Available in US East, US West, Asia Pacific, and EU regions
- Pay-per-use pricing model with no cluster setup required
SageMaker AI now enables serverless fine-tuning of 12 additional models using advanced reinforcement techniques for domain-specific reasoning tasks.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Apr 22
2026
2026
Amazon SageMaker AI now supports serverless model customization for Qwen3.5 models
Dec 3
2025
2025
New serverless customization in Amazon SageMaker AI accelerates model fine-tuning
Dec 3
2025
2025
New serverless model customization capability in Amazon SageMaker AI
Jun 3
2026
2026
Amazon SageMaker AI launches multi-turn reinforcement learning for AI agent model customization
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.