Home icon

Amazon SageMaker AI now supports serverless reinforcement fine-tuning for 12 additional models

News



This article announces that Amazon SageMaker AI now supports serverless reinforcement fine-tuning for 12 additional open-weight models.

  • Newly supported models include Qwen, DeepSeek, Llama, and gpt-oss variants
  • Enables fine-tuning without provisioning or managing infrastructure
  • Supports supervised fine-tuning (SFT), direct preference optimization (DPO), and reinforcement fine-tuning (RFT)
  • RLVR improves accuracy on verifiable tasks like code generation and math
  • RLAIF uses AI-generated feedback for quality and safety alignment
  • Available in US East, US West, Asia Pacific, and EU regions
  • Pay-per-use pricing model with no cluster setup required

SageMaker AI now enables serverless fine-tuning of 12 additional models using advanced reinforcement techniques for domain-specific reasoning tasks.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Apr 22
2026
Amazon SageMaker AI now supports serverless model customization for Qwen3.5 models
Dec 3
2025
New serverless customization in Amazon SageMaker AI accelerates model fine-tuning
Dec 3
2025
New serverless model customization capability in Amazon SageMaker AI
Jun 3
2026
Amazon SageMaker AI launches multi-turn reinforcement learning for AI agent model customization

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.