Amazon SageMaker AI now supports Flexible Training Plans capacity for Inference

News

This article announces that Amazon SageMaker AI's Flexible Training Plans (FTP) now support inference endpoints, enabling customers to reserve guaranteed GPU capacity for model evaluation and production workloads.

Reserve specific instance types for inference endpoints with guaranteed capacity
Automatically provision and run endpoints on reserved capacity without infrastructure management
Choose instance types, compute requirements, reservation length, and start date
Reference reservation ARN when creating endpoints for automatic provisioning
Available in US East (N. Virginia), US West (Oregon), and US East (Ohio)

SageMaker AI FTP for inference eliminates infrastructure management overhead, allowing customers to run inference predictably while focusing on model optimization.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Jan 14
2026

Transform AI development with new Amazon SageMaker AI model customization and large-scale training capabilities

May 4
2026

Amazon SageMaker AI Now Supports Capacity-Aware Inference with Automatic Instance Fallback

Feb 20
2026

Amazon SageMaker AI in 2025, a year in review part 1: Flexible Training Plans and improvements to price performance for inference workloads

Apr 22
2026

Amazon SageMaker AI now supports optimized generative AI inference recommendations

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Amazon SageMaker AI now supports Flexible Training Plans capacity for Inference

Related articles