Home icon

Amazon SageMaker AI now supports Flexible Training Plans capacity for Inference

News



This article announces that Amazon SageMaker AI's Flexible Training Plans (FTP) now support inference endpoints, enabling customers to reserve guaranteed GPU capacity for model evaluation and production workloads.

  • Reserve specific instance types for inference endpoints with guaranteed capacity
  • Automatically provision and run endpoints on reserved capacity without infrastructure management
  • Choose instance types, compute requirements, reservation length, and start date
  • Reference reservation ARN when creating endpoints for automatic provisioning
  • Available in US East (N. Virginia), US West (Oregon), and US East (Ohio)

SageMaker AI FTP for inference eliminates infrastructure management overhead, allowing customers to run inference predictably while focusing on model optimization.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Jan 14
2026
Transform AI development with new Amazon SageMaker AI model customization and large-scale training capabilities
May 4
2026
Amazon SageMaker AI Now Supports Capacity-Aware Inference with Automatic Instance Fallback
Feb 20
2026
Amazon SageMaker AI in 2025, a year in review part 1: Flexible Training Plans and improvements to price performance for inference workloads
Apr 22
2026
Amazon SageMaker AI now supports optimized generative AI inference recommendations

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.