Home icon

Amazon SageMaker Inference now supports rolling update for inference component endpoints

News



Amazon SageMaker Inference has introduced rolling updates for inference component (IC) endpoints, offering a more efficient way to update machine learning models.

  • Rolling updates allow customers to update endpoints batch by batch, without traffic interruption
  • Replaces previous blue/green update method that required doubling instances
  • Reduces the number of additional instances needed during model updates
  • Helps customers minimize costs and capacity reservation requirements
  • Available in multiple regions across Asia Pacific, Canada, Europe, Middle East, South America, and US

This feature enables more flexible and cost-effective deployment of machine learning models, particularly foundation models, on SageMaker Inference endpoints.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Mar 25
2025
Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference
May 21
2026
Amazon SageMaker AI now supports OpenAI-compatible APIs for inference endpoints
Apr 21
2022
Amazon SageMaker Serverless Inference is now generally available
Jun 18
2026
Amazon SageMaker AI Announces New observability capability For Inference Endpoints

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.