Home icon

Build and deploy AI inference workflows with new enhancements to the Amazon SageMaker Python SDK

Machine Learning Blog



AWS has introduced new enhancements to the Amazon SageMaker Python SDK for building and deploying AI inference workflows, addressing the growing complexity of AI applications. Key improvements include:

  • Deployment of multiple models within a single SageMaker endpoint
  • Workflow definition using a new workflow mode in the Python SDK
  • Flexible development and deployment options
  • Ability to invoke individual models or entire workflows
  • Simplified dependency management

The new feature introduces a `CustomOrchestrator` class that allows developers to create complex inference workflows using Python, with an example demonstrated using a two-model workflow for IT customer service. Amazon Search is exploring this capability to improve its search ranking infrastructure.

This enhancement aims to simplify the process of building and managing sophisticated AI inference systems, allowing developers to focus on business logic and model integrations.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Dec 6
2024
SageMaker SDK enhances training and inference workflows
May 21
2026
Amazon SageMaker AI now supports OpenAI-compatible APIs for inference endpoints
Dec 3
2024
Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker
Mar 25
2025
Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.