Build and deploy AI inference workflows with new enhancements to the Amazon SageMaker Python SDK

Machine Learning Blog

AWS has introduced new enhancements to the Amazon SageMaker Python SDK for building and deploying AI inference workflows, addressing the growing complexity of AI applications. Key improvements include:

Deployment of multiple models within a single SageMaker endpoint
Workflow definition using a new workflow mode in the Python SDK
Flexible development and deployment options
Ability to invoke individual models or entire workflows
Simplified dependency management

The new feature introduces a `CustomOrchestrator` class that allows developers to create complex inference workflows using Python, with an example demonstrated using a two-model workflow for IT customer service. Amazon Search is exploring this capability to improve its search ranking infrastructure.

This enhancement aims to simplify the process of building and managing sophisticated AI inference systems, allowing developers to focus on business logic and model integrations.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Dec 6
2024

SageMaker SDK enhances training and inference workflows

May 21
2026

Amazon SageMaker AI now supports OpenAI-compatible APIs for inference endpoints

Dec 3
2024

Speed up your AI inference workloads with new NVIDIA-powered capabilities in Amazon SageMaker

Mar 25
2025

Enhance deployment guardrails with inference component rolling updates for Amazon SageMaker AI inference

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Build and deploy AI inference workflows with new enhancements to the Amazon SageMaker Python SDK

Related articles