Customized model monitoring for near real-time batch inference with Amazon SageMaker

Machine Learning Blog

This article discusses customized model monitoring for near real-time batch inference with Amazon SageMaker. It presents a framework to handle multi-payload inference requests for near real-time inference scenarios.

Specifically, the article covers:

Overview of the solution architecture
Prerequisites for following along
Steps to train an XGBoost model
Creating custom inference code to handle multi-payload requests
Deploying a SageMaker endpoint with data capture enabled
Creating constraints for model quality monitoring
Publishing a custom Docker image for model monitoring
Creating a SageMaker Model Monitor schedule with the custom image
Observing the model monitoring job output and violations
Cleaning up resources

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Nov 25
2024

Amazon SageMaker launches Multi-Adapter Model Inference

Jul 30
2026

Inference meta-monitoring for Amazon SageMaker AI endpoints with Amazon Quick

Jan 9
2024

Inference Llama 2 models with real-time response streaming using Amazon SageMaker

Jul 7
2026

Monitoring discriminative ML models using Amazon SageMaker AI with MLflow

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Customized model monitoring for near real-time batch inference with Amazon SageMaker

Related articles