Home icon

Customized model monitoring for near real-time batch inference with Amazon SageMaker

Machine Learning Blog



This article discusses customized model monitoring for near real-time batch inference with Amazon SageMaker. It presents a framework to handle multi-payload inference requests for near real-time inference scenarios.

Specifically, the article covers:

  • Overview of the solution architecture
  • Prerequisites for following along
  • Steps to train an XGBoost model
  • Creating custom inference code to handle multi-payload requests
  • Deploying a SageMaker endpoint with data capture enabled
  • Creating constraints for model quality monitoring
  • Publishing a custom Docker image for model monitoring
  • Creating a SageMaker Model Monitor schedule with the custom image
  • Observing the model monitoring job output and violations
  • Cleaning up resources


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Nov 25
2024
Amazon SageMaker launches Multi-Adapter Model Inference
Jan 9
2024
Inference Llama 2 models with real-time response streaming using Amazon SageMaker
Feb 16
2026
Announcing Amazon SageMaker Inference for custom Amazon Nova models
Feb 19
2024
Run ML inference on unplanned and spiky traffic using Amazon SageMaker multi-model endpoints

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.