Home icon
Scale training and inference of thousands of ML models with Amazon SageMaker

Blog



This article demonstrates how to use Amazon SageMaker to efficiently train and serve thousands of ML models at scale, using an energy forecasting use case with 1,000 customers.

  • SageMaker Processing preprocesses data and creates individual CSV files per customer in S3
  • Training jobs use ShardedByS3Key distribution to shard data across instances without duplication
  • Checkpoints save individual models to S3, avoiding need to unpack large archives
  • Multi-Model Endpoints (MMEs) host multiple models on single endpoint for cost efficiency
  • MMEs automatically serve all models in specified S3 paths without redeployment
  • Frequently used models cached in memory and disk for low-latency inference
  • Solution uses Prophet algorithm for time-series energy consumption forecasting

SageMaker provides a scalable, cost-effective platform for organizations needing to train and deploy thousands of models simultaneously using managed infrastructure.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.