Home icon

Create a SageMaker inference endpoint with custom model & extended container

Machine Learning Blog



This AWS blog post provides a comprehensive guide to creating a custom SageMaker inference endpoint using the NASA Prithvi geospatial AI model. The article demonstrates how to:

  • Extend a SageMaker container image with custom dependencies
  • Create a custom inference.py file for model initialization and prediction
  • Build and deploy a custom model using AWS CodeBuild and SageMaker
  • Create a SageMaker endpoint for real-time inference with GPU acceleration

Key technical steps include:

  • Extending a PyTorch SageMaker container with MMCV library
  • Implementing custom model loading and inference functions
  • Packaging model artifacts in a specific S3 file structure
  • Creating IAM roles and configuring SageMaker endpoint resources

The solution enables deploying complex custom models with unique dependencies on SageMaker, providing flexibility for specialized machine learning use cases.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Mar 10
2025
Amazon SageMaker Inference now supports rolling update for inference component endpoints
Feb 16
2026
Announcing Amazon SageMaker Inference for custom Amazon Nova models
May 21
2026
Amazon SageMaker AI now supports OpenAI-compatible APIs for inference endpoints
Mar 24
2026
Deploy SageMaker AI inference endpoints with set GPU capacity using training plans

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.