Home icon

HyperPod now supports Multi-Instance GPU to maximize GPU utilization for generative AI tasks

Machine Learning Blog



This article announces general availability of GPU partitioning with Amazon SageMaker HyperPod using NVIDIA Multi-Instance GPU (MIG) technology.

  • Run multiple concurrent tasks on single GPU, minimizing wasted compute and memory resources
  • MIG partitions GPUs into isolated instances with dedicated memory, cache, and compute cores
  • Supports flexible resource allocation across teams with predictable performance and workload isolation
  • Two setup experiences: managed MIG (recommended, instance group level) and DIY (Kubernetes labels)
  • Compatible with ml.p5en.48xlarge and other supported GPU instances with Ampere/Hopper/Blackwell architectures
  • Integrates with HyperPod features: task governance, observability dashboards, autoscaling, inference operator
  • Practical use cases: resource-guided model serving, mixed workloads, development/testing efficiency
  • Hands-on examples demonstrate concurrent inference, disaggregated inference, and interactive Jupyter notebooks
  • Includes comprehensive monitoring, quota management, and enterprise-grade reliability features

MIG on SageMaker HyperPod enables organizations to maximize GPU infrastructure investment through flexible partitioning, cost optimization, and efficient resource sharing across teams and workloads.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Nov 24
2025
Amazon SageMaker HyperPod now supports NVIDIA Multi-Instance GPU (MIG) for generative AI tasks
Nov 21
2025
Amazon SageMaker HyperPod now supports running IDEs and Notebooks to accelerate AI development
Jul 10
2025
Amazon SageMaker HyperPod launches model deployments to accelerate the generative AI model development lifecycle
Jun 24
2025
Amazon SageMaker HyperPod announces P6-B200 instances powered by NVIDIA B200 GPUs

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.