SageMaker JumpStart now offers optimized deployments for foundation models

News

This article announces optimized deployments for foundation models in SageMaker JumpStart, simplifying model deployment with pre-configured, task-aware settings.

Deploy foundation models with pre-configured settings tailored to specific use cases
Optimize for cost, throughput, latency, or balanced performance based on workload
Support for 30+ popular models from Meta, Microsoft, Mistral AI, Qwen, Google, TII
View key metrics like P50 latency, time-to-first token, and throughput before deployment
Deploy to SageMaker AI Managed Inference endpoints or SageMaker HyperPod clusters
VPC deployment capabilities ensure data control and enterprise-grade security
Available in all AWS regions where SageMaker JumpStart is supported

SageMaker JumpStart optimized deployments reduce deployment complexity by eliminating guesswork while providing visibility into performance metrics and security.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Apr 14
2026

Use-case based deployments on SageMaker JumpStart

Jun 21
2024

Amazon SageMaker JumpStart now provides granular access control for foundation models

Jan 29
2024

Benchmark and optimize endpoint deployment in Amazon SageMaker JumpStart

Mar 26
2025

Amazon SageMaker JumpStart adds fine-tuning support for models in a private model hub

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

SageMaker JumpStart now offers optimized deployments for foundation models

Related articles