Home icon

Customize your Amazon SageMaker model deployment software and driver versions

News



The article summarizes a new feature introduced by Amazon SageMaker that allows customers to customize the software and driver versions when deploying machine learning models for inference.

Specifically, the article covers:

  • Previously, customers had to use preset software and driver versions defined by SageMaker on the managed instances behind an endpoint.
  • Now, customers can specify the "InferenceAmiVersion" parameter when configuring endpoints to select the combination of software and driver versions (such as Nvidia driver and CUDA version) that best meets their requirements.
  • This allows customers to tailor their hosting environment to meet the performance, compatibility, scalability, and operational requirements of their ML applications.
  • Customers can now downgrade and upgrade driver versions for their endpoints on their own schedule.
  • The feature is available in all regions where SageMaker is available.


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Nov 26
2024
Apply Amazon SageMaker Studio lifecycle configurations using AWS CDK
Jul 16
2025
Customize Amazon Nova in Amazon SageMaker AI
Dec 3
2025
New serverless model customization capability in Amazon SageMaker AI
Jul 23
2025
Customize Amazon Nova in Amazon SageMaker AI using Direct Preference Optimization

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.