Home icon

New features for AWS Neuron 2.24 include PyTorch 2.7 and inference enhancements

News



AWS has announced the general availability of Neuron 2.24, a release focused on improving deep learning model development and deployment on Inferentia and Trainium-based instances.

  • Introduces support for PyTorch 2.7
  • Adds enhanced inference features like prefix caching and disaggregated inference
  • Supports context parallelism for improved performance on long sequences
  • Provides support for Qwen 2.5 text models
  • Improves integration with Hugging Face Optimum Neuron and PyTorch-based NxD Core backend
  • Available in all AWS Regions with Inferentia and Trainium instances

The release aims to help developers and data scientists accelerate model training and inference, improve efficiency, and simplify deployment of large language models and AI workloads.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Aug 22
2025
Announcing AWS Neuron SDK 2.25.0
Sep 19
2025
Announcing AWS Neuron SDK 2.26.0
May 27
2025
AWS Neuron introduces NxD Inference GA, new features, and improved tools
Dec 22
2025
Announcing AWS Neuron SDK 2.27.0

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.