AWS Neuron introduces NxD Inference GA, new features, and improved tools
News
AWS has released Neuron 2.23, introducing several key enhancements to its machine learning infrastructure:
- NxD Inference library (NxDI) reaches general availability with Persistent Cache support
- NxD Training library adds Context Parallelism for Llama models and ORPO model alignment
- Neuron Kernel Interface introduces new 32-bit integer operations and performance tuning APIs
- Neuron Profiler offers 5x faster profile result viewing and improved error tracking
- Supports PyTorch 2.6, JAX 0.5.3, and upgraded compatibility with third-party libraries
The release supports training and deploying models on Trn1, Trn2, and Inf2 instances across various AWS pricing models.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Dec 23
2024
2024
AWS Neuron introduces support for Trainium2 and NxD Inference
Jul 2
2025
2025
New features for AWS Neuron 2.24 include PyTorch 2.7 and inference enhancements
Sep 17
2024
2024
AWS Neuron introduces Neuron Kernel Interface (NKI), NxD Training, and JAX support for training
Aug 22
2025
2025
Announcing AWS Neuron SDK 2.25.0
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.