AWS Neuron adds support for Llama 2, GPT-NeoX, and SDXL generative AI models
News
This article announces AWS Neuron 2.13 release, expanding support for generative AI models on AWS's purpose-built instances.
- Adds Llama 2 and GPT-NeoX model training support
- Enables Stable Diffusion XL and CLIP model inference
- Integrates with PyTorch and TensorFlow frameworks
- Includes compiler, runtime, profiling tools, and libraries
- Supports distributed LLM training via Neuron Reference for Nemo Megatron
- Optimized LLM inference for Llama 2 with Transformers Neuron
- Available on Trn1 and Inf2 instances in three US regions
AWS Neuron 2.13 expands generative AI model support with minimal code changes required for popular ML frameworks.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.