Amazon EC2 Inf2 instances, optimized for generative AI, now available globally
News
The article announces the global availability of Amazon EC2 Inf2 instances, which are optimized for generative AI workloads such as text summarization, code generation, video/image generation, speech recognition, and personalization.
Specifically, the article covers:
- Inf2 instances deliver high performance at the lowest cost on Amazon EC2 for generative AI models
- Support for scale-out distributed inference with NeuronLink interconnect
- High performance with up to 2.3 petaflops and 384 GB accelerator memory
- Up to 40% better price-performance than comparable EC2 instances
- Native integration with popular machine learning frameworks via AWS Neuron SDK
- Available in four sizes across 8 AWS Regions as On-Demand, Reserved, Spot Instances, or Savings Plans
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.