Amazon EC2 P5 instances, optimized for generative AI and HPC, are generally available
News
This article announces the general availability of Amazon EC2 P5 instances, powered by NVIDIA H100 Tensor Core GPUs, designed for generative AI and HPC workloads.
- Delivers up to 6x faster time to solution and 40% lower ML training costs
- Supports training and deploying large language models and diffusion models
- Enables generative AI applications: question answering, code generation, image/video generation
- Suitable for HPC applications: pharmaceutical discovery, seismic analysis, weather forecasting
- Features 2x higher CPU performance, memory, and 4x higher local storage than previous generation
- Provides up to 3,200 Gbps networking with second-generation Elastic Fabric Adapter
- Deployed in EC2 UltraClusters supporting up to 20,000 H100 GPUs
- Available in US East (N. Virginia) and US West (Oregon) regions
P5 instances provide significant performance and cost improvements for demanding AI and HPC applications at scale.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.