Amazon EMR Serverless eliminates local storage provisioning, reducing data processing costs by up to 20%
Big Data Blog
This article announces serverless storage for Amazon EMR Serverless, eliminating local disk provisioning for Apache Spark workloads and reducing costs by up to 20%.
- Automatically handles shuffle operations without manual disk sizing
- Reduces data processing costs up to 20% with no storage charges
- Eliminates job failures from disk capacity constraints and data skew
- Enables independent compute and storage scaling for elastic resource usage
- Multi-tier aggregation architecture consolidates shuffle data efficiently
- Custom networking stack reduces mesh complexity and connection overhead
- Decouples storage from compute, allowing instant worker release
- Available in multiple AWS regions for EMR release 7.12 and later
- Data encrypted in transit and at rest with job-level isolation
- Best for workloads shuffling 10GB-200GB with unpredictable sizes
Serverless storage shifts data processing infrastructure management by automating shuffle operations, aligning costs to actual usage, and improving reliability for I/O-intensive analytics workloads.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Dec 2
2025
2025
Amazon EMR Serverless eliminates local storage provisioning for Apache Spark workloads
Jan 9
2026
2026
Amazon EMR Serverless adds support for job run level cost allocation
Jan 21
2026
2026
Amazon EMR Serverless now supports AWS KMS customer managed keys for encrypting local disks
Mar 10
2026
2026
Reducing costs for shuffle-heavy Apache Spark workloads with serverless storage for Amazon EMR Serverless
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.