Apache Spark encryption performance improvement with Amazon EMR 7.9
Big Data Blog
This article announces Apache Spark encryption performance improvements in Amazon EMR 7.9.0, demonstrating significant speedups for encrypted workloads without code changes.
- EMR 7.9.0 Spark runtime delivers up to 20% faster performance for encrypted workloads
- Supports Spark 3.5.5 with 100% API compatibility with open source Apache Spark
- Optimizations improve shuffle, decryption operations, and memory management
- 20% cost savings achieved through reduced runtime on TPC-DS 3TB benchmark tests
- Encryption protects shuffle files, cached data, and intermediate data on local disk
- Meets compliance requirements for HIPAA, PCI-DSS, GDPR, and FedRAMP regulations
- Detailed benchmarking instructions and scripts provided for reproducibility
EMR 7.9.0 enables faster, more cost-effective encrypted Spark workloads for compliance-heavy industries without requiring application modifications.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Jan 27
2026
2026
Secure Apache Spark writes to Amazon S3 on Amazon EMR with dynamic AWS KMS encryption
Aug 8
2024
2024
Amazon EMR 7.2 now supports Apache Spark 3.5.1
May 27
2026
2026
Amazon EMR now supports Apache Spark 4.0.2 in general availability
Jun 9
2026
2026
Announcing general availability of Apache Spark 4.0 on Amazon EMR
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.