Home icon

Amazon EMR 7.5 runtime for Apache Spark and Iceberg can run Spark workloads 3.6 times faster than Spark 3.5.3 and Iceberg 1.6.1

Big Data Blog



Amazon has announced the Amazon EMR 7.5 runtime for Apache Spark and Iceberg, which demonstrates significant performance improvements for data processing workloads:

  • Runs Spark workloads 3.6 times faster than Spark 3.5.3 and Iceberg 1.6.1
  • Reduces total runtime from 1.54 hours to 0.42 hours on TPC-DS 3TB benchmark
  • Improves cost efficiency by 2.9 times, reducing costs from $16.00 to $5.39
  • Scans approximately 3.4 times less data from Amazon S3
  • Represents a 32% performance increase from Amazon EMR 7.1

The performance gains are achieved through continued optimization of DataSource V2 and Spark operators, making it an attractive option for data processing and analytics workloads.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Aug 26
2024
Amazon EMR 7.1 runtime for Apache Spark and Iceberg can run Spark workloads 2.7 times faster than Apache Spark 3.5.1 and Iceberg 1.5.2
Nov 27
2025
Run Apache Spark and Iceberg 4.5x faster than open source Spark with Amazon EMR
Nov 27
2025
Run Apache Spark and Apache Iceberg write jobs 2x faster with Amazon EMR
Jun 21
2024
Run Apache Spark 3.5.1 workloads 4.5 times faster with Amazon EMR runtime for Apache Spark

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.