How Nielsen uses serverless concepts on Amazon EKS for big data processing with Spark workloads

Architecture Blog

Nielsen Marketing Cloud transformed their big data processing by implementing a serverless-inspired architecture on Amazon EKS for handling Spark workloads. They addressed significant challenges in scaling and performance by reimagining their data processing approach.

Original system suffered from performance degradation and remote data shuffle issues
Moved from large Spark clusters to multiple local mode Spark clusters on EKS
Implemented a workflow using SQS queues, Lambda functions, and EKS pods
Utilized Kubernetes Horizontal Pod Autoscaler to dynamically scale processing capacity
Achieved 130% performance improvement per instance
Reduced processing costs by 55%

The new architecture enables linear scaling, improved efficiency, and significant cost savings by breaking down large data processing jobs into smaller, independent workloads.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Mar 10
2026

Reducing costs for shuffle-heavy Apache Spark workloads with serverless storage for Amazon EMR Serverless

Nov 21
2025

Amazon EMR Serverless now supports Apache Spark 4.0.1 (preview)

Dec 2
2025

Amazon EMR Serverless eliminates local storage provisioning for Apache Spark workloads

May 29
2025

Optimizing data lakes with Amazon S3 Tables and Apache Spark on Amazon EKS

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

How Nielsen uses serverless concepts on Amazon EKS for big data processing with Spark workloads

Related articles