Home icon
How Ontraport reduced data processing cost by 80% with AWS Glue

Blog



This article describes how Ontraport, a CRM and automation service, reduced data processing costs by 80% by migrating from Amazon EMR to AWS Glue for their log processing ETL workloads.

  • Ontraport processes 200+ million log records daily from multiple sources
  • Initial Amazon EMR setup required 16-node cluster; AWS Glue reduced complexity significantly
  • AWS Glue's DynamicFrame and crawlers simplified schema discovery and data transformation
  • Processing cost reduced from $500 to $100 per terabyte (80% savings)
  • Storage costs reduced by 92% through Parquet compression and partitioning
  • 10 workers processed 500M records in under 60 minutes; 100 workers handled 10B records in 1 hour
  • Achieved results with only one full-time developer using serverless approach

AWS Glue's serverless model and built-in data processing capabilities enabled Ontraport to optimize costs and accelerate development compared to managing EMR clusters.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.