Home icon

Extract data from Amazon Aurora MySQL to Amazon S3 Tables in Apache Iceberg format

Big Data Blog



This article demonstrates how to extract data from Amazon Aurora MySQL to Amazon S3 Tables in Apache Iceberg format using AWS Glue, with full infrastructure automation via CloudFormation.

  • Automates end-to-end ETL pipeline from Aurora MySQL Serverless v2 to S3 Tables
  • Uses AWS Glue 5.0 with native Iceberg support and S3 Tables REST catalog
  • Single CloudFormation stack provisions infrastructure, loads sample TICKIT data, configures pipeline
  • Enables analytics queries without impacting production database performance
  • Data queryable via Amazon Athena, Redshift Spectrum, Apache Spark
  • Includes VPC endpoints, private subnets, AWS Secrets Manager for security
  • Deployment takes 30-45 minutes; main costs from Glue DPU-hours and S3 Tables storage
  • Sample code available in aws-samples GitHub repository

This solution bridges the gap between relational databases and modern lakehouse architectures, enabling organizations to leverage Apache Iceberg's ACID transactions and schema evolution capabilities.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Mar 24
2025
Using Amazon S3 Tables with Amazon Redshift to query Apache Iceberg tables
Nov 6
2024
Stream real-time data into Apache Iceberg tables in Amazon S3 using Amazon Data Firehose
Oct 1
2024
Amazon Data Firehose delivers data streams into Apache Iceberg format tables in Amazon S3
Jun 20
2025
Stream data from Amazon MSK to Apache Iceberg tables in Amazon S3 and Amazon S3 Tables using Amazon Data Firehose

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.