Extract data from Amazon Aurora MySQL to Amazon S3 Tables in Apache Iceberg format
Big Data Blog
This article demonstrates how to extract data from Amazon Aurora MySQL to Amazon S3 Tables in Apache Iceberg format using AWS Glue, with full infrastructure automation via CloudFormation.
- Automates end-to-end ETL pipeline from Aurora MySQL Serverless v2 to S3 Tables
- Uses AWS Glue 5.0 with native Iceberg support and S3 Tables REST catalog
- Single CloudFormation stack provisions infrastructure, loads sample TICKIT data, configures pipeline
- Enables analytics queries without impacting production database performance
- Data queryable via Amazon Athena, Redshift Spectrum, Apache Spark
- Includes VPC endpoints, private subnets, AWS Secrets Manager for security
- Deployment takes 30-45 minutes; main costs from Glue DPU-hours and S3 Tables storage
- Sample code available in aws-samples GitHub repository
This solution bridges the gap between relational databases and modern lakehouse architectures, enabling organizations to leverage Apache Iceberg's ACID transactions and schema evolution capabilities.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
2025
2024
2024
2025
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.