Extract data from Amazon Aurora MySQL to Amazon S3 Tables in Apache Iceberg format

Big Data Blog

This article demonstrates how to extract data from Amazon Aurora MySQL to Amazon S3 Tables in Apache Iceberg format using AWS Glue, with full infrastructure automation via CloudFormation.

Automates end-to-end ETL pipeline from Aurora MySQL Serverless v2 to S3 Tables
Uses AWS Glue 5.0 with native Iceberg support and S3 Tables REST catalog
Single CloudFormation stack provisions infrastructure, loads sample TICKIT data, configures pipeline
Enables analytics queries without impacting production database performance
Data queryable via Amazon Athena, Redshift Spectrum, Apache Spark
Includes VPC endpoints, private subnets, AWS Secrets Manager for security
Deployment takes 30-45 minutes; main costs from Glue DPU-hours and S3 Tables storage
Sample code available in aws-samples GitHub repository

This solution bridges the gap between relational databases and modern lakehouse architectures, enabling organizations to leverage Apache Iceberg's ACID transactions and schema evolution capabilities.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Mar 24
2025

Using Amazon S3 Tables with Amazon Redshift to query Apache Iceberg tables

Nov 6
2024

Stream real-time data into Apache Iceberg tables in Amazon S3 using Amazon Data Firehose

Oct 1
2024

Amazon Data Firehose delivers data streams into Apache Iceberg format tables in Amazon S3

Jun 20
2025

Stream data from Amazon MSK to Apache Iceberg tables in Amazon S3 and Amazon S3 Tables using Amazon Data Firehose

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Extract data from Amazon Aurora MySQL to Amazon S3 Tables in Apache Iceberg format

Related articles