Home icon
Create an Apache Hudi-based near-real-time transactional data lake using AWS DMS, Amazon Kinesis, AWS Glue streaming ETL, and data visualization using Amazon QuickSight

Blog



This article demonstrates building a near-real-time transactional data lake using Apache Hudi, AWS DMS, Kinesis, Glue streaming ETL, and QuickSight visualization.

  • AWS DMS captures CDC changes from RDS to Kinesis Data Streams
  • AWS Glue 4.0 streaming jobs read, enrich, and upsert data to S3 in Hudi format
  • Apache Hudi enables ACID transactions, time travel queries, and upsert/delete operations
  • CloudFormation template automates setup of RDS, DMS, Kinesis, Glue jobs, and networking
  • Data validated via Athena queries; visualized in QuickSight dashboards
  • Supports both new record ingestion and real-time updates to existing records

This solution enables organizations to replicate relational database changes to S3 data lakes in near-real-time with full transactional support and analytics capabilities.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.