Home icon

Building end-to-end data lineage for one-time and complex queries using Amazon Athena, Amazon Redshift, Amazon Neptune and dbt

Big Data Blog



This article discusses building end-to-end data lineage for one-time and complex queries using AWS services like Amazon Athena, Amazon Redshift, Amazon Neptune, and dbt. The solution addresses challenges in tracking data lineage across different query types and data sources.

  • Key challenges include diverse data sources, varying query complexity, and inconsistent lineage tracking
  • Uses dbt for unified data modeling across Amazon Athena and Amazon Redshift
  • Leverages Amazon Neptune graph database to store and analyze complex lineage relationships
  • Employs AWS Step Functions and Lambda for automated lineage generation
  • Provides a comprehensive solution for end-to-end data lineage analysis

The solution offers a flexible, scalable architecture that simplifies data modeling, enhances data governance, and provides deeper insights for decision-making across different data platforms.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Dec 3
2024
Announcing the general availability of data lineage in the next generation of Amazon SageMaker and Amazon DataZone
Jun 27
2024
Introducing end-to-end data lineage (preview) visualization in Amazon DataZone
Jun 24
2025
Capture data lineage from dbt, Apache Airflow, and Apache Spark with Amazon SageMaker
Oct 13
2025
Visualize data lineage using Amazon SageMaker Catalog for Amazon EMR, AWS Glue, and Amazon Redshift

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.