Home icon

How BMW Group built a serverless terabyte-scale data transformation architecture with dbt and Amazon Athena

Big Data Blog



BMW Group developed a serverless data transformation architecture using dbt and Amazon Athena to address challenges with data processing and scalability in their Cloud Efficiency Analytics (CLEA) team.

  • Replaced complex, rigid data infrastructure with a modular serverless pipeline
  • Uses GitHub Actions for workflow automation and deployment
  • Implemented a three-stage data transformation process: Source, Prepared, and Semantic layers
  • Leverages Athena for efficient querying of large-scale Parquet data
  • Integrated dbt for SQL-based transformations with about 400 models and 100 data tests

Key benefits include improved development agility, cost-efficiency, and the ability to quickly modify data models while maintaining high performance and data quality. The architecture processes around 300 GB daily from a 15 TB data lake.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Feb 8
2024
Cloud-driven enterprise transformation at the BMW Group
Jan 30
2026
BMW Group unlocks insights from petabytes of data with agentic search on AWS
Oct 14
2024
BMW Group fosters data-driven culture with a no-code generative AI data analytics solution on AWS
Nov 7
2024
How BMW Group and Qualcomm built an automated driving platform on AWS

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.