Home icon

The AWS Glue Data Catalog now supports storage optimization of Apache Iceberg tables

Big Data Blog



This article discusses the new storage optimization features for Apache Iceberg tables in the AWS Glue Data Catalog. It allows you to remove expired snapshots and orphan data files from Iceberg tables to control storage costs and improve query performance.

Specifically, the article covers:

  • Overview of the solution and its architecture
  • Prerequisites and steps to set up resources using AWS CloudFormation
  • How to enable snapshot retention and orphan file deletion on an Iceberg table
  • Validating the solution using the AWS Glue console and AWS CLI
  • Monitoring storage metrics in Amazon S3
  • Clean up steps
  • Conclusion highlighting the benefits of the new features


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Sep 12
2024
AWS Glue Data Catalog now supports storage optimization of Apache Iceberg tables
Dec 19
2024
AWS Glue Data Catalog offers advanced automatic optimization for Apache Iceberg tables
Nov 21
2024
AWS Glue Data Catalog now supports Apache Iceberg automatic table optimization through Amazon VPC
Nov 21
2024
AWS Glue Data Catalog supports automatic optimization of Apache Iceberg tables through your Amazon VPC

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.