Home icon

Read and write S3 Iceberg table using AWS Glue Iceberg Rest Catalog from Open Source Apache Spark

Big Data Blog



This article provides a comprehensive guide to using AWS Glue Iceberg REST Catalog with Apache Spark to read and write Iceberg tables on Amazon S3, leveraging AWS Lake Formation for metadata and access management.

  • Demonstrates how to set up Lake Formation permissions for third-party data access
  • Explains how to configure Apache Spark to interact with Glue Data Catalog using Iceberg Rest API
  • Provides a detailed Python script for creating, inserting, and reading Iceberg tables on S3
  • Shows how to grant access to other data teams using Lake Formation
  • Highlights the flexibility of accessing Iceberg tables across different environments

The solution enables organizations to efficiently manage and process data across data lakes and warehouses using open-source tools and AWS services.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Jan 23
2025
Access Amazon S3 Iceberg tables from Databricks using AWS Glue Iceberg Rest Catalog in Amazon SageMaker Lakehouse
Feb 17
2025
Access data in Amazon S3 Tables using PyIceberg through the AWS Glue Iceberg REST endpoint
Mar 13
2025
Amazon S3 Tables add Apache Iceberg REST Catalog APIs
Dec 18
2025
Create and update Apache Iceberg tables with partitions in the AWS Glue Data Catalog using the AWS SDK and AWS CloudFormation

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.