Read and write S3 Iceberg table using AWS Glue Iceberg Rest Catalog from Open Source Apache Spark

Big Data Blog

This article provides a comprehensive guide to using AWS Glue Iceberg REST Catalog with Apache Spark to read and write Iceberg tables on Amazon S3, leveraging AWS Lake Formation for metadata and access management.

Demonstrates how to set up Lake Formation permissions for third-party data access
Explains how to configure Apache Spark to interact with Glue Data Catalog using Iceberg Rest API
Provides a detailed Python script for creating, inserting, and reading Iceberg tables on S3
Shows how to grant access to other data teams using Lake Formation
Highlights the flexibility of accessing Iceberg tables across different environments

The solution enables organizations to efficiently manage and process data across data lakes and warehouses using open-source tools and AWS services.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Jan 23
2025

Access Amazon S3 Iceberg tables from Databricks using AWS Glue Iceberg Rest Catalog in Amazon SageMaker Lakehouse

Feb 17
2025

Access data in Amazon S3 Tables using PyIceberg through the AWS Glue Iceberg REST endpoint

Mar 13
2025

Amazon S3 Tables add Apache Iceberg REST Catalog APIs

Dec 18
2025

Create and update Apache Iceberg tables with partitions in the AWS Glue Data Catalog using the AWS SDK and AWS CloudFormation

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Read and write S3 Iceberg table using AWS Glue Iceberg Rest Catalog from Open Source Apache Spark

Related articles