Home icon

Develop and test AWS Glue 5.0 jobs locally using a Docker container

Big Data Blog



This article provides a comprehensive guide to developing and testing AWS Glue 5.0 jobs locally using a Docker container. Key highlights include:

  • AWS Glue 5.0 offers a performance-optimized Apache Spark 3.5 runtime for data integration
  • An official AWS Glue Docker image is available in the Amazon ECR Public Gallery
  • The Docker image includes:
    • Amazon Linux 2023
    • AWS Glue ETL Library
    • Apache Spark 3.5.2
    • Open table format libraries
  • Multiple development methods are supported:
    • spark-submit
    • REPL shell (pyspark)
    • pytest
    • Visual Studio Code

The article provides detailed instructions for setting up and running AWS Glue jobs locally, including configuration of AWS credentials, pulling the Docker image, and running containers with different development approaches.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Dec 3
2024
Introducing AWS Glue 5.0
Nov 26
2025
Introducing AWS Glue 5.1
Oct 29
2024
Control your AWS Glue Studio development interface with AWS Glue job mode API property
Mar 14
2024
Use Azure DevOps to deploy AWS Glue jobs in CI/CD pipeline

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.