Extract time series from satellite weather data with AWS Lambda
Blog
This article demonstrates how to extract time series from satellite weather data using AWS Lambda, Step Functions, and AWS Glue for serverless ETL processing.
- Use Lambda functions to process multidimensional satellite data in parallel by day
- Step Functions orchestrates 365 parallel Lambda invocations for year-long dataset processing
- Each Lambda function extracts data for 100 geographical coordinates from daily weather files
- AWS Glue repartitions output data from day-based to geographical point-based partitioning
- Solution processes ~100GB satellite dataset in 1-2 minutes using serverless architecture
- Lambda requires 2048MB memory, 5-minute timeout, and netcdf4/pandas libraries as layers
- Final output: one folder per geographical point containing full-year time series data
This serverless approach enables scalable, cost-effective ETL for large multidimensional datasets without managing infrastructure.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.