How to optimize querying your data in Amazon S3
Storage Blog
This article discusses three options to efficiently query data stored in Amazon S3: client-side filtering, Amazon Athena, and Amazon S3 Object Lambda.
Specifically, the article covers:
- Client-side filtering by downloading the S3 object and using a data analysis library like Pandas to query it locally, which is fast for single or restored S3 Glacier objects
- Amazon Athena for running SQL queries on data in S3 in a serverless way, suitable for data lakes and querying multiple objects
- Amazon S3 Object Lambda for adding custom code transformations to S3 GET requests to filter and process data as it is retrieved
- Detailed walkthroughs with code examples for using each of the three options
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
May 14
2026
2026
Optimize Amazon S3 Tables queries with Amazon Redshift
Dec 4
2024
2024
How Amazon S3 Tables use compaction to improve query performance by up to 3 times
Jul 21
2025
2025
Optimizing vector search using Amazon S3 Vectors and Amazon OpenSearch Service
Oct 21
2025
2025
Optimizing recommendations and analytics using Amazon DynamoDB and Amazon S3
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.