How to optimize querying your data in Amazon S3

Storage Blog

This article discusses three options to efficiently query data stored in Amazon S3: client-side filtering, Amazon Athena, and Amazon S3 Object Lambda.

Specifically, the article covers:

Client-side filtering by downloading the S3 object and using a data analysis library like Pandas to query it locally, which is fast for single or restored S3 Glacier objects
Amazon Athena for running SQL queries on data in S3 in a serverless way, suitable for data lakes and querying multiple objects
Amazon S3 Object Lambda for adding custom code transformations to S3 GET requests to filter and process data as it is retrieved
Detailed walkthroughs with code examples for using each of the three options

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

May 14
2026

Optimize Amazon S3 Tables queries with Amazon Redshift

Dec 4
2024

How Amazon S3 Tables use compaction to improve query performance by up to 3 times

Jul 21
2025

Optimizing vector search using Amazon S3 Vectors and Amazon OpenSearch Service

Oct 21
2025

Optimizing recommendations and analytics using Amazon DynamoDB and Amazon S3

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

How to optimize querying your data in Amazon S3

Related articles