Improve the performance of generative AI workloads on Amazon Aurora with Optimized Reads and pgvector
Database Blog
This article discusses how using Amazon Aurora Optimized Reads with pgvector can significantly improve the performance of generative AI workloads involving vector similarity search on large datasets that exceed the memory capacity of a database instance.
Specifically, the article covers:
- How Optimized Reads uses local NVMe SSD storage to cache data evicted from memory, reducing reliance on slower network storage and improving read latency and throughput
- Benchmark results showing Optimized Reads instances providing up to 9x higher query throughput and 75-80% lower cost per query compared to instances without Optimized Reads for a 1 billion vector dataset
- Details on the benchmark setup, instance configurations, and performance metrics
- How Optimized Reads allows scaling vector workloads further on the same instance size before upgrading
- Conclusion that Optimized Reads offers a high-performance, cost-effective solution for vector similarity search workloads exceeding instance memory
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Feb 2
2024
2024
Build generative AI applications with Amazon Aurora and Amazon Bedrock Knowledge Bases
Aug 16
2024
2024
Improve speed and reduce cost for generative AI workloads with a persistent semantic cache in Amazon MemoryDB
Nov 29
2024
2024
Improve the performance of your Generative AI applications with Prompt Optimization on Amazon Bedrock
Dec 17
2025
2025
Improve Aurora PostgreSQL throughput by up to 165% and price-performance ratio by up to 120% using Optimized Reads on AWS Graviton4-based R8gd instances
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.