Running pgvector in production on Amazon Aurora PostgreSQL

Database Blog

This article provides comprehensive operational guidance for running pgvector in production on Amazon Aurora PostgreSQL, covering index selection, scaling strategies, memory management, and observability practices for RAG workloads.

Choose HNSW indexes for most production RAG workloads; skip indexing for small datasets or partitioned schemas requiring 100% recall
Use cosine distance (<=>) for text embeddings or inner product (<#>) for unit-normalized vectors like Amazon Titan embeddings
Enable iterative scans with relaxed_order mode to fix overfiltering in queries combining WHERE clauses with vector search
Implement two-stage retrieval with binary quantization for coarse candidate selection followed by cosine re-ranking
Manage HNSW index churn through scheduled REINDEX CONCURRENTLY, partition-based rebuilds, or append-only patterns with compaction
Size Aurora instances with memory-optimized r-series classes to keep HNSW graphs memory-resident and avoid performance degradation
Monitor BufferCacheHitRatio, query-level statistics via aurora_stat_statements, and custom recall/latency metrics to detect index drift early
Use Amazon RDS Proxy to manage connection pools and prevent work_mem exhaustion under concurrent vector query load

Successful pgvector production deployments require deliberate upfront decisions on index strategy, parameter tuning, capacity planning, and observability aligned with corpus size, write patterns, and recall targets.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Apr 9
2025

Announcing pgvector 0.8.0 support in Aurora PostgreSQL

Aug 8
2024

Announcing pgvector 0.7.0 support in Aurora PostgreSQL

Nov 21
2024

Amazon RDS for PostgreSQL supports pgvector 0.8.0

May 28
2025

Supercharging vector search performance and relevance with pgvector 0.8.0 on Amazon Aurora PostgreSQL

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Running pgvector in production on Amazon Aurora PostgreSQL

Related articles