Real-time personalized recommendations with Amazon SageMaker and Amazon-managed Valkey

Database Blog

This article describes an architecture for delivering real-time personalized product recommendations using Amazon SageMaker and Amazon-managed Valkey, addressing the challenges of serving relevant results at scale.

SageMaker hosts sentence transformer models to convert customer queries into semantic vectors
Valkey provides sub-10ms vector similarity search using HNSW indexing on millions of products
Architecture handles latency, throughput, semantic understanding, and personalization requirements
Optional enrichment from DynamoDB layers additional signals like preferences and trending items
Parallel processing and configurable pipelines keep end-to-end latency under 100 milliseconds
Valkey chosen for in-memory performance, rich filtering, and multiple indexes per cluster
Index populated via batch embedding generation and pipelined HSET commands

The solution demonstrates how to combine semantic embeddings, vector search, and optional enrichment to build production-scale recommendation systems that feel instant and personalized.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Apr 22
2026

Amazon SageMaker AI launches optimized generative AI inference recommendations

Jul 27
2026

Building a scalable personalized recommendation system on AWS: From batch to real-time

Apr 22
2026

Amazon SageMaker AI now supports optimized generative AI inference recommendations

Jun 28
2024

Amazon SageMaker now supports SageMaker Studio Personalization

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Real-time personalized recommendations with Amazon SageMaker and Amazon-managed Valkey

Related articles