Introducing multimodal retrieval for Amazon Bedrock Knowledge Bases
Machine Learning Blog
This article announces general availability of multimodal retrieval for Amazon Bedrock Knowledge Bases, enabling RAG applications to search text, images, audio, and video content.
- Native support for video and audio alongside text and images in unified workflow
- Amazon Nova Multimodal Embeddings encodes all content types into single vector space
- Bedrock Data Automation converts multimedia to text descriptions and transcripts
- Cross-modal retrieval enables searching with images to find visually similar products
- Video and audio automatically segmented into 5-30 second chunks for embedding
- Timestamps provided for exact temporal location within source videos
- Nova best for visual-driven use cases; Data Automation best for speech-heavy content
- Console walkthrough demonstrates e-commerce product search example
Multimodal retrieval eliminates custom preprocessing complexity for enterprise RAG applications spanning multiple content types.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Dec 1
2025
2025
Multimodal retrieval for Bedrock Knowledge Bases now generally available
Dec 4
2024
2024
Amazon Bedrock Knowledge Bases now processes multimodal data
Dec 4
2024
2024
Amazon Bedrock Knowledge Bases now supports structured data retrieval
May 28
2025
2025
Building a multimodal RAG based application using Amazon Bedrock Data Automation and Amazon Bedrock Knowledge Bases
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.