Automate caption creation and search for images at enterprise scale using generative AI and Amazon Kendra
Blog
This article demonstrates how to automate image captioning and enable searchable image repositories using generative AI and Amazon Kendra.
- Generative AI models automatically create textual captions for images, eliminating manual metadata tagging
- Amazon Kendra's Custom Document Enrichment invokes AI models during image ingestion to generate searchable metadata
- Solution combines vision and language models (ViT-GPT2) deployed on Amazon SageMaker with Amazon Textract
- Applicable across ecommerce, healthcare, manufacturing, marketing, and accessibility use cases
- CloudFormation template provided for easy deployment with estimated monthly cost of $129.20
- Users can search images using natural language queries like "dog under umbrella"
This solution automates enterprise-scale image search by leveraging generative AI to create rich, searchable metadata without manual effort.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.