Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart
Machine Learning Blog
This article discusses how to build retrieval augmented generation (RAG) applications using Jina Embeddings v2 on Amazon SageMaker JumpStart. RAG is an approach to optimize the output of large language models by referencing an authoritative knowledge base before generating a response.
Specifically, the article covers:
- What is RAG and its benefits
- Advantages of using Jina Embeddings v2 for RAG applications
- Overview of Amazon SageMaker JumpStart
- Steps to deploy Jina Embeddings v2 model on SageMaker JumpStart
- Preparing a dataset and indexing text embeddings
- Prompting a generative LLM endpoint and querying it using the indexed context
- Cleaning up resources after use
- Conclusion highlighting the benefits of this approach
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Dec 5
2024
2024
Deploy RAG applications on Amazon SageMaker JumpStart using FAISS
Jul 11
2024
2024
Improve RAG accuracy with fine-tuned embedding models on Amazon SageMaker
Nov 18
2024
2024
Build cost-effective RAG applications with Binary Embeddings in Amazon Titan Text Embeddings V2, Amazon OpenSearch Serverless, and Amazon Bedrock Knowledge Bases
Sep 12
2024
2024
Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.