Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart

Machine Learning Blog

This article discusses how to build retrieval augmented generation (RAG) applications using Jina Embeddings v2 on Amazon SageMaker JumpStart. RAG is an approach to optimize the output of large language models by referencing an authoritative knowledge base before generating a response.

Specifically, the article covers:

What is RAG and its benefits
Advantages of using Jina Embeddings v2 for RAG applications
Overview of Amazon SageMaker JumpStart
Steps to deploy Jina Embeddings v2 model on SageMaker JumpStart
Preparing a dataset and indexing text embeddings
Prompting a generative LLM endpoint and querying it using the indexed context
Cleaning up resources after use
Conclusion highlighting the benefits of this approach

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Dec 5
2024

Deploy RAG applications on Amazon SageMaker JumpStart using FAISS

Jul 11
2024

Improve RAG accuracy with fine-tuned embedding models on Amazon SageMaker

Nov 18
2024

Build cost-effective RAG applications with Binary Embeddings in Amazon Titan Text Embeddings V2, Amazon OpenSearch Serverless, and Amazon Bedrock Knowledge Bases

Sep 12
2024

Build a RAG-based QnA application using Llama3 models from SageMaker JumpStart

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Build RAG applications using Jina Embeddings v2 on Amazon SageMaker JumpStart

Related articles