Home icon

How Tealium built a chatbot evaluation platform with Ragas and Auto-Instruct using AWS generative AI services

Machine Learning Blog



This article details how Tealium built a chatbot evaluation platform using Ragas and Auto-Instruct with AWS generative AI services, specifically Amazon Bedrock. The solution focuses on improving RAG (Retrieval Augmented Generation) pipelines through comprehensive evaluation and error correction techniques.

  • Developed an evaluation framework using Ragas to assess RAG system performance across metrics like faithfulness, context precision, and answer relevancy
  • Implemented automatic prompt engineering using Auto-Instruct to generate and rank improved instructions
  • Created a human-in-the-loop UI for subject matter experts to provide feedback on model outputs
  • Utilized Amazon Bedrock's flexible foundation models for evaluation and instruction generation
  • Achieved 85% context utilization, 86% faithfulness, and 76% answer relevancy in initial testing

The collaboration demonstrated Amazon Bedrock's potential in powering sophisticated generative AI solutions with improved evaluation and refinement capabilities.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Nov 28
2024
Building your first generative AI conversational experience on AWS
Oct 4
2024
Improving constituent experience using AWS-powered generative AI chatbots
Oct 7
2024
Build a generative AI Slack chat assistant using Amazon Bedrock and Amazon Kendra
Feb 14
2024
Build generative AI chatbots using prompt engineering with Amazon Redshift and Amazon Bedrock

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.