Amazon Bedrock RAG and Model Evaluations now support custom metrics

News

Amazon Bedrock Evaluations now supports custom metrics for evaluating foundation models and retrieval-augmented generation (RAG) systems across different deployment environments.

Offers human-based and programmatic evaluations
Provides built-in metrics like correctness, completeness, and faithfulness
Now allows customers to create custom metrics using LLM-as-a-judge
Enables writing custom judge prompts and defining unique rating scales
Supports injecting dataset or GenAI response data into evaluation prompts
Provides quickstart templates to help users create custom evaluation metrics

Customers can access these new features through the Amazon Bedrock console or Bedrock APIs, allowing for more tailored and precise model and RAG system evaluations.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Mar 20
2025

Amazon Bedrock now supports RAG Evaluation (generally available)

Apr 4
2025

Evaluate models or RAG systems using Amazon Bedrock Evaluations – Now generally available

Oct 9
2024

Amazon Bedrock Model Evaluation now supports evaluating custom models

Apr 23
2024

Amazon Bedrock model evaluation is now generally available

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Amazon Bedrock RAG and Model Evaluations now support custom metrics

Related articles