Amazon Bedrock RAG and Model Evaluations now support custom metrics
News
Amazon Bedrock Evaluations now supports custom metrics for evaluating foundation models and retrieval-augmented generation (RAG) systems across different deployment environments.
- Offers human-based and programmatic evaluations
- Provides built-in metrics like correctness, completeness, and faithfulness
- Now allows customers to create custom metrics using LLM-as-a-judge
- Enables writing custom judge prompts and defining unique rating scales
- Supports injecting dataset or GenAI response data into evaluation prompts
- Provides quickstart templates to help users create custom evaluation metrics
Customers can access these new features through the Amazon Bedrock console or Bedrock APIs, allowing for more tailored and precise model and RAG system evaluations.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Mar 20
2025
2025
Amazon Bedrock now supports RAG Evaluation (generally available)
Apr 4
2025
2025
Evaluate models or RAG systems using Amazon Bedrock Evaluations – Now generally available
Oct 9
2024
2024
Amazon Bedrock Model Evaluation now supports evaluating custom models
Apr 23
2024
2024
Amazon Bedrock model evaluation is now generally available
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.