Home icon
Amazon Bedrock Model Evaluation now supports evaluating custom models

News



The article discusses Amazon Bedrock Model Evaluation, a service that allows customers to evaluate, compare, and select the best foundation models for their use cases. It supports automatic evaluation with predefined metrics like accuracy, robustness, and toxicity, as well as human evaluation for subjective and custom metrics like friendliness, style, and brand voice alignment.

Specifically, the article covers:

  • Evaluation methods: Automatic evaluation with predefined algorithms and human evaluation workflows
  • Evaluation metrics: Built-in metrics like accuracy, robustness, toxicity, as well as custom metrics
  • New feature: Customers can now evaluate their own custom fine-tuned models from Amazon Bedrock's fine-tuning and continued pretraining jobs
  • Availability: The service is generally available in commercial regions and AWS GovCloud (US-West)
  • Getting started: Sign in to Amazon Bedrock on the AWS Management Console or use the Amazon Bedrock APIs


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.