Build Strands Agents with SageMaker AI models and MLflow

Machine Learning Blog

This article demonstrates how to build AI agents using Strands Agents SDK with models deployed on Amazon SageMaker AI endpoints, integrated with MLflow for observability and A/B testing.

Deploy foundation models from SageMaker JumpStart as SageMaker AI endpoints
Integrate Strands Agents SDK with SageMaker AI models using OpenAI-compatible APIs
Set up SageMaker Serverless MLflow App for automatic agent tracing and observability
Implement A/B testing by deploying multiple model variants behind single endpoint
Evaluate agent performance using MLflow GenAI evaluation framework with custom scorers
Compare model variants (Qwen3-4B vs Qwen3-8B) using metrics and LLM-based judges
Transition to better-performing model by adjusting traffic weights

This approach enables organizations to build production-grade AI agents with infrastructure control, cost optimization, compliance capabilities, and comprehensive monitoring without relying solely on managed foundation model services.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Mar 5
2026

Building custom model provider for Strands Agents with LLMs hosted on SageMaker AI endpoints

May 4
2026

Agent-guided workflows to accelerate model customization in Amazon SageMaker AI

May 16
2025

Introducing Strands Agents, an Open Source AI Agents SDK

Aug 1
2025

Observing and evaluating AI agentic workflows with Strands Agents SDK and Arize AX

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Build Strands Agents with SageMaker AI models and MLflow

Related articles