Home icon

Automate Amazon EKS troubleshooting using an Amazon Bedrock agentic workflow

Machine Learning Blog



This article describes an innovative solution for automating Amazon EKS troubleshooting using Amazon Bedrock multi-agent collaboration. The solution creates an intelligent system that can identify, analyze, and resolve Kubernetes cluster issues with minimal human intervention.

  • Uses multiple specialized AI agents working together:
    • Collaborator agent for workflow orchestration
    • K8sGPT agent for cluster analysis
    • ArgoCD agent for remediation
  • Key technologies include:
  • Amazon Bedrock
  • K8sGPT for cluster analysis
  • ArgoCD for GitOps deployment
  • Demonstrates capabilities such as:
  • Identifying pod failures
  • Analyzing resource constraints
  • Automatically suggesting and implementing fixes

The solution aims to reduce mean time to identify (MTTI) and mean time to resolve (MTTR) for Kubernetes cluster issues, enabling operations teams to focus on innovation rather than manual troubleshooting.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Mar 20
2025
Streamline AWS resource troubleshooting with Amazon Bedrock Agents and AWS Support Automation Workflows
Mar 21
2025
Automate IT operations with Amazon Bedrock Agents
May 20
2025
How to automate incident response for Amazon EKS on Amazon EC2
Feb 25
2025
Accelerate IaC troubleshooting with Amazon Bedrock Agents

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.