Building an end-to-end agentic SRE using AWS DevOps Agent
DevOps & Developer Productivity Blog
This article demonstrates building an end-to-end agentic SRE solution using AWS DevOps Agent for autonomous incident investigation and resolution.
- AWS DevOps Agent autonomously investigates incidents, identifies root causes, and recommends mitigation plans
- Agent Spaces define investigation scope with integrated CloudWatch, Splunk, GitHub, and Slack access
- Webhook integration enables automated incident triggers from CloudWatch alarms via EventBridge and Lambda
- Splunk MCP integration provides centralized log aggregation and analysis capabilities
- DevOps Agent Skills guide investigations using Markdown-based runbooks and documentation
- Mitigation plans include four phases: Prepare, Pre-Validate, Apply, and Post-Validate
- Agent-ready specs enable handoff to coding agents like Kiro for automated fix implementation
- Multi-account architecture separates demo application, Splunk, and DevOps Agent responsibilities
This solution shifts incident response from reactive firefighting to autonomous resolution, reducing MTTR while freeing engineers for innovation.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Jun 15
2026
2026
AWS DevOps Agent expands with custom SRE agents and MCP/A2A protocols
Mar 31
2026
2026
Leverage Agentic AI for Autonomous Incident Response with AWS DevOps Agent
Mar 26
2026
2026
Architecting for agentic AI development on AWS
Jan 15
2026
2026
From AI agent prototype to product: Lessons from building AWS DevOps Agent
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.