Home icon

Learn from AWS Fault Injection Service team’s approach to Game Days

AWS Cloud Operations Blog



The article discusses how the AWS Fault Injection Service (FIS) team conducts game day exercises to improve operational resilience and test system reliability.

  • Game days are structured exercises that simulate failure scenarios in pre-production environments
  • Key objectives include testing system responses, training on-call operators, and validating monitoring systems
  • The team follows a comprehensive framework with detailed preparation, execution, and post-event analysis
  • Game days help identify gaps in runbooks, improve incident response times, and build operator confidence
  • Best practices include starting simple, running exercises frequently, and focusing on business-critical services

The approach enables continuous improvement of operational capabilities, helping teams build more resilient systems and more confident operators.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Jun 26
2025
Best practices for utilizing AWS Systems Manager with AWS Fault Injection Service
Jun 23
2025
Simulating partial failures with AWS Fault Injection Service
Sep 3
2024
AWS Fault Injection Service introduces additional safety control
Nov 12
2024
AWS Fault Injection Service now generates experiment reports

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.