Home icon

Resilience testing on Amazon ElastiCache with AWS Fault Injection Service

Database Blog



This article explains how to use AWS Fault Injection Service (FIS) to test Amazon ElastiCache resilience by simulating node failures and validating failover mechanisms.

  • AWS FIS enables controlled fault injection experiments on ElastiCache clusters without custom scripts
  • Test setup uses Multi-AZ enabled Valkey cluster with primary and replica nodes
  • FIS action interrupts power to nodes in specified Availability Zone, triggering automatic failover
  • Replica with lowest replication lag promotes to primary; DNS endpoint automatically updates
  • Monitor failover via CloudWatch logs, ElastiCache events, and cluster status changes
  • Validate application gracefully handles brief connection interruptions and falls back to database
  • Typical failover completes within seconds; applications should see 5-15 second outages

This guide helps teams proactively identify weaknesses in caching strategies and validate failover mechanisms before production incidents occur.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Dec 18
2025
AWS Direct Connect now supports resilience testing with AWS Fault Injection Service
Oct 1
2025
Testing AWS Managed Microsoft AD Resilience using AWS Fault Injection Service
Aug 18
2025
Amazon S3 Express One Zone now supports resilience testing with AWS Fault Injection Service
Jul 31
2025
Testing network resilience of AWS Fargate workloads on Amazon ECS using AWS Fault Injection Service

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.