Home icon

Introducing new alerts to help users detect and react to blocked job queues in AWS Batch

HPC Blog



The article discusses new features in AWS Batch that help users detect and react to blocked job queues. AWS Batch allows running batch workloads on Amazon ECS and EKS, and sometimes a job at the head of the queue can block other jobs from running due to issues like misconfiguration or lack of resources.

Specifically, the article covers:

  • Common causes of blocked job queues, including insufficient capacity, compute environment limitations, job resource requirements, service role permissions, and invalid compute environments.
  • New CloudWatch Events notifications that Batch sends when it detects a blocked job queue, including the specific reason for the blockage.
  • Using Amazon EventBridge to define automated actions (like terminating the job) when receiving these notifications.
  • A new jobStateTimeLimitActions parameter that allows automatically cancelling a stuck job after a specified time.
  • A table mapping different blocked job queue scenarios to the corresponding CloudWatch event messages and parameters for jobStateTimeLimitActions.


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Aug 12
2024
AWS Batch adds support for cancelling queued jobs
Jun 3
2024
AWS Batch introduces the Job Queue Snapshot to view jobs at the front of the job queues
Feb 13
2026
AWS Batch now provides Job Queue and Share Utilization Visibility
Feb 4
2026
AWS Batch now provides Array Job Status Summary in ListJobs API

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.