Home icon

A guide to capacity planning for Airflow worker pool in Amazon MWAA

Big Data Blog



This article provides a practical capacity planning framework for Amazon MWAA worker pools, using a financial services scenario with 25% DAG growth to demonstrate how to right-size infrastructure before workload increases hit production.

  • Assess current peak concurrent tasks using CloudWatch RunningTasks and QueuedTasks metrics
  • Calculate required workers: peak tasks ÷ tasks-per-worker × safety buffer (5-15%)
  • Target 85-95% peak utilization to maintain SLA compliance and handle unexpected spikes
  • Monitor five critical metrics: QueuedTasks, RunningTasks, AdditionalWorkers, Worker CPU, Task Duration
  • Set CloudWatch alarms for queue depth, permanent worker detection, and SLA risk alerts
  • Choose strategy: full provisioning (mission-critical), hybrid (balanced), or minimal+scaling (cost-focused)
  • Conduct quarterly reviews plus trigger-based assessments when DAGs grow >10% or SLA breaches occur

Effective capacity planning prevents both under-provisioning (SLA breaches) and over-provisioning (cost overruns) by measuring current state, projecting growth, calculating needs with buffers, and monitoring continuously.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

May 1
2026
A guide to Airflow worker pool optimization in Amazon MWAA
Nov 19
2024
Introducing Amazon MWAA micro environments for Apache Airflow
May 16
2024
Introducing Amazon MWAA support for the Airflow REST API and web server auto scaling
Feb 28
2024
Introducing Amazon MWAA support for Apache Airflow version 2.8.1

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.