Understanding AWS Lambda’s invoke throttling limits
Blog
This article explains AWS Lambda's three invoke throttle limits and how they work together to protect the service and prevent noisy neighbor issues.
- Concurrency limit: Maximum simultaneous in-flight invocations allowed at any time
- TPS limit: Capped at 10x concurrency to protect invoke data plane services
- TPS formula: min(10 x concurrency, concurrency / function duration in seconds)
- Functions under 100ms duration are TPS-limited rather than concurrency-limited
- Burst limit: Uses token bucket algorithm to gradually ramp up concurrency
- Burst prevents sudden cold start surges that stress invoke services
- Token bucket refills at constant rate until reaching maximum capacity
Understanding these three throttle limits helps developers interpret 429 errors and optimize Lambda workload scaling.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.