Building responsive APIs with Amazon API Gateway response streaming
Compute Blog
This article announces API Gateway response streaming, enabling REST APIs to progressively stream response payloads to clients for improved responsiveness and user experience.
- Streams responses progressively, reducing time-to-first-byte for interactive applications
- Supports LLM-driven applications, large file transfers, and long-running operations
- Works with HTTP-proxy, Lambda proxy, and private integrations
- Supports timeouts up to 15 minutes and responses exceeding 10MB
- Enable via console, OpenAPI spec, CloudFormation, or AWS CLI
- Lambda functions must use streamifyResponse and HttpResponseStream.from() methods
- Compatible with API Gateway security features and observability tools
- First 10MB unrestricted; data beyond 10MB limited to 2MB/s
- Pricing unchanged; each 10MB rounded up billed as single request
API Gateway response streaming significantly improves user experience for AI applications and interactive services by delivering real-time response feedback without architectural complexity.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
2025
2025
2025
2025
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.