Announcing OpenAI-compatible API support for Amazon SageMaker AI endpoints
Machine Learning Blog
This article announces OpenAI-compatible API support for Amazon SageMaker AI real-time inference endpoints, enabling seamless integration with existing OpenAI SDKs and frameworks.
- SageMaker endpoints now expose `/openai/v1` path for Chat Completions requests
- Use OpenAI SDK, LangChain, or Strands Agents by changing only the endpoint URL
- Bearer token authentication with time-limited tokens (up to 12 hours) from AWS credentials
- Support for single-model endpoints and multi-model deployments via inference components
- Deploy fine-tuned models without code changes to existing applications
- Run agentic workflows entirely on owned SageMaker infrastructure
- Includes IAM permissions, token generation, and deployment examples
SageMaker AI now enables drop-in OpenAI-compatible inference without custom clients or code rewrites, simplifying AI application deployment on dedicated infrastructure.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
May 21
2026
2026
Amazon SageMaker AI now supports OpenAI-compatible APIs for inference endpoints
Apr 22
2026
2026
Amazon SageMaker AI now supports optimized generative AI inference recommendations
Mar 19
2026
2026
Enhanced metrics for Amazon SageMaker AI endpoints: deeper visibility for better performance
Dec 4
2024
2024
AWS announces Amazon SageMaker Partner AI Apps
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.