Deploy GPT-OSS models with Amazon Bedrock Custom Model Import
Machine Learning Blog
This article explains how to deploy OpenAI's GPT-OSS models on Amazon Bedrock using Custom Model Import while maintaining full API compatibility.
- Amazon Bedrock Custom Model Import now supports GPT-OSS models with 20B and 120B parameters
- GPT-OSS models use mixture-of-experts architecture for efficient reasoning and tool use
- Complete step-by-step deployment: download model, upload to S3, import via Bedrock, invoke with OpenAI API
- Full OpenAI Chat Completions API compatibility enables minimal code changes for migration
- Model import typically completes in 10-15 minutes for 20B parameter models
- Requires ~40GB local disk space, AWS CLI, Hugging Face CLI, and US East 1 region access
- Provides enterprise benefits: cost predictability, data privacy, fine-tuning capabilities, and model control
Organizations can now deploy open-weight GPT-OSS models on AWS infrastructure with enterprise-grade security and scaling, requiring only minimal application code modifications.
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.