Home icon

Unlock cost-effective AI inference using Amazon Bedrock serverless capabilities with an Amazon SageMaker trained model

Machine Learning Blog



This article explains how to use Amazon Bedrock's Custom Model Import feature to deploy custom models trained in Amazon SageMaker for cost-effective AI inference.

  • Amazon Bedrock now supports importing custom models from architectures like Mistral, Flan, and Meta Llama
  • The article demonstrates importing a Hugging Face Flan-T5 Base model trained in SageMaker JumpStart
  • Key steps include training the model in SageMaker Studio and importing it into Amazon Bedrock
  • Benefits include on-demand, scalable, and cost-efficient model inference
  • Supports using custom fine-tuned models through a simple API

The feature enables developers to easily deploy specialized machine learning models with Amazon Bedrock's fully managed infrastructure, focusing on innovation rather than infrastructure management.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Nov 25
2024
Amazon SageMaker introduces Scale Down to Zero for AI inference to help customers save costs
Apr 21
2022
Amazon SageMaker Serverless Inference is now generally available
Oct 29
2024
Automate Amazon Bedrock batch inference: Building a scalable and efficient pipeline
Jul 9
2024
Amazon SageMaker introduces a new generative AI inference optimization capability

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.