Boosting Salesforce Einstein’s code generating model performance with Amazon SageMaker

Machine Learning Blog

The article discusses how Salesforce's Einstein AI Platform team used Amazon SageMaker to boost the performance of their large language model, CodeGen, for code generation tasks. It highlights the specific challenges they faced with hosting CodeGen and the advantages of using SageMaker to address those issues.

Specifically, the article covers:

The challenge of hosting and scaling CodeGen, Salesforce's in-house code generation model
Salesforce's evaluation of different tools and services, and their decision to use Amazon SageMaker
Key SageMaker features that enabled performance optimization, such as specialized deep learning containers, advanced batching strategies, efficient routing strategies, access to high-end GPUs, and rapid iteration and deployment
How SageMaker helped Salesforce achieve a 6,500% increase in throughput and reduced latency for their CodeGen models
New challenges and opportunities encountered during the integration with SageMaker
Key takeaways and lessons learned for optimizing models in SageMaker

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Apr 17
2025

How Salesforce achieves high-performance model deployment with Amazon SageMaker AI

Aug 15
2025

Optimizing Salesforce’s model endpoints with Amazon SageMaker AI inference components

Sep 17
2024

Accelerating Code Conversion with Amazon SageMaker and IBM Granite Code models

Aug 21
2025

Speed up delivery of ML workloads using Code Editor in Amazon SageMaker Unified Studio

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Boosting Salesforce Einstein’s code generating model performance with Amazon SageMaker

Related articles