Home icon

Boosting Salesforce Einstein’s code generating model performance with Amazon SageMaker

Machine Learning Blog



The article discusses how Salesforce's Einstein AI Platform team used Amazon SageMaker to boost the performance of their large language model, CodeGen, for code generation tasks. It highlights the specific challenges they faced with hosting CodeGen and the advantages of using SageMaker to address those issues.

Specifically, the article covers:

  • The challenge of hosting and scaling CodeGen, Salesforce's in-house code generation model
  • Salesforce's evaluation of different tools and services, and their decision to use Amazon SageMaker
  • Key SageMaker features that enabled performance optimization, such as specialized deep learning containers, advanced batching strategies, efficient routing strategies, access to high-end GPUs, and rapid iteration and deployment
  • How SageMaker helped Salesforce achieve a 6,500% increase in throughput and reduced latency for their CodeGen models
  • New challenges and opportunities encountered during the integration with SageMaker
  • Key takeaways and lessons learned for optimizing models in SageMaker


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Apr 17
2025
How Salesforce achieves high-performance model deployment with Amazon SageMaker AI
Aug 15
2025
Optimizing Salesforce’s model endpoints with Amazon SageMaker AI inference components
Sep 17
2024
Accelerating Code Conversion with Amazon SageMaker and IBM Granite Code models
Aug 21
2025
Speed up delivery of ML workloads using Code Editor in Amazon SageMaker Unified Studio

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.