Home icon

Sprinklr improves performance by 20% and reduces cost by 25% for machine learning inference on AWS Graviton3

Machine Learning Blog



This article discusses how Sprinklr optimized their machine learning inference performance and reduced costs by migrating to AWS Graviton3 instances.

Specifically, the article covers:

  • Sprinklr's AI scale and challenges with diverse AI workloads
  • Cost-effective ML inference using AWS Graviton3
  • Sprinklr's two-step approach to migration: benchmarking and integration
  • Results: 20% throughput improvement, 30% latency reduction, 25-30% cost savings
  • Conclusion highlighting the benefits of using Graviton3 instances


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

May 15
2024
Accelerate NLP inference with ONNX Runtime on AWS Graviton processors
Nov 25
2025
Warner Bros. Discovery achieves 60% cost savings and faster ML inference with AWS Graviton
Nov 24
2025
How potential performance upside with AWS Graviton helps reduce your costs further
Feb 29
2024
Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.