Home icon

Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton

Machine Learning Blog



This article discusses the performance of ThirdAI's BOLT engine for training neural networks on AWS Graviton3 CPUs compared to other CPU and GPU instances.

Specifically, the article covers:

  • Introduction to ThirdAI's sparse deep learning engine BOLT for efficient CPU-based neural network training
  • Benchmarks on three tasks: extreme multi-label classification, sentiment analysis, and multi-class text classification
  • AWS Graviton3 CPUs accelerated BOLT training by 30-40% compared to Intel Ice Lake CPUs, with cost-performance benefits
  • BOLT achieved comparable accuracy to GPU-based models like DistilBERT with much faster training times on CPUs
  • Conclusion highlighting the performance and cost advantages of using Graviton3 for ThirdAI's CPU-based deep learning workloads


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Mar 27
2024
Accelerating simulated quantum annealing on AWS Graviton processors
Jun 17
2024
Accelerate deep learning training and simplify orchestration with AWS Trainium and AWS Batch
Dec 2
2025
Announcing Amazon EC2 Trn3 UltraServers for faster, lower-cost generative AI training
Apr 15
2026
Accelerating physical AI with AWS and NVIDIA: building production-ready applications with simulation and real-world learning

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.