Accelerating large-scale neural network training on CPUs with ThirdAI and AWS Graviton
Machine Learning Blog
This article discusses the performance of ThirdAI's BOLT engine for training neural networks on AWS Graviton3 CPUs compared to other CPU and GPU instances.
Specifically, the article covers:
- Introduction to ThirdAI's sparse deep learning engine BOLT for efficient CPU-based neural network training
- Benchmarks on three tasks: extreme multi-label classification, sentiment analysis, and multi-class text classification
- AWS Graviton3 CPUs accelerated BOLT training by 30-40% compared to Intel Ice Lake CPUs, with cost-performance benefits
- BOLT achieved comparable accuracy to GPU-based models like DistilBERT with much faster training times on CPUs
- Conclusion highlighting the performance and cost advantages of using Graviton3 for ThirdAI's CPU-based deep learning workloads
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Mar 27
2024
2024
Accelerating simulated quantum annealing on AWS Graviton processors
Jun 17
2024
2024
Accelerate deep learning training and simplify orchestration with AWS Trainium and AWS Batch
Dec 2
2025
2025
Announcing Amazon EC2 Trn3 UltraServers for faster, lower-cost generative AI training
Apr 15
2026
2026
Accelerating physical AI with AWS and NVIDIA: building production-ready applications with simulation and real-world learning
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.