Home icon

Optimizing AI implementation costs with Automat-it

Machine Learning Blog



This article details how Automat-it helped a customer optimize AI implementation costs for video intelligence solutions using YOLOv8 models on AWS Elastic Kubernetes Service (EKS).

  • Initial approach used dedicated GPU instances per customer, costing $353 per camera monthly
  • Implemented GPU time-slicing to share GPU resources across multiple AI models
  • Used NVIDIA Kubernetes device plugin to enable GPU resource fractionalization
  • Reduced inference costs to $27.81 per camera monthly - a twelvefold cost reduction
  • Successfully maintained model performance while processing up to 54 pods on a single GPU

The solution demonstrates how strategic GPU resource sharing can dramatically lower AI infrastructure costs without compromising performance, using Kubernetes and time-slicing techniques.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Mar 28
2025
Optimizing cost for building AI models with Amazon EC2 and SageMaker AI
Mar 18
2025
Optimizing Cost for Generative AI with AWS
Jun 25
2026
You CAN Manage, Forecast, and Evaluate AI Costs
Sep 23
2024
Generative AI Cost Optimization Strategies

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.