Optimizing AI implementation costs with Automat-it

Machine Learning Blog

This article details how Automat-it helped a customer optimize AI implementation costs for video intelligence solutions using YOLOv8 models on AWS Elastic Kubernetes Service (EKS).

Initial approach used dedicated GPU instances per customer, costing $353 per camera monthly
Implemented GPU time-slicing to share GPU resources across multiple AI models
Used NVIDIA Kubernetes device plugin to enable GPU resource fractionalization
Reduced inference costs to $27.81 per camera monthly - a twelvefold cost reduction
Successfully maintained model performance while processing up to 54 pods on a single GPU

The solution demonstrates how strategic GPU resource sharing can dramatically lower AI infrastructure costs without compromising performance, using Kubernetes and time-slicing techniques.

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Mar 28
2025

Optimizing cost for building AI models with Amazon EC2 and SageMaker AI

Mar 18
2025

Optimizing Cost for Generative AI with AWS

Jun 25
2026

You CAN Manage, Forecast, and Evaluate AI Costs

Sep 23
2024

Generative AI Cost Optimization Strategies

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Optimizing AI implementation costs with Automat-it

Related articles