Operationalize generative AI applications on AWS: Part I – Overview of LLMOps solution

AWS for Games Blog

This article is an overview of operationalizing generative AI applications, specifically large language models (LLMs), on AWS using LLMOps (Large Language Model Operations).

Specifically, the article covers:

Use cases for LLMOps in the gaming industry like generating unique NPC dialogues, improving script writing efficiency, and chat/audio moderation
Design patterns for model customization such as fine-tuning, pretraining, and retrieval augmented generation (RAG)
The three main phases of LLMOps: Continuous Integration (CI), Continuous Deployment (CD), and Continuous Tuning (CT)
A high-level architecture using AWS services like API Gateway, Amazon Bedrock, AWS CodePipeline, Amazon OpenSearch Service, and Amazon SageMaker
Methods to deploy the LLMOps solution on AWS

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Mar 14
2024

Best practices to build generative AI applications on AWS

Jun 6
2024

Operationalize generative AI applications on AWS: Part II – Architecture Deep Dive

Jun 6
2024

Unlocking generative AI opportunities with AWS

Apr 30
2026

AWS Generative AI Model Agility Solution: A comprehensive guide to migrating LLMs for generative AI production

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Operationalize generative AI applications on AWS: Part I – Overview of LLMOps solution

Related articles