Operationalize generative AI applications on AWS: Part I – Overview of LLMOps solution
AWS for Games Blog
This article is an overview of operationalizing generative AI applications, specifically large language models (LLMs), on AWS using LLMOps (Large Language Model Operations).
Specifically, the article covers:
- Use cases for LLMOps in the gaming industry like generating unique NPC dialogues, improving script writing efficiency, and chat/audio moderation
- Design patterns for model customization such as fine-tuning, pretraining, and retrieval augmented generation (RAG)
- The three main phases of LLMOps: Continuous Integration (CI), Continuous Deployment (CD), and Continuous Tuning (CT)
- A high-level architecture using AWS services like API Gateway, Amazon Bedrock, AWS CodePipeline, Amazon OpenSearch Service, and Amazon SageMaker
- Methods to deploy the LLMOps solution on AWS
The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.
Related articles
Mar 14
2024
2024
Best practices to build generative AI applications on AWS
Jun 6
2024
2024
Operationalize generative AI applications on AWS: Part II – Architecture Deep Dive
Jun 6
2024
2024
Unlocking generative AI opportunities with AWS
Apr 30
2026
2026
AWS Generative AI Model Agility Solution: A comprehensive guide to migrating LLMs for generative AI production
The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.