Operationalize generative AI applications on AWS: Part II – Architecture Deep Dive

AWS for Games Blog

This article discusses an architecture for operationalizing generative AI applications on AWS, with a focus on creating dialogs for non-player characters (NPCs) in games. The key points are:

Specifically, the article covers:

Base model inference: Invoking foundation models hosted on Amazon Bedrock via APIs (API Gateway, Lambda) for generating text or image content
Code pipeline: Automating the continuous integration and deployment of the application using AWS CodePipeline, CodeCommit, CodeBuild, and CloudFormation
Continuous fine-tuning: Automating the process of data preprocessing, model fine-tuning, evaluation, and registration using SageMaker Pipelines, Step Functions, and Bedrock
Retrieval Augmented Generation (RAG): Adding context to the model's responses by retrieving relevant information from a vector database (Amazon OpenSearch Service) and augmenting the input prompts
Conclusion: The article provides an overview of the architecture and points to the code and a workshop for further details

Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Apr 24
2024

Let’s Architect! Discovering Generative AI on AWS

Mar 14
2024

Best practices to build generative AI applications on AWS

May 30
2025

Architect a mature generative AI foundation on AWS

Jul 22
2026

Architecting offline-first generative AI applications for edge deployments using AWS services

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Operationalize generative AI applications on AWS: Part II – Architecture Deep Dive

Related articles