Home icon

Operationalize generative AI applications on AWS: Part II – Architecture Deep Dive

AWS for Games Blog



This article discusses an architecture for operationalizing generative AI applications on AWS, with a focus on creating dialogs for non-player characters (NPCs) in games. The key points are:

Specifically, the article covers:

  • Base model inference: Invoking foundation models hosted on Amazon Bedrock via APIs (API Gateway, Lambda) for generating text or image content
  • Code pipeline: Automating the continuous integration and deployment of the application using AWS CodePipeline, CodeCommit, CodeBuild, and CloudFormation
  • Continuous fine-tuning: Automating the process of data preprocessing, model fine-tuning, evaluation, and registration using SageMaker Pipelines, Step Functions, and Bedrock
  • Retrieval Augmented Generation (RAG): Adding context to the model's responses by retrieving relevant information from a vector database (Amazon OpenSearch Service) and augmenting the input prompts
  • Conclusion: The article provides an overview of the architecture and points to the code and a workshop for further details


Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Apr 24
2024
Let’s Architect! Discovering Generative AI on AWS
Mar 14
2024
Best practices to build generative AI applications on AWS
May 30
2025
Architect a mature generative AI foundation on AWS
Apr 2
2024
Operationalize generative AI applications on AWS: Part I – Overview of LLMOps solution

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.