Home icon

How PDI built an enterprise-grade RAG system for AI applications with AWS

Machine Learning Blog



This article describes how PDI Technologies built PDIQ, an enterprise-grade Retrieval Augmented Generation (RAG) system using AWS serverless services to provide employees with AI-powered access to company knowledge.

  • PDIQ consolidates knowledge from disparate sources including websites, Confluence, SharePoint, and Azure DevOps
  • Supports multiple crawler types with flexible authentication methods and configurable refresh schedules
  • Implements image captioning and document summarization to enhance semantic search accuracy
  • Uses dynamic token allocation: 70% content, 10% overlap, 20% summary for optimal embeddings
  • Stores vector embeddings in Aurora PostgreSQL with comprehensive metadata for filtering
  • Enforces zero-trust security with role-based access control and encrypted credentials
  • Improved accuracy approval rate from 60% to 79% through prepended document summaries
  • Delivers faster query resolution, higher customer satisfaction, and reduced operational costs

PDI's PDIQ demonstrates how serverless AWS services enable scalable, cost-effective enterprise RAG systems with advanced document processing and security controls.



Go to article

The AWS News Feed is currently looking for gold sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.

Related articles

Jul 17
2025
Building enterprise-scale RAG applications with Amazon S3 Vectors and DeepSeek R1 on Amazon SageMaker AI
Nov 22
2024
Building RAG-based applications with AWS Amplify AI Kit and Neon Postgres
May 13
2025
Build scalable containerized RAG based generative AI applications in AWS using Amazon EKS with Amazon Bedrock
Feb 19
2025
Well-rounded technical architecture for a RAG implementation on AWS

The AWS News Feed is currently looking for silver sponsors. If you want to support the AWS community and reach a large audience of AWS professionals, consider sponsoring the AWS News Feed.