Skip to main content
AI ArchitectureIntermediate

What is RAG (Retrieval-Augmented Generation)?

RAG combines a language model with a retrieval system, letting the AI search a knowledge base before answering — reducing hallucinations and keeping responses up to date.

TL;DR: RAG combines a language model with a retrieval system, letting the AI search a knowledge base before answering — reducing hallucinations and keeping responses up to date.

The Problem RAG Solves

Standard LLMs have a fixed training cutoff. Ask ChatGPT about last week's news and it either confabulates or says it doesn't know. RAG patches this by retrieving real documents at query time.

training cutoffconfabulationretrieval