← Glossary

RAG (Retrieval-Augmented Generation)

Technique

Definition

An architecture that enhances LLM outputs by first retrieving relevant documents from a knowledge base (via vector search) and injecting them into the prompt. Grounds the model in external, up-to-date facts without requiring retraining.