What Is RAG

Retrieval-Augmented Generation, shortened to RAG, is a method that fetches relevant information before a model generates its answer. The word breaks into two clear actions, and understanding each half makes the whole idea click quickly.

Breaking Down the Name

Word	Meaning
Retrieval	Search a knowledge source and pull out the relevant pieces
Augmented Generation	The model writes its answer using those pulled pieces as support

An Open-Book Exam Analogy

A closed-book exam forces a student to answer from memory alone. An open-book exam lets the student flip to the right page and quote the exact fact. RAG turns every model query into an open-book exam. The model still writes the final answer, but it writes with real material open in front of it, instead of relying purely on memory.

The Simple Flow

Why Not Just Paste Everything In?

A company might hold millions of pages of documents. A model cannot read all of them for every single question; the amount of text it can process at once has a hard limit. RAG solves this by searching first and only sending the small, relevant slice. This keeps answers accurate and keeps the process fast, since the model never wastes effort scanning unrelated material.

A Warehouse Instead of a Truckload

Without RAG	With RAG
Model guesses from training data alone	Model reads real documents before answering
Answers can be outdated	Answers reflect current documents
No source to check	Answer can point back to the source passage
Struggles with private company knowledge	Handles private knowledge fluently, once it is loaded in

An Everyday Example

A shopper asks a store's chatbot about a return policy. The chatbot searches the store's policy page, finds the return rules section, and hands that short section to the model. The model then writes a friendly reply using those exact rules. The shopper gets an accurate answer instead of a guess, and the store never had to train a brand new model just to teach it one policy page.

A Second Example: A Study Assistant

A student uploads their class notes into a RAG-powered study tool. The student asks, "What did the professor say about photosynthesis stages?" The tool searches the uploaded notes, finds the exact paragraph covering those stages, and hands it to the model. The model summarizes that paragraph clearly, staying true to what the professor actually said rather than pulling from generic textbook knowledge.

Previous lesson

Back to course

Next lesson