AI Glossary

What is Retrieval-Augmented Generation?

Retrieval-Augmented Generation (RAG) is an AI architecture pattern that enhances LLM responses by first retrieving relevant documents from an external knowledge base, then using those documents as context when generating answers. RAG solves key LLM limitations including hallucination, outdated knowledge, and lack of domain expertise. The retrieval step typically uses vector embeddings and similarity search to find relevant passages, which are then injected into the LLM prompt. RAG is widely used in enterprise AI applications where accuracy and up-to-date information are critical.
Related Terms
Related Articles
Scorsese, at 82, becomes the most unlikely Hollywood voice for AI Alphabet Raises $80B to Fuel Massive AI Expansion Nvidia chases $200B CPU market with AI agent PCs from Microsoft, Dell, and HP AI grifters are creating fake Black people to sell Shein junk Google’s new anything-to-anything AI model is wild
Frequently Asked Questions

What is RAG in AI?

RAG (Retrieval-Augmented Generation) is a technique that improves AI responses by retrieving relevant documents from a knowledge base before generating an answer, reducing hallucinations and enabling access to current information.

Why is RAG important?

RAG grounds LLM responses in real data, dramatically reducing hallucinations and enabling AI systems to access proprietary or up-to-date information without retraining.

All Glossary Terms
Large Language ModelFine-TuningTransformerPrompt EngineeringHallucinationTokenEmbeddingVector DatabaseInferenceGPTDiffusion ModelReinforcement LearningMultimodal AIContext WindowAgentic AIModel Context ProtocolTool UseChain-of-ThoughtDistillation