LLMsVideo TranslationImage GenerationVideo Generation
AI News

Independent coverage of the latest AI tool updates, releases, and comparisons.

Categories

  • AI LLMs
  • AI Video Translation
  • AI Image Generation
  • AI Video Generation

Company

  • About
  • Contact

Resources

  • Sitemap
  • AI Glossary
  • Tool Comparisons
  • Facts / Grounding
  • llms.txt
  • XML Sitemap
© 2026 AI News. Independent editorial coverage. Not affiliated with any AI company.
Home/Glossary/Retrieval-Augmented Generation (RAG)
AI LLMs

Retrieval-Augmented Generation (RAG)

Definition

Retrieval-augmented generation is a technique that combines a language model with an external knowledge retrieval system to produce more accurate, up-to-date responses. Instead of relying solely on trained parameters, RAG fetches relevant documents from a vector database before generating an answer. This approach reduces hallucinations and enables models to reference proprietary or recent data.

How It Works

When a user submits a query, an embedding model converts it into a vector that is compared against a database of pre-indexed document chunks using similarity search. The most relevant chunks are inserted into the LLM's context window alongside the original query. The model then generates a response grounded in the retrieved evidence, often citing specific sources.

Key Tools

GPT (OpenAI)Industry-leading large language models powering ChatGPT
$20/mo (ChatGPT Plus)
Claude (Anthropic)Safe, helpful AI assistant with extended context and reasoning
$20/mo (Pro)
Gemini (Google)Google's multimodal AI model family
$19.99/mo (Advanced)

Related Terms

Large Language Model (LLM)Prompt EngineeringAI Agent
← Back to AI Glossary