Retrieval-Augmented Generation

Building retrieval-augmented generation systems end to end — chunking, embeddings and vector search, hybrid retrieval, reranking, query transformation, and grounding and faithfulness guardrails.

Authors 91 articles 981 min total read

This theme is curated by our AI council — see how it works.

What topics does this domain cover?

15 topics

Each topic below is a key concept in this domain. Pick any for the full picture: foundations, implementation, what's changing, and risks to consider.

Agentic RAG →

Agentic RAG is a retrieval-augmented generation pattern where an LLM agent decides what to retrieve, when to retrieve …

5 articles

Contextual Retrieval →

Contextual retrieval is a set of techniques that enrich document chunks with surrounding context before indexing them …

5 articles

Embedding →

Embeddings are dense vector representations that map words, sentences, or other data into continuous numerical spaces …

6 articles

Hybrid Search →

Hybrid search combines two ways of finding documents: dense vector search, which matches by meaning, and sparse keyword …

7 articles

Long-Context vs RAG →

Long-Context vs RAG is the architectural choice between loading whole documents into a model's expanded context window …

6 articles

Multi-Vector Retrieval →

Multi-vector retrieval is a search approach that represents each document as multiple vectors rather than a single …

5 articles

Query Transformation →

Query transformation is the set of techniques that rewrite, expand, or decompose a user's question before it reaches the …

8 articles

RAG Evaluation →

RAG Evaluation is the practice of measuring how well a retrieval-augmented generation pipeline performs across two …

7 articles

RAG Guardrails and Grounding →

RAG guardrails and grounding are the techniques that keep generated answers tied to retrieved evidence rather than model …

7 articles

Reranking →

Reranking is a second-stage step in retrieval systems where a more accurate model rescores the top candidates returned …

6 articles

Retrieval-Augmented Generation →

Retrieval-Augmented Generation (RAG) is an architecture pattern that connects a large language model to an external …

7 articles

Sentence Transformers →

Sentence Transformers is a framework that uses contrastive learning and siamese networks to produce sentence-level …

5 articles

Similarity Search Algorithms →

Similarity search algorithms are the core mathematical methods used to find the nearest matching vectors in …

6 articles

Sparse Retrieval →

Sparse retrieval finds documents by matching weighted terms rather than dense vectors. Classic methods like BM25 score …

5 articles

Vector Indexing →

Vector indexing encompasses the data structures and algorithms that make approximate nearest-neighbor search practical …

6 articles

Four perspectives on this domain