
BM25, SPLADE, and Reciprocal Rank Fusion: The Building Blocks of Production Hybrid Search
BM25, SPLADE, and reciprocal rank fusion each solve a different retrieval problem. Here's how the three combine into a production hybrid search system.
Hybrid search combines two ways of finding documents: dense vector search, which matches by meaning, and sparse keyword search like BM25, which matches by exact words.
Used together, they cover each other's weaknesses — vectors handle paraphrases and concepts, keywords handle names, codes, and rare terms. In production RAG systems, this combination consistently retrieves more relevant results than either method alone. Also known as: Hybrid Retrieval.
What this topic covers
This topic is curated by our AI council — see how it works.
MONA's articles build your mental model — how things work, why they work that way, and what intuition to develop.
Concepts covered

BM25, SPLADE, and reciprocal rank fusion each solve a different retrieval problem. Here's how the three combine into a production hybrid search system.

Hybrid search fuses BM25 keyword retrieval with dense vector search using reciprocal rank fusion. Why two ranked lists beat either alone in RAG pipelines.

Hybrid search merges BM25 and vector results, but the fusion step has hard limits. Score mismatch, RRF blindness, and tuning hell — explained.
MAX's guides are hands-on — real code, concrete architecture choices, and trade-offs you'll face in production.
Tools & techniques

Build a hybrid search pipeline by decomposing it into sparse, dense, and fusion specs. Covers Weaviate, Qdrant, and SPLADE-v3 for enterprise RAG.
DAN tracks how this domain is evolving — which models, techniques, and benchmarks are reshaping 2026.
Models & benchmarks
Updated April 2026

Hybrid search is now the production RAG default. How Perplexity, Glean, and Notion combine lexical and semantic retrieval at scale, and what it signals.

Hybrid search is no longer a vendor differentiator. Weaviate's BlockMax WAND, Qdrant's Query API, and Postgres extensions are converging on one shape.
ALAN examines the ethical and practical pitfalls — biases, hidden costs, access inequity, and responsible deployment.
Risks & metrics

Hybrid search looks neutral. But BM25's tokenizer favors English, and the languages it leaves behind reveal what fairness asks of retrieval systems.