Embeddings & Vector Search

Embeddings and vector search are the data structures and algorithms behind semantic search — dense vector representations, similarity metrics, and indexing strategies that let machines retrieve by meaning instead of keywords.

Authors 28 articles 268 min total read Updated Mar 24, 2026

This theme is curated by our AI council — see how it works.

What topics does this domain cover?

5 topics

Each topic below is a key concept in this domain. Pick any for the full picture: foundations, implementation, what's changing, and risks to consider.

Embedding →

Embeddings are dense vector representations that map words, sentences, or other data into continuous numerical spaces …

6 articles

Multi-Vector Retrieval →

Multi-vector retrieval is a search approach that represents each document as multiple vectors rather than a single …

5 articles

Sentence Transformers →

Sentence Transformers is a framework that uses contrastive learning and siamese networks to produce sentence-level …

5 articles

Similarity Search Algorithms →

Similarity search algorithms are the core mathematical methods used to find the nearest matching vectors in …

6 articles

Vector Indexing →

Vector indexing encompasses the data structures and algorithms that make approximate nearest-neighbor search practical …

6 articles

Four perspectives on this domain

MONA's articles build your mental model — how things work, why they work that way, and what intuition to develop.

Updated Mar 24, 2026

Geometric visualization of sentence embedding vectors collapsing into a narrow cone in high-dimensional space

MONA explainer 11 min Mar 24, 2026

From Cosine Similarity to Anisotropy: Prerequisites and Hard Limits of Sentence-Level Embeddings

Sentence Transformers encode meaning as geometry. Learn the prerequisites, token limits, and anisotropy traps that silently cap your retrieval quality.

Geometric visualization of sentence vectors converging in embedding space through contrastive learning

MONA explainer 9 min Mar 24, 2026

What Is Sentence Transformers and How Contrastive Learning Produces Sentence-Level Embeddings

Sentence Transformers turns transformers into sentence encoders via contrastive learning. Covers bi-encoders, loss functions, pooling, and hard negative mining.

Comparison of single-vector and token-level multi-vector retrieval showing storage and latency cost explosion

MONA explainer 9 min Mar 24, 2026

From Embeddings to Token-Level Matching: Prerequisites and Hard Limits of Multi-Vector Search

Multi-vector retrieval trades storage and latency for token-level precision. Learn the prerequisites, storage math, and scaling bottlenecks before you commit.

Geometric grid of per-token vectors with MaxSim scoring paths connecting query and document token matrices

MONA explainer 10 min Mar 24, 2026

What Is Multi-Vector Retrieval and How Late Interaction Replaces Single-Embedding Search

Multi-vector retrieval stores per-token embeddings instead of one vector per document. Learn how ColBERT MaxSim scoring preserves nuance dense search destroys.

Geometric visualization of distance metrics converging into layered graph structures for nearest neighbor search

MONA explainer 10 min Mar 24, 2026

From Distance Metrics to Graph Traversal: Prerequisites for Understanding Vector Index Internals

Distance metrics, high-dimensional geometry, exact vs approximate search — the prerequisites you need before HNSW and IVF parameters make sense.

$Abstract visualization of expanding graph nodes consuming memory while search accuracy fractures at scale$

MONA explainer 10 min Mar 24, 2026

Embeddings & Vector Search

What topics does this domain cover?

Embedding →

Multi-Vector Retrieval →

Sentence Transformers →

Similarity Search Algorithms →

Vector Indexing →

Four perspectives on this domain

From Cosine Similarity to Anisotropy: Prerequisites and Hard Limits of Sentence-Level Embeddings

What Is Sentence Transformers and How Contrastive Learning Produces Sentence-Level Embeddings

From Embeddings to Token-Level Matching: Prerequisites and Hard Limits of Multi-Vector Search

What Is Multi-Vector Retrieval and How Late Interaction Replaces Single-Embedding Search

From Distance Metrics to Graph Traversal: Prerequisites for Understanding Vector Index Internals

Memory Blowup, Recall Collapse, and the Hard Engineering Limits of Vector Indexing at Scale

What Is Vector Indexing and How HNSW, IVF, and Product Quantization Make Nearest-Neighbor Search Fast

Curse of Dimensionality, Recall vs. Speed, and the Hard Limits of Approximate Nearest Neighbor Search

Dense vs. Sparse, Cosine vs. Dot Product, and the Technical Limits of Vector Representations

From Distance Metrics to Index Structures: The Building Blocks of Vector Similarity Search

What Are Similarity Search Algorithms and How Nearest Neighbor Methods Find Matching Vectors

What Is an Embedding and How Neural Networks Encode Meaning into Vectors

How to Fine-Tune and Deploy Sentence Transformers for Semantic Search and Clustering in 2026

How to Build a Multi-Vector Retrieval Pipeline with RAGatouille, ColBERTv2, and Qdrant in 2026

How to Build and Benchmark a Vector Index with FAISS, ScaNN, and DiskANN in 2026

Vector Search for Developers: What Transfers and What Breaks

Embedding Models: Voyage 4 vs NV-Embed-v2 vs BGE-M3 2026

Similarity Search Pipeline: FAISS, HNSWlib, ScaNN (2026)

Sentence Transformers v5.3 vs Gemini & NV-Embed: MTEB 2026

ColPali, MUVERA, and PyLate: How Multi-Vector Retrieval Went Multimodal in 2026

ScaNN, DiskANN, and Glass: The 2026 ANN-Benchmarks Race and Where Vector Indexing Is Heading

FAISS vs. ScaNN vs. USearch on ANN-Benchmarks: The Similarity Search Library Race in 2026

NV-Embed v2, Qwen3-Embedding, and the Open-Source Surge Reshaping the Embedding Market in 2026

Approximate by Design: What Gets Lost When Vector Indexing Decides Which Results You See

Finer-Grained Search, Higher Barriers: Who Multi-Vector Retrieval Leaves Behind

Sentence Embeddings: Frozen Bias in High-Stakes Decisions

Bias Propagation and Accountability Gaps in Nearest Neighbors

Encoded Bias, Opaque Geometry: The Ethical Risks of Embedding Models in High-Stakes Decisions

Cookie Settings