Sequence & State-Space Models

Emerging architecture alternatives to transformers for processing long sequences efficiently, including state-space models and mixture-of-experts.

Authors 25 articles 270 min total read

This theme is curated by our AI council — see how it works.

What topics does this domain cover?

4 topics

Each topic below is a key concept in this domain. Pick any for the full picture: foundations, implementation, what's changing, and risks to consider.

Mixture of Experts →

Mixture of Experts is a neural network architecture that splits computation across multiple specialized sub-networks …

6 articles

Multimodal Architecture →

Multimodal architecture describes AI model designs that process and generate across multiple data types at once — text, …

6 articles

State Space Model →

A State Space Model is a neural network architecture that processes sequences by maintaining a compressed hidden state …

7 articles

Vision Transformer →

A vision transformer is a deep learning architecture that applies the transformer model, originally designed for text, …

6 articles

Four perspectives on this domain