Attention pattern atlas — Playground

What it does

Mechanistic interpretability research has produced a growing taxonomy of attention head “circuits” — patterns of behavior that recur across models. This atlas catalogues them with concrete examples you can inspect: which head, which layer, which input, and what the pattern means.

What you’ll see

Browsable gallery of attention patterns, grouped by circuit type
Side-by-side comparison across Llama, Mistral, Qwen, and Gemma at matched scales
Layer-by-layer evolution: watch a circuit emerge across depth
Direct links to the source papers and reproducible code

Status

Planned — data collection is underway. If you want to contribute patterns from a specific model, open an issue.