What it does
Mechanistic interpretability research has produced a growing taxonomy of attention head “circuits” — patterns of behavior that recur across models. This atlas catalogues them with concrete examples you can inspect: which head, which layer, which input, and what the pattern means.
What you’ll see
- Browsable gallery of attention patterns, grouped by circuit type
- Side-by-side comparison across Llama, Mistral, Qwen, and Gemma at matched scales
- Layer-by-layer evolution: watch a circuit emerge across depth
- Direct links to the source papers and reproducible code
Status
Planned — data collection is underway. If you want to contribute patterns from a specific model, open an issue.