Skip to content
Longterm Wiki
Search
Entities
Research
Policy
Sources
FactBase
About
Internal
Search
⌘K
Research Areas
/
Mechanistic Interpretability
/
Toy Models for Interpretability
Toy Models for Interpretability
Interpretability
active
Data
Small simplified model proxies that capture key deep learning dynamics for interpretability research.
Organizations
2
Key Papers
2
Cluster:
Interpretability
Parent Area:
Mechanistic Interpretability
Tags
function:assurance
scope:technique