Skip to content
Longterm Wiki
Search
Entities
Research
Policy
Sources
FactBase
About
Internal
Search
⌘K
Research Areas
/
Interpretability
/
Linear Probing
Linear Probing
Interpretability
active
Wiki page
Data
Lightweight interpretability using linear classifiers on model activations to detect features.
Organizations
3
Cluster:
Interpretability
Parent Area:
Interpretability
Tags
function:assurance
scope:technique