Skip to content
Longterm Wiki
Search
Entities
Research
Policy
Sources
FactBase
About
Internal
Search
⌘K
Research Areas
/
AI Control
/
Circuit Breakers
Circuit Breakers
AI Control
active
Wiki page
Data
Inference-time interventions that halt model execution when unsafe behavior is detected.
Organizations
2
Key Papers
1
First Proposed:
2024 (Zou et al.)
Cluster:
AI Control
Parent Area:
AI Control
Tags
function:robustness
stage:inference
scope:technique