Longterm Wiki

Corrigibility

Scalable Oversightactive

Research on building AI systems that allow themselves to be corrected, modified, or shut down.

First Proposed: 2015 (Soares et al., MIRI)
Cluster: Scalable Oversight

Tags

function:specificationscope:field