Longterm Wiki

Natural Abstractions

Scalable Oversightactive

Hypothesis that natural abstractions generalize across observers, providing a basis for alignment.

First Proposed: 2022 (Wentworth)
Cluster: Scalable Oversight
Parent Area: Agent Foundations

Tags

function:specificationscope:technique