AI Safety via Debate
Scalable OversightactiveUsing adversarial debate between AI systems to help humans evaluate complex claims.
Tags
function:specificationscope:technique
Using adversarial debate between AI systems to help humans evaluate complex claims.