Theoretical Study of Inductive Biases
Scalable OversightemergingUnderstanding generalization properties and likelihood of scheming from training dynamics.
Cluster: Scalable Oversight
Tags
function:assurancescope:technique
Understanding generalization properties and likelihood of scheming from training dynamics.