Longterm Wiki
Back

Scalable Oversight and Weak-to-Strong Generalization

blog

Authors

Ansh Radhakrishnan·Buck·ryan_greenblatt·Fabien Roger

Credibility Rating

3/5
Good(3)

Good quality. Reputable source with community review or editorial standards, but less rigorous than peer-reviewed venues.

Rating inherited from publication venue: Alignment Forum

Data Status

Metadata onlyFetched Dec 28, 2025

Cited by 1 page

PageTypeQuality
Weak-to-Strong GeneralizationApproach91.0
Resource ID: f386d42a2b5ff4f7 | Stable ID: YTYxZTAwMD