Skip to content
Longterm Wiki
Index
Citation·page:gwern:fn32

Gwern Branwen - Footnote 32

Verdictpartial75%
1 check · 4/3/2026

The claim mentions Gwern analyzing datasets like The Pile for model behavior impacts, which is supported by the source, but it overstates his role as an 'archivist'. The source mentions he explored the contents of datasets like The Pile, but doesn't explicitly call him an archivist. The claim mentions Everitt 2018's work on Bayesian history-based RL (superseded by Everitt & Hutter 2019), which is not mentioned in the source. The claim mentions addressing value alignment to avoid Goodhart's law failures like reward hacking, which is not explicitly mentioned in the source.

Our claim

entire record

No record data available.

Source evidence

1 src · 1 check
partial75%Haiku 4.5 · 4/3/2026

NoteThe claim mentions Gwern analyzing datasets like The Pile for model behavior impacts, which is supported by the source, but it overstates his role as an 'archivist'. The source mentions he explored the contents of datasets like The Pile, but doesn't explicitly call him an archivist. The claim mentions Everitt 2018's work on Bayesian history-based RL (superseded by Everitt & Hutter 2019), which is not mentioned in the source. The claim mentions addressing value alignment to avoid Goodhart's law failures like reward hacking, which is not explicitly mentioned in the source.

Case № page:gwern:fn32Filed 4/3/2026Confidence 75%