Skip to content
Longterm Wiki
benchmark

GPQA Diamond

Metadata

Source Tablebenchmarks
Source IDbdDmOTMoX8
DescriptionGraduate-level Google-Proof Q&A Diamond subset — extremely difficult questions in physics, chemistry, and biology that even domain experts struggle with.
Wiki IDgpqa-diamond
Children
CreatedMar 14, 2026, 12:43 AM
UpdatedMar 24, 2026, 11:24 PM
SyncedMar 24, 2026, 11:24 PM

Record Data

idbdDmOTMoX8
sluggpqa-diamond
nameGPQA Diamond
categoryreasoning
descriptionGraduate-level Google-Proof Q&A Diamond subset — extremely difficult questions in physics, chemistry, and biology that even domain experts struggle with.
website
scoringMethodaccuracy
higherIsBetterYes
introducedDate2023-11
maintainerDavid Rein et al.
sourcearxiv.org/abs/2311.12022
Debug info

Thing ID: bdDmOTMoX8

Source Table: benchmarks

Source ID: bdDmOTMoX8

Wiki ID: gpqa-diamond