Skip to content
Longterm Wiki
benchmark

Humanity's Last Exam

Metadata

Source Tablebenchmarks
Source IDXt4Dv7KAey
DescriptionA benchmark of 2,500+ expert-level questions across dozens of academic disciplines, designed to be the hardest public AI evaluation. Questions contributed by domain experts worldwide.
Source URLlastexam.ai/
Wiki IDhumanitys-last-exam
Children
CreatedMar 14, 2026, 12:43 AM
UpdatedMar 24, 2026, 11:24 PM
SyncedMar 24, 2026, 11:24 PM

Record Data

idXt4Dv7KAey
slughumanitys-last-exam
nameHumanity's Last Exam
categoryreasoning
descriptionA benchmark of 2,500+ expert-level questions across dozens of academic disciplines, designed to be the hardest public AI evaluation. Questions contributed by domain experts worldwide.
websitelastexam.ai/
scoringMethodaccuracy
higherIsBetterYes
introducedDate2025-01
maintainerScale AI / Center for AI Safety
sourcearxiv.org/abs/2501.14249
Debug info

Thing ID: Xt4Dv7KAey

Source Table: benchmarks

Source ID: Xt4Dv7KAey

Wiki ID: humanitys-last-exam