Skip to content
Longterm Wiki
benchmark

SimpleQA

Metadata

Source Tablebenchmarks
Source ID1O19f6j13Z
DescriptionA factual question-answering benchmark from OpenAI testing short, fact-seeking questions with verifiable answers. Evaluates factual accuracy and calibration.
Wiki IDsimpleqa
Children
CreatedMar 14, 2026, 12:43 AM
UpdatedMar 24, 2026, 11:24 PM
SyncedMar 24, 2026, 11:24 PM

Record Data

id1O19f6j13Z
slugsimpleqa
nameSimpleQA
categoryknowledge
descriptionA factual question-answering benchmark from OpenAI testing short, fact-seeking questions with verifiable answers. Evaluates factual accuracy and calibration.
website
scoringMethodaccuracy
higherIsBetterYes
introducedDate2024-10
maintainerOpenAI
sourceopenai.com/index/introducing-simpleqa/
Debug info

Thing ID: 1O19f6j13Z

Source Table: benchmarks

Source ID: 1O19f6j13Z

Wiki ID: simpleqa