benchmark
SimpleQA
Metadata
| Source Table | benchmarks |
| Source ID | 1O19f6j13Z |
| Description | A factual question-answering benchmark from OpenAI testing short, fact-seeking questions with verifiable answers. Evaluates factual accuracy and calibration. |
| Wiki ID | simpleqa |
| Children | — |
| Created | Mar 14, 2026, 12:43 AM |
| Updated | Mar 24, 2026, 11:24 PM |
| Synced | Mar 24, 2026, 11:24 PM |
Record Data
id | 1O19f6j13Z |
slug | simpleqa |
name | SimpleQA |
category | knowledge |
description | A factual question-answering benchmark from OpenAI testing short, fact-seeking questions with verifiable answers. Evaluates factual accuracy and calibration. |
website | — |
scoringMethod | accuracy |
higherIsBetter | Yes |
introducedDate | 2024-10 |
maintainer | OpenAI |
source | openai.com/index/introducing-simpleqa/ |
Debug info
Thing ID: 1O19f6j13Z
Source Table: benchmarks
Source ID: 1O19f6j13Z
Wiki ID: simpleqa