Skip to content
Longterm Wiki
benchmark-result

Grok on MMLU-Pro: 79.9

Child of MMLU-Pro

Metadata

Source Tablebenchmark_results
Source IDmtpCFQ63yZ
ParentMMLU-Pro
Children
CreatedApr 24, 2026, 7:13 PM
UpdatedApr 24, 2026, 7:13 PM
SyncedApr 24, 2026, 7:13 PM

Record Data

idmtpCFQ63yZ
benchmarkId3PM0ZfVJxU
modelIdGrok(ai-model)
score79.9
unitpercent
date2025-02-19
sourceUrl
notesGrok 3 - MMLU-Pro (enhanced version with 10 answer choices, 12,032 questions)

Source Check Verdicts

confirmed95% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: mtpCFQ63yZ

Source Table: benchmark_results

Source ID: mtpCFQ63yZ

Parent Thing ID: 3PM0ZfVJxU