Grok on MMLU-Pro: 79.9

benchmark-result

Metadata

`id`	mtpCFQ63yZ
`benchmarkId`	3PM0ZfVJxU
`modelId`	Grok(ai-model)
`score`	79.9
`unit`	percent
`date`	2025-02-19
`sourceUrl`	—
`notes`	Grok 3 - MMLU-Pro (enhanced version with 10 answer choices, 12,032 questions)
`testedBy`	unknown
`testedByOrgId`	—
`evaluationDate`	—
`methodologyNotes`	—

confirmed95% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: mtpCFQ63yZ

Source Table: benchmark_results

Source ID: mtpCFQ63yZ

Parent Thing ID: 3PM0ZfVJxU