Claude 3.7 Sonnet on MMLU-Pro: 78.4

benchmark-result

Entity profile Source checks

Child of MMLU-Pro

Metadata

Source Table	`benchmark_results`
Source ID	`K7lJ2nPqW3`
Parent	MMLU-Pro
Children	—
Created	Apr 24, 2026, 7:38 PM
Updated	Apr 24, 2026, 7:38 PM
Synced	Apr 24, 2026, 7:38 PM

Record Data

`id`	K7lJ2nPqW3
`benchmarkId`	3PM0ZfVJxU
`modelId`	Claude 3.7 Sonnet(ai-model)
`score`	78.4
`unit`	percent
`date`	2025-02-24
`sourceUrl`	—
`notes`	MMLU-Pro benchmark
`testedBy`	unknown
`testedByOrgId`	—
`evaluationDate`	—
`methodologyNotes`	—

Source Check Verdicts

confirmed99% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: K7lJ2nPqW3

Source Table: benchmark_results

Source ID: K7lJ2nPqW3

Parent Thing ID: 3PM0ZfVJxU