Skip to content
Longterm Wiki
benchmark-result

Claude 3.7 Sonnet on MMLU-Pro: 78.4

Child of MMLU-Pro

Metadata

Source Tablebenchmark_results
Source IDK7lJ2nPqW3
ParentMMLU-Pro
Children
CreatedApr 24, 2026, 7:38 PM
UpdatedApr 24, 2026, 7:38 PM
SyncedApr 24, 2026, 7:38 PM

Record Data

idK7lJ2nPqW3
benchmarkId3PM0ZfVJxU
modelIdClaude 3.7 Sonnet(ai-model)
score78.4
unitpercent
date2025-02-24
sourceUrl
notesMMLU-Pro benchmark

Source Check Verdicts

confirmed99% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: K7lJ2nPqW3

Source Table: benchmark_results

Source ID: K7lJ2nPqW3

Parent Thing ID: 3PM0ZfVJxU