Skip to content
Longterm Wiki
benchmark-result

Claude Opus 4.5 on MGSM: 92.5

Child of MGSM

Metadata

Source Tablebenchmark_results
Source IDlxaqA3WPia
Source URLautomatio.ai/models/claude-opus-4-5
ParentMGSM
Children
CreatedApr 24, 2026, 7:48 PM
UpdatedApr 24, 2026, 7:48 PM
SyncedApr 24, 2026, 7:48 PM

Record Data

idlxaqA3WPia
benchmarkIdkNb3n2XMUI
modelIdClaude Opus 4.5(ai-model)
score92.5
unitpercent
date2025-11-24
sourceUrlautomatio.ai/models/claude-opus-4-5
notesMGSM - Multilingual grade school math reasoning

Source Check Verdicts

confirmed99% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: lxaqA3WPia

Source Table: benchmark_results

Source ID: lxaqA3WPia

Parent Thing ID: kNb3n2XMUI