Skip to content
Longterm Wiki
benchmark-result

Claude Haiku 4.5 on SWE-bench Verified: 73.3

Child of SWE-bench Verified

Metadata

Source Tablebenchmark_results
Source ID9iqsCKcMgg
ParentSWE-bench Verified
Children
CreatedApr 24, 2026, 6:42 PM
UpdatedApr 24, 2026, 6:42 PM
SyncedApr 24, 2026, 6:42 PM

Record Data

id9iqsCKcMgg
benchmarkIdWOSlsBTTmV
modelIdClaude Haiku 4.5(ai-model)
score73.3
unitpercent
date2025-10-15
sourceUrl
notesScore reported by Anthropic averaged over 50 runs. One of the highest scores in its price tier. Tested via Anthropic's agent scaffold.

Source Check Verdicts

confirmed98% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: 9iqsCKcMgg

Source Table: benchmark_results

Source ID: 9iqsCKcMgg

Parent Thing ID: WOSlsBTTmV