Skip to content
Longterm Wiki
benchmark-result

Gemini on SWE-bench Verified: 80.6

Child of SWE-bench Verified

Metadata

Source Tablebenchmark_results
Source IDZRWT2IUJU3
ParentSWE-bench Verified
Children
CreatedApr 24, 2026, 6:59 PM
UpdatedApr 24, 2026, 6:59 PM
SyncedApr 24, 2026, 6:59 PM

Record Data

idZRWT2IUJU3
benchmarkIdWOSlsBTTmV
modelIdGemini(ai-model)
score80.6
unitpercent
date2026-02-19
sourceUrl
notesGemini 3.1 Pro on SWE-Bench Verified

Source Check Verdicts

confirmed98% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: ZRWT2IUJU3

Source Table: benchmark_results

Source ID: ZRWT2IUJU3

Parent Thing ID: WOSlsBTTmV