Gemini on SWE-bench Verified: 80.6

benchmark-result

Metadata

`id`	ZRWT2IUJU3
`benchmarkId`	WOSlsBTTmV
`modelId`	Gemini(ai-model)
`score`	80.6
`unit`	percent
`date`	2026-02-19
`sourceUrl`	—
`notes`	Gemini 3.1 Pro on SWE-Bench Verified
`testedBy`	unknown
`testedByOrgId`	—
`evaluationDate`	—
`methodologyNotes`	—

confirmed98% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: ZRWT2IUJU3

Source Table: benchmark_results

Source ID: ZRWT2IUJU3

Parent Thing ID: WOSlsBTTmV