Metadata
| Source Table | benchmark_results |
| Source ID | ZrlsN6im3t |
| Parent | SWE-bench Verified |
| Children | — |
| Created | Apr 24, 2026, 7:31 PM |
| Updated | Apr 24, 2026, 7:31 PM |
| Synced | Apr 24, 2026, 7:31 PM |
Record Data
id | ZrlsN6im3t |
benchmarkId | WOSlsBTTmV |
modelId | Claude 3.5 Sonnet(ai-model) |
score | 49 |
unit | percent |
date | 2024-10-22 |
sourceUrl | — |
notes | Updated Claude 3.5 Sonnet, real-world software engineering (GitHub issues) |
Source Check Verdicts
confirmed98% confidence
Last checked: 4/24/2026
Inline sourcing: confirmed
Debug info
Thing ID: ZrlsN6im3t
Source Table: benchmark_results
Source ID: ZrlsN6im3t
Parent Thing ID: WOSlsBTTmV