Skip to content
Longterm Wiki
benchmark-result

GPT-3.5 Turbo on WinoGrande: 81.6

Child of WinoGrande

Metadata

Source Tablebenchmark_results
Source IDZuHn95HDWq
Source URLhuggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard/discussions/30
ParentWinoGrande
Children
CreatedApr 24, 2026, 7:07 PM
UpdatedApr 24, 2026, 7:07 PM
SyncedApr 24, 2026, 7:07 PM

Record Data

idZuHn95HDWq
benchmarkIdpIjI7HLxmb
modelIdGPT-3.5 Turbo(ai-model)
score81.6
unitpercent
date2023-03-15
sourceUrlhuggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard/discussions/30
notes5-shot evaluation from GPT-3.5 technical report

Source Check Verdicts

confirmed95% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: ZuHn95HDWq

Source Table: benchmark_results

Source ID: ZuHn95HDWq

Parent Thing ID: pIjI7HLxmb