Index
sid_bFjrDfX8rQ / WinoGrande: 81.6
Verdictconfirmed95%
1 check · 4/24/2026Inline sourcing: confirmed
Our claim
entire record- Benchmark
- pIjI7HLxmb
- Model
- GPT-3.5 Turbo
- Score
- 81.6
- Unit
- percent
- Date
- March 15, 2023
- Notes
- 5-shot evaluation from GPT-3.5 technical report
Source evidence
1 src · 1 checkconfirmed95%inline-submission · 4/24/2026
Case № ZuHn95HDWqFiled 4/24/2026Confidence 95%