GPT-3.5 Turbo on GSM8K: 57.1

benchmark-result

Child of GSM8K

Metadata

Source Table	`benchmark_results`
Source ID	`YGWv3H9EOZ`
Source URL	huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard/discussions/30
Parent	GSM8K
Children	—
Created	Apr 24, 2026, 7:07 PM
Updated	Apr 24, 2026, 7:07 PM
Synced	Apr 24, 2026, 7:07 PM

`id`	YGWv3H9EOZ
`benchmarkId`	fjjBrOI3p2
`modelId`	GPT-3.5 Turbo(ai-model)
`score`	57.1
`unit`	percent
`date`	2023-03-15
`sourceUrl`	huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard/discussions/30
`notes`	5-shot evaluation from GPT-3.5 technical report
`testedBy`	unknown
`testedByOrgId`	—
`evaluationDate`	—
`methodologyNotes`	—

confirmed95% confidence

Last checked: 4/29/2026

1 → confirmed

Debug info

Thing ID: YGWv3H9EOZ

Source Table: benchmark_results

Source ID: YGWv3H9EOZ

Parent Thing ID: fjjBrOI3p2