Metadata
| Source Table | benchmark_results |
| Source ID | aIFYxY4ywm |
| Source URL | huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard/discussions/30 |
| Parent | TruthfulQA |
| Children | — |
| Created | Apr 24, 2026, 7:07 PM |
| Updated | Apr 24, 2026, 7:07 PM |
| Synced | Apr 24, 2026, 7:07 PM |
Record Data
id | aIFYxY4ywm |
benchmarkId | hCOa5gx2L7 |
modelId | GPT-3.5 Turbo(ai-model) |
score | 47 |
unit | percent |
date | 2023-03-15 |
sourceUrl | huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard/discussions/30 |
notes | 0-shot evaluation from GPT-3.5 technical report |
Source Check Verdicts
confirmed95% confidence
Last checked: 4/24/2026
Inline sourcing: confirmed
Debug info
Thing ID: aIFYxY4ywm
Source Table: benchmark_results
Source ID: aIFYxY4ywm
Parent Thing ID: hCOa5gx2L7