Index
sid_bFjrDfX8rQ / TruthfulQA: 47
Verdictconfirmed95%
1 check · 4/24/2026Inline sourcing: confirmed
Our claim
entire record- Benchmark
- hCOa5gx2L7
- Model
- GPT-3.5 Turbo
- Score
- 47
- Unit
- percent
- Date
- March 15, 2023
- Notes
- 0-shot evaluation from GPT-3.5 technical report
Source evidence
1 src · 1 checkconfirmed95%inline-submission · 4/24/2026
Case № aIFYxY4ywmFiled 4/24/2026Confidence 95%