Index
sid_kWPQCvjKSg / HumanEval: 89
Verdictconfirmed95%
1 check · 4/24/2026Inline sourcing: confirmed
Our claim
entire record- Benchmark
- vxX2rorgxU
- Model
- Llama
- Score
- 89
- Unit
- pass@1
- Date
- July 23, 2024
- Notes
- Llama 3.1 405B Instruct, 0-shot evaluation
Source evidence
1 src · 1 checkconfirmed95%inline-submission · 4/24/2026
Case № npuCCe8ZKXFiled 4/24/2026Confidence 95%