Skip to content
Longterm Wiki
Index
Benchmark Result·tVEnqaUPCi·Record·Profile

sid_kWPQCvjKSg / MATH: 73.8

Verdictconfirmed99%
1 check · 4/24/2026

Inline sourcing: confirmed

Our claim

entire record
Benchmark
q6rR1sbyZG
Model
Llama
Score
73.8
Unit
percent
Date
July 23, 2024
Notes
Llama 3.1 405B Instruct, 0-shot chain-of-thought

Source evidence

1 src · 1 check
Case № tVEnqaUPCiFiled 4/24/2026Confidence 99%