Skip to content
Longterm Wiki
benchmark-result

Llama on HumanEval: 89

Child of HumanEval

Metadata

Source Tablebenchmark_results
Source IDnpuCCe8ZKX
Source URLwww.ibm.com/think/news/meta-releases-llama-3-1-models-405b-parameter-variant
ParentHumanEval
Children
CreatedApr 24, 2026, 7:16 PM
UpdatedApr 24, 2026, 7:16 PM
SyncedApr 24, 2026, 7:16 PM

Record Data

idnpuCCe8ZKX
benchmarkIdvxX2rorgxU
modelIdLlama(ai-model)
score89
unitpass@1
date2024-07-23
sourceUrlwww.ibm.com/think/news/meta-releases-llama-3-1-models-405b-parameter-variant
notesLlama 3.1 405B Instruct, 0-shot evaluation

Source Check Verdicts

confirmed95% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: npuCCe8ZKX

Source Table: benchmark_results

Source ID: npuCCe8ZKX

Parent Thing ID: vxX2rorgxU