Llama on HumanEval: 89

benchmark-result

Metadata

Source Table	`benchmark_results`
Source ID	`npuCCe8ZKX`
Source URL	www.ibm.com/think/news/meta-releases-llama-3-1-models-405b-parameter-variant
Parent	HumanEval
Children	—
Created	Apr 24, 2026, 7:16 PM
Updated	Apr 24, 2026, 7:16 PM
Synced	Apr 24, 2026, 7:16 PM

`id`	npuCCe8ZKX
`benchmarkId`	vxX2rorgxU
`modelId`	Llama(ai-model)
`score`	89
`unit`	pass@1
`date`	2024-07-23
`sourceUrl`	www.ibm.com/think/news/meta-releases-llama-3-1-models-405b-parameter-variant
`notes`	Llama 3.1 405B Instruct, 0-shot evaluation
`testedBy`	unknown
`testedByOrgId`	—
`evaluationDate`	—
`methodologyNotes`	—

confirmed95% confidence

Last checked: 4/29/2026

1 → confirmed

Debug info

Thing ID: npuCCe8ZKX

Source Table: benchmark_results

Source ID: npuCCe8ZKX

Parent Thing ID: vxX2rorgxU