Benchmark Result·I4H3zQ7VJK·Record

sid_tppPAkJqjQ / HumanEval: 92

Verdictconfirmed99%

1 check · 4/29/2026

1 → confirmed

Our claim

entire record

Benchmark: vxX2rorgxU
Model: Claude Opus 4.5
Score: 92
Unit: percent
Date: November 24, 2025
Source Url: https://automatio.ai/models/claude-opus-4-5
Notes: HumanEval - Python function implementation benchmark
Tested By: unknown

Source evidence

1 src · 1 check

automatio.ai/models/claude-opus-4-5

confirmed99%inline-submission · 4/24/2026

Case № I4H3zQ7VJKFiled 4/29/2026Confidence 99%