Skip to content
Longterm Wiki
benchmark-result

DeepSeek Models on BBH: 87.5

Child of BBH

Metadata

Source Tablebenchmark_results
Source IDf2cJ0hQVyI
ParentBBH
Children
CreatedApr 24, 2026, 6:54 PM
UpdatedApr 24, 2026, 6:54 PM
SyncedApr 24, 2026, 6:54 PM

Record Data

idf2cJ0hQVyI
benchmarkIdjp1Xu4jbIy
modelIdDeepSeek Models(ai-model)
score87.5
unitpercent
date2024-12-01
sourceUrl
notesDeepSeek-V3 BBH accuracy, 3-shot evaluation

Source Check Verdicts

confirmed99% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: f2cJ0hQVyI

Source Table: benchmark_results

Source ID: f2cJ0hQVyI

Parent Thing ID: jp1Xu4jbIy