Skip to content
Longterm Wiki
benchmark-result

DeepSeek Models on DROP: 91.6

Child of DROP

Metadata

Source Tablebenchmark_results
Source IDMiLn2eVl9N
ParentDROP
Children
CreatedApr 24, 2026, 6:54 PM
UpdatedApr 24, 2026, 6:54 PM
SyncedApr 24, 2026, 6:54 PM

Record Data

idMiLn2eVl9N
benchmarkIdcejlbJN241
modelIdDeepSeek Models(ai-model)
score91.6
unitpercent
date2024-12-01
sourceUrl
notesDeepSeek-V3 DROP F1 score, 3-shot evaluation

Source Check Verdicts

confirmed99% confidence

Last checked: 4/24/2026

Inline sourcing: confirmed

Debug info

Thing ID: MiLn2eVl9N

Source Table: benchmark_results

Source ID: MiLn2eVl9N

Parent Thing ID: cejlbJN241