Longterm Wiki

MMLU

Knowledge

Massive Multitask Language Understanding — a multiple-choice benchmark covering 57 academic subjects from STEM to humanities.

Models Tested
0
Scoring: accuracy
Introduced: 2021-01
Maintainer: Dan Hendrycks et al.
No model scores recorded for this benchmark yet.