MMLU
KnowledgeMassive Multitask Language Understanding — a multiple-choice benchmark covering 57 academic subjects from STEM to humanities.
Models Tested
0
Scoring: accuracy
Introduced: 2021-01
Maintainer: Dan Hendrycks et al.
No model scores recorded for this benchmark yet.