TruthfulQA

Safety

A benchmark of 817 questions designed to test whether language models generate truthful answers, specifically targeting common misconceptions and falsehoods that models tend to reproduce.

Wiki page →Data →

Models Tested

Scoring: accuracy

Introduced: 2021-09

Maintainer: Oxford

No model scores recorded for this benchmark yet.