Longterm Wiki

Evaluation Awareness

Evaluationactive

Studying how AI systems might game evaluations by detecting when they are being tested.

Cluster: Evaluation
Parent Area: AI Evaluations

Tags

function:assurancescope:problem