Evaluation Awareness
EvaluationactiveStudying how AI systems might game evaluations by detecting when they are being tested.
Cluster: Evaluation
Parent Area: AI Evaluations
Tags
function:assurancescope:problem
Studying how AI systems might game evaluations by detecting when they are being tested.