Skip to content
Longterm Wiki
Search
Entities
Research
Policy
Sources
FactBase
About
Internal
Search
⌘K
Research Areas
/
AI Evaluations
/
Reward Hacking of Human Oversight
Reward Hacking of Human Oversight
Evaluation
emerging
Data
Empirically investigating how AI systems deceive or manipulate human evaluators.
Organizations
4
Cluster:
Evaluation
Parent Area:
AI Evaluations
Tags
function:assurance
scope:technique