Longterm Wiki

Co-authored Risks from Learned Optimization (2019) introducing mesa-optimization and deceptive alignment; led Sleeper Agents and Alignment Faking research at Anthropic; 3,400+ citations

Evan HubingerNotable For

Core Data

EntityEvan Hubinger
PropertyNotable For
Formatted ValueCo-authored Risks from Learned Optimization (2019) introducing mesa-optimization and deceptive alignment; led Sleeper Agents and Alignment Faking research at Anthropic; 3,400+ citations
Raw ValueCo-authored Risks from Learned Optimization (2019) introducing mesa-optimization and deceptive alignment; led Sleeper Agents and Alignment Faking research at Anthropic; 3,400+ citations
Value Typetext
Unit
As OfMar 2026
Valid End
Expired?No

Source

Source URL
Source Quote
NotesEnriched from wiki page

Debug Info

Fact IDf_eH6tN2pQ5v
Subject IDXnqqKHiNmw
Property IDnotable-for
Derived From
YAML Filepackages/kb/data/things/XnqqKHiNmw.yaml
Fact: f_eH6tN2pQ5v | Longterm Wiki