Skip to content
Longterm Wiki

Sleeper Agents Research

TeamActive
Anthropic·2024-01present·Wiki page →

Investigating whether AI systems can maintain hidden behaviors through training. Seminal paper on deceptive alignment.

No detailed data available for this division yet.

Sleeper Agents Research | Anthropic | Divisions | Longterm Wiki