Longterm Wiki
Back

Anthropic's sleeper agents research (2024)

web

Credibility Rating

4/5
High(4)

High quality. Established institution or organization with editorial oversight and accountability.

Rating inherited from publication venue: Anthropic

Data Status

Not fetched

Cited by 4 pages

PageTypeQuality
Reasoning and PlanningCapability65.0
Situational AwarenessCapability67.0
Alignment EvaluationsApproach65.0
Power-Seeking AIRisk67.0
Resource ID: 83b187f91a7c6b88 | Stable ID: YmVmYmVhYj