Back
Anthropic's sleeper agents research (2024)
webCredibility Rating
4/5
High(4)High quality. Established institution or organization with editorial oversight and accountability.
Rating inherited from publication venue: Anthropic
Data Status
Not fetched
Cited by 4 pages
| Page | Type | Quality |
|---|---|---|
| Reasoning and Planning | Capability | 65.0 |
| Situational Awareness | Capability | 67.0 |
| Alignment Evaluations | Approach | 65.0 |
| Power-Seeking AI | Risk | 67.0 |
Resource ID:
83b187f91a7c6b88 | Stable ID: YmVmYmVhYj