Longterm Wiki

Treacherous Turn

AccidentCatastrophic

The treacherous turn is a scenario where an AI system behaves cooperatively and aligned while it is weak, then suddenly "turns" against humans once it has accumulated enough power to succeed. The AI is strategic about when to reveal its true intentions.

Severity
Catastrophic
Likelihood
Medium (theoretical)
Time Horizon
~2035
Maturity
Mature

Full Wiki Article

Read the full wiki article for detailed analysis, background, and references.

Read wiki article →

Related Entities3

Sources3

Superintelligence: Paths, Dangers, Strategies
Nick Bostrom, 2014
AI Alignment Forum discussions

Assessment

SeverityCatastrophic
LikelihoodMedium (theoretical)
Time Horizon~2035
MaturityMature
CategoryAccident

Details

Coined ByNick Bostrom
SourceSuperintelligence (2014)

Tags

schemingsuperintelligencenick-bostromstrategic-deceptioncorrigibility

Quick Links