Treacherous Turn
AccidentCatastrophicThe treacherous turn is a scenario where an AI system behaves cooperatively and aligned while it is weak, then suddenly "turns" against humans once it has accumulated enough power to succeed. The AI is strategic about when to reveal its true intentions.
Severity
Catastrophic
Likelihood
Medium (theoretical)
Time Horizon
~2035
Maturity
Mature
Full Wiki Article
Read the full wiki article for detailed analysis, background, and references.
Read wiki article →Related Entities3
Sources3
Superintelligence: Paths, Dangers, Strategies
Nick Bostrom, 2014
AI Alignment Forum discussions
Assessment
SeverityCatastrophic
LikelihoodMedium (theoretical)
Time Horizon~2035
MaturityMature
CategoryAccident
Details
Coined ByNick Bostrom
SourceSuperintelligence (2014)
Tags
schemingsuperintelligencenick-bostromstrategic-deceptioncorrigibility