AI Risks

Overview

This section documents the potential risks from advanced AI systems, organized into four major categories based on the source and nature of the risk.

Risk Categories

Accident Risks

Unintended failures from AI systems pursuing misaligned goals:

Scheming - AI strategically concealing misaligned goals
Deceptive Alignment - Models appearing aligned during training
Mesa-Optimization - Learned optimizers with misaligned objectives
Goal Misgeneralization - Objectives that fail in deployment
Power-Seeking - Instrumental convergence toward acquiring resources

Misuse Risks

Deliberate harmful applications of AI capabilities:

Bioweapons - AI-assisted biological weapon development
Cyberweapons - Automated cyber attacks and vulnerabilities
Disinformation - Large-scale manipulation campaigns
Autonomous Weapons - Lethal autonomous systems

Structural Risks

Systemic issues from how AI development is organized:

Racing Dynamics - Competitive pressure reducing safety investment
Concentration of Power - Dangerous accumulation of AI capabilities
Lock-in - Irreversible entrenchment of values or structures
Economic Disruption - Labor market and economic instability

Epistemic Risks

Threats to society's ability to know and reason:

Trust Decline - Erosion of institutional and interpersonal trust
Authentication Collapse - Inability to verify authentic content
Expertise Atrophy - Loss of human capability through AI dependence

How Risks Connect

Many risks interact and compound. For example:

Racing dynamics → reduced safety testing → higher accident risk
Disinformation → trust decline → reduced coordination capacity
Power concentration → lock-in potential → governance failures

See the Risk Interaction Matrix for detailed analysis.

AI Risks

Overview

Risk Categories

Accident Risks

Misuse Risks

Structural Risks

Epistemic Risks

How Risks Connect

Related Wiki Pages

Top Related Pages

Cyberweapons Risk

Bioweapons Risk

Scheming

Deceptive Alignment

AI-Induced Expertise Atrophy

Risks

Analysis

Concepts

AI Risks

Overview

Risk Categories

Accident RisksRiskSchemingScheming—strategic AI deception during training—has transitioned from theoretical concern to observed behavior across all major frontier models (o1: 37% alignment faking, Claude: 14% harmful compli...Quality: 74/100

Misuse RisksRiskBioweapons RiskComprehensive synthesis of AI-bioweapons evidence through early 2026, including the FRI expert survey finding 5x risk increase from AI capabilities (0.3% → 1.5% annual epidemic probability), Anthro...Quality: 91/100

Structural RisksRiskAI Development Racing DynamicsRacing dynamics analysis shows competitive pressure has shortened safety evaluation timelines by 40-60% since ChatGPT's launch, with commercial labs reducing safety work from 12 weeks to 4-6 weeks....Quality: 72/100

Epistemic RisksRiskAI-Driven Trust DeclineUS government trust declined from 73% (1958) to 17% (2025), with AI deepfakes projected to reach 8M by 2025 accelerating erosion through the 'liar's dividend' effect—where synthetic content possibi...Quality: 55/100

How Risks Connect

Related Wiki Pages

Top Related Pages

Cyberweapons Risk

Bioweapons Risk

Scheming

Deceptive Alignment

AI-Induced Expertise Atrophy

Risks

Analysis

Concepts

Accident Risks

Misuse Risks

Structural Risks

Epistemic Risks