Longterm Wiki

Agent Foundations

Agent foundations research (MIRI's mathematical frameworks for aligned agency) faces low tractability after 10+ years with core problems unsolved, leading to MIRI's 2024 strategic pivot away from the field. Assessment shows ~15-25% probability the work is essential, 60-75% confidence in low tractabi

Related Pages

Top Related Pages

Risks

Instrumental ConvergenceDeceptive AlignmentSharp Left TurnMesa-OptimizationCorrigibility Failure

Analysis

Capability-Alignment Race Model

Other

AI ControlScalable OversightInterpretabilityNeel NandaCorrigibility

Concepts

Governance-Focused WorldviewAlignment Theoretical Overview

Organizations

Manifest (Forecasting Conference)

Historical

Deep Learning Revolution EraThe MIRI Era

Key Debates

Technical AI Safety Research