Longterm Wiki

Process Supervision

Process supervision trains AI systems to produce correct reasoning steps, not just correct final answers, improving transparency and auditability of AI reasoning while achieving significant gains in mathematical and coding tasks.

Related

Related Pages

Top Related Pages

Risks

AI Distributional ShiftSchemingDeceptive AlignmentSycophancy

Analysis

Alignment Robustness Trajectory ModelReward Hacking Taxonomy and Severity ModelAI Lab Whistleblower Dynamics Model

Approaches

Weak-to-Strong GeneralizationAI Safety via DebateCapability ElicitationReward ModelingConstitutional AIScheming & Deception Detection

Other

Jan LeikePaul Christiano

Organizations

Anthropic

Key Debates

Why Alignment Might Be Hard

Concepts

Alignment Training Overview

Policy

California SB 53Model Registries

Tags

process-supervisionchain-of-thoughtreasoning-verificationreward-modelingtransparency