AI Safety Intervention Portfolio
Strategic overview of AI safety interventions analyzing ~$650M annual investment across 1,100 FTEs. Maps 13+ interventions against 4 risk categories with ITN prioritization, finding 85% of external funding from 5 sources and safety/capabilities ratio at 0.5-1.3%.
Related
Related Pages
Top Related Pages
AI Evaluations
This page analyzes AI safety evaluations and red-teaming as a risk mitigation strategy.
Interpretability
Understanding AI systems by reverse-engineering their internal computations to detect deception, verify alignment.
Responsible Scaling Policies
Responsible Scaling Policies (RSPs) are voluntary commitments by AI labs to pause scaling when capability or safety thresholds are crossed.
Coefficient Giving
Coefficient Giving (formerly Open Philanthropy) is a major philanthropic organization that has directed over \\$4 billion in grants since 2014, inc...
AI Safety Field Building Analysis
This analysis examines AI safety field-building interventions including education programs (ARENA, MATS, BlueDot).