Policy & Governance
Policy approaches establish organizational frameworks for responsible AI development.
Scaling Policies:
- Responsible Scaling Policies (RSPs)RspComprehensive analysis of Responsible Scaling Policies showing 20 companies with published frameworks as of Dec 2025, with SaferAI grading major policies 1.9-2.2/5 for specificity. Evidence suggest...Quality: 62/100: Commitments tied to capability thresholds
Model Documentation:
- Model SpecificationsModel SpecModel specifications are explicit documents defining AI behavior, now published by all major frontier labs (Anthropic, OpenAI, Google, Meta) as of 2025. While they improve transparency and enable e...Quality: 50/100: Documenting intended behavior and limitations
Evaluation Governance:
- Evaluation GovernanceEvals GovernanceEvals-based deployment gates create formal checkpoints requiring AI systems to pass safety evaluations before deployment, with EU AI Act imposing fines up to EUR 35M/7% turnover and UK AISI testing...Quality: 66/100: How evaluations inform deployment decisions
Development Speed:
- Pause/MoratoriumPause MoratoriumComprehensive analysis of pause/moratorium proposals finding they would provide very high safety benefits if implemented (buying time for safety research to close the growing capability-safety gap)...Quality: 72/100: Arguments for slowing AI development