Longterm Wiki

Responsible Scaling Policy

AnthropicResearch Areasresponsible-scaling-policy

Record Metadata

Record Keyresponsible-scaling-policy
EntityAnthropic
CollectionResearch Areas(6 records total)
SchemaMajor research initiatives and focus areas.
YAML Filepackages/kb/data/things/mK9pX3rQ7n.yaml

Fields

NameResponsible Scaling Policy
DescriptionFramework for evaluating and mitigating risks at each capability level
StartedSep 2023
Key Publicationanthropic.com
NotesCurrently on v3.0 (Feb 2026); ASL-2 to ASL-3 framework

Other Records in Research Areas (5)

KeyNameDescriptionTeam Size
mechanistic-interpretabilityMechanistic InterpretabilityUnderstanding neural network internals through reverse-engineering50
constitutional-aiConstitutional AITraining AI systems to follow principles through self-critique and RLAIF
alignment-scienceAlignment ScienceScalable oversight, weak-to-strong generalization, robustness to jailbreaks
sleeper-agentsSleeper Agents ResearchInvestigating whether AI systems can maintain hidden behaviors through training
ai-welfareAI Welfare ResearchInvestigating moral status and welfare considerations for AI systems
Record: responsible-scaling-policy | Longterm Wiki