Longterm Wiki

Constitutional AI Paper

AnthropicSafety Milestonesconstitutional-ai-paper

Record Metadata

Record Keyconstitutional-ai-paper
EntityAnthropic
CollectionSafety Milestones(11 records total)
SchemaSignificant safety research publications and policy milestones.
YAML Filepackages/kb/data/things/mK9pX3rQ7n.yaml

Fields

NameConstitutional AI Paper
DateDec 2022
Typeresearch-paper
DescriptionFoundational paper on training AI systems to follow principles through self-critique
Sourcearxiv.org

Other Records in Safety Milestones (10)

KeyNameDateType
rsp-v1Responsible Scaling Policy v1.0Sep 2023policy-update
sleeper-agents-paperSleeper Agents PaperJan 2024research-paper
scaling-monosemanticityScaling MonosemanticityMay 2024research-paper
rsp-v2RSP v2.0Oct 2024policy-update
alignment-faking-paperAlignment Faking PaperDec 2024research-paper
constitutional-classifiersConstitutional Classifiers ChallengeFeb 2025red-team
circuit-tracingCircuit Tracing / Attribution GraphsMar 2025research-paper
asl-3-activationASL-3 ActivationMay 2025safety-eval
constitution-publishedClaude's Constitution PublishedJan 2026policy-update
rsp-v3RSP v3.0 (Frontier Safety Roadmaps)Feb 2026policy-update
Record: constitutional-ai-paper | Longterm Wiki