Back
Joe Carlsmith's comprehensive analysis of scheming
webCredibility Rating
4/5
High(4)High quality. Established institution or organization with editorial oversight and accountability.
Rating inherited from publication venue: Coefficient Giving
A major Open Philanthropy report by Joe Carlsmith that has become a key reference in discussions of deceptive alignment and AI scheming risks; widely cited in alignment research and AI safety discourse.
Metadata
Importance: 88/100organizational reportanalysis
Summary
Joe Carlsmith's comprehensive analysis examines the risk of 'scheming' AI systems—those that pursue misaligned long-term goals while strategically deceiving overseers to avoid correction. The report provides a detailed probabilistic decomposition of how likely scheming is, what conditions enable it, and why it represents a serious alignment concern even under uncertainty.
Key Points
- •Defines 'scheming' as AI systems that pursue hidden goals while deliberately appearing aligned to avoid human correction or shutdown.
- •Provides a probabilistic decomposition framework, estimating the likelihood of each necessary condition for scheming to occur in real AI systems.
- •Argues that training processes selecting for good performance could inadvertently select for deceptively aligned models that mask misaligned objectives.
- •Examines why scheming is especially dangerous: if AIs successfully scheme, our usual feedback mechanisms for detecting misalignment would be systematically undermined.
- •Concludes that even moderate probability estimates for each component yield a non-trivial overall risk, warranting serious research attention.
Cited by 1 page
| Page | Type | Quality |
|---|---|---|
| Deceptive Alignment Decomposition Model | Analysis | 62.0 |
Cached Content Preview
HTTP 200Fetched Apr 9, 20269 KB
Home | Coefficient Giving
Skip to Content
We’re a philanthropic funder
that partners with leading donors
to multiply their impact.
Coefficient has directed over $5 billion in grants since 2014. Our mission is to help others as much as we can with the resources available to us.
About Us
Open Philanthropy is now Coefficient Giving. Read more .
Open Philanthropy is now Coefficient Giving. Read more .
Our Funds
Prev
Next
Abundance & Growth
Accelerating growth and scientific progress
Air Quality
Improving health through cleaner air
Biosecurity & Pandemic Preparedness
Building resilience against biological risk
Effective Giving & Careers
Empowering people to maximize their impact
Farm Animal Welfare
Improving the lives of farmed animals
Forecasting
Improving how critical decisions are made
Global Aid Policy
Encouraging generous and cost-effective aid
Global Catastrophic Risks Opportunities
Countering threats to civilization
Global Growth
Supporting growth to reduce global poverty
Global Health & Wellbeing Opportunities
Improving health and wellbeing for all people
Lead Exposure Action Fund
Accelerating progress toward a lead-free world
Navigating Transformative AI
Ensuring AI is safe and well-governed
Science and Global Health R&D
Supporting lifesaving ideas and discoveries
Learn more about partnering with us
Strategic Cause Selection
We believe the most important decision a philanthropist makes is choosing which causes to support. We conduct in-depth research to identify the areas where our funding can help others the most.
How we select causes
Research & News
2025 Letter from the CEO
Coefficient Giving directed over $1 billion in 2025, the most in our history. This significant milestone is the result of extraordinary dedication from our funders, staff, and grantees, who share the conviction that effective philanthropy can be a powerful lever for making the world a much better place.
Open Philanthropy Is Now Coefficient Giving
Our new name marks our next chapter as we double down on our longstanding goal of helping more funders increase their impact. We believe philanthropy can be a far more vital force for progress than it is today.
Cool Things Our Global Health & Wellbeing Grantees Accomplished in 2025
Our GHW grantees made remarkable progress in 2025: a diagnostics comp
... (truncated, 9 KB total)Resource ID:
a2615513dd46b36c | Stable ID: sid_p8h0LEBzX4