AI Safety Organizations (Overview)

Overview

The AI safety organizational landscape spans dedicated alignment research labs, policy think tanks, advocacy groups, and field-building institutions. These organizations aim to reduce catastrophic and existential risks from advanced AI systems through technical research, governance advocacy, talent development, and public engagement.

Funding is heavily concentrated through a small number of major funders, most prominently Coefficient Giving, which has provided grants to the majority of organizations listed on this page. This concentration produces a relatively coordinated funding environment, with most grantees sharing compatible research agendas and norms, while also reducing diversification of funding sources across the field.

Note: The AI safety organizational landscape evolves rapidly. Headcount, budget, and focus area descriptions reflect available information as of mid-2025 and may not capture recent changes. Check individual entity pages for the most current details.

Alignment Research Labs

Dedicated organizations conducting technical AI safety research:

ARC (Alignment Research Center): Founded by Paul Christiano; focuses on alignment evaluation and theoretical alignment research
METR: Evaluates dangerous capabilities in frontier AI models; spun out of ARC
Apollo Research: Focuses on detecting and understanding deceptive AI behavior, including scheming evaluations
Redwood Research: Alignment research lab working on interpretability, adversarial training, and AI control
Conjecture: Alignment research and product company based in London
FAR AI: Researches robustness, adversarial attacks, and alignment failures in AI systems
Palisade Research: Focuses on practical AI safety evaluation and red-teaming
Seldon Lab: Works on alignment approaches and safety evaluations
Goodfire: Interpretability-focused startup building tools for understanding neural networks
MIRI (Machine Intelligence Research Institute): Pioneer in AI alignment theory; founded 2000

Policy and Governance Organizations

Think tanks and research centers focused on AI governance and policy:

GovAI: Research center focused on AI governance based at Oxford
CSET (Center for Security and Emerging Technology): Georgetown think tank producing policy-relevant research on AI and emerging technologies
CSER (Centre for the Study of Existential Risk): Cambridge-based research center studying existential risks including from AI
Secure AI Project: Advocacy organization focused on AI safety policy
ControlAI: Advocacy organization pushing for stronger AI regulation and safety standards
Pause AI: Grassroots advocacy movement calling for a pause on frontier AI development
Frontier Model Forum: Industry-led consortium for frontier AI safety, founded by Anthropic, Google DeepMind, Microsoft, and OpenAI. The forum's stated mission centers on safety research and best-practice sharing; observers differ on the extent to which it functions as a coordination body versus an industry advocacy vehicle
IAPS (Institute for AI Policy and Strategy): Nonpartisan think tank producing policy research from today's advanced models to potential AGI and superintelligence, and a significant talent pipeline into government
Partnership on AI: Multi-stakeholder nonprofit founded by major tech companies with 100+ partners, setting shared norms for responsible AI across industry and civil society
Ada Lovelace Institute: Independent UK research institute (Nuffield Foundation) bridging technical AI research with human rights-focused policymaking
AI Now Institute: NYU-based institute pioneering research on algorithmic bias, surveillance, and corporate AI power concentration
AI Policy Institute: DC-based advocacy organization translating public concern about AI into effective governance
Americans for Responsible Innovation: DC-based policy group focused on bipartisan AI safety legislation, backed by EA-aligned donors
The Future Society: International nonprofit working on AI governance across the UN, OECD, EU, and G7

Major Think Tank AI Programs

Established policy research institutions with significant AI programs:

Brookings AIET Initiative: One of the most-cited think tank programs on AI governance, workforce impacts, and algorithmic accountability
RAND Corporation: AI policy research shaping Pentagon and NATO thinking on autonomous weapons and escalation risk
Stanford HAI: Interdisciplinary institute producing the widely-cited AI Index Report, with 200+ affiliated faculty
Carnegie Endowment AI Program: Researches AI's intersection with geopolitics and democratic institutions, with offices globally including Beijing
CSIS Wadhwani Center: Research on AI and national security, military competition, and US-China tech rivalry
Center for Democracy and Technology: Digital rights nonprofit (est. 1994) with AI Governance Lab covering algorithmic accountability and workers' rights

Field-Building and Talent Development

Organizations supporting the growth of the AI safety field:

80,000 Hours: Career advisory organization directing talent toward high-impact careers including AI safety
MATS (ML Alignment Theory Scholars): Training program connecting aspiring alignment researchers with mentors
Lightning Rod Labs: Works on AI safety infrastructure and tooling
AI Futures Project: Research and analysis on AI development trajectories and safety considerations

Research and Analysis

Organizations focused on understanding AI progress and risks:

Epoch AI: Tracks AI compute trends, model capabilities, and training data
CAIS (Center for AI Safety): Conducts safety research and field-building for AI safety; hosts a compute cluster for safety research
CHAI (Center for Human-Compatible AI): UC Berkeley research center founded by Stuart Russell focusing on human-compatible AI

Budget and Headcount Comparison

For funders and researchers evaluating organizational capacity and capital efficiency, comparative budget and headcount data can help identify where additional resources may be most impactful and how different organizations structure their research operations.

AI Safety Org Annual Revenue

The table below aggregates publicly available estimates across nine prominent independent AI safety organizations.

All figures are estimates derived from IRS Form 990 filings (via ProPublica Nonprofit Explorer), Coefficient Giving (formerly Open Philanthropy) grant disclosures, LinkedIn headcount data, and news reports. Figures are approximate, may lag actual values by one to two years, and should be treated as indicative rather than authoritative. The "Est. Budget per Staff Member/year" column is calculated using the midpoint of the headcount range and counts all staff, not researchers only.

Organization	Annual Budget (Est.)	Headcount (Est.)	Est. Budget per Staff Member/year (Est.)	Primary Funder	Focus Area
MIRI	≈$5M	10–15	≈$400K	Coefficient Giving	Alignment theory
ARC	≈$8M	20–30	≈$320K	Coefficient Giving	Alignment research & evaluation
METR	≈$5M	20–30	≈$200K	Coefficient Giving	Dangerous capability evaluation
CAIS	≈$5M	15–20	≈$286K	Coefficient Giving	Research & field-building
Redwood Research	≈$10M	30–40	≈$286K	Coefficient Giving	Interpretability & AI control
Apollo Research	≈$4M	15–20	≈$229K	Coefficient Giving	Deceptive alignment & scheming
Conjecture	≈$5M	30–40	≈$143K	Mixed (VC + grants)	Alignment research & products
FAR AI	≈$3M	10–15	≈$240K	Coefficient Giving	Robustness & adversarial ML
GovAI	≈$5M	20–30	≈$200K	Coefficient Giving	AI governance & policy

The budget-per-staff figures reflect meaningful variation in organizational structure. Organizations with lower ratios (e.g., Conjecture) typically employ a higher proportion of non-researcher staff or operate hybrid research-product models, whereas those with higher ratios (e.g., MIRI) tend toward smaller, senior-heavy research teams. These figures should not be interpreted as proxies for research quality or output volume.

Key Patterns

Specialization trend: The field has moved from generalist safety organizations—such as MIRI and the Future of Humanity Institute (FHI, which closed in 2024)—toward more specialized roles: dedicated evaluation labs (METR, Apollo Research), interpretability startups (Goodfire), policy research centers (ControlAI, GovAI), and talent pipelines (MATS, 80,000 Hours).

Industry-adjacent positioning: Organizations in this landscape occupy a range of positions relative to frontier AI developers. Some—such as the Frontier Model Forum, Redwood Research, and Apollo Research—maintain active collaborative relationships with frontier labs. Others, including Pause AI and ControlAI, advocate for regulatory constraints on AI development and position themselves independently of industry partnerships. Proponents of each approach offer different accounts of how safety outcomes are best achieved.

Funding concentration: As illustrated in the budget table above, most organizations in this cluster report Coefficient Giving as their primary funder. This pattern is visible across alignment research, governance research, and field-building organizations alike.

AI Safety Organizations (Overview)

Overview

Alignment Research Labs

Policy and Governance Organizations

Major Think Tank AI Programs

Field-Building and Talent Development

Research and Analysis

Budget and Headcount Comparison

AI Safety Org Annual Revenue

Key Patterns

Related Wiki Pages

Top Related Pages

AI Alignment

FAR AI

AI Control

Redwood Research

Scheming

Organizations

Other

Approaches

Concepts

Analysis