Frontier AI Labs Comparison
Read this before the table
Epistemic status. Opinionated synthesis, not a peer-reviewed scorecard. Ratings reflect publicly observable signals through early 2026; internal practices that aren't in the public record are not assessed.
Direction of ratings. For every safety dimension, HIGH means more safety-positive (more investment, more restraint, more transparency, stronger alignment, more detail). The capability "Tier" column is descriptive, not evaluative — being a frontier lab is not in itself good or bad. UNCLEAR means we don't have enough public signal to rate — not that the underlying practice is bad.
Limits. Ratings age fast — frontier labs reorganize, ship new policies, and lose senior staff on quarterly timescales. Update cadence here is ~14 days; always check the linked references for current state. We weigh observable artifacts (published RSPs, system cards, attrition patterns) more heavily than press statements. Other scorecards weight differently — see FLI AI Safety Index, METR Common Elements, and AI Lab Watch for independent rubrics.
Frontier tier classification | Dedicated safety / alignment headcount and research output, relative to org size | Public posture and revealed behavior on competitive racing. HIGH = restrained. | System cards, RSPs, training detail, pre-deployment AISI access | Regulator / AISI / summit engagement. HIGH = constructive engagement. | CEO / co-founder public stance and revealed prioritization of safety | Internal safety advocacy, alignment-research output, attrition pattern | Detail and operational specificity of published Responsible Scaling Policy or equivalent | Notable signals | References | |
|---|---|---|---|---|---|---|---|---|---|---|
Founded 2021 by ex-OpenAI safety leads; explicitly safety-mission lab. Claude family. | FRONTIER Claude Opus / Sonnet competitive at the top tier through 2025-2026. | HIGH Largest dedicated alignment/interpretability staffing share among frontier labs; public RSP and ASL framework; governed by the Long-Term Benefit Trust. | MEDIUM Dario Amodei has openly argued the 'race to safety' framing — explicit goal to stay at frontier rather than slow down. Commercial trajectory has accelerated. | HIGH Detailed RSP v2, comprehensive system cards, public interpretability research, some training/compute disclosure. | HIGH US AISI MOU, UK AISI pre-deployment access, congressional testimony, supports compute-based regulation. | HIGH All seven co-founders left OpenAI over safety disagreements. Dario and Daniela Amodei publicly EA-aligned; 80% Giving Pledge. | HIGH Heavy alignment-research output; employee donation matching program; few high-profile safety-driven departures relative to peers. | HIGH RSP v2 (Oct 2024) — explicit capability thresholds, deployment/training pause conditions, board governance, internal red-team requirements. |
| |
Founded 2015 as nonprofit; now capped-profit. GPT family, o-series reasoning, Sora. | FRONTIER GPT-4o / o-series at the top tier; routinely trades leadership with Anthropic and Google. | MEDIUM Preparedness team and Safety Advisory Group exist; Superalignment team disbanded May 2024. Safety-to-capabilities ratio has fallen since 2023 reorg. | LOW Sam Altman has publicly framed fast deployment as inevitable. Preparedness Framework includes deployment-gating language but execution has been criticized as lagging. | MEDIUM Publishes system cards, model spec, Preparedness Framework. Withholds training data, compute, and full safety-eval results. | MEDIUM Altman 2023 Senate testimony; AISI partnership; participated in summit commitments. Downgraded from HIGH because OpenAI also lobbied against parts of EU AI Act and SB 1047 — engagement is selective, not consistently constructive. | LOW Founding safety mission walked back; chief scientist Sutskever departed; Jan Leike departed citing 'safety culture taking a back seat'; November 2023 board episode. | LOW Pattern of high-profile safety departures 2023-2025 (Sutskever, Leike, Saunders, Krueger, Lehman, Kokotajlo, Brundage). June 2024 'Right to Warn' open letter from former employees on whistleblower protections. | MEDIUM Preparedness Framework v1 (Dec 2023, updated 2025) — thresholds for cyber, persuasion, CBRN, autonomy. Less prescriptive on operational tripwires than Anthropic's RSP. |
| |
Merger of DeepMind and Google Brain (2023). Gemini family; AlphaFold / AlphaGo lineage. | FRONTIER Gemini 2.x and reasoning models competitive at top tier. | MEDIUM Frontier Safety + AGI Safety teams; substantial absolute headcount but smaller fraction of total org than Anthropic. Long alignment research history. | MEDIUM Demis Hassabis publicly safety-conscious. 2023 DeepMind/Brain merger driven in part by competitive pressure; Gemini cadence has accelerated. | MEDIUM Frontier Safety Framework, model cards, AGI safety paper. Less prescriptive than Anthropic's RSP; commercial constraints from Google reduce disclosure. | HIGH UK + US AISI partnerships; Hassabis advises UK Government; participated in Bletchley / Seoul / Paris summit commitments. | MEDIUM Hassabis publicly safety-focused since DeepMind founding; original ethics board legacy. Operating constraints from Google parent. | MEDIUM Strong public alignment-research output. Some internal-protest history (Project Maven, Gemini launch). Less visible attrition pattern than OpenAI. | MEDIUM Frontier Safety Framework v2 (Feb 2025) — capability thresholds and deployment mitigations, less operationally specific than RSP v2. |
| |
Includes FAIR (Yann LeCun) and Meta GenAI. Llama family; open-weights strategy. | NEAR-FRONTIER Llama 3.x / 4 competitive in their tiers but consistently behind closed-weights frontier. | LOW Small explicit safety team relative to size; LeCun publicly skeptical of catastrophic AI x-risk framings. | LOW Open-weights release strategy maximizes diffusion. LeCun has publicly minimized frontier-risk concerns; Zuckerberg framing emphasizes openness and competition with China. | MEDIUM Open weights and detailed technical reports are unusually transparent on capability artifacts. But this rubric weights transparency toward operational safety detail — RSPs, system cards, capability thresholds — none of which Meta publishes. HIGH on openness, LOW on safety-operational transparency; netted to MEDIUM. | MEDIUM Participated at summits. Opposed parts of EU AI Act addressing open-source models. Less proactive on safety-specific policy. | LOW LeCun's public position downplays catastrophic risk. Strategy is openness-first. | LOW Few public safety-driven departures or open letters from FAIR / GenAI. Less visible internal safety advocacy. | NONE No published RSP or capability-threshold framework as of early 2026. |
| |
Founded 2023 by Elon Musk. Grok family; Colossus GPU build-out in Memphis. | FRONTIER Grok 3 (Feb 2025) competitive on public benchmarks; massive compute build-out. | LOW Small explicit safety team; thin public safety-research output; Risk Management Framework published Feb 2025 is brief. | LOW Founded explicitly to race; Musk publicly framed xAI as 'anti-woke' alternative to OpenAI. Grok released with deliberately minimal output guardrails. | LOW Sparse model documentation, no detailed safety evaluations published, brief system cards. | MEDIUM Musk publicly supported SB 1047. xAI participated in Seoul Summit. Less ongoing engagement than the frontier-3. | LOW Musk's positioning has shifted over time; pattern at xAI is to ship capability before safety scaffolding. | UNCLEAR Too early to characterize; little public information on internal safety dissent. | LOW Risk Management Framework (Feb 2025) — published but thin on operational thresholds, tripwires, and governance specifics. |
| |
Mustafa Suleyman-led division (Mar 2024). Phi series + Copilot products + OpenAI partnership. | NEAR-FRONTIER Phi small-model line is strong; frontier-capability access primarily via OpenAI partnership rather than in-house pretraining. | MEDIUM Microsoft Responsible AI Office and large governance apparatus; less in-house alignment-research output than frontier-3. | LOW Commercial deployment-first strategy via Copilot ubiquity. Bing chat / Sydney incidents (2023) reflect prioritizing speed. | MEDIUM Responsible AI Standard published; Phi model cards exist; less RSP-style capability disclosure. | HIGH Brad Smith leads global policy advocacy; AISI partner; UN AI advisory body participation. | MEDIUM Suleyman publicly safety-aware; Microsoft RAI track record mixed (Tay 2016, Sydney 2023). Brad Smith is policy-active. | UNCLEAR Large enterprise; few visible safety-driven departures, also less visibility into internal advocacy. | LOW Responsible AI Standard is an enterprise-policy framework; no RSP-style capability-threshold tripwire system. |
| |
Hangzhou-based, spun out of High-Flyer Capital. DeepSeek V3 / R1 reasoning models, open-weights. | FRONTIER DeepSeek V3 / R1 (early 2025) reached frontier reasoning performance at notably lower training cost. | LOW Minimal public safety apparatus. Some alignment-related discussion in technical reports; no dedicated safety org disclosure. | LOW Strategy pushes capability frontier with permissive open-weights release. R1 reasoning model released with weights and method. | MEDIUM Weights, technical methodology, and some training detail unusually open. No RSP / capability-threshold framework and limited safety-eval disclosure — netted to MEDIUM on this rubric's safety-weighted definition of transparency. | UNCLEAR Operates under PRC regulatory framework (algorithm registration, content moderation). Limited Western governance interaction. | UNCLEAR Few public safety statements from leadership in English-language venues. | UNCLEAR Limited public information on internal safety culture or dissent. | NONE No public RSP, framework, or capability-threshold commitment. |
| |
Alibaba Cloud / DAMO Academy. Qwen family — strong open-weights releases. | NEAR-FRONTIER Qwen 2.5 / 3 competitive in their tiers; multimodal and code variants strong; below absolute frontier on hardest reasoning. | LOW DAMO Academy has safety research streams; explicit frontier-safety framing limited in English-language public output. | LOW Aggressive open-weights cadence across sizes; rapid release tempo. | MEDIUM Weights released and detailed technical reports. No RSP / capability framework and limited safety-eval depth — HIGH on capability openness, LOW on safety-operational transparency; netted to MEDIUM. | UNCLEAR Operates under PRC regulatory framework. Limited Western policy interaction by Alibaba AI specifically. | UNCLEAR Few public safety-framed statements from leadership. | UNCLEAR Limited public information. | NONE No public RSP-equivalent framework. |
| |
Paris-based, founded 2023. Open and commercial models; European AI champion positioning. | NEAR-FRONTIER Mid-sized models competitive in efficiency tier; below absolute frontier on largest closed models. | LOW Minimal public safety research output; no dedicated safety org publicly disclosed. | LOW Open-weights strategy; European-champion competitive framing. | MEDIUM Model cards exist; safety detail thinner. | MEDIUM Active in EU AI Act lobbying — primarily to reduce obligations on open-source frontier models. | UNCLEAR Limited public statements on existential safety; not a central messaging focus. | UNCLEAR Too small / too recent for clear pattern. | NONE No published RSP. |
| |
Toronto-based, founded 2019 by ex-Google Brain (Aidan Gomez). Enterprise / RAG focus. | NEAR-FRONTIER Command R / R+ competitive in enterprise / RAG tier; below frontier on hardest reasoning. Strategy is enterprise-focused, not frontier-chasing. | MEDIUM Cohere For AI research lab; published responsible-AI / multilingual safety work. Smaller commercial scale than frontier-3. | MEDIUM Enterprise positioning de-emphasizes the frontier race; deliberate focus on smaller, deployable models. | MEDIUM Model and use-policy docs available; some safety research published; not full weight releases. | HIGH Active in Canadian and international AI governance discussion; Aidan Gomez public commentary on policy. | MEDIUM Some public safety framing alongside commercial focus. | UNCLEAR Limited public signal. | LOW Acceptable Use and responsible-use docs; no published RSP-style capability-threshold framework. |
| |
AI21 Labs Tel Aviv-based, founded 2017. Jurassic, Jamba (SSM hybrid); enterprise focus. | SPECIALIZED Jamba (SSM-Transformer hybrid) is technically distinctive but not at general-purpose frontier; enterprise-specialized. | LOW Minimal public safety apparatus. | MEDIUM Enterprise-focused; less frontier-race framing than the closed-weights frontier-3. | MEDIUM Jamba weights released; technical reports detailed. Safety-eval disclosure limited. | UNCLEAR Limited Western-frontier policy visibility. | UNCLEAR Limited public statements on existential safety. | UNCLEAR Limited public information. | NONE No published RSP-equivalent framework. |
| |
Founded 2022 by Mustafa Suleyman, Karén Simonyan, Reid Hoffman. Most of team absorbed into Microsoft AI March 2024. | DEFUNCT Frontier ambitions abandoned March 2024; remaining entity pivoted to enterprise. Original Pi-model team is now Microsoft AI under Suleyman. | UNCLEAR Not applicable to current entity; pre-absorption safety apparatus was modest. | UNCLEAR Pre-absorption was a fast scaler; current entity is no longer frontier. | LOW Minimal public technical detail in either form. | UNCLEAR Suleyman has carried his policy engagement into Microsoft AI. | UNCLEAR Not applicable to current entity. | UNCLEAR Not applicable post-absorption. | NONE No public RSP-equivalent framework, before or after absorption. |
|
What this table is — and isn't
It's a cross-lab comparison overlay on top of the FactBase / TableBase organization data this wiki already maintains. Hard numbers (funding, valuations, headcount) live in the per-org pages at /organizations. This table is the layer above that — the editorial judgment about safety practice, anchored where possible in observable artifacts.
It is not:
- A precise scoring system. The HIGH / MEDIUM / LOW levels are buckets, not measurements.
- A prediction. We rate revealed behavior to date, not where each lab is heading.
- A safety endorsement for HIGH-rated labs. A lab can score well on every column here and still be a net contributor to risk (because the underlying activity is high-risk, or because revealed safety practice can degrade fast).
- Definitive. Other lab-scorecard efforts (FLI, METR, AI Lab Watch) use different rubrics and reach different conclusions on the same labs. We cite ours, not theirs.
Why these eight dimensions
The dimensions are chosen to track what a third-party observer can verify from public artifacts, on the theory that the gap between stated safety commitment and revealed safety behavior is more useful than either alone:
- Capability tier — descriptive, not evaluative. Distinguishes labs whose safety choices actually matter at the global frontier from those whose choices matter mostly to their customers.
- Safety investment — dedicated alignment / interpretability / red-team headcount and research output, relative to org size. Absolute headcount favors hyperscalers; ratio is more meaningful.
- Race posture restraint — public framing plus revealed behavior on competitive deployment. We mark HIGH when restraint is both stated and demonstrated; LOW when racing is explicit or when revealed cadence contradicts safety framing.
- Transparency — RSPs, system cards, training detail, AISI pre-deployment access. Weighted toward operationally specific artifacts (capability thresholds, tripwires) over PR commitments.
- Governance engagement — constructive engagement with regulators, AISIs, summits. We don't credit a lab for testifying if they also lobbied against the same legislation.
- Leadership safety alignment — CEO / co-founder public stance plus revealed prioritization (resourcing, who reports to whom, what gets shipped when leadership disagrees).
- Employee safety culture — internal safety advocacy, public alignment-research output, attrition patterns. High-profile safety-driven departures are weighed heavily because they're a costly signal.
- RSP / framework quality — operational specificity. We treat a vague public-relations document as worse than no policy, because the former forecloses scrutiny.
What HIGH means
HIGH is safety-positive, uniformly. HIGH safety investment means more investment; HIGH transparency means more disclosure; HIGH race restraint means more restraint; HIGH leadership alignment means stronger alignment. This lets you scan the table left-to-right and read green/amber/red as a rough safety signal — at the cost of forcing some unidimensional judgments onto multidimensional realities. The per-cell notes try to recover the nuance.
We use UNCLEAR rather than guessing when public signal is thin (especially for Chinese-jurisdiction labs and very small / very new labs). Treat UNCLEAR rows as "we don't know," not as "probably bad."
Cross-references
- Anthropic Stakeholders Table — single-org-deep-dive prototype.
- /organizations — organization directory with FactBase facts (funding rounds, valuations, headcount).
- FLI AI Safety Index (Winter 2025) — independent scorecard, different rubric.
- METR: Common Elements — frontier safety policy comparison.
- AI Lab Watch — third-party scorecard with detailed rubric.
Methodology — how each rating is anchored
Every rating in the table has a one- or two-sentence note linking it to an observable signal. Where multiple signals exist, the "Notable signals" column carries the longer evidence list and the "References" column links sources.
When evidence conflicts (e.g., a public safety statement contradicted by revealed deployment cadence), we weight revealed behavior over stated commitments, on the theory that the costly-signal side carries more information. This is why labs that publish strong-sounding safety frameworks can still receive a MEDIUM or LOW rating on race-posture restraint — the frameworks are real, but so is the deployment cadence.
For attribution-light claims (e.g., "small explicit safety team"), the absence of public confirmation is itself informative: frontier labs that invest heavily in safety typically advertise it. We mark these LOW, not UNCLEAR.
Limitations
- Sample is small — 12 labs, eight dimensions, hand-curated. Per-cell error bars are wide.
- Western-centric public signal — Chinese-jurisdiction labs operate under a different regulatory and disclosure regime; UNCLEAR ratings there reflect our limited visibility, not necessarily their internal practice.
- Race-posture restraint is the most contested column — labs and observers disagree about what counts as racing. We use "race-conscious" as MEDIUM and reserve LOW for labs whose public framing explicitly endorses racing.
- Ratings age fast — quarterly cadence at minimum. Anything older than three months should be re-verified before being cited.
Suggested re-reads, in order of likely turnover: leadership safety alignment > race-posture restraint > RSP quality > safety investment > transparency > everything else.
References
The Future of Life Institute evaluated eight major AI companies across 35 safety indicators, finding widespread deficiencies in risk management and existential safety practices. Even top performers Anthropic and OpenAI received only marginal passing grades, highlighting systemic gaps across the industry in preparedness for advanced AI risks.
METR analyzes the common structural elements across frontier AI safety policies published by major AI companies, identifying shared frameworks around capability thresholds, model evaluations, weight security, deployment mitigations, and accountability mechanisms. The December 2025 version covers twelve companies including Anthropic, OpenAI, Google DeepMind, Meta, and others, and incorporates references to the EU AI Act's General-Purpose AI Code of Practice and California's Senate Bill 53.
AI Lab Watch is a publicly maintained scorecard tracking what major AI companies are doing to address safety risks, scoring them across weighted categories including risk assessment, scheming risk prevention, safety research, misuse prevention, and security preparedness. Created by Zach Stein-Perlman, it provides comparative transparency on AI lab safety practices. As of September 2025, the site is no longer actively maintained.