Skip to content
Longterm Wiki
Updated 2026-05-11HistoryData
Page StatusContentTable
Edited 5 weeks ago972 wordsUpdated biweeklyOverdue by 20 days
Content5/12
SummaryScheduleEntityEdit history
Tables0/ ~5Diagrams0Int. links3/ ~5Ext. links9/ ~3Footnotes0/ ~2References3/ ~2Quotes0Accuracy0
Issues1
Links9 links could use <R> components

Frontier AI Labs Comparison

Read this before the table

Epistemic status. Opinionated synthesis, not a peer-reviewed scorecard. Ratings reflect publicly observable signals through early 2026; internal practices that aren't in the public record are not assessed.

Direction of ratings. For every safety dimension, HIGH means more safety-positive (more investment, more restraint, more transparency, stronger alignment, more detail). The capability "Tier" column is descriptive, not evaluative — being a frontier lab is not in itself good or bad. UNCLEAR means we don't have enough public signal to rate — not that the underlying practice is bad.

Limits. Ratings age fast — frontier labs reorganize, ship new policies, and lose senior staff on quarterly timescales. Update cadence here is ~14 days; always check the linked references for current state. We weigh observable artifacts (published RSPs, system cards, attrition patterns) more heavily than press statements. Other scorecards weight differently — see FLI AI Safety Index, METR Common Elements, and AI Lab Watch for independent rubrics.

Columns:|
Frontier tier classification
Dedicated safety / alignment headcount and research output, relative to org size
Public posture and revealed behavior on competitive racing. HIGH = restrained.
System cards, RSPs, training detail, pre-deployment AISI access
Regulator / AISI / summit engagement. HIGH = constructive engagement.
CEO / co-founder public stance and revealed prioritization of safety
Internal safety advocacy, alignment-research output, attrition pattern
Detail and operational specificity of published Responsible Scaling Policy or equivalent
Notable signalsReferences
Founded 2021 by ex-OpenAI safety leads; explicitly safety-mission lab. Claude family.
FRONTIER
Claude Opus / Sonnet competitive at the top tier through 2025-2026.
HIGH
Largest dedicated alignment/interpretability staffing share among frontier labs; public RSP and ASL framework; governed by the Long-Term Benefit Trust.
MEDIUM
Dario Amodei has openly argued the 'race to safety' framing — explicit goal to stay at frontier rather than slow down. Commercial trajectory has accelerated.
HIGH
Detailed RSP v2, comprehensive system cards, public interpretability research, some training/compute disclosure.
HIGH
US AISI MOU, UK AISI pre-deployment access, congressional testimony, supports compute-based regulation.
HIGH
All seven co-founders left OpenAI over safety disagreements. Dario and Daniela Amodei publicly EA-aligned; 80% Giving Pledge.
HIGH
Heavy alignment-research output; employee donation matching program; few high-profile safety-driven departures relative to peers.
HIGH
RSP v2 (Oct 2024) — explicit capability thresholds, deployment/training pause conditions, board governance, internal red-team requirements.
  • RSP v2 published Oct 2024 with detailed ASL-3/ASL-4 tripwires.
  • Pre-deployment access agreements with US and UK AISIs.
  • Public interpretability program with regular technical releases.
Founded 2015 as nonprofit; now capped-profit. GPT family, o-series reasoning, Sora.
FRONTIER
GPT-4o / o-series at the top tier; routinely trades leadership with Anthropic and Google.
MEDIUM
Preparedness team and Safety Advisory Group exist; Superalignment team disbanded May 2024. Safety-to-capabilities ratio has fallen since 2023 reorg.
LOW
Sam Altman has publicly framed fast deployment as inevitable. Preparedness Framework includes deployment-gating language but execution has been criticized as lagging.
MEDIUM
Publishes system cards, model spec, Preparedness Framework. Withholds training data, compute, and full safety-eval results.
MEDIUM
Altman 2023 Senate testimony; AISI partnership; participated in summit commitments. Downgraded from HIGH because OpenAI also lobbied against parts of EU AI Act and SB 1047 — engagement is selective, not consistently constructive.
LOW
Founding safety mission walked back; chief scientist Sutskever departed; Jan Leike departed citing 'safety culture taking a back seat'; November 2023 board episode.
LOW
Pattern of high-profile safety departures 2023-2025 (Sutskever, Leike, Saunders, Krueger, Lehman, Kokotajlo, Brundage). June 2024 'Right to Warn' open letter from former employees on whistleblower protections.
MEDIUM
Preparedness Framework v1 (Dec 2023, updated 2025) — thresholds for cyber, persuasion, CBRN, autonomy. Less prescriptive on operational tripwires than Anthropic's RSP.
  • Superalignment team disbanded May 2024.
  • June 2024 'Right to Warn' open letter signed by 13 former and current employees.
  • Preparedness Framework updated 2025 but Safety Advisory Group decisions opaque.
Merger of DeepMind and Google Brain (2023). Gemini family; AlphaFold / AlphaGo lineage.
FRONTIER
Gemini 2.x and reasoning models competitive at top tier.
MEDIUM
Frontier Safety + AGI Safety teams; substantial absolute headcount but smaller fraction of total org than Anthropic. Long alignment research history.
MEDIUM
Demis Hassabis publicly safety-conscious. 2023 DeepMind/Brain merger driven in part by competitive pressure; Gemini cadence has accelerated.
MEDIUM
Frontier Safety Framework, model cards, AGI safety paper. Less prescriptive than Anthropic's RSP; commercial constraints from Google reduce disclosure.
HIGH
UK + US AISI partnerships; Hassabis advises UK Government; participated in Bletchley / Seoul / Paris summit commitments.
MEDIUM
Hassabis publicly safety-focused since DeepMind founding; original ethics board legacy. Operating constraints from Google parent.
MEDIUM
Strong public alignment-research output. Some internal-protest history (Project Maven, Gemini launch). Less visible attrition pattern than OpenAI.
MEDIUM
Frontier Safety Framework v2 (Feb 2025) — capability thresholds and deployment mitigations, less operationally specific than RSP v2.
  • Frontier Safety Framework v2 (Feb 2025).
  • AGI safety position paper (2025) outlining technical research agenda.
  • Pre-deployment AISI access for Gemini frontier releases.
Includes FAIR (Yann LeCun) and Meta GenAI. Llama family; open-weights strategy.
NEAR-FRONTIER
Llama 3.x / 4 competitive in their tiers but consistently behind closed-weights frontier.
LOW
Small explicit safety team relative to size; LeCun publicly skeptical of catastrophic AI x-risk framings.
LOW
Open-weights release strategy maximizes diffusion. LeCun has publicly minimized frontier-risk concerns; Zuckerberg framing emphasizes openness and competition with China.
MEDIUM
Open weights and detailed technical reports are unusually transparent on capability artifacts. But this rubric weights transparency toward operational safety detail — RSPs, system cards, capability thresholds — none of which Meta publishes. HIGH on openness, LOW on safety-operational transparency; netted to MEDIUM.
MEDIUM
Participated at summits. Opposed parts of EU AI Act addressing open-source models. Less proactive on safety-specific policy.
LOW
LeCun's public position downplays catastrophic risk. Strategy is openness-first.
LOW
Few public safety-driven departures or open letters from FAIR / GenAI. Less visible internal safety advocacy.
NONE
No published RSP or capability-threshold framework as of early 2026.
  • Llama 4 release continued open-weights strategy.
  • LeCun has repeatedly publicly minimized AGI / x-risk framings.
  • Acceptable Use Policy exists for Llama; no upstream capability thresholds.
Founded 2023 by Elon Musk. Grok family; Colossus GPU build-out in Memphis.
FRONTIER
Grok 3 (Feb 2025) competitive on public benchmarks; massive compute build-out.
LOW
Small explicit safety team; thin public safety-research output; Risk Management Framework published Feb 2025 is brief.
LOW
Founded explicitly to race; Musk publicly framed xAI as 'anti-woke' alternative to OpenAI. Grok released with deliberately minimal output guardrails.
LOW
Sparse model documentation, no detailed safety evaluations published, brief system cards.
MEDIUM
Musk publicly supported SB 1047. xAI participated in Seoul Summit. Less ongoing engagement than the frontier-3.
LOW
Musk's positioning has shifted over time; pattern at xAI is to ship capability before safety scaffolding.
UNCLEAR
Too early to characterize; little public information on internal safety dissent.
LOW
Risk Management Framework (Feb 2025) — published but thin on operational thresholds, tripwires, and governance specifics.
  • Risk Management Framework published Feb 2025; criticized for brevity and lack of tripwires.
  • Grok image-generation launches with notably few content filters.
  • Memphis Colossus cluster built rapidly; environmental and grid-impact concerns raised by local press.
Mustafa Suleyman-led division (Mar 2024). Phi series + Copilot products + OpenAI partnership.
NEAR-FRONTIER
Phi small-model line is strong; frontier-capability access primarily via OpenAI partnership rather than in-house pretraining.
MEDIUM
Microsoft Responsible AI Office and large governance apparatus; less in-house alignment-research output than frontier-3.
LOW
Commercial deployment-first strategy via Copilot ubiquity. Bing chat / Sydney incidents (2023) reflect prioritizing speed.
MEDIUM
Responsible AI Standard published; Phi model cards exist; less RSP-style capability disclosure.
HIGH
Brad Smith leads global policy advocacy; AISI partner; UN AI advisory body participation.
MEDIUM
Suleyman publicly safety-aware; Microsoft RAI track record mixed (Tay 2016, Sydney 2023). Brad Smith is policy-active.
UNCLEAR
Large enterprise; few visible safety-driven departures, also less visibility into internal advocacy.
LOW
Responsible AI Standard is an enterprise-policy framework; no RSP-style capability-threshold tripwire system.
  • Inflection AI team (incl. Suleyman) absorbed Mar 2024 to lead Microsoft AI.
  • Responsible AI Transparency Report published annually.
  • Significant deployment via Copilot, Office, Azure — sets the de facto safety floor for enterprise AI.
Hangzhou-based, spun out of High-Flyer Capital. DeepSeek V3 / R1 reasoning models, open-weights.
FRONTIER
DeepSeek V3 / R1 (early 2025) reached frontier reasoning performance at notably lower training cost.
LOW
Minimal public safety apparatus. Some alignment-related discussion in technical reports; no dedicated safety org disclosure.
LOW
Strategy pushes capability frontier with permissive open-weights release. R1 reasoning model released with weights and method.
MEDIUM
Weights, technical methodology, and some training detail unusually open. No RSP / capability-threshold framework and limited safety-eval disclosure — netted to MEDIUM on this rubric's safety-weighted definition of transparency.
UNCLEAR
Operates under PRC regulatory framework (algorithm registration, content moderation). Limited Western governance interaction.
UNCLEAR
Few public safety statements from leadership in English-language venues.
UNCLEAR
Limited public information on internal safety culture or dissent.
NONE
No public RSP, framework, or capability-threshold commitment.
  • DeepSeek R1 release (Jan 2025) shifted assumptions about training-cost frontiers.
  • Operates under Chinese generative AI regulations including algorithm filings.
  • No counterpart to the Western RSP / FSF framework family.
Alibaba Cloud / DAMO Academy. Qwen family — strong open-weights releases.
NEAR-FRONTIER
Qwen 2.5 / 3 competitive in their tiers; multimodal and code variants strong; below absolute frontier on hardest reasoning.
LOW
DAMO Academy has safety research streams; explicit frontier-safety framing limited in English-language public output.
LOW
Aggressive open-weights cadence across sizes; rapid release tempo.
MEDIUM
Weights released and detailed technical reports. No RSP / capability framework and limited safety-eval depth — HIGH on capability openness, LOW on safety-operational transparency; netted to MEDIUM.
UNCLEAR
Operates under PRC regulatory framework. Limited Western policy interaction by Alibaba AI specifically.
UNCLEAR
Few public safety-framed statements from leadership.
UNCLEAR
Limited public information.
NONE
No public RSP-equivalent framework.
  • Qwen 2.5 / 3 open-weights releases competitive with peer-tier closed models.
  • Operating under PRC content / algorithmic regulation.
  • Strong code and multilingual variants; less emphasis on safety scaffolding in English-language docs.
Paris-based, founded 2023. Open and commercial models; European AI champion positioning.
NEAR-FRONTIER
Mid-sized models competitive in efficiency tier; below absolute frontier on largest closed models.
LOW
Minimal public safety research output; no dedicated safety org publicly disclosed.
LOW
Open-weights strategy; European-champion competitive framing.
MEDIUM
Model cards exist; safety detail thinner.
MEDIUM
Active in EU AI Act lobbying — primarily to reduce obligations on open-source frontier models.
UNCLEAR
Limited public statements on existential safety; not a central messaging focus.
UNCLEAR
Too small / too recent for clear pattern.
NONE
No published RSP.
  • Active lobbying against parts of EU AI Act applying to GPAI models.
  • Open-weights releases (Mixtral, Pixtral, Codestral).
  • Microsoft partnership 2024 for Azure distribution.
Toronto-based, founded 2019 by ex-Google Brain (Aidan Gomez). Enterprise / RAG focus.
NEAR-FRONTIER
Command R / R+ competitive in enterprise / RAG tier; below frontier on hardest reasoning. Strategy is enterprise-focused, not frontier-chasing.
MEDIUM
Cohere For AI research lab; published responsible-AI / multilingual safety work. Smaller commercial scale than frontier-3.
MEDIUM
Enterprise positioning de-emphasizes the frontier race; deliberate focus on smaller, deployable models.
MEDIUM
Model and use-policy docs available; some safety research published; not full weight releases.
HIGH
Active in Canadian and international AI governance discussion; Aidan Gomez public commentary on policy.
MEDIUM
Some public safety framing alongside commercial focus.
UNCLEAR
Limited public signal.
LOW
Acceptable Use and responsible-use docs; no published RSP-style capability-threshold framework.
  • Cohere For AI publishes open multilingual safety research.
  • Enterprise / RAG positioning rather than chat-product frontier race.
  • Aidan Gomez active in policy commentary; co-founded the lab as a frontier alternative focused on deployability.
AI21 Labs
Tel Aviv-based, founded 2017. Jurassic, Jamba (SSM hybrid); enterprise focus.
SPECIALIZED
Jamba (SSM-Transformer hybrid) is technically distinctive but not at general-purpose frontier; enterprise-specialized.
LOW
Minimal public safety apparatus.
MEDIUM
Enterprise-focused; less frontier-race framing than the closed-weights frontier-3.
MEDIUM
Jamba weights released; technical reports detailed. Safety-eval disclosure limited.
UNCLEAR
Limited Western-frontier policy visibility.
UNCLEAR
Limited public statements on existential safety.
UNCLEAR
Limited public information.
NONE
No published RSP-equivalent framework.
  • Jamba 1.5 released 2024 — first production-quality SSM hybrid.
  • Enterprise focus rather than frontier chat product.
  • Limited public engagement with frontier-safety governance frameworks.
Founded 2022 by Mustafa Suleyman, Karén Simonyan, Reid Hoffman. Most of team absorbed into Microsoft AI March 2024.
DEFUNCT
Frontier ambitions abandoned March 2024; remaining entity pivoted to enterprise. Original Pi-model team is now Microsoft AI under Suleyman.
UNCLEAR
Not applicable to current entity; pre-absorption safety apparatus was modest.
UNCLEAR
Pre-absorption was a fast scaler; current entity is no longer frontier.
LOW
Minimal public technical detail in either form.
UNCLEAR
Suleyman has carried his policy engagement into Microsoft AI.
UNCLEAR
Not applicable to current entity.
UNCLEAR
Not applicable post-absorption.
NONE
No public RSP-equivalent framework, before or after absorption.
  • March 2024: Suleyman + most technical team move to Microsoft AI.
  • Inflection entity remains, pivoted to B2B enterprise inference.
  • Frequently cited as the canonical 'frontier lab gets absorbed by hyperscaler' case.
12 labs across 5 categories. Ratings updated to early 2026; check references for current state. See linked organization pages for FactBase facts (funding, headcount, leadership history).

What this table is — and isn't

It's a cross-lab comparison overlay on top of the FactBase / TableBase organization data this wiki already maintains. Hard numbers (funding, valuations, headcount) live in the per-org pages at /organizations. This table is the layer above that — the editorial judgment about safety practice, anchored where possible in observable artifacts.

It is not:

  • A precise scoring system. The HIGH / MEDIUM / LOW levels are buckets, not measurements.
  • A prediction. We rate revealed behavior to date, not where each lab is heading.
  • A safety endorsement for HIGH-rated labs. A lab can score well on every column here and still be a net contributor to risk (because the underlying activity is high-risk, or because revealed safety practice can degrade fast).
  • Definitive. Other lab-scorecard efforts (FLI, METR, AI Lab Watch) use different rubrics and reach different conclusions on the same labs. We cite ours, not theirs.

Why these eight dimensions

The dimensions are chosen to track what a third-party observer can verify from public artifacts, on the theory that the gap between stated safety commitment and revealed safety behavior is more useful than either alone:

  • Capability tier — descriptive, not evaluative. Distinguishes labs whose safety choices actually matter at the global frontier from those whose choices matter mostly to their customers.
  • Safety investment — dedicated alignment / interpretability / red-team headcount and research output, relative to org size. Absolute headcount favors hyperscalers; ratio is more meaningful.
  • Race posture restraint — public framing plus revealed behavior on competitive deployment. We mark HIGH when restraint is both stated and demonstrated; LOW when racing is explicit or when revealed cadence contradicts safety framing.
  • Transparency — RSPs, system cards, training detail, AISI pre-deployment access. Weighted toward operationally specific artifacts (capability thresholds, tripwires) over PR commitments.
  • Governance engagement — constructive engagement with regulators, AISIs, summits. We don't credit a lab for testifying if they also lobbied against the same legislation.
  • Leadership safety alignment — CEO / co-founder public stance plus revealed prioritization (resourcing, who reports to whom, what gets shipped when leadership disagrees).
  • Employee safety culture — internal safety advocacy, public alignment-research output, attrition patterns. High-profile safety-driven departures are weighed heavily because they're a costly signal.
  • RSP / framework quality — operational specificity. We treat a vague public-relations document as worse than no policy, because the former forecloses scrutiny.

What HIGH means

HIGH is safety-positive, uniformly. HIGH safety investment means more investment; HIGH transparency means more disclosure; HIGH race restraint means more restraint; HIGH leadership alignment means stronger alignment. This lets you scan the table left-to-right and read green/amber/red as a rough safety signal — at the cost of forcing some unidimensional judgments onto multidimensional realities. The per-cell notes try to recover the nuance.

We use UNCLEAR rather than guessing when public signal is thin (especially for Chinese-jurisdiction labs and very small / very new labs). Treat UNCLEAR rows as "we don't know," not as "probably bad."

Cross-references

Methodology — how each rating is anchored

Every rating in the table has a one- or two-sentence note linking it to an observable signal. Where multiple signals exist, the "Notable signals" column carries the longer evidence list and the "References" column links sources.

When evidence conflicts (e.g., a public safety statement contradicted by revealed deployment cadence), we weight revealed behavior over stated commitments, on the theory that the costly-signal side carries more information. This is why labs that publish strong-sounding safety frameworks can still receive a MEDIUM or LOW rating on race-posture restraint — the frameworks are real, but so is the deployment cadence.

For attribution-light claims (e.g., "small explicit safety team"), the absence of public confirmation is itself informative: frontier labs that invest heavily in safety typically advertise it. We mark these LOW, not UNCLEAR.

Limitations

  • Sample is small — 12 labs, eight dimensions, hand-curated. Per-cell error bars are wide.
  • Western-centric public signal — Chinese-jurisdiction labs operate under a different regulatory and disclosure regime; UNCLEAR ratings there reflect our limited visibility, not necessarily their internal practice.
  • Race-posture restraint is the most contested column — labs and observers disagree about what counts as racing. We use "race-conscious" as MEDIUM and reserve LOW for labs whose public framing explicitly endorses racing.
  • Ratings age fast — quarterly cadence at minimum. Anything older than three months should be re-verified before being cited.

Suggested re-reads, in order of likely turnover: leadership safety alignment > race-posture restraint > RSP quality > safety investment > transparency > everything else.

References

1AI Safety Index Winter 2025Future of Life Institute

The Future of Life Institute evaluated eight major AI companies across 35 safety indicators, finding widespread deficiencies in risk management and existential safety practices. Even top performers Anthropic and OpenAI received only marginal passing grades, highlighting systemic gaps across the industry in preparedness for advanced AI risks.

★★★☆☆

METR analyzes the common structural elements across frontier AI safety policies published by major AI companies, identifying shared frameworks around capability thresholds, model evaluations, weight security, deployment mitigations, and accountability mechanisms. The December 2025 version covers twelve companies including Anthropic, OpenAI, Google DeepMind, Meta, and others, and incorporates references to the EU AI Act's General-Purpose AI Code of Practice and California's Senate Bill 53.

★★★★☆
3AI Lab Watchailabwatch.org

AI Lab Watch is a publicly maintained scorecard tracking what major AI companies are doing to address safety risks, scoring them across weighted categories including risk assessment, scheming risk prevention, safety research, misuse prevention, and security preparedness. Created by Zach Stein-Perlman, it provides comparative transparency on AI lab safety practices. As of September 2025, the site is no longer actively maintained.