16 companies committed to publish frontier AI safety protocols

web

montrealethics.ai·montrealethics.ai/ai-policy-corner-frontier-ai-safety-com...

This article covers a significant 2024 international AI governance milestone where leading AI companies made voluntary public commitments on frontier model safety protocols, relevant to tracking industry self-regulation efforts.

Metadata

Importance: 62/100blog postnews

Summary

At the AI Seoul Summit 2024, 16 major AI companies committed to publishing frontier AI safety protocols, building on the Bletchley Declaration. The commitments outline expectations for how companies should identify and manage catastrophic risks from frontier AI models, including requirements for red-teaming, evaluations, and risk thresholds.

Key Points

•16 frontier AI companies signed safety commitments at the Seoul AI Summit in May 2024, pledging to publish model safety frameworks.
•Commitments include publishing safety policies before or alongside new frontier model releases, covering risk assessment and mitigation.
•Companies agreed to define thresholds at which identified risks would be deemed too severe to deploy or continue developing a model.
•The commitments build on the 2023 Bletchley Declaration and represent a step toward voluntary industry self-governance on AI safety.
•Signatory companies include major labs such as OpenAI, Google DeepMind, Anthropic, Meta, and Microsoft, among others.

Cited by 1 page

Page	Type	Quality
AI Lab Safety Culture	Approach	62.0

Cached Content Preview

HTTP 200Fetched Apr 9, 20266 KB

AI Policy Corner: Frontier AI Safety Commitments, AI Seoul Summit 2024 | Montreal AI Ethics Institute 

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

 

 
 
 
 
 
 

 
 
 
 

 

 
 
 
 
 
 
 
 
 
 
 

 
 
 
 
 
 
 
 
 

 
 
 
 

 

 
 
 
 
 
 
 
 
 Skip to main content 
 Skip to secondary menu 
 Skip to primary sidebar 
 Skip to footer 
 
 
 ✍️ By Alexander Wilhelm. 

 Alexander is a PhD Student in Political Science and a Graduate Affiliate at the Governance and Responsible AI Lab (GRAIL) , Purdue University. 

 

 

 📌 Editor’s Note: This article is part of our AI Policy Corner series, a collaboration between the Montreal AI Ethics Institute (MAIEI) and the Governance and Responsible AI Lab (GRAIL) at Purdue University. The series provides concise insights into critical AI policy developments from the local to international levels, helping our readers stay informed about the evolving landscape of AI governance.

 

 Frontier AI Safety Commitments, AI Seoul Summit 2024

 Discussions between governments, civil society, and companies on the ‘safe’ development of AI have advanced through collaborations such as the AI Safety Summit 2023 held in the UK and the AI Seoul Summit 2024 . Led by the United Kingdom and the Republic of South Korea, the Seoul Summit resulted in a framework of commitments, known as the Frontier AI Safety Commitments , which 20 organizations, including Anthropic, Microsoft, NVIDIA, and OpenAI, have agreed to. These commitments required signatories to publish &#8220;a safety framework focused on severe risks&#8221; at the AI Summit in France in February 2025 (See The AI Ethics Brief #158 for more on the Paris AI Action Summit). However, rhetoric at the Paris Summit emphasized the benefits of AI rather than its potential harms and risks, raising questions about the future of the three goals outlined in the Frontier AI Safety Commitments.

 Three outcomes of the Frontier AI Safety Commitments

 Outcome 1: Organisations effectively identify, assess and manage risks when developing and deploying their frontier AI models and systems. 

 
 Signatories to the Commitments agree to identify risks relevant to their frontier models, including risks detected by external entities and governments. Frontier models are defined within the Commitments as “highly capable general-purpose AI models or systems that can perform a wide variety of tasks and match or exceed the capabilities present in the most advanced models.” Multiple stakeholders are expected to collaboratively identify unacceptable levels of risk within frontier models, with justifications for the boundaries once they are set. Risk mitigation should then be planned to maintain the acceptable levels, with a commitment not to develop models that fail to meet these standards. 

 

 Outcome 2: Organisations are accountable for safely developing and deploying their frontier AI models and systems. 

 
 Groups that voluntarily pledge to join the Frontier AI Safety Commitments must update their p

... (truncated, 6 KB total)

Resource ID: c7bf226bdc483bf6 | Stable ID: sid_q6CxYn8UQs