Evals-Based Deployment Gates

in-effectInternational

Introduced 2023Wiki article →

Evals-based deployment gates require AI models to pass safety evaluations before deployment or capability scaling. The EU AI Act mandates conformity assessments for high-risk systems with fines up to EUR 35M or 7% global turnover, while UK AISI has evaluated 30+ frontier models.

Introduced

Committee

Floor Vote

Passed

Enacted

Related Legislation

Name	Status
EU AI Act	in-effect

Related Pages

Top Related Pages

Organization

METR

Model Evaluation and Threat Research conducts dangerous capability evaluations for frontier AI models, testing for autonomous replication, cybersec...

Organization

Anthropic

An AI safety company founded by former OpenAI researchers that develops frontier AI models while pursuing safety research, including the Claude mod...

Policy

EU AI Act

The world's first comprehensive AI regulation, adopting a risk-based approach to regulate foundation models and general-purpose AI systems

Approach

Responsible Scaling Policies

Industry self-regulation frameworks establishing capability thresholds that trigger safety evaluations. Anthropic's ASL-3 requires 30%+ bioweapon d...

Approach

AI Evaluation

Methods and frameworks for evaluating AI system safety, capabilities, and alignment properties before deployment, including dangerous capability de...

Evals-Based Deployment Gates

Related Legislation

Related Topics

Related Pages

Top Related Pages

METR

Anthropic

EU AI Act

Responsible Scaling Policies

AI Evaluation

Organizations

Risks

Approaches

Analysis

Safety Research

Policy

Concepts

Historical

Key Debates

Other

Quick Facts

Tags