AI Model Specifications

Wiki article →

Model specifications are explicit documents defining AI behavior, now published by all major frontier labs (Anthropic, OpenAI, Google, Meta) as of 2025. While they improve transparency and enable external scrutiny, they face a fundamental spec-reality gap—specifications don't guarantee implementatio

Top Related Pages

Approach

Constitutional AI

Anthropic's Constitutional AI (CAI) methodology uses explicit principles and AI-generated feedback to train safer language models, demonstrating 3-...

Organization

Anthropic

An AI safety company founded by former OpenAI researchers that develops frontier AI models while pursuing safety research, including the Claude mod...

Safety Agenda

AI Evaluations

This page analyzes AI safety evaluations and red-teaming as a risk mitigation strategy. Current evidence shows evals reduce detectable dangerous ca...

Organization

OpenAI

Leading AI lab that developed GPT models and ChatGPT, analyzing organizational evolution from non-profit research to commercial AGI development ami...

Policy

Responsible Scaling Policies

Responsible Scaling Policies (RSPs) are voluntary commitments by AI labs to pause scaling when capability or safety thresholds are crossed. As of D...

AI Model Specifications

Related Pages

Top Related Pages

Constitutional AI

Anthropic

AI Evaluations

OpenAI

Responsible Scaling Policies

Organizations

Approaches

Concepts

Safety Research

Policy

Quick Facts