Longterm Wiki

AI Model Specifications

Model specifications are explicit documents defining AI behavior, now published by all major frontier labs (Anthropic, OpenAI, Google, Meta) as of 2025. While they improve transparency and enable external scrutiny, they face a fundamental spec-reality gap—specifications don't guarantee implementation, and current compliance verification mechanisms are limited.

Related Wiki Pages

Top Related Pages

Approaches

Cooperative AIEvals-Based Deployment GatesAI Output FilteringPreference Optimization Methods

Other

RLHFAI Control

Organizations

Google DeepMind

Concepts

Alignment Policy Overview