AI safety research nonprofit founded in 2022 by Adam Gleave and Karl Berzins, focusing on adversarial robustness, model evaluation, and alignment r...

Crux

AI Accident Risk Cruxes

Key uncertainties that determine views on AI accident risks and alignment difficulty, including questions about mesa-optimization, deceptive alignm...

Organization

Center for AI Safety (CAIS)

Research organization focused on AI safety through technical research, field-building, and public communication, including the May 2023 Statement o...

Person

Dan Hendrycks

Director of CAIS, focuses on catastrophic AI risk reduction through research, education, and policy advocacy

Risk

AI Distributional Shift

When AI systems fail due to differences between training and deployment contexts. Research shows 40-45% accuracy drops when models encounter novel ...

Adversarial Robustness