Longterm Wiki
CO

Chris Olah

Also known as: Christopher Olah

Pioneer of neural network interpretability and visualization; co-founder of Anthropic; creator of Distill.pub and the Circuits thread at Transformer Circuits

Current Role
Co-founder, Interpretability
Organization
Anthropic

Expert Positions2 topics

TopicViewEstimateConfidenceDate
How Hard Is Alignment?Tractable via interpretabilitySolvable with sufficient transparency toolsmedium2021
Will Advanced AI Be Deceptive?Detectable with interpretabilityInterpretability provides a "mulligan" to catch deceptionmediumDec 2021

Career History3

Co-founder; Research Lead, Mechanistic InterpretabilityFounderCurrent
Jan 2021 – present
Research Scientist
2018 – Jan 2021
Research Scientist
Google Brain
2015 – 2018

Education

Attended University of Toronto (did not complete degree); Thiel Fellow

Publications & Resources4

Concrete Problems in AI Safety
2016PaperTechnical Safety
Understanding LSTM Networks
2015Blog PostInterpretability

Links

Organization Roles1

AnthropicFounderCurrent
Research Lead, Mechanistic Interpretability
Jan 2021 – present

Facts8

People
Employed ByAnthropic
Role / TitleCo-founder, Interpretability
Biographical
EducationAttended University of Toronto (did not complete degree); Thiel Fellow
Notable ForPioneer of neural network interpretability and visualization; co-founder of Anthropic; creator of Distill.pub and the Circuits thread at Transformer Circuits
Social Media@ch402
GitHubhttps://github.com/colah
Google Scholarhttps://scholar.google.com/citations?user=vKAKE1gAAAAJ
General
Websitehttps://colah.github.io
View all facts in KB explorer →