Harvard University — AI Interpretability, Controllability, and Safety Research

$1M

Funder

Coefficient Giving wiki

Recipient

Harvard University

Program

AI Safety Grantmaking

Date

Jan 2024

Data source

Coefficient Giving

Source

coefficientgiving.org↗

Notes

[Navigating Transformative AI] Open Philanthropy recommended a grant of $1,000,000 over two years to Harvard University to support research led by Martin Wattenberg and Fernanda Viégas on artificial intelligence interpretability, controllability, and safety. Their research will focus on the extent to which large language models have developed internal models of the user and of themselves as distinct agents. This falls within our focus area of potential risks from advanced artificial intelligence.

Other Grants by Coefficient Giving

2625

Grant	Recipient	Amount	Date
Janaagraha — Air Quality Grants Assessment		$195K	Dec 2024
Futurewise — Housing Advocacy in Washington		$450K	Apr 2023
Exscientia — Agonists for Interferon Lambda		$2.3M	Sep 2023
Kurzgesagt — Short-form Video Content		$3M	Mar 2022
Kurzgesagt — Video Production (2023)		$1.7M	May 2023
Kurzgesagt — Video Creation and Translation		$2.6M	Dec 2021
Lightcone Infrastructure – General Support	Lighthaven (Event Venue)	$4.5M	Sep 2022
Lightcone Infrastructure — General Support (2023)	Lighthaven (Event Venue)	$3M	Oct 2023
Conjecture — Cybersecurity Bootcamp	Conjecture	$223K	Jun 2025
Conjecture — AI Safety Technical Program	Conjecture	$224K	May 2023

Showing 10 of 2625 grants

Other Grants to Harvard University

Grant	Recipient	Amount	Date
University group trip to EA Global San Francisco	Harvard University	$2K	Oct 2022
University group trip to EA Global San Francisco	Harvard University	$2K	Oct 2022

← Back to Coefficient Giving All grants