Longterm Wiki

6-month salary to interpret neurons in language models & build tools to accelerate this process. The aim is to understand all features and circuits in a model and use this understanding to predict out of distribution performance in high-stake situations.

$40K
Funder
Recipient
Logan Smith
Program
Date
Jan 2023
Source
Notes

[Long-Term Future Fund] 6-month salary to interpret neurons in language models & build tools to accelerate this process. The aim is to understand all features and circuits in a model and use this understanding to predict out of distribution performance in high-stake situations.

Other Grants by Long-Term Future Fund (LTFF)

544
Showing 10 of 544 grants
6-month salary to interpret neurons in language models & build tools to accelerate this process. The aim is to understand all features and circuits in a model and use this understanding to predict out of distribution performance in high-stake situations. | Grants | Longterm Wiki