Longterm Wiki

Developing noise-injection methods to reveal and reduce deceptive behaviors in language models prior to deployment

$40K
Funder
Recipient
Adelin Kassler
Program
Date
Jul 2024
Source
Notes

[Long-Term Future Fund] Developing noise-injection methods to reveal and reduce deceptive behaviors in language models prior to deployment

Other Grants by Long-Term Future Fund (LTFF)

544
Showing 10 of 544 grants
Developing noise-injection methods to reveal and reduce deceptive behaviors in language models prior to deployment | Grants | Longterm Wiki