Mitigating Reward Hacking Through RL Training Interventions

$7.9K

Funder

Recipient

Aria Wong

Program

Manifund Regranting

Date

Feb 2026

Data source

Source

manifund.org↗

Notes

Technical AI safety

Other Grants by Manifund

353

Grant	Recipient	Amount	Date
Train great open-source sparse autoencoders	Tom McGrath	$4K	May 2024
Develop an accessible, low-cost system for single-cell imaging in multiple regions of freely moving organisms	Andrew Luskin	$25K	Feb 2024
Luthien	Jai Dhyani	$170K	Mar 2025
Investigating the Effects of IF in the reversal of Type 2 Diabetes Mellitus.	Ambreen Deol	$8.5K	Mar 2024
Connect For Animals: a platform for ending factory farming	Steven Rouk	$21K	Aug 2024
Split Personality Training	Florian Dietz	$3K	Apr 2025
Holly Elmore organizing people for a frontier AI moratorium	Holly Elmore	$5.3K	Jul 2023
Lead-acid battery recycling in Philippines	Micaella Rogers	$50K	Oct 2025
AI-Plans.com	Kabir Kumar	$5.4K	Jan 2024
Adjacent News	Lucas Kohorst	$1K	Aug 2024

Showing 10 of 353 grants

← Back to Manifund All grants