Longterm Wiki

Benchmarking and comparing different evaluation awareness metrics

$3K
Funder
Recipient
Hieu Minh Nguyen
Program
Date
Aug 2025
Source
Notes

[Technical AI safety, AI governance] LLMs often know when they are being evaluated. We’ll do a study comparing various methods to measure and monitor this capability.

Other Grants by Manifund

376
Showing 10 of 376 grants
Benchmarking and comparing different evaluation awareness metrics | Grants | Longterm Wiki