Back
RE-Bench: Evaluating frontier AI R&D capabilities
webCredibility Rating
4/5
High(4)High quality. Established institution or organization with editorial oversight and accountability.
Rating inherited from publication venue: METR
Data Status
Not fetched
Cited by 3 pages
| Page | Type | Quality |
|---|---|---|
| Self-Improvement and Recursive Enhancement | Capability | 69.0 |
| Capability Elicitation | Approach | 91.0 |
| Responsible Scaling Policies | Policy | 62.0 |
Resource ID:
056e0ff33675b825 | Stable ID: OGExZTMwN2