publication
The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
Metadata
| Source Table | publications |
| Source ID | 2G9YlHLXK6 |
| Description | Nathaniel Li, Alexander Pan, Anjali Gopal et al., 2024 |
| Source URL | www.wmdp.ai/ |
| Parent | Center for AI Safety (CAIS) |
| Children | — |
| Created | Mar 23, 2026, 2:46 PM |
| Updated | Mar 23, 2026, 2:46 PM |
| Synced | Mar 23, 2026, 2:46 PM |
Record Data
id | 2G9YlHLXK6 |
entityId | Center for AI Safety (CAIS)(organization) |
entityDisplayName | — |
resourceId | — |
title | The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning |
authors | Nathaniel Li, Alexander Pan, Anjali Gopal et al. |
url | www.wmdp.ai/ |
venue | — |
publishedDate | 2024 |
publicationType | paper |
citationCount | — |
isFlagship | Yes |
abstract | — |
source | www.wmdp.ai/ |
notes | ICML 2024. Biosecurity/cybersecurity knowledge unlearning. |
Source Check Verdicts
confirmed95% confidence
Last checked: 4/3/2026
All key fields in the record are confirmed by the source text. The title matches exactly. The authors Nathaniel Li, Alexander Pan, and Anjali Gopal are confirmed as the first three authors (the 'et al.' appropriately indicates additional authors, which the citation block confirms). The publication year 2024 is confirmed. The URL https://www.wmdp.ai/ is the official website shown in the source. The publication type 'paper' is confirmed by the arXiv reference (2403.03218) and the citation format. No contradictions exist.
Debug info
Thing ID: 2G9YlHLXK6
Source Table: publications
Source ID: 2G9YlHLXK6