Skip to content
Longterm Wiki
publication

The WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning

Metadata

Source Tablepublications
Source ID2G9YlHLXK6
DescriptionNathaniel Li, Alexander Pan, Anjali Gopal et al., 2024
Source URLwww.wmdp.ai/
ParentCenter for AI Safety (CAIS)
Children
CreatedMar 23, 2026, 2:46 PM
UpdatedMar 23, 2026, 2:46 PM
SyncedMar 23, 2026, 2:46 PM

Record Data

id2G9YlHLXK6
entityIdCenter for AI Safety (CAIS)(organization)
entityDisplayName
resourceId
titleThe WMDP Benchmark: Measuring and Reducing Malicious Use With Unlearning
authorsNathaniel Li, Alexander Pan, Anjali Gopal et al.
urlwww.wmdp.ai/
venue
publishedDate2024
publicationTypepaper
citationCount
isFlagshipYes
abstract
sourcewww.wmdp.ai/
notesICML 2024. Biosecurity/cybersecurity knowledge unlearning.

Source Check Verdicts

confirmed95% confidence

Last checked: 4/3/2026

All key fields in the record are confirmed by the source text. The title matches exactly. The authors Nathaniel Li, Alexander Pan, and Anjali Gopal are confirmed as the first three authors (the 'et al.' appropriately indicates additional authors, which the citation block confirms). The publication year 2024 is confirmed. The URL https://www.wmdp.ai/ is the official website shown in the source. The publication type 'paper' is confirmed by the arXiv reference (2403.03218) and the citation format. No contradictions exist.

Debug info

Thing ID: 2G9YlHLXK6

Source Table: publications

Source ID: 2G9YlHLXK6