HarmBench: A Standardized Evaluation Framework for Automated Red Teaming
The source text is merely a JavaScript error message from the HarmBench website and does not contain any substantive information about the publication itself. It does not confirm or contradict any of the claimed metadata (title, authors, publication date, publication type). While the URL matches the claimed domain, the source text itself provides no verifiable content about the paper's details. To verify this record, actual publication metadata (from arXiv, a conference proceedings, or similar) would be needed.
Our claim
entire record- Title
- HarmBench: A Standardized Evaluation Framework for Automated Red Teaming
- Authors
- Mantas Mazeika, Long Phan, Xuwang Yin et al.
- Published Date
- 2024
- Publication Type
- paper
- Is Flagship
- Yes
- Source
- https://harmbench.org/
- Notes
- ICML 2024
Source evidence
1 src · 1 checkNoteThe source text is merely a JavaScript error message from the HarmBench website and does not contain any substantive information about the publication itself. It does not confirm or contradict any of the claimed metadata (title, authors, publication date, publication type). While the URL matches the claimed domain, the source text itself provides no verifiable content about the paper's details. To verify this record, actual publication metadata (from arXiv, a conference proceedings, or similar) would be needed.