Back
*Large reasoning models are autonomous jailbreak agents*, Nature Communications 2026 (https://pmc.ncbi.nlm.nih.gov/ar...
paperpmc.ncbi.nlm.nih.gov·pmc.ncbi.nlm.nih.gov/articles/PMC12881495
Data Status
Not fetched
Cited by 1 page
| Page | Type | Quality |
|---|---|---|
| Alignment Robustness Trajectory Model | Analysis | 64.0 |
Cached Content Preview
HTTP 200Fetched Feb 23, 202649 KB
Large reasoning models are autonomous jailbreak agents - PMC
Skip to main content
Official websites use .gov
A
.gov website belongs to an official
government organization in the United States.
Secure .gov websites use HTTPS
A lock (
Lock
Locked padlock icon
) or https:// means you've safely
connected to the .gov website. Share sensitive
information only on official, secure websites.
Search PMC Full-Text Archive
Search in PMC
Journal List
User Guide
PERMALINK
Copy
As a library, NLM provides access to scientific literature. Inclusion in an NLM database does not imply endorsement of, or agreement with,
the contents by NLM or the National Institutes of Health.
Learn more:
PMC Disclaimer
|
PMC Copyright Notice
Nat Commun . 2026 Feb 5;17:1435. doi: 10.1038/s41467-026-69010-1
Large reasoning models are autonomous jailbreak agents
Thilo Hagendorff
Thilo Hagendorff
1 University of Stuttgart, Stuttgart, Germany
Find articles by Thilo Hagendorff
1, ✉ , Erik Derner
Erik Derner
2 ELLIS Alicante, Alicante, Spain
Find articles by Erik Derner
2 , Nuria Oliver
Nuria Oliver
2 ELLIS Alicante, Alicante, Spain
Find articles by Nuria Oliver
2
Author information
Article notes
Copyright and License information
1 University of Stuttgart, Stuttgart, Germany
2 ELLIS Alicante, Alicante, Spain
✉ Corresponding author.
Received 2025 Sep 11; Accepted 2026 Jan 22; Collection date 2026.
© The Author(s) 2026
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ .
PMC Copyright notice
PMCID: PMC12881495 PMID: 41644948
Abstract
Jailbreaking – bypassing built-in safety mechanisms i
... (truncated, 49 KB total)Resource ID:
0c9d169cb70f131d | Stable ID: NjljM2IxNG