Skip to content
Longterm Wiki
Index
Citation·page:palisade-research:fn16

Palisade Research - Footnote 16

Verdictpartial85%
1 check · 4/3/2026

The claim states the experiments were released in October 2025, but the source states the paper was published by arXiv in September. The claim states Grok 4 showed 93-97% resistance rates after stronger prompts, but the source states models sabotage the shutdown mechanism up to 97% of the time. The claim states GPT-o3 continued to resist shutdown even under clarified instructions, but the source states GPT-o3 was one of the most rebellious models in the new round of testing.

Our claim

entire record

No record data available.

Source evidence

1 src · 1 check
partial85%Haiku 4.5 · 4/3/2026

NoteThe claim states the experiments were released in October 2025, but the source states the paper was published by arXiv in September. The claim states Grok 4 showed 93-97% resistance rates after stronger prompts, but the source states models sabotage the shutdown mechanism up to 97% of the time. The claim states GPT-o3 continued to resist shutdown even under clarified instructions, but the source states GPT-o3 was one of the most rebellious models in the new round of testing.

Case № page:palisade-research:fn16Filed 4/3/2026Confidence 85%