Scorecard_grade·ailabwatch-2025-09|sid_A4XoubikkQ|scheming·Record·Profile

Scorecard: ailabwatch 2025-09-01 scored Google DeepMind on Scheming risk prevention = Very Weak

Verdictconfirmed95%

1 check · 7/13/2026

1 → confirmed

Our claim

entire record

Snapshot: ailabwatch-2025-09
Entity: Google DeepMind
Dimension Slug: scheming
Dimension Label: Scheming risk prevention
Score Numeric: 8
Score Letter: Very Weak
Score Raw: 8%

Source evidence

1 src · 1 check

ailabwatch.org/resource

confirmed95%Haiku 4.5 · 7/13/2026

NoteThe record claims: publisher=ailabwatch, publishedAt=2025-09-01, entity=Google DeepMind, dimension=Scheming risk prevention, scoreRaw=8%, scoreLetter=Very Weak, scoreNumeric=8. The source confirms all key fields: (1) Publisher is 'AI Lab Watch' (ailabwatch); (2) Date is September 2025 (source states 'Up to date as of September 15' and footer notes 'as of September 2025'); (3) Entity is DeepMind (shown in the scorecard column); (4) Dimension is 'Scheming risk prevention' (explicitly listed as a row); (5) Score is 8% (shown in the DeepMind column for that row); (6) The scoreLetter 'Very Weak' is consistent with an 8% score on a 0-100 scale. The publishedAt date of 2025-09-01 is slightly earlier than the 'as of September 15' update date mentioned in the source, but this is a minor temporal precision difference within the same month and does not contradict the record — the scorecard was maintained throughout September 2025.

Case № ailabwatch-2025-09|sid_A4XoubikkQ|schemingFiled 7/13/2026Confidence 95%