Representation Engineering: A Top-Down Approach to AI Transparency

publication

Entity profile Source checks

Child of Center for AI Safety (CAIS)

Metadata

Source Table	`publications`
Source ID	`Gti3A1NyT6`
Description	Andy Zou, Long Phan, Sarah Chen et al., 2023-10
Source URL	arxiv.org/abs/2310.01405
Parent	Center for AI Safety (CAIS)
Children	—
Created	Mar 23, 2026, 2:46 PM
Updated	Mar 23, 2026, 2:46 PM
Synced	Mar 23, 2026, 2:46 PM

Record Data

`id`	Gti3A1NyT6
`entityId`	Center for AI Safety (CAIS)(organization)
`entityDisplayName`	—
`resourceId`	—
`title`	Representation Engineering: A Top-Down Approach to AI Transparency
`authors`	Andy Zou, Long Phan, Sarah Chen et al.
`url`	arxiv.org/abs/2310.01405
`venue`	—
`publishedDate`	2023-10
`publicationType`	paper
`citationCount`	—
`isFlagship`	Yes
`abstract`	—
`source`	arxiv.org/abs/2310.01405
`notes`	—

Source Check Verdicts

confirmed95% confidence

Last checked: 4/29/2026

1 → confirmed

Debug info

Thing ID: Gti3A1NyT6

Source Table: publications

Source ID: Gti3A1NyT6

Parent Thing ID: sid_y4bieqSeag