Skip to content
Longterm Wiki
resource

Training a Helpful and Harmless Assistant with Reinforcement ...

Metadata

Source Tableresources
Source IDb868fe3bc816c0f9
Source URLanthropic.com/research/training-a-helpful-and-harmless-assistant-with-reinforcement-learning-from-human-feedback
Children
CreatedApr 10, 2026, 7:48 PM
UpdatedApr 10, 2026, 7:48 PM
SyncedApr 10, 2026, 7:48 PM

Record Data

idb868fe3bc816c0f9
urlanthropic.com/research/training-a-helpful-and-harmless-assistant-with-reinforcem…
titleTraining a Helpful and Harmless Assistant with Reinforcement ...
typeweb
summary
review
abstract
keyPoints
publicationId
authors
authorEntityIds
publishedDate
tags
[]
localFilename
credibilityOverride
fetchedAt
contentHash
stableIdsid_68522SoGMg
fetchStatus
lastFetchedAt
archiveUrl
stance
contextNote
resourcePurpose
resourceSubtype
typeMetadata
publisherEntityId
relatedEntityIds
enrichmentStatus
enrichmentDate
importanceScore
contentLifecycle
Debug info

Thing ID: sid_68522SoGMg

Source Table: resources

Source ID: b868fe3bc816c0f9