Sycophancy Research
Information IntegrityactiveUnderstanding and mitigating AI systems' tendency to agree with users rather than be truthful.
Risks Addressed
1
Cluster: Information Integrity
Tags
function:specificationscope:technique
Understanding and mitigating AI systems' tendency to agree with users rather than be truthful.