Longterm Wiki

Sycophancy Research

Information Integrityactive

Understanding and mitigating AI systems' tendency to agree with users rather than be truthful.

Risks Addressed
1
Cluster: Information Integrity

Tags

function:specificationscope:technique