All Publications

OpenAI

Company BlogHigh(4)

GPT developer, leading AI lab

Credibility Rating

4/5

High(4)

High quality. Established institution or organization with editorial oversight and accountability.

62

Resources

109

Citing pages

1

Tracked domains

Tracked Domains

openai.com

Resources (62)

62 resources

			Summary
OpenAI	web	-	S	24
OpenAI Preparedness Framework	web	-	S	19
OpenAI: Model Behavior	paper	-	S	15
OpenAI Safety Updates	web	-	S	13
Preparedness Framework	web	-	S	7
Weak-to-strong generalization	web	-	S	7
OpenAI Preparedness	web	-	S	6
Resisting Sycophancy: OpenAI	paper	-	S	5
announced December 2024	web	-	S	5
OpenAI	web	-	S	4
Superalignment team	web	-	S	4
OpenAI Superalignment Fast Grants	web	-	S	4
2025 OpenAI-Anthropic joint evaluation	web	-	S	4
SWE-bench Verified - OpenAI	web	-	S	4
OpenAI's o1	web	-	S	3
OpenAI CoT Monitoring	web	-	S	3
OpenAI	web	-	S	3
Extracting Concepts from GPT-4	web	-	S	3
OpenAI on detection limits	web	-	S	2
OpenAI	web	-	S	2
Sora quality	web	-	S	2
GPT-4	web	-	S	2
OpenAI's alignment research	web	-	S	2
OpenAI Goodhart Measurement	web	-	S	2
ChatGPT launch	web	-	S	2

Rows per page:

Page 1 of 3

Citing Pages (109)

AI Accident Risk Cruxes Agentic AI AGI Development AGI Timeline AI-Assisted Alignment AI Alignment Alignment Evaluations Apollo Research Alignment Research Center Authentication Collapse Bioweapons Risk AI Uplift Assessment Model Center for AI Safety Capabilities-to-Safety Pipeline Model Capability Elicitation AI Capability Threshold Model Autonomous Coding AI-Driven Concentration of Power Constitutional AI Corporate AI Safety Responses Corrigibility Failure Cyberweapons Risk Dangerous Capability Evaluations Deceptive Alignment AI Safety Defense in Depth Model AI Disinformation Emergent Capabilities Epistemic Sycophancy EU AI Act Eval Saturation & The Evals Gap AI Evaluations Evals-Based Deployment Gates AI Evaluation Goal Misgeneralization Probability Model AI Governance and Policy Heavy Scaffolding / Agentic Systems AI-Human Hybrid Systems Instrumental Convergence Instrumental Convergence Framework Is Interpretability Sufficient for Safety?AI Safety Intervention Effectiveness Matrix AI Safety Intervention Portfolio AI Knowledge Monopoly AI Lab Safety Culture Large Language Models Large Language Models AI Value Lock-in Long-Horizon Autonomous Tasks Mesa-Optimization Mesa-Optimization Risk Analysis Metaculus Minimal Scaffolding AI Misuse Risk Cruxes Third-Party Model Auditing Multipolar Trap Dynamics Model OpenAI Optimistic Alignment Worldview Paul Christiano Should We Pause AI Development?Persuasion and Social Manipulation Process Supervision AI Proliferation AI Proliferation Risk Model AI Development Racing Dynamics Racing Dynamics Impact Model Reasoning and Planning Red Teaming AI Alignment Research Agendas Responsible Scaling Policies (RSPs)Reward Hacking Reward Hacking Taxonomy and Severity Model AI Risk Activation Timeline Model AI Risk Cascade Pathways Model AI Risk Interaction Network Model RLHF Responsible Scaling Policies AI Safety Cases AI Safety Research Allocation Model AI Safety Research Value Model AI Safety Researcher Gap Model Sam Altman AI Capability Sandbagging Sandboxing / Containment Scalable Eval Approaches Scalable Oversight Is Scaling All You Need?AI Scaling Laws Scheming Scheming & Deception Detection Scheming Likelihood Assessment Self-Improvement and Recursive Enhancement Sharp Left Turn Situational Awareness Sleeper Agent Detection AI Safety Solution Cruxes Sparse Autoencoders (SAEs)AI Model Steganography Sycophancy AI Safety Technical Pathway Decomposition Technical AI Safety Research Compute Thresholds Tool Use and Computer Use Treacherous Turn Voluntary AI Safety Commitments AI Risk Warning Signs Model Weak-to-Strong Generalization Why Alignment Might Be Hard World Models + Planning Worldview-Intervention Mapping

Publication ID: openai

OpenAI | Publications | Longterm Wiki | Longterm Wiki