Philip Tetlock

Person

Philip Tetlock

Philip Tetlock is a psychologist who revolutionized forecasting research by demonstrating that expert predictions often perform no better than chance, while identifying systematic methods and 'superforecasters' who achieve superior accuracy. His work has significant implications for AI safety and existential risk assessment, though faces challenges when applied to long-term, low-probability events with limited feedback loops.

AffiliationUniversity of Pennsylvania

RoleProfessor of Psychology and Political Science

ProfileView profile page

Organizations

2.7k words · 7 backlinks

Quick Assessment

Dimension	Assessment
Primary Achievement	Pioneered forecasting tournaments demonstrating that systematic methods outperform expert intuition; identified "superforecasters" with superior accuracy
Key Publications	Expert Political Judgment (2005), Superforecasting (2015)
Institutional Affiliation	Leonore Annenberg University Professor at University of Pennsylvania (Wharton and Psychology)
Major Projects	Good Judgment Project (IARPA tournament winner 2011-2015), Forecasting Research Institute
Influence on AI Safety	Methods applied to existential risk assessment; adversarial collaboration on AI forecasting; EA community adoption of forecasting practices
Key Finding	Most expert predictions perform no better than chance; "fox-like" integrative thinkers outperform "hedgehog" theorists

Key Links

Source	Link
Official Website	en.wikiquote.org
Wikipedia	en.wikipedia.org

Overview

Philip E. Tetlock is a Canadian-born psychologist who revolutionized the study of forecasting accuracy through decades of research demonstrating that expert predictions on political and economic events are often no better than random chance, while identifying systematic methods to achieve superior forecasting performance¹². As the Leonore Annenberg University Professor at the University of Pennsylvania with cross-appointments at the Wharton School and School of Arts and Sciences, Tetlock has authored over 200 peer-reviewed articles and more than ten books examining judgment, decision-making, and prediction accuracy³⁴.

Tetlock's most influential work emerged from forecasting tournaments he initiated during the Cold War era through the National Academy of Sciences Committee for the Prevention of Nuclear War, collecting predictions from hundreds of experts between 1984 and 2003⁵⁶. This research culminated in his landmark 2005 book Expert Political Judgment, which documented that experts with access to classified information performed no better than Berkeley undergraduates or "dart-throwing chimpanzees" on long-range forecasts⁷⁸. However, Tetlock also identified a minority of superior forecasters—"foxes" who integrate diverse perspectives rather than "hedgehogs" who apply single theories—leading to his co-founding of the Good Judgment Project with Barbara Mellers and Don Moore⁹.

The Good Judgment Project won a four-year IARPA-sponsored forecasting tournament (2011-2015) involving thousands of forecasters making over one million predictions on geopolitical events¹⁰¹¹. The project identified "superforecasters"—ordinary citizens whose accuracy substantially exceeded intelligence analysts with classified information access¹²¹³. This work established systematic methods for improving prediction accuracy, including training protocols, team dynamics, and aggregation algorithms that have influenced intelligence agencies, forecasting platforms like Metaculus, and the effective altruism community's approach to decision-making under uncertainty¹⁴¹⁵.

History and Academic Career

Education and Early Career

Tetlock was born in Toronto, Canada, and grew up in Winnipeg and Vancouver¹⁶. He received his B.A. in psychology from the University of British Columbia in 1975, followed by an M.A. in 1976 working with Peter Suedfeld on content analysis of diplomatic communications¹⁷¹⁸. He completed his Ph.D. in psychology at Yale University in 1979 under the supervision of Phoebe C. Ellsworth¹⁹.

From 1979 to 1995, Tetlock served as Assistant Professor of psychology at the University of California, Berkeley, directing the Institute of Personality and Social Research from 1988 to 1995²⁰. He then held the Harold E. Burtt Endowed Chair in Psychology and Political Science at Ohio State University (1996-2001) before returning to Berkeley as the Mitchell Endowed Chair at the Haas School of Business (2001-2010)²¹²². In December 2010, he was appointed Leonore Annenberg University Professor of Democracy and Citizenship at the University of Pennsylvania, becoming a Penn Integrates Knowledge (PIK) Professor with joint appointments in Psychology, Management, and the Annenberg School for Communication²³²⁴.

Origins of Forecasting Research

Tetlock's forecasting research originated from his work on the National Academy of Sciences Committee for the Prevention of Nuclear War in the early 1980s during Cold War tensions²⁵. He became concerned that public debate on nuclear policy relied heavily on vague, unverifiable predictions that could not be systematically evaluated²⁶. This led him to create the first forecasting tournament during the Cold War to test expert predictions scientifically²⁷.

Between 1984 and 2003, Tetlock conducted small-scale forecasting tournaments with 284 experts—including government officials, professors, and journalists spanning ideologies from Marxists to free-market advocates—on geopolitical outcomes²⁸²⁹. These experts made predictions about events such as the Soviet Union's collapse, the future of apartheid in South Africa, and Middle East peace prospects. The results formed the empirical basis for his 2005 book Expert Political Judgment: How Good Is It? How Can We Know?, published by Princeton University Press³⁰.

Good Judgment Project

The publication of Expert Political Judgment directly influenced U.S. intelligence agencies to create a four-year geopolitical forecasting tournament sponsored by IARPA (Intelligence Advanced Research Projects Activity)³¹. From 2011 to 2015, Tetlock co-led the winning team—the Good Judgment Project—with his spouse Barbara Mellers and UC Berkeley colleague Don Moore³²³³. The multidisciplinary team included experts in statistics, computer science, economics, psychology, and political science³⁴.

The project involved thousands of forecasters making over one million predictions on geopolitical questions³⁵. It identified "superforecasters"—high-performing individuals who consistently outperformed both average forecasters and professional intelligence analysts with access to classified information³⁶. According to analysis of the project's results, superforecasters were approximately 60-85% more accurate than average forecasters and demonstrated the ability to distinguish 10-15 degrees of uncertainty while maintaining calibration across hundreds of events³⁷³⁸.

The Good Judgment Project's success led to the founding of Good Judgment Inc., a consultancy co-founded by Tetlock that offers bespoke forecasting services, workshops for private clients, and the Good Judgment Open platform for crowd-based forecasts³⁹⁴⁰. The project's methods have been adapted for use by U.S. intelligence agencies and inspired forecasting platforms including Metaculus and INFER-Public⁴¹.

Research Contributions

The Fox-Hedgehog Distinction

One of Tetlock's most influential conceptual contributions is the distinction between "fox-like" and "hedgehog-like" thinkers, inspired by Isaiah Berlin's essay "The Hedgehog and the Fox"⁴². Hedgehogs organize their thinking around a single grand theory or ideology and make bold, confident predictions. Foxes, by contrast, are modest, self-critical thinkers who draw on diverse perspectives and remain skeptical of grand theories⁴³.

Tetlock's research demonstrated that fox-like forecasters consistently outperformed hedgehog forecasters, particularly on long-range forecasts⁴⁴. Foxes showed greater willingness to update their beliefs in response to evidence and were more accurate across a wider range of prediction domains⁴⁵. However, early critiques noted that while foxes outperformed hedgehogs, they still only modestly exceeded simple benchmarks like extrapolation algorithms, rather than achieving substantial superiority over baseline models⁴⁶.

Superforecasting Methodology

The Good Judgment Project identified specific attributes and practices associated with superior forecasting performance. Superforecasters typically exhibit:

Probabilistic thinking: Ability to think in granular probabilities rather than binary yes/no predictions
Active open-mindedness: Willingness to consider alternative hypotheses and update beliefs based on evidence
Intellectual humility: Recognition of uncertainty and limits of their knowledge
Pattern recognition: Skill at identifying relevant historical analogies
Team collaboration: Ability to productively combine perspectives with other forecasters
Regular practice: Consistent engagement with forecasting questions to refine judgment⁴⁷⁴⁸

Tetlock's research demonstrated that forecasting accuracy could be improved through training programs focusing on these cognitive habits, team structures that facilitate information sharing, and aggregation algorithms that appropriately weight the judgments of top performers⁴⁹⁵⁰. The project developed techniques including extremizing weighted averages (adjusting crowd predictions to account for shared information) and Bayesian question clusters (breaking complex forecasts into component questions)⁵¹⁵².

Accountability and Judgment

Beyond forecasting accuracy, Tetlock has extensively researched how accountability affects judgment and decision-making. His 2006 paper "Conflicts of Interest and the Case of Auditor Independence: Moral Seduction and Strategic Issue Cycling" (co-authored with Don Moore, Lloyd Tanlu, and Max Bazerman) analyzed how conflicts of interest in auditing contributed to scandals like Enron and WorldCom⁵³⁵⁴. The paper introduced "moral seduction theory"—the concept that professionals can become unaware of moral compromise from conflicts of interest at a micro level—and "issue-cycle theory" explaining how such conflicts persist at a macro level in major accounting firms⁵⁵.

Tetlock has warned that accountability mechanisms can degrade into "bureaucratic rituals" or "Potemkin villages"—symbolic facades designed to deflect critics rather than genuinely improve decision-making⁵⁶. His work emphasizes that outcome accountability requires careful, calibrated implementation through controlled evaluation rather than simple demands to "hold rascals accountable"⁵⁷.

Application to Existential Risk and AI Safety

Forecasting Research Institute and X-Risk

In 2022, Tetlock became President and Chief Scientist of the Forecasting Research Institute (FRI), which received over $6 million in funding from Coefficient Giving for developing forecasting techniques applicable to global catastrophic and existential risks⁵⁸⁵⁹. In June-October 2022, FRI organized an "Existential Risk Persuasion Tournament" involving 169 experts—80 subject matter experts and 89 superforecasters—estimating probabilities of catastrophes (≥10% of humanity deaths) or extinction (<1,000 humans) by 2030, 2050, and 2100⁶⁰.

Tetlock has acknowledged significant challenges in applying forecasting methods to existential risks, including the difficulty of recruiting the right talent, managing information hazards so that predictions of risks don't do more harm than good, and maintaining rigor while addressing real-world relevance⁶¹. His recent research explores "hybrid persuasion-forecasting tournaments" that combine expert argumentation with probabilistic forecasting to improve judgments about low-probability, high-impact events⁶².

AI Forecasting Work

Tetlock has engaged directly with AI governance concerns through multiple initiatives. He conducted a survey of 135 AI safety and governance researchers on advanced AI risks with Ezra Karger and others⁶³. More recently, his team conducted a two-month intensive adversarial collaboration focused on identifying short-term "cruxes"—key questions about AI that could be resolved by 2030—to explore the limits of how disagreements about AI risks can be resolved through structured debate⁶⁴.

His 2025 research published in ACM Transactions on Interactive Intelligent Systems examined how large language models can achieve forecasting accuracy comparable to human forecasters when predictions are combined, raising questions about both AI capabilities in prediction tasks and the potential role of AI systems in risk assessment⁶⁵. This work suggests that AI-augmented forecasting—combining human judgment with machine learning—may offer advantages over either approach alone for certain types of predictions⁶⁶.

Influence on Effective Altruism

Tetlock has become a prominent figure in the effective altruism (EA) community, with "Tetlock-style judgmental forecasting" notably more popular within EA than in broader contexts⁶⁷. Coefficient Giving has directly supported forecasting infrastructure influenced by Tetlock's research, funding FRI, Metaculus, and INFER (a program supporting forecasting use by U.S. policymakers)⁶⁸. Founders Pledge has evaluated Tetlock's forecasting research on existential risk as high-impact work suitable for philanthropic support⁶⁹.

Tetlock has participated in multiple EA Global conferences through fireside chats and Q&A sessions, discussing topics including prediction algorithms, long-term future considerations, epistemic modesty, and belief updating mechanics⁷⁰⁷¹. His work on identifying cognitive biases, tracking prediction accuracy, and conducting systematic post-mortems provides methodological tools relevant to assessing low-probability, high-impact scenarios central to EA priorities⁷².

Criticisms and Limitations

Methodological Concerns

Critics have raised several concerns about the scope and interpretation of Tetlock's forecasting research. While fox-like forecasters outperform hedgehog forecasters, early analyses noted that foxes still only modestly exceed simple benchmarks like extrapolation algorithms, raising questions about whether the framework sufficiently distinguishes skill from noise⁷³⁷⁴. Hedgehogs performed worse than basic models—in some tests, slightly below random chance—but the practical significance of foxes' advantage over simple algorithms remains debated⁷⁵.

Tetlock's research confronts inherent challenges in evaluating predictions, including the role of exogenous shocks and missing variables that can undermine even sound analyses, giving undue credit to improbable theories⁷⁶. Arbitrary time frames for prediction windows (such as 5 versus 10 years for Soviet collapse predictions) can distort evaluations of forecaster accuracy⁷⁷. Domains involving high combinatorial complexity—such as AI risk debates or complex simulations—reveal blind spots even in skilled forecasters, as the number of relevant variables exceeds human cognitive capacity⁷⁸.

A persistent limitation identified by Tetlock himself is that experts without regular accuracy feedback struggle to convert causal knowledge into probabilistic forecasts⁷⁹. This challenge is particularly acute for long-term existential risk forecasts, where feedback loops for learning from errors may not exist until after catastrophic outcomes⁸⁰.

Misinterpretation and Misuse

Tetlock has expressed frustration that his research has been misinterpreted and misused to justify dismissing expert opinion entirely, rather than improving forecasting practices⁸¹. He particularly criticized how political figures like Michael Gove cited Expert Political Judgment to justify ignoring expert consensus on Brexit consequences, characterizing this as a "dangerous misreading" of his findings⁸². Tetlock emphasized that "it's not that I'm saying that the experts are going to be right, but I would say completely ignoring them is dangerous"⁸³.

Populist "know-nothingism" represents a misreading of Tetlock's work, which demonstrates problems with expert forecasting—including systematic overconfidence and reluctance to change minds—without implying that expert opinion should be completely discounted⁸⁴. His more recent work, including Superforecasting, emphasizes that forecasting accuracy can be improved through better methodology and training, rather than arguing that prediction is fundamentally impossible⁸⁵.

Accountability Mechanisms

Tetlock's proposals for improving forecaster accountability face significant practical challenges. Implementing respected arbiters to evaluate pundit accuracy encounters difficulties ensuring perceived fairness amid partisan divisions⁸⁶. Process accountability—requiring forecasters to document their reasoning and methods—can degrade into bureaucratic rituals or symbolic facades ("Potemkin villages") rather than genuine improvement, as observed in domains from public education to intelligence analysis⁸⁷. Outcome accountability, while valuable, requires complex and calibrated implementation through controlled evaluation rather than simple demands for accountability⁸⁸.

Scope Limitations

Forecasters are valued for multiple purposes beyond pure accuracy, including ideological comfort, entertainment value, and regret minimization (such as in pandemic preparedness)⁸⁹. Fox-like thinking helps navigate these conflicting values but isn't solely about predictive performance. Tetlock acknowledges that forecasting serves multiple social functions, and that the temptation exists for activists to exaggerate risks (framing certainty as group commitment) or for ideological groups to exclude those expressing doubt⁹⁰.

Some critics argue that Tetlock's findings about expert underperformance, while methodologically sound for short and medium-term forecasts, have been inappropriately extrapolated to long-range planning domains. Tetlock himself has expressed skepticism about very long-term forecasts (such as IPCC projections to 2100), noting that wide estimate spreads and the lack of feedback mechanisms limit the applicability of his methods to century-scale predictions⁹¹⁹².

Recent Developments

Tetlock continues active research and institutional involvement in forecasting. In January 2026, he was appointed to the Board of Directors of ForecastEx, Interactive Brokers' prediction market platform, where his expertise in forecasting and decision-making under uncertainty aligns with the platform's mission to help market participants trade probabilities of future outcomes⁹³⁹⁴.

Recent publications include "AI-Augmented predictions: LLM assistants improve human forecasting accuracy" (2025) in ACM Transactions on Interactive Intelligent Systems, "Subjective-probability forecasts of existential risk: Initial Results from a hybrid persuasion-forecasting tournament" (2025) in the International Journal of Forecasting, and "Long-range subjective-probability forecasts of slow-motion variables in world politics: Exploring limits on expert judgment" (2024) in Futures and Foresight Science⁹⁵⁹⁶.

According to the Financial Times in October 2025, superforecasters associated with the Good Judgment Project proved 30% more accurate on average than futures markets and continued to beat market predictions on Federal Reserve decisions, demonstrating the continued relevance of Tetlock's forecasting methods⁹⁷. Tetlock received significant media attention throughout 2024-2025, with appearances and coverage in outlets including the Financial Times, Bloomberg, Forbes, Newsweek, The Guardian, and Times Radio⁹⁸.

Key Uncertainties

Several important questions remain about the scope and applicability of Tetlock's forecasting methods:

Scalability to existential risks: How well do forecasting techniques validated on short and medium-term geopolitical questions transfer to low-probability, high-impact scenarios with limited historical precedent? The lack of feedback loops for century-scale predictions presents fundamental challenges for evaluating and improving long-term forecasts.

AI augmentation limits: As large language models achieve forecasting accuracy comparable to human forecasters, what is the optimal division of labor between human and machine intelligence in prediction tasks? Recent research suggests hybrid approaches may be superior, but the specific conditions favoring human versus AI forecasting remain unclear.

Institutional adoption barriers: Despite demonstrated accuracy improvements, why have forecasting tournaments and superforecaster methods seen limited adoption outside intelligence agencies and specialized platforms? Organizational resistance, incentive misalignment, and the multiple non-accuracy functions that expert predictions serve may present barriers beyond methodological validation.

Long-term forecast calibration: Can any systematic methods achieve meaningful calibration for predictions extending decades or centuries into the future, or are such forecasts inherently limited by irreducible uncertainty and the absence of feedback mechanisms for learning?

Information hazards in risk assessment: How should forecasting tournaments balance the value of detailed, specific predictions about existential risks against the potential for such forecasts to provide roadmaps for malicious actors or create self-fulfilling prophecies?

Sources

References

1Evidence on good forecasting practices from the Good Judgment Project - AI ImpactsAI Impacts▸

Summarizes empirical findings from the Good Judgment Project (GJP), the winning team in IARPA's 2011-2015 forecasting tournament, on what factors correlate with accurate probabilistic forecasting. Key predictors include past performance, prediction frequency, deliberation time, team collaboration, and cognitive traits like active open-mindedness. Based on Philip Tetlock's research and the Superforecasting methodology.

★★★☆☆

aiimpacts.org

Claims (1)

Accurate100%Feb 22, 2026

“For example, they ran an RCT to test the effect of a short training program on forecasting accuracy.”

2Philip Tetlock - The Decision Labthedecisionlab.com▸

Overview of Philip Tetlock's career and research on human prediction accuracy, demonstrating that most expert forecasts are no better than chance, while identifying a subset of 'superforecasters' who consistently outperform experts through probabilistic thinking, diverse information synthesis, and willingness to update beliefs. His Good Judgment Project quantified the attributes enabling accurate forecasting.

thedecisionlab.com

Claims (6)

Tetlock is a Canadian-born psychologist who revolutionized the study of forecasting accuracy through decades of research demonstrating that expert predictions on political and economic events are often no better than random chance, while identifying systematic methods to achieve superior forecasting performance.

Accurate90%Feb 22, 2026

“It was psychologist Philip Tetlock who demonstrated that, generally, the accuracy of our predictions is no better than chance, which means that flipping a coin is just as good as our best guess.”

This research culminated in his landmark 2005 book Expert Political Judgment, which documented that experts with access to classified information performed no better than Berkeley undergraduates or "dart-throwing chimpanzees" on long-range forecasts.

Foxes, by contrast, are modest, self-critical thinkers who draw on diverse perspectives and remain skeptical of grand theories.

+3 more claims

3Adversarial Collaboration on AI Risk | Wiley Online LibraryWiley Online Library (peer-reviewed)·David Ríos Insua, David Banks, Jesus Ríos & Jorge González‐Ortega·2017·Paper▸

★★★★☆

onlinelibrary.wiley.com

Claims (2)

More recently, his team conducted a two-month intensive adversarial collaboration focused on identifying short-term "cruxes"—key questions about AI that could be resolved by 2030—to explore the limits of how disagreements about AI risks can be resolved through structured debate.

A persistent limitation identified by Tetlock himself is that experts without regular accuracy feedback struggle to convert causal knowledge into probabilistic forecasts.

4Good Judgment Inc. - AboutGood Judgment▸

Good Judgment Inc. is a professional forecasting and superforecasting organization that applies structured analytical methods to improve prediction accuracy on geopolitical, economic, and emerging technology questions. Founded on research from the Good Judgment Project, it offers forecasting services and training. The organization is known for developing and deploying 'superforecasters' whose predictions significantly outperform traditional expert forecasting.

★★★☆☆

goodjudgment.com

Claims (1)

Minor issues80%Feb 22, 2026

“Good Judgment Inc is now making this winning approach to harnessing the wisdom of the crowd available for commercial use.”

The source does not explicitly state that Good Judgment Inc. offers workshops for private clients. The source does not explicitly state that Philip Tetlock co-founded Good Judgment Inc.

5Press & News - Good Judgment Inc.Good Judgment▸

Good Judgment Inc. is the commercial spinoff of the Good Judgment Project, a superforecasting research initiative that emerged from IARPA's Aggregative Contingent Estimation (ACE) program. This press page aggregates media coverage and news about the company's forecasting products and research. Good Judgment's work on calibrated probability estimation is relevant to AI safety efforts around forecasting AI development timelines and risks.

★★★☆☆

goodjudgment.com

Claims (2)

Minor issues85%Feb 22, 2026

““Superforecasters have continued to beat the market so far this year when it comes to anticipating Fed decisions, as they had also in 2023 and 2024,” writes Financial Times data journalist Joel Suss for FT’s exclusive Monetary Policy Radar service.”

The claim that superforecasters were '30% more accurate on average than futures markets' is not directly supported by the source. The source only states that superforecasters 'continued to beat the market so far this year when it comes to anticipating Fed decisions'. The claim mentions 'the continued relevance of Tetlock's forecasting methods', but the source does not explicitly mention this.

Tetlock received significant media attention throughout 2024-2025, with appearances and coverage in outlets including the Financial Times, Bloomberg, Forbes, Newsweek, The Guardian, and Times Radio.

Minor issues85%Feb 22, 2026

“Monetary Policy Radar: ‘Superforecasters’ tend to beat the market Financial Times (October 2025) “Superforecasters have continued to beat the market so far this year when it comes to anticipating Fed decisions, as they had also in 2023 and 2024,” writes Financial Times data journalist Joel Suss for FT’s exclusive Monetary Policy Radar service.”

The claim covers 2024-2025, but the source also includes media attention from 2023 and 2026. The claim states that Tetlock received the media attention, but the source focuses on Superforecasting and Good Judgement.

6Philip E. Tetlock - WikipediaWikipedia·Reference▸

Wikipedia biography of Philip Tetlock, a prominent psychologist known for his research on expert political judgment, forecasting accuracy, and the superforecasting methodology. His work demonstrates that structured, probabilistic thinking and calibrated uncertainty significantly outperform traditional expert prediction, with implications for decision-making under uncertainty in high-stakes domains including AI governance.

★★★☆☆

en.wikipedia.org

Claims (9)

Tetlock was born in Toronto, Canada, and grew up in Winnipeg and Vancouver.

in 1976 working with Peter Suedfeld on content analysis of diplomatic communications.

Ellsworth.

+6 more claims

Citation source check: 42 verified, 4 flagged, 25 unchecked of 98 total

Property	Value	As Of
Notable For	Pioneered science of superforecasting; authored Expert Political Judgment (2005) and Superforecasting (2015); co-led Good Judgment Project winning IARPA tournament 2011-2015; identified superforecasters outperforming intelligence analysts by 60-85%	Mar 2026
Education	Yale University; University of British Columbia
Wikipedia	https://en.wikipedia.org/wiki/Philip_E._Tetlock
Birth Year	1954

Philip Tetlock

Philip Tetlock

Quick Assessment

Key Links

Overview

History and Academic Career

Education and Early Career

Origins of Forecasting Research

Good Judgment Project

Research Contributions

The Fox-Hedgehog Distinction

Superforecasting Methodology

Accountability and Judgment

Application to Existential Risk and AI Safety

Forecasting Research Institute and X-Risk

AI Forecasting Work

Influence on Effective Altruism

Criticisms and Limitations

Methodological Concerns

Misinterpretation and Misuse

Accountability Mechanisms

Scope Limitations

Recent Developments

Key Uncertainties

Sources

Footnotes

References

Structured Data

All Facts

Related Wiki Pages

Top Related Pages

Forecasting Research Institute (FRI)

Good Judgment (Forecasting)

Metaculus

AI-Augmented Forecasting

Large Language Models

Organizations

Analysis

Approaches

Other

Concepts

Risks

Key Debates