Reference Class Forecasting Depends on How You Define “Similar”
From the AI IQ Blog
by Paul Boudreau
Technology offers an incredible opportunity to improve project performance. This blog shares the latest research and how organizations are implementing AI into their project methodology. Come with an open mind, increase your knowledge, share your concerns, and become a project manager with new skills to offer an organization.
Recent Posts
Using AI to Improve Team Communication (Without Losing Trust)
Start with AI, not a Project Framework.
Will the PMO Become the Center of AI Adoption in Organizations?
Project Manager Accountability in the Era of AI
Reference Class Forecasting Depends on How You Define “Similar”
Categories
AI,
Artificial Intelligence,
Ethics,
Machine learning,
Natural language processing,
procurement,
Scope Management
Date
Reference Class Forecasting (RCF), popularized by Bent Flyvbjerg, is used to address optimism bias in project cost and schedule estimation. The logic is simple: compare your project to similar completed projects and adjust expectations based on their actual outcomes. It is one of the most influential forecasting innovations in project governance. But a foundational question that requires more attention is: how is similarity defined in the first place?
In practice, reference classes are often formed using broad administrative categories such as “light rail,” “metro,” or “heavy rail.” Yet these similarity criteria are frequently underspecified, even though they determine the empirical distribution from which percentile uplifts are calculated. Before any statistical adjustment occurs, a methodological decision has already shaped the forecast.
In my recent study using a dataset of U.S. mass transit projects, I examined the sensitivity of RCF to alternative reference class formation. Rather than relying on predefined categories, I applied unsupervised clustering techniques to construct alternative reference classes based on structural attributes such as track length, number of stations, underground proportion, and rolling stock. Cost and schedule outcomes were excluded to preserve the outside-view logic.
The results showed that changing similarity groupings altered cost and schedule distributions. Percentile-based uplifts shifted, dispersion patterns changed, and the implied contingency requirements varied across alternative clusters. In other words, RCF outcomes proved structurally contingent on how similarity was operationalized, not merely on statistical adjustment.
This does not undermine RCF. The behavioral foundations established by scholars such as Daniel Kahneman and Amos Tversky remain essential. The outside view is still one of the strongest correctives to optimism bias in major projects. However, the findings suggest that the credibility of RCF depends not only on selecting the appropriate percentile (P50, P80, etc.), but also on transparently justifying how the reference class was formed.
For practitioners and governance bodies, this has important implications. Reference class formation should be treated as a methodological decision requiring documentation and testing. Specifically:
- Similarity criteria should be explicitly defined and justified.
- Alternative class constructions should be tested for sensitivity.
- Forecast reports should disclose how grouping decisions influence uplift outcomes.
When RCF is embedded in funding approvals and public accountability processes, similarity is not a technical detail. It shapes the empirical foundation of budget envelopes and schedule expectations. The central insight is straightforward: reference class forecasting depends on reference class formation. Those decisions about formation quietly determine the distribution relied on and, ultimately, the confidence in the forecast.
Posted on: May 04, 2026 08:00 AM |
Permalink
Comments (2)
Please login or join to subscribe to this item
Kwiyuh Michael Wepngong
Community Champion
Financial Management Specialist | US Peace Corps
Yaounde, Centre, Cameroon
Excellent perspective—this piece thoughtfully highlights that forecasting accuracy depends not only on data, but on the methodology used to construct the reference class itself.
Please Login/Register to leave a comment.
|
"Let us be thankful for fools. But for them the rest of us could not succeed."
- Mark Twain
|