Reference Class Forecasting Depends on How You Define “Similar”

Technology offers an incredible opportunity to improve project performance. This blog shares the latest research and how organizations are implementing AI into their project methodology. Come with an open mind, increase your knowledge, share your concerns, and become a project manager with new skills to offer an organization.

AI | Machine learning | Natural language processing | show all posts

About this Blog

RSS

Date

Reference Class Forecasting (RCF), popularized by Bent Flyvbjerg, is used to address optimism bias in project cost and schedule estimation. The logic is simple: compare your project to similar completed projects and adjust expectations based on their actual outcomes. It is one of the most influential forecasting innovations in project governance. But a foundational question that requires more attention is: how is similarity defined in the first place?

In practice, reference classes are often formed using broad administrative categories such as “light rail,” “metro,” or “heavy rail.” Yet these similarity criteria are frequently underspecified, even though they determine the empirical distribution from which percentile uplifts are calculated. Before any statistical adjustment occurs, a methodological decision has already shaped the forecast.

In my recent study using a dataset of U.S. mass transit projects, I examined the sensitivity of RCF to alternative reference class formation. Rather than relying on predefined categories, I applied unsupervised clustering techniques to construct alternative reference classes based on structural attributes such as track length, number of stations, underground proportion, and rolling stock. Cost and schedule outcomes were excluded to preserve the outside-view logic.

The results showed that changing similarity groupings altered cost and schedule distributions. Percentile-based uplifts shifted, dispersion patterns changed, and the implied contingency requirements varied across alternative clusters. In other words, RCF outcomes proved structurally contingent on how similarity was operationalized, not merely on statistical adjustment.

This does not undermine RCF. The behavioral foundations established by scholars such as Daniel Kahneman and Amos Tversky remain essential. The outside view is still one of the strongest correctives to optimism bias in major projects. However, the findings suggest that the credibility of RCF depends not only on selecting the appropriate percentile (P50, P80, etc.), but also on transparently justifying how the reference class was formed.

For practitioners and governance bodies, this has important implications. Reference class formation should be treated as a methodological decision requiring documentation and testing. Specifically:

Similarity criteria should be explicitly defined and justified.
Alternative class constructions should be tested for sensitivity.
Forecast reports should disclose how grouping decisions influence uplift outcomes.

When RCF is embedded in funding approvals and public accountability processes, similarity is not a technical detail. It shapes the empirical foundation of budget envelopes and schedule expectations. The central insight is straightforward: reference class forecasting depends on reference class formation. Those decisions about formation quietly determine the distribution relied on and, ultimately, the confidence in the forecast.

Posted on: May 04, 2026 08:00 AM | Permalink

Comments (2)

Please login or join to subscribe to this item

Kwiyuh Michael Wepngong

Community Champion

Financial Management Specialist | US Peace Corps Yaounde, Centre, Cameroon

Thanks for sharng this

Posted: May 6, 2026 8:48 AM

Shakeel Anwar Bhatti Abu Dhabi, , United Arab Emirates

Excellent perspective—this piece thoughtfully highlights that forecasting accuracy depends not only on data, but on the methodology used to construct the reference class itself.

Posted: May 10, 2026 4:04 AM

Please Login/Register to leave a comment.

Reference Class Forecasting Depends on How You Define “Similar”

About this Blog

RSS

Recent Posts

Categories

Date

Comments (2)

Reference Class Forecasting Depends on How You Define “Similar”

About this Blog

RSS

Recent Posts

Categories

Date

Comments (2)

Sponsors

Newsletters