Week 4 Flyvbjerg - 2006

FROM NOBEL PRIZE TO PROJECT
MANAGEMENT: GETTING RISKS RIGHT

BENT FLYVBJERG, Aalborg University, Denmark
The American Planning Association Endorses Reference Class Forecasting

ABSTRACT n April 2005, based on a study of inaccuracy in demand forecasts for public
A major source of risk in project manage-

ment is inaccurate forecasts of project
costs, demand, and other impacts. The
I works projects by Flyvbjerg, Holm, and Buhl (2005), the American Planning
Association (APA) officially endorsed a promising new forecasting method
called “reference class forecasting” and made the strong recommendation that
paper presents a promising new planners should never rely solely on conventional forecasting techniques when
approach to mitigating such risk based making forecasts:
on theories of decision-making under APA encourages planners to use reference class forecasting in addition to
uncertainty, which won the 2002 Nobel traditional methods as a way to improve accuracy. The reference class fore-
Prize in economics. First, the paper docu-
ments inaccuracy and risk in project man-
casting method is beneficial for non-routine projects... Planners should never
agement. Second, it explains inaccuracy rely solely on civil engineering technology as a way to generate project fore-
in terms of optimism bias and strategic casts (American Planning Association, 2005).
misrepresentation. Third, the theoretical Reference class forecasting is based on theories of decision-making under
basis is presented for a promising new uncertainty that won Princeton psychologist Daniel Kahneman the Nobel prize in
method called “reference class forecast-
ing,” which achieves accuracy by basing
economics in 2002 (Kahneman, 1994; Kahneman & Tversky, 1979a; 1979b).
forecasts on actual performance in a ref- Reference class forecasting promises more accuracy in forecasts by taking a so-
erence class of comparable projects and called “outside view” on prospects being forecasted, while conventional forecast-
thereby bypassing both optimism bias ing takes an inside view. The outside view on a given project is based on
and strategic misrepresentation. Fourth, knowledge about actual performance in a reference class of comparable projects.
the paper presents the first instance of
practical reference class forecasting,
Where Flyvbjerg, Holm, and Buhl (2005) briefly outlined the idea of reference
which concerns cost forecasts for large class forecasting, this paper presents the first instance of reference class forecasting
transportation infrastructure projects. in practical project management. The emphasis will be on transportation project
Finally, potentials for and barriers to ref- management, because this is where the first instance of reference class forecasting
erence class forecasting are assessed. occurred. It should be mentioned at the outset, however, that comparative research
Keywords: risk management; project
shows that the problems, causes, and cures identified for transportation apply to
forecasting; forecast models a wide range of other project types, including concert halls, museums, sports are-
nas, exhibit and convention centers, urban renewal, power plants, dams, water
©2006 by the Project Management Institute projects, IT systems, oil and gas extraction projects, aerospace projects, new pro-
Vol. 37, No. 3, 5-15, ISSN 8756-9728/03
duction plants, and the development of new products and new markets (Altshuler
& Luberoff, 2003; Flyvbjerg, 2005; Flyvbjerg, Bruzelius, & Rothengatter, 2003, pp.
18–19; Flyvbjerg, Holm, & Buhl, 2002, p. 286).
A U G U S T 2006 P R O J E C T M A N A G E M E N T J O U R N A L 5
Inaccuracy in Forecasts ture investments, the consequence is and the use of inappropriate forecast-
Forecasts of cost, demand, and other inaccuracy to the second degree. ing models (Vanston & Vanston, 2004,
impacts of planned projects have Benefit-cost ratios are often wrong, not p. 33). However, when such explana-
remained constantly and remarkably only by a few percent but by several fac- tions are put to empirical test, they do
inaccurate for decades. No improve- tors. This is especially the case for rail not account well for the available data.
ment in forecasting accuracy seems to projects (Flyvbjerg, Bruzelius, & First, if technical explanations were
have taken place, despite all claims of Rothengatter, 2003, pp. 37–41). As a valid, one would expect the distribu-
improved forecasting models, better consequence, estimates of viability are tion of inaccuracies to be normal or
data, etc. (Flyvbjerg, Bruzelius, & often misleading, as are socioeconomic near-normal with an average near zero.
Rothengatter, 2003; Flyvbjerg, Holm, & and environmental appraisals, the accura- Actual distributions of inaccuracies are
Buhl, 2002; 2005). For transportation cy of which are all heavily dependent on consistently and significantly non-nor-
infrastructure projects, inaccuracy in demand and cost forecasts. These results mal with averages that are significantly
cost forecasts in constant prices is on point to a significant problem in trans- different from zero. Thus the problem
average 44.7% for rail, 33.8% for portation project management: More is bias and not inaccuracy as such.
bridges and tunnels, and 20.4% for often than not, the information that man- Second, if imperfect data and models
roads (see Table 1).1 For the 70-year agers use to decide whether to invest in were main explanations of inaccura-
period for which cost data are available, new projects is highly inaccurate and cies, one would expect an improve-
accuracy in cost forecasts has not biased, making projects highly risky. ment in accuracy over time, because in
improved. Average inaccuracy for rail Comparative studies show that trans- a professional setting errors and their
passenger forecasts is –51.4%, with 84% portation projects are no worse than other sources would be recognized and
of all rail projects being wrong by more project types in this respect (Flyvbjerg, addressed, for instance, through refer-
than ±20%. For roads, average inaccura- Bruzelius, & Rothengatter, 2003). ee processes with scholarly journals
cy in traffic forecasts is 9.5%, with half and similar expert critical reviews.
of all road forecasts being wrong by Explaining Inaccuracy Undoubtedly, substantial resources
more than ±20% (see Table 2). For the Flyvbjerg, Holm, and Buhl (2002; have been spent over several decades
30-year period for which demand data 2004; 2005) and Flyvbjerg and Cowi on improving data and forecasting
are available, accuracy in rail and road (2004) tested technical, psychological, models. Nevertheless, this has had no
traffic forecasts has not improved. and political-economic explanations effect on the accuracy of forecasts, as
When cost and demand forecasts for inaccuracy in forecasting. Technical demonstrated. This indicates that
are combined, for instance in the cost- explanations are common in the liter- something other than poor data and
benefit analyses that are typically used ature, and they explain inaccuracy in models is at play in generating inaccu-
to justify large transportation infrastruc- terms of unreliable or outdated data rate forecasts, a finding that has been
corroborated by interviews with fore-
Type of Average Standard Level of casters (Flyvbjerg & Cowi, 2004;
Project Inaccuracy Deviation Significance Flyvbjerg & Lovallo, in progress;
(%) p Wachs, 1990).
Psychological and political explana-
Rail 44.7 38.4 <0.001 tions better account for inaccurate fore-
Bridges and tunnels 33.8 62.4 0.004 casts. Psychological explanations
Road 20.4 29.9 <0.001 account for inaccuracy in terms of opti-
mism bias; that is, a cognitive predispo-
Source: Flyvbjerg database on large-scale infrastructure projects. sition found with most people to judge
Table 1: Inaccuracy in cost forecasts for rail, bridges, tunnels, and roads, respectively future events in a more positive light
(construction costs, constant prices) than is warranted by actual experience.
Political explanations, on the other
Rail Road hand, explain inaccuracy in terms of
strategic misrepresentation. Here, when
Average inaccuracy (%) -51.4 (sd=28.1) 9.5 (sd=44.3) forecasting the outcomes of projects,
Percentage of projects with inaccuracies 84 50 forecasters and managers deliberately
larger than ±20% and strategically overestimate benefits
Percentage of projects with inaccuracies 72 25 and underestimate costs in order to
larger than ±40% increase the likelihood that it is their
projects, and not the competition’s, that
Percentage of projects with inaccuracies 40 13
larger than ±60% gain approval and funding. Strategic
misrepresentation can be traced to polit-
Source: Flyvbjerg database on large-scale infrastructure projects. ical and organizational pressures; for
Table 2: Inaccuracy in forecasts of rail passenger and road vehicle traffic instance, competition for scarce funds or
6 A U G U S T 2006 P R O J E C T M A N A G E M E N T J O U R N A L
jockeying for position. Optimism bias
and strategic misrepresentation both Deception
involve deception, but where the latter is
intentional—i.e., lying—the first is not.
Optimism bias is self-deception.
Although the two types of explanation
Explanatory Power
are different, the result is the same: inac-
curate forecasts and inflated benefit-cost
ratios. However, the cures for optimism
bias are different from the cures for
strategic misrepresentation, as we will
see next.
Explanations of inaccuracy in terms
of optimism bias have been developed Delusion
by Kahneman and Tversky (1979a) and
Lovallo and Kahneman (2003).
Political and Organizational Pressure
Explanations in terms of strategic mis-
representation have been set forth by Figure 1: Explanatory power of optimism bias and strategic misrepresentation, respectively,
Wachs (1989; 1990) and Flyvbjerg, in accounting for forecasting inaccuracy as function of political and organizational pressure
Holm, and Buhl (2002; 2005). As illus-
trated schematically in Figure 1, expla- The Planning Fallacy and Reference ation. Human judgment, including
nations in terms of optimism bias have Class Forecasting forecasts, is biased. Reference class
their relative merit in situations where The theoretical and methodological forecasting is a method for unbiasing
political and organizational pressures foundations of reference class forecast- forecasts.
are absent or low, whereas such explana- ing were first described by Kahneman Kahneman and Tversky (1979a;
tions hold less power in situations and Tversky (1979b) and later by 1979b) found human judgment to be
where political pressures are high. Lovallo and Kahneman (2003). generally optimistic due to overconfi-
Conversely, explanations in terms of Reference class forecasting was origi- dence and insufficient regard to distri-
strategic misrepresentation have their nally developed to compensate for the butional information. Thus, people
relative merit where political and orga- type of cognitive bias that Kahneman will underestimate the costs, comple-
nizational pressures are high, while they and Tversky found in their work on tion times, and risks of planned
become immaterial when such pres- decision-making under uncertainty, actions, whereas they will overestimate
sures are not present. Thus the two types which won Kahneman the 2002 Nobel the benefits of the same actions.
of explanation complement, rather than prize in economics (Kahneman, 1994; Lovallo and Kahneman (2003, p. 58)
compete with one another: one is strong Kahneman & Tversky, 1979a). This call such common behavior the “plan-
where the other is weak, and both expla- work showed that errors of judgment ning fallacy” and argue that it stems
nations are necessary to understand the are often systematic and predictable from actors taking an “inside view,”
phenomenon at hand—the pervasive- rather than random, manifesting bias focusing on the constituents of the
ness of inaccuracy in forecasting—and rather than confusion, and that any specific planned action rather than on
how to curb it. corrective prescription should reflect the outcomes of similar already-com-
In what follows, we present a fore- this. They also found that many errors pleted actions. Kahneman and Tversky
casting method called “reference class of judgment are shared by experts and (1979b) argued that the prevalent ten-
forecasting,” which bypasses human laypeople alike. Finally, they found dency to underweigh or ignore distrib-
bias—including optimism bias and that errors remain compelling even utional information is perhaps the
strategic misrepresentation—by cutting when one is fully aware of their nature. major source of error in forecasting.
directly to outcomes. In experimental Thus, awareness of a perceptual or cog- “The analysts should therefore make
research carried out by Daniel nitive illusion does not by itself pro- every effort to frame the forecasting
Kahneman and others, this method has duce a more accurate perception of problem so as to facilitate utilizing all
been demonstrated to be more accurate reality, according to Kahneman and the distributional information that is
than conventional forecasting methods Tversky (1979b, p. 314). Awareness available,” say Kahneman and Tversky
(Kahneman, 1994; Kahneman & may, however, enable one to identify (1979b, p. 316). This may be consid-
Tversky, 1979a; 1979b; Lovallo & situations in which the normal faith in ered the single most important piece of
Kahneman, 2003). First, we explain the one’s impressions must be suspended advice regarding how to increase accu-
theoretical and methodological founda- and in which judgment should be con- racy in forecasting through improved
tions for reference class forecasting, then trolled by a more critical evaluation of methods. Using such distributional
we present the first instance of reference the evidence. Reference class forecast- information from other ventures simi-
class forecasting in project management. ing is a method for such critical evalu- lar to that being forecasted is called
taking an “outside view,” and it is the months needed to finish and report ones, were overly optimistic. The out-
cure to the planning fallacy. Reference the project. The estimates ranged from side view is the one provoked by the
class forecasting is a method for sys- 18 to 30 months. One of the team question to the curriculum expert. It
tematically taking an outside view on members—a distinguished expert in completely ignored the details of the
planned actions. curriculum development—was then project at hand, and it involved no
More specifically, reference class posed a challenge by another team attempt at forecasting the events that
forecasting for a particular project member to recall as many projects sim- would influence the project’s future
requires the following three steps: ilar to theirs as possible, and to think course. Instead, it examined the experi-
1. Identification of a relevant refer- of these projects as they were in a stage ences of a class of similar projects, laid
ence class of past, similar proj- comparable to their project. “How out a rough distribution of outcomes
ects. The class must be broad long did it take them at that point to for this reference class, and then posi-
enough to be statistically mean- reach completion?,” the expert was tioned the current project in that dis-
ingful, but narrow enough to be asked. After a while he answered, with tribution. The resulting forecast, as it
truly comparable with the spe- some discomfort, that not all the com- turned out, was much more accurate.
cific project. parable teams he could think of ever The contrast between inside and
2. Establishing a probability distri- did complete their task. About 40% of outside views has been confirmed by
bution for the selected reference them eventually gave up. Of those systematic research (Gilovich, Griffin,
class. This requires access to remaining, the expert could not think & Kahneman, 2002). The research
credible, empirical data for a of any that completed their task in less shows that when people are asked sim-
sufficient number of projects than seven years, nor of any that took ple questions requiring them to take
within the reference class to more than 10. The expert was then an outside view, their forecasts become
make statistically meaningful asked if he had reason to believe that significantly more accurate. For exam-
conclusions. the present team was more skilled in ple, a group of students enrolling at a
3. Comparing the specific project curriculum development than the ear- college were asked to rate their future
with the reference class distribu- lier ones had been. The expert said no, academic performance relative to their
tion, in order to establish the he did not see any relevant factor that peers in their major. On average, these
most likely outcome for the spe- distinguished this team favorably from students expected to perform better
cific project. the teams that he had been thinking than 84% of their peers, which is logi-
about. His impression was that the cally impossible. The forecasts were
Thus, reference class forecasting present team was slightly below aver- biased by overconfidence. Another
does not try to forecast the specific age in terms of resources and potential. group of incoming students from the
uncertain events that will affect the According to Kahneman, the wise deci- same major were asked about their
particular project, but instead places sion at this point would probably have entrance scores and their peers’ scores
the project in a statistical distribution been for the team to break up. Instead, before being asked about their expect-
of outcomes from the class of reference the members ignored the pessimistic ed performance. This simple diversion
projects. In statisticians’ vernacular, information and proceeded with the into relevant outside-view informa-
reference class forecasting consists of project. They finally completed the tion, which both groups of subjects
regressing forecasters’ best guesses project eight years later, and their were aware of, reduced the second
toward the average of the reference efforts went largely wasted—the result- group’s average expected performance
class and expanding their estimate of ing curriculum was rarely used. ratings by 20%. That is still overconfi-
credible interval toward the correspon- In this example, the curriculum dent, but it is much more realistic than
ding interval for the class (Kahneman expert made two forecasts for the same the forecast made by the first group
& Tversky, 1979b, p. 326). problem and arrived at very different (Lovallo & Kahneman, 2003, p. 61).
Daniel Kahneman relates the fol- answers. The first forecast was the However, most individuals and
lowing story about curriculum plan- inside view; the second was the outside organizations are inclined to adopt the
ning to illustrate how reference class view, or the reference class forecast. inside view in planning new projects.
forecasting works (Lovallo & The inside view is the one that the This is the conventional and intuitive
Kahneman, 2003, p. 61). Some years expert and the other team members approach. The traditional way to think
ago, Kahneman was involved in a proj- adopted. They made forecasts by focus- about a complex project is to focus on
ect to develop a curriculum for a new ing tightly on the project at hand, and the project itself and its details, to
subject area for high schools in Israel. considering its objective, the resources bring to bear what one knows about it,
The project was carried out by a team they brought to it, and the obstacles to paying special attention to its unique
of academics and teachers. In time, the its completion. They constructed in or unusual features, trying to predict
team began to discuss how long the their minds scenarios of their coming the events that will influence its future.
project would take to complete. progress and extrapolated current The thought of going out and gather-
Everyone on the team was asked to trends into the future. The resulting ing simple statistics about related proj-
write on a slip of paper the number of forecasts, even the most conservative ects seldom enters a manager’s mind.
This is the case in general, according to First Instance of Reference Class benefits, and duration of large public
Lovallo and Kahneman (2003, pp. Forecasting in Practice procurement (HM Treasury, 2003a;
61–62). And it is certainly the case for The first instance of reference class fore- 2003b).
cost and demand forecasting in trans- casting in practice may be found in In response to the Treasury’s
portation infrastructure projects. Of Flyvbjerg and Cowi (2004): Green Book and its recommendations,
the several-hundred forecasts reviewed “Procedures for Dealing with Optimism the U.K. Department for Transport
3
in Flyvbjerg, Bruzelius, and Bias in Transport Planning.” Based on decided to collect the type of data
Rothengatter (2003) and Flyvbjerg, this study in the summer of 2004, the which the Treasury recommended, and
Holm, and Buhl (2002; 2005), not one U.K. Department for Transport and HM on that basis to develop a methodolo-
was a reference class forecast.2 Treasury decided to employ the method gy for dealing with optimism bias in
Although understandable, project as part of project appraisal for large the planning and management of
managers’ preference for the inside view transportation projects. transportation projects. The
over the outside view is unfortunate. The immediate background to Department for Transport appointed
When both forecasting methods are this decision was the revision to The Bent Flyvbjerg in association with
applied with equal skill, the outside Green Book by HM Treasury in 2003 Cowi to undertake this assignment as
view is much more likely to produce a that identified for large public pro- regards costing of large transportation
realistic estimate. That is because it curement a demonstrated, systematic procurement. The main aims of the
bypasses cognitive and political biases tendency for project appraisers to be assignment were two; first, to provide
such as optimism bias and strategic overly optimistic: empirically based optimism bias
misrepresentation, and cuts directly to “There is a demonstrated, system- uplifts for selected reference classes of
outcomes. In the outside view, project atic tendency for project appraisers to transportation infrastructure projects,
managers and forecasters are not be overly optimistic. To redress this and, second, to provide guidance on
required to make scenarios, imagine tendency, appraisers should make using the established uplifts to pro-
events, or gauge their own and others’ explicit, empirically based adjustments duce more realistic forecasts of capital
levels of ability and control, so they can- to the estimates of a project’s costs, expenditures in individual projects
not get all these things wrong. Human benefits, and duration … [I]t is recom- (Flyvbjerg & Cowi, 2004). Uplifts
bias is bypassed. Surely the outside mended that these adjustments be would be established for capital expen-
view, being based on historical prece- based on data from past projects or ditures based on the full business case
dent, may fail to predict extreme out- similar projects elsewhere” (HM (time of decision to build).
comes; that is, those that lie outside all Treasury, 2003b, p. 1). The types of transportation
historical precedents. But for most proj- Such optimism was seen as an schemes under the direct and indirect
ects, the outside view will produce more impediment to prudent fiscal plan- responsibility of the U.K. Department
accurate results. In contrast, a focus on ning, for the government as a whole for Transport were divided into a num-
inside details is the road to inaccuracy. and for individual departments within ber of distinct categories in which sta-
The comparative advantage of the government. To redress this tendency, tistical tests, benchmarkings, and other
outside view is most pronounced for HM Treasury recommended that analyses showed that the risk of cost
non-routine projects, understood as appraisers involved in large public pro- overruns within each category may be
projects that managers and decision- curement should make explicit, empir- treated as statistically similar. For each
makers in a certain locale or organiza- ically based adjustments to the category, a reference class of projects
tion have never attempted before—like estimates of a project’s costs, benefits, was then established as the basis for
building new plants or infrastructure, or and duration. HM Treasury recom- reference class forecasting, as required
catering to new types of demand. It is in mended that these adjustments be by step 1 in the three-step procedure
the planning of such new efforts that based on data from past projects or for reference class forecasting previous-
the biases toward optimism and strate- similar projects elsewhere, and adjust- ly described. The specific categories
gic misrepresentation are likely to be ed for the unique characteristics of the and the types of project allocated to
largest. To be sure, choosing the right project at hand. In the absence of a each category are shown in Table 3.
reference class of comparative past proj- more specific evidence base, HM For each category of projects, a ref-
ects becomes more difficult when man- Treasury encouraged government erence class of completed, comparable
agers are forecasting initiatives for departments to collect valid and reli- transportation infrastructure projects
which precedents are not easily found; able data to inform future estimates of was used to establish probability distri-
for instance, the introduction of new optimism, and in the meantime to use butions for cost overruns for new proj-
and unfamiliar technologies. However, the best available data. The Treasury let ects similar in scope and risks to the
most projects are both non-routine it be understood that in the future the projects in the reference class, as
locally and use well-known technolo- allocation of funds for large public required by step 2 in reference class
gies. Such projects are, therefore, partic- procurement would be dependent on forecasting. For roads, for example, a
ularly likely to benefit from the outside valid adjustments of optimism in class of 172 completed and compara-
view and reference class forecasting. order to secure valid estimates of costs, ble projects was used to establish the
Category Types of Projects cost overrun, the higher the uplift. For
instance, with a willingness to accept a
Roads Motorway 50% risk for cost overrun in a road
Trunk roads project, the required uplift for this
Local roads
Bicycle facilities project would be 15%. If the
Pedestrian facilities Department for Transport were willing
Park and ride to accept only a 10% risk for cost over-
Bus lane schemes run, then the required uplift would be
Guided buses on wheels 45%. In comparison, for rail, with a
Rail Metro willingness to accept a 50% risk for
Light rail cost overrun, the required uplift would
Guided buses on tracks be 40%. If the Department for
Conventional rail
High speed rail Transport were willing to accept only a
10% risk for cost overrun, then the
Fixed links Bridges required uplift would be 68% for rail.
Tunnels
All three figures share the same basic S-
Building projects Stations shape, but at different levels, demon-
Terminal buildings strating that the required uplifts are
IT projects IT system development significantly different for different
Standard civil engineering Included for reference purposes only project categories for a given level of
risk of cost overrun. The figures also
Non-standard civil engineering Included for reference purposes only show that the cost for additional
reductions in the risk of cost overrun is
Table 3: Categories and types of projects used as basis for reference class forecasting
different for the three types of projects,
with risk reduction becoming increas-
ingly expensive (rising marginal costs)
Distribution of Cost Overruns for roads and fixed links below 20%
Roads risk, whereas for rail the cost of
100% increased risk reduction rises more
90% slowly, albeit from a high level.
Share of Projects with Given
80% Table 4 presents an overview of

applicable optimism bias uplifts for
Max. Cost Overrun
70%
60% the 50% and 80% percentiles for all
the project categories listed in Table 3.
50%
The 50% percentile is pertinent to the
40% investor with a large project portfolio,
30% where cost overruns on one project
20% may be offset by cost savings on anoth-
10% er. The 80% percentile—correspon-
0% ding to a risk of cost overrun of
-20% 0% 20% 40% 60% 80% 20%—is the level of risk that the U.K.
Cost Overrun VS Budget Department for Transport is typically
willing to accept for large investments
Source: Flyvbjerg database on large-scale infrastructure projects. in local transportation infrastructure.
Figure 2: Probability distribution of cost overrun for roads, constant prices (N=172) The established uplifts for opti-
mism bias should be applied to esti-
probability distribution of cost overrun is substantial for all three project mated budgets at the time of decision
runs shown in Figure 2. The share of types, but highest for rail, followed by to build a project. In the U.K., the
projects with a given maximum cost bridges and tunnels, and with the low- approval stage for a large transporta-
overrun is shown in the figure. For est risk for roads. tion project is equivalent to the time of
instance, 40% of projects have a maxi- Based on the probability distribu- presenting the business case for the
mum cost overrun of 10%; 80% of tions described, the required uplifts project to the Department for
projects a maximum overrun of 32%, needed to carry out step 3 in a refer- Transport with a view to obtaining the
etc. For rail, the probability distribu- ence class forecast may be calculated as go or no-go for that project.
tion is shown in Figure 3, and for shown in Figures 5–7. The uplifts refer If, for instance, a group of project
bridges and tunnels in Figure 4. The to cost overrun calculated in constant managers were preparing the business
figures show that the risk of cost over- prices. The lower the acceptable risk for case for a new motorway, and if they or
Applicable Optimism colleagues were for the projects in the
Bias Uplifts reference class would the managers be
Category Types of Projects justified in using lower uplifts than
50% 80% those previously described.
percentile percentile Conversely, if there is evidence that the
Roads Motorway project managers are worse at estimat-
Trunk roads ing costs than their colleagues, then
Local roads higher uplifts should be used.
Bicycle facilities 15% 32% The methodology previously
Pedestrian facilities
Park and ride described for systematic, practical ref-
Bus lane schemes erence class forecasting for transporta-
Guided buses on wheels tion projects was developed in
Rail Metro 2003–2004, with publication by the
Light rail Department of Transport in August
Guided buses on tracks 40% 57% 2004. From this date on, local author-
Conventional rail ities applying for funding for trans-
High speed rail
portation projects with the
Fixed links Bridges 23% 55% Department for Transport or with HM
Tunnels Treasury were required to take into
Building projects Stations account optimism bias by using uplifts
Terminal buildings 4-51%* as previously described and as laid out
IT projects IT system development 10-200%* in more detail in guidelines from the
two ministries.
Standard civil Included for reference purposes only
engineering 3-44%*
Forecasting Costs for the
Non-standard Included for reference purposes only Edinburgh Tram
civil engineering 6-66%*
In October 2004, the first instance of
*Based on Mott MacDonald (2002, p. 32) no probability distribution available. practical use of the uplifts was record-
Table 4: Applicable capital expenditure optimism bia uplifts for 50% and 80% percentiles, ed, in the planning of the Edinburgh
constant prices Tram Line 2. Ove Arup and Partners
Scotland (2004) had been appointed
their client had decided that the risk of budget, then the final budget would be by the Scottish Parliament’s Edinburgh
cost overrun must be less than 20%, £420 million. Tram Bill Committee to provide a
then they would use an uplift of 32% It follows that the 50% percentile review of the Edinburgh Tram Line 2
on their estimated capital expenditure should be used only in instances business case developed on behalf of
budget. Thus, if the initially estimated where investors are willing to take a Transport Initiatives Edinburgh.
budget were £100 million, then the high degree of risk that cost overrun Transport Initiatives Edinburgh is the
final budget—taking into account will occur and/or in situations where project promoter and is a private limit-
optimism bias at the 80%-level— investors are funding a large number ed company owned by the City of
would be £132 million (£1 = $1.8). If of projects, and where cost savings Edinburgh Council established to
the project managers or their client (underruns) on one project may be deliver major transport projects for the
decided instead that a 50% risk of cost used to cover the costs of overruns on Council. The Scottish Executive is a
overrun was acceptable, then the uplift other projects. The upper percentiles main funder of the Edinburgh Tram,
would be 15% and the final budget (80–90%) should be used when having made an Executive Grant of
£115 million. investors want a high degree of certain- £375 million (US$670 million) toward
Similarly, if a group of project ty that cost overrun will not occur; for lines 1 and 2, of which Transport
managers were preparing the business instance, in stand-alone projects with Initiatives Edinburgh proposed spend-
case for a metro rail project, and if they no access to additional funds beyond ing £165 million toward Line 2.
or their client had decided that with the approved budget. Other percentiles As part of their review, Ove Arup
80% certainty they wanted to stay may be employed to reflect other assessed whether the business case for
within budget, then they would use an degrees of willingness to accept risk Tram Line 2 had adequately taken into
uplift on capital costs of 57%. An ini- and the associated uplifts as shown in account optimism bias as regards capi-
tial capital expenditure budget of £300 Figures 5–7. tal costs. The business case had esti-
million would then become a final Only if project managers have evi- mated a base cost of £255 million and
budget of £504 million. If the project dence to substantiate that they would an additional allowance for contin-
managers or their client required only be significantly better at estimating gency and optimism bias of £64 mil-
50% certainty they would stay within costs for the project at hand than their lion—or 25%—resulting in total
distributional information made avail-
Distribution of Cost Overruns
Rail able by the U.K. Department for
100% Transport, Ove Arup was able to take
90% an outside view on the Edinburgh
Tram Line 2 capital cost forecast and
80%
thus unbias what appeared to be a
Max. Cost Overrun
70% biased forecast. As a result, Ove Arup’s

60% client, The Scottish Parliament, was
50% provided with a more reliable estimate
40% of what the true costs of Line 2 was
30% likely to be.
20%
Potentials and Barriers for Reference
10% Class Forecasting
0% As previously mentioned, two types of
-20% 0% 20% 40% 60% 80% 100% explanation best account for forecasting
Cost Overrun VS Budget inaccuracy: optimism bias and strategic
Source: Flyvbjerg database on large-scale infrastructure projects.
misrepresentation. Reference class fore-
casting was originally developed to miti-
Figure 3: Probability distribution of cost overrun for rail, constant prices (N=46)
gate optimism bias, but reference class
forecasting may help mitigate any type of
capital costs of approximately £320 mism bias uplifts be applied to the human bias, including strategic bias,
million. Ove Arup concluded about budget at the time of decision to build, because the method bypasses such bias
this overall estimate of capital costs which typically equates to business by cutting directly to empirical outcomes
that it seemed to have been rigorously case submission. (In addition, Tram and building forecasts on these. Even so,
prepared using a database of costs, Line 2 had not yet even reached the the potentials for and barriers to refer-
comparison to other U.K. light rail outline business case stage, indicating ence class forecasting will be different in
schemes, and reconciliations with ear- that risks would be substantially high- situations in which (1) optimism bias is
lier project estimates. Ove Arup found, er at this early stage, as would corre- the main cause of inaccuracy as com-
however, that the following potential sponding uplifts. On that basis, Arup pared to situations in which (2) strategic
additional costs needed to be consid- concluded that “it is considered that misrepresentation is the reason for inac-
ered in determining the overall capital current optimism bias uplifts [for Tram curacy. We therefore need to distinguish
costs: £26 million for future expendi- Line 2] may have been underestimat- between these two types of situations
ture on replacement and renewals and ed” [Ove Arup & Partners Scotland, when endeavoring to apply reference
£20 million as a notional allowance 2004, p. 27].) class forecasting in practice.
for a capital sum to cover risks of Finally, Ove Arup mentioned that In the first type of situation—in
future revenue shortfalls, amounting the Department for Transport guid- which optimism bias is the main cause
to an increase in total capital costs of ance does allow for optimism bias to of inaccuracy—we may assume that
14.4% (Ove Arup and Partners be adjusted downward if strong evi- managers and forecasters are making
Scotland, 2004, pp. 15–16). dence of improved risk mitigation can honest mistakes and have an interest
Using the U.K. Department for be demonstrated. According to Ove in improving accuracy. Consider, for
Transport uplifts for optimism bias Arup, this may be the case if advanced example, the students who were asked
previously presented on the base costs, risk analysis has been applied, but this to estimate their future academic per-
Ove Arup then calculated the 80th per- was not the case for Tram Line 2. Ove formance relative to their peers. We
centile value for total capital costs— Arup therefore concluded that “the may reasonably believe that the stu-
the value at which the likelihood of justification for reduced Department dents did not deliberately misrepresent
staying within budget is 80%—to be for Transport optimism bias uplifts their estimates, because they had no
£400 million (i.e., £255 million x would appear to be weak” (Ove Arup interest in doing so and were not
1.57). The 50th percentile for total & Partners Scotland, 2004, pp. exposed to pressures that would push
capital costs—the value at which the 27–28). Thus, the overall conclusion them in that direction. The students
likelihood of staying within budget is of Ove Arup was that the promoter’s made honest mistakes, which pro-
50%—was £357 million (i.e., £255 capital cost estimate of approximately duced honest, if biased, numbers
million x 1.4). Ove Arup remarked that £320 million was optimistic. Most regarding performance. And, indeed,
these estimates of total capital costs likely Tram Line 2 would cost signifi- when students were asked to take into
were likely to be conservative—that is, cantly more. account outside-view information, we
low—because the U.K. Department for By framing the forecasting prob- saw that the accuracy of their estimates
Transport recommends that its opti- lem to allow the use of the empirical improved substantially. In this type of
situation—when forecasters are hon-
Distribution of Cost Overruns
estly trying to gauge the future—the Fixed Links
potential for using the outside view 100%
and reference class forecasting will be 90%
good. Forecasters will be welcoming

80%
the method and barriers will be low,
Max. Cost Overrun

70%
because no one has reason to be
against a methodology that will 60%
improve their forecasts. 50%
In the second type of situation— 40%
in which strategic misrepresentation is 30%
the main cause of inaccuracy—differ-
20%
ences between estimated and actual
10%
costs and benefits are best explained
by political and organizational pres- 0%
-20% 30% 80% 130% 180%
sures. Here, managers and forecasters
would still need reference class fore- Cost Overrun VS Budget
casting if accuracy were to be Source: Flyvbjerg database on large-scale infrastructure projects.
improved, but managers and forecast-
Figure 4: Probability distribution of cost overrun for fixed links, constant prices (N=34)
ers may not be interested in this
because inaccuracy is deliberate.
Biased forecasts serve strategic purpos- Required Uplift
es that dominate the commitment to Roads
accuracy and truth. Consider, for 70%
example, city managers with responsi- 60%
bility for estimating costs and benefits
of urban rail projects. Here, the 50%
Required Uplift
assumption of innocence regarding

outcomes typically cannot be upheld. 40%
Cities compete fiercely for approval 30%
and for scarce national funds for such
projects, and pressures are strong to 20%
present projects as favorably as possi-
ble; that is, with low costs and high 10%
benefits, in order to beat the competi- 0%
tion. There is no incentive for the indi- 0% 10% 20% 30% 40% 50% 60%
vidual city to unbias its forecasts, but Acceptable Chance of Cost Overrun
quite the opposite. Unless all other
cities also unbias, the individual city Source: Flyvbjerg database on large-scale infrastructure projects.
would lose out in the competition for Figure 5: Required uplift for roads as function of the maximum acceptable level of risk for cost
funds. Project managers are on record overrun, constant prices (N=172)
confirming that this is a common situ-
ation (Flyvbjerg & Cowi, 2004, pp. reward accurate forecasts and punish higher the stakes, and the higher the
36–58; Flyvbjerg & Lovallo, in inaccurate ones. Forecasters and pro- level of political and organizational
progress). The result is the same as in moters should be made to carry the pressures, the more pronounced
the case of optimism: actors promote full risks of their forecasts. Their will be the need for such measures
ventures that are unlikely to perform as work should be reviewed by inde- of accountability. Flyvbjerg,
promised. But the causes are different, pendent bodies such as national Bruzelius, and Rothengatter (2003)
as are possible cures. auditors or independent analysts, and Flyvbjerg, Holm, and Buhl
In this type of situation, the and such bodies would need refer- (2005) further detailed the design
potential for reference class forecasting ence class forecasting to do their of such measures and how they may
is low—the demand for accuracy is work. Projects with inflated bene- be implemented in practical project
simply not there—and barriers are fit-cost ratios should be stopped or management.
high. In order to lower barriers, and placed on hold. Professional and The existence of strategic misrepre-
thus create room for reference class even criminal penalties should be con- sentation does not exclude the simul-
forecasting, measures of accountability sidered for people who consistently taneous existence of optimism bias,
must be implemented that would produce misleading forecasts. The and vice versa. In fact, it is realistic to
tion is perceived as being available to
Required Uplift
Rail the project and its contractors. This
90% makes it important to combine the
introduction of reference class fore-
80%
casting and optimism bias uplifts with
70% tight contracts, and maintained incen-
60% tives for promoters to undertake good
Required Uplift
50% quantified risk assessment and exercise

prudent cost control during project
40%
implementation. How this may be
30% done is described in Flyvbjerg and
20% Cowi (2004).
10%
Notes
0% 1 Inaccuracy is measured in percent as
0% 10% 20% 30% 40% 50% 60%
(actual outcome/forecast outcome -1) x
Acceptable Chance of Cost Overrun 100. The base year of a forecast for a
Source: Flyvbjerg database on large-scale infrastructure projects.
project is the time of decision to build
that project. An inaccuracy of 0 indi-
Figure 6: Required uplift for rail as function of the maximum acceptable level of risk for cost
cates perfect accuracy. Cost is measured
overrun, constant prices (N=46)
as construction costs. Demand is meas-
ured as number of vehicles for roads
Required Uplift and number of passengers for rail.
Fixed Links
160% 2 The closest thing to an outside view
140% in large infrastructure forecasting is
Gordon and Wilson’s (1984) use of
120%
regression analysis on an international
Required Uplift
100% cross section of light-rail projects to

80% forecast patronage in a number of
light-rail schemes in North America.
60%
3 The fact that this is, indeed, the first
40%
instance of practical reference class
20% forecasting has been confirmed with
0% Daniel Kahneman and Dan Lovallo,
0% 10% 20% 30% 40% 50% 60% who also knows of no other instances
Acceptable Chance of Cost Overrun of practical reference class forecasting.
Personal communications with Daniel
Source: Flyvbjerg database on large-scale infrastructure projects. Kahneman and Dan Lovallo, author’s
Figure 7: Required uplift for fixed links as function of the maximum acceptable level of risk for archives.
cost overrun, constant prices (N=34)
References
expect such co-existence in forecasting ciencies and overspending. Reserves Altshuler, A., & Luberoff, D.
in large and complex projects and will be spent simply because they are (2003). Mega-projects: The changing poli-
organizations. This again underscores there, as the saying goes in the con- tics of urban public investment.
the point that improved forecasting struction business. For instance, it is Washington, DC: Brookings Institution.
methods—here, reference class fore- important to recognize that, for the American Planning Association.
casting—and measures of accountabil- previously mentioned examples, the (2005). JAPA article calls on planners to
ity must go hand in hand if the introduction of reference class fore- help end inaccuracies in public project rev-
attempt to arrive at more accurate forecasting and optimism-bias uplifts enue forecasting. Retrieved April 7, 2005,
casts is to be effective. would establish total budget reserva- from http://www.planning.org/newsre-
Finally, it could be argued that in tions (including uplifts) which for leases/2005/ftp040705.htm
some cases the use of reference class some projects would be more than Flyvbjerg, B. (2005, Spring/Summer).
forecasting may result in such large adequate. This may in itself create an Design by deception: The politics of
reserves set aside for a project that this incentive which works against firm megaproject approval. Harvard Design
would in itself lead to risks of ineffi- cost control if the total budget reserva- Magazine, 22, 50–59.
Flyvbjerg, B., Bruzelius, N., & Gilovich, T., Griffin, D., & corrective procedures. In S. Makridakis &
Rothengatter, W. (2003). Megaprojects and Kahneman, D. (Eds.). (2002). S. C. Wheelwright (Eds.), Studies in the
risk: An anatomy of ambition. Cambridge, Heuristics and biases: The psychology of management sciences: Forecasting (p. 12)
MA: Cambridge University Press. intuitive judgment. Cambridge, MA: Amsterdam: North Holland.
Flyvbjerg, B., & Cowi. (2004). Cambridge University Press. Lovallo, D., & Kahneman, D.
Procedures for dealing with optimism bias Gordon, P., & Wilson, R. (1984). (2003, July). Delusions of success:
in transport planning: Guidance docu- The determinants of light-rail transit How optimism undermines executives’
ment. London: UK Department for demand: An international cross-sec- decisions. Harvard Business Review,
Transport. tional comparison. Transportation 56–63.
Flyvbjerg, B., Holm, M. K. S., & Research A, 18A(2), 135–140. MacDonald, M. (2002). Review of
Buhl, S. L. (2002, Summer). HM Treasury. (2003a). The green large public procurement in the UK.
Underestimating costs in public works book: Appraisal and evaluation in central gov- Study for HM Treasury. London: HM
projects: Error or lie? Journal of the ernment, treasury guidance. London: TSO. Treasury.
American Planning Association, 68(3), HM Treasury. (2003b). Ove Arup & Partners Scotland.
279–295. Supplementary green book guidance: (2004). Scottish parliament, Edinburgh
Flyvbjerg, B., Holm, M. K. S., & Optimism bias. London: HM Treasury. tram line 2 review of business case. West
Buhl, S. L. (2004). What causes cost Kahneman, D. (1994). New chal- Lothian, Scotland: Author.
overrun in transport infrastructure lenges to the rationality assumption. Vanston, J. H., & Vanston, L. K.
projects? Transport Reviews, 24(1), Journal of Institutional and Theoretical (2004). Testing the tea leaves:
3–18. Economics, 150, 18–36. Evaluating the validity of forecasts.
Flyvbjerg, B., Holm, M. K. S., & Kahneman, D., & Lovallo, D. Research-Technology Management,
Buhl, S. L. (2005, Spring).How (1993). Timid choices and bold fore- 47(5), 33–39.
(in)accurate are demand forecasts in casts: A cognitive perspective on risk Wachs, M. (1989). When planners
public works projects? The case of taking. Management Science, 39, 17–31. lie with numbers. Journal of the
transportation. Journal of the American Kahneman, D., & Tversky, A. American Planning Association, 55(4),
Planning Association, 71(2), 131-146. (1979a). Prospect theory: An analysis 476–479.
Flyvbjerg, B., & Lovallo, D. (n.d.). of decisions under risk. Econometrica, Wachs, M. (1990). Ethics and
Delusion and deception: Two models for 47, 313–327. advocacy in forecasting for public pol-
explaining executive disaster. Kahneman, D., & Tversky, A. icy. Business and Professional Ethics
Unpublished manuscript. (1979b). Intuitive prediction: Biases and Journal, 9(1–2), 141–157.
BENT FLYVBJERG is professor of planning at the Department of Development and Planning at Aalborg University,
Denmark. He has doctor of technology from Aalborg University and holds a PhD in urban geography and planning from
Aarhus University, Denmark. He was twice a visiting Fulbright Scholar to the U.S., where he did research at the University
of California at Los Angeles and Berkeley and at Harvard University. He has been a visiting fellow with the European
University Institute in Florence. He has written several books: Megaprojects and Risk: An Anatomy of Ambition, Making
Social Science Matter, and Rationality and Power: Democracy in Practice. His books and articles have been translated into
15 languages. His main research interest is urban policy and planning. He is currently conducting research on
megaprojects, phronetic planning research, and the relationship between truth and lying in policy and planning. He has
two decades of practical experience from working as a policy and planning adviser and consultant to more than 30 public
and private organizations, including the EU Commission, the United Nations, national and local government, auditors
general, and private companies. His work covers both developed and developing nations. He has been adviser to the
government of Denmark in formulating national policies for transportation, environment, and research. He is founding
chairman of the Geography Program at Aalborg University, co-founder of the university’s Program in Planning and
Environment, and founding director of the university’s Research Program on Large Infrastructure Projects. He has received
numerous honors and awards, including the Danish National Science Council Distinguished Research Scholarship
(equivalent to the MacArthur Fellowship). In 2002, Queen Margrethe II of Denmark conferred upon Bent Flyvbjerg the
Knighthood of the Order of the Dannebrog for his professional accomplishments.

Week 4 Flyvbjerg - 2006

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Week 4 Flyvbjerg - 2006

Uploaded by

Copyright:

Available Formats

FROM NOBEL PRIZE TO PROJECT

MANAGEMENT: GETTING RISKS RIGHT

The American Planning Association Endorses Reference Class Forecasting

A major source of risk in project manage-

80% Table 4 presents an overview of

70% biased forecast. As a result, Ove Arup’s

Share of Projects with Given

Max. Cost Overrun

assumption of innocence regarding

50% quantified risk assessment and exercise

100% cross section of light-rail projects to

You might also like