Professional Documents
Culture Documents
Learning Objectives
1.
Extrapolation
z
z
2.
3.
Part 1:
Extrapolation
Forecasting Context
z Uncertainty
with
Longer forecast horizon
z Smaller areas
z
z Extrapolation
Extrapolation Technique
z Fit
Assumptions
z Use
Advantages / Benefits
z Computational
simplicity
z Transparent methodology
z Ease of application
z May work for
Large areas
z Short time horizons
z Slow grow areas
z
Disadvantages / Risks
z Does
z Ignores
Klostermans Technique
z
Klosterman
z
z
z
z
Linear Function
Linear Function: Y=10X+100
450
Dependent Variable
400
350
300
250
200
150
100
50
0
0
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
= a + bX
where
a Y is the intercept and b is the slope
26
27
28
29
30
Geometric Function
Dependent Variable
Geometric Function:
X
=
10
(
1
.
1
)
YC
200
180
160
140
120
100
80
60
40
20
0
0
9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
Inde pe nde nt Variable
X
=
ab
YC
where
a is the intercept
b is the growth rate plus one
Parabolic Function
Parabolic Function
2
=
10
+
1
.
5
X
+
2
X
YC
Dependent Variable
2,000
1,500
1,000
500
0
0
10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
Independent Variable
= a + bX + cX
where
a is the Y intercept and b is the slope
Modified Exponential
Modified Exponential Function:
250
Dependent Variable
200
150
100
50
0
0
9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
Independent Variable
X
=
c
+
ab
YC
where
a is c minus the Y intercept
b is the ratio of successive growth increments (constant)
c is the asymptotic value
Gompertz Function
Gompertz Function
120
= 100 * (0.9)
0.8 X
80
60
40
20
20
18
16
14
12
10
-2
-4
-6
-8
-1
0
-1
2
-1
4
-1
6
-1
8
0
-2
0
Dependent Variable
100
Independent Variable
Y C = ca
bX
Logistic Function
Logistic Function
YC =
0.120
1
10 + (0.5) * (0.5) x
Dependent Variable
0.100
0.080
0.060
0.040
0.020
Independent Variable
1
Y C = c + ab x
If
1. b is between 0 and 1 and
2. a < 0
Then
1. Curve takes the S shape and
2. 1/c is the asymptotic value (upper limit) and
3. 0 is the lower limit
20
18
16
14
12
10
-2
-4
-6
-8
-1
0
-1
2
-1
4
-1
6
-1
8
-2
0
0.000
by Stephen R. Lawrence of
University of Colorado
http://leeds-faculty.colorado.edu/Lawrence/Tools/SCurve/scurve.xls
z Algorithm
z Judgment
Part 2:
Simple Linear Regression
z Given
Meanings of Coefficients
z
z
0 No relationship
1 Perfect fit
R : correlation coefficient
z
z
Fitted Value
Deviation
Observed Value
i =1
i =1
a x i + bn =a y i
n
i =1
i =1
a x i + b x i = x i y i
i =1
10
R = 0.9577
8
6
4
2
Linear (y)
0
0
10
15
20
5
41
a
b
27.5
299
27.5
299
Inverse
a
b
a
b
-0.0574
0.6709
0.514706
1.279412
0.0070
-0.0574
SSE = [ y i f ( x i )]
i =1
SSE
r = 1
SST
2
Where:
SST = [ y i y ]
i =1
Values of r2
z 0<r2<1
z As r2 approaches
Calculating r2 in Excel
I
x
1
2
3
4
5
2
4
7
11
17
y
2.00
3.50
4.50
8.00
9.50
5.5
Error
Error^2
f(x)
y-f(x)
(y-f(x))^2 y-(AveY) (y-(AveY))^2
2.308824 -0.308824 0.095372
-3.50
12.25
3.338235 0.161765 0.026168
-2.00
4.00
4.882353 -0.382353 0.146194
-1.00
1.00
6.941176 1.058824 1.121107
2.50
6.25
10.029412 -0.529412 0.280277
4.00
16.00
1.67
39.50
Average of Y
SSE
SST
R^2 = 1 - (SSE/SST)
0.957743857 is the R squared
2
3.5
4.5
8
9.5
12
10
SUMMARY OUTPUT
8
Regression Statistics
Multiple R
0.978643887
R Square
0.957743857
Adjusted R Square
0.943658476
Standard Error
0.745903847
Observations
5
Y
Predicted Y
4
2
0
0
10
X Variable
1
15
20
ANOVA
df
Regression
Residual
Total
Intercept
X Variable 1
1
3
4
SS
MS
F
Significance F
37.83088235 37.83088 67.99559
0.003734402
1.669117647 0.556373
39.5
z There
Part 3:
Calculating Forecasted Values
from Fitted Functions in Excel
http://home.business.utah.edu/bebrpsp/URPL5020/Trend/CalcExtrap.xls
Worksheet examples
Linear
Go to Options tab
Forecast period
Display equation
Display R-squared
Note: You can
customize trend
label
Click OK
y = 29481.450370525x + 316996.533799534
2
R = 0.962332124
State of Utah
Trend Forecast
1,000,000
500,000
0
1940
1950
1960
1970
1980
1990
2000
2010
2020
y = 29,481.45x + 31,6996.53
2
3,000,000
2,500,000
2,000,000
R = 0.962
1,500,000
State of Utah
Trend Forecast
1,000,000
500,000
0
1940
1950
1960
1970
1980
1990
2000
2010
2020
Y = population
X = time marker substitute the year (?)
29481.450370525*(2020)+ 316996.533799534
= 59,869,526
This is much too high.
1940
1950
1960
1970
1980
1990
2000
150,000
100,000
50,000
0
-50,000
-100,000
-150,000
-200,000
-250,000
-300,000
-350,000
1940
1950
1960
1970
1980
1990
2000
Ratio Methods
z
z
z
z
z
Extrapolation - Summary
z
Just because a function fits (high r2) does not mean that the
extrapolation is reasonable.
Make your assumptions explicit
Generally there are growth limits at some point
Create your own functions in excel based on your knowledge
of the area (growth limits, etc.)
Use Excel to fit a trend line and extrapolate it into the future