Professional Documents
Culture Documents
3,000,000
2,500,000
2,000,000
1,500,000
1,000,000
Ecommerce
500,000
0
1995 2000 2005 2010 2015
Year
– Source: census.gov
432000
Sales in Million (USD)
428000
424000
420000
416000
412000
1 2 3 4 5 6 7 8 9 10 11 12
Month (Jan=1, Feb=2, ...)
◆ E.g. in 2013, sales are higher during the later half of the year (source:
census.gov)
◆ Seasonality effects often come from predictable annual events (cultural,
weather)
– Thanksgiving sales in the US, Boxing Day sales in Canada, UK and Australia
– Diwali Sales in India, Chinese New Year Sales.
– Ski sales in winter.
OPS Analytics MOOC, Week 1, Session 3
Forecasting when there is Trend
◆ In such cases, where the data reveals trend let us examine two
ways of forecasting the future demand.
◆ Linear Regression
◆ Main forecasting idea is to fit a line that has a slope to capture the
trend in data.
Model: 𝐷t = 𝑎 + 𝑏𝑡
◆ Linear Regression
Source:nps.gov
◆ National Park Service cares about the data for several purposes
– To plan for camping or backpacking permits.
– To plan for adequate emergency services.
– To plan food and fuel services (for visitors and cars).
– To budget revenues and costs.
– To understand ecological footprint.
◆ In YellowstoneTemplate.xlsx template,
◆ Using last 50 data points we get the following best fit trend line,
𝐷t = −53,525,580 + 28248𝑡
50-Year Trendline
4000000
3500000
3000000
2500000
y = 28,248x - 53,525,580
2000000
1500000
1000000
500000
0
1960 1970 1980 1990 2000 2010 2020
𝐷t = −57,401,021 + 30191𝑡
30-Year Trendline
4000000
3500000
3000000
Number of visitors
2500000
y = 30,191x - 57,401,021
2000000
1500000
1000000
500000
0
1980 1985 1990 1995 2000 2005 2010 2015 2020
year
◆ Recognizing Seasonality
30
25
20
visitors 15
Seasonal
10
0
0 2 4 6 8 10 12 14 period
– ∑i 𝑐i = 𝑁 .
◆ Step 1: Sample Mean. Compute the sample mean of the entire data set.
MEAN 14.33
30
25
20
visitors 15
Seasonal
10
0
0 2 4 6 8 10 12 14 period
MEAN 14.33
30
25
20
visitors 15
Seasonal
10
0
0 2 4 6 8 10 12 14 period
MEAN 14.33
30
25
20
15
Seasonal
visitors 10
0
0 2 4 6 8 10 12 14 period
MEAN 14.333
◆ The monthly load factor data (2003-2013) shows both trend and seasonality.
– You can see airlines are getting more crowded over the years.
– You also see that there some months are more crowded than others.
◆ Using the Template, I follow the previous 4 steps again to generate the de-
seasonalized data.
◆ Using MA(4). The average of last 4 data points from year 2014:
13.91
– De-seasonal forecast = 13.91
MEAN 14.333
– long term forecasting (assuming same trend, etc.) is fraught with pitfalls
since technology changes might occur.