Professional Documents
Culture Documents
Eurostat
Topics
Presented by
yvind Langsrud
Statistics Norway
80
60
40
a1
100
120
140
2004
2006
2008
Time
2010
2012
40
60
80
100
120
140
2004
2006
2008
Time
2010
2012
40
60
80
100
120
140
2004
2006
2008
Time
2010
2012
80
100
120
140
160
Monthly
time series example
Original series: Retail sales volume index
2000
2002
2004
2006
2008
2010
2012
2014
120
140
160
y = a + b*time + e
80
100
a = -6619.731
b = 3.351223
2000
2002
2004
2006
2008
2010
2012
2014
y = a + b*time + cmonth + e
80
100
120
140
160
2000
2002
2004
2006
2008
2010
2012
2014
140
80
y = a + b*time + cmonth + e
100
120
160
2000
2002
2004
2006
2008
2010
2012
a = -6468.505
b = 3.275956
c =
mnd0
mnd2
-9.19620250 -16.59062737
mnd7
mnd8
1.84439111
4.62139480
mnd3
-6.79790939
mnd9
-2.56494236
mnd4
-8.51090569
mnd10
-0.04409251
mnd5
-1.18890200
mnd11
1.53598811
mnd6
6.33881598
mnd12
30.55299181
yt = Tt + St + It
2014
yt = Tt + St + It
80
100
120
140
160
2000
2002
2004
2006
2008
2010
2012
2014
yt = Tt + St + It
Seasonality from "the dirty model"
-10
10
20
30
Seasonality
2000
2002
2004
2006
2008
2010
2012
2014
yt = Tt + St + It
Seasonal adjustment by "the dirty model"
80
100
120
140
160
2000
2002
2004
2006
2008
2010
2012
2014
yt = Tt + St + It
Irregular component by "the dirty model"
-5
10
Irregular componet
2000
2002
2004
2006
2008
2010
2012
2014
yt = Tt St It
yt is not the original series but a series that is corrected for holiday and
trading day effects (calendar adjusted)
yt = Tt St It
100
120
140
160
80
2000
2002
2004
2006
2008
2010
2012
2014
Seasonal factors
0.9
1.0
1.1
1.2
1.3
yt = Tt St It
2000
2005
2010
2015
Irregular componet
0.97
0.98
0.99
1.00
1.01
1.02
yt = Tt St It
2000
2002
2004
2006
2008
2010
2012
2014
yt = Tt St It
80
100
120
140
160
2000
2002
2004
2006
2008
2010
2012
2014
Multiplicative model: yt = Tt St It
Additive model: yt = Tt + St + It
90
100
110
2002
2004
2006
2008
2010
2012
90
100
110
120
2000
2002
2004
2006
2008
2010
2012
2014
110
115
120
2007
2008
2009
2010
2011
2012
Smoothing
by averaging
P = (Y + Y + Y )/3
t-1
t+1
115
120
110
2007
2008
2009
2010
2011
2012
110
115
120
2007
2008
2009
2010
2011
2012
110
115
120
2007
2008
2009
2010
2011
2012
2x12 filter
[1/2,1,1,1,1,1,1,1,1,1,1,1,1/2]/12
Also called a centred 12-term moving average
Question to the audience:
Why is this filter of special interest?
Henderson filters
Finding filters with good properties is an
interesting topic
Hederson (1916) introduces the so-called
Henderson filters
X-12-ARIMA uses this type of filter to calculate
the trend
The filter length determines the degree of
smoothing
110
115
120
2007
2008
2009
2010
2011
2012
110
115
120
2007
2008
2009
2010
2011
2012
110
115
120
2007
2008
2009
2010
2011
2012
110
115
120
2007
2008
2009
2010
2011
2012
Question to the audience: Why does the filtered series stop in 2009?
110
115
120
2007
2008
2009
2010
2011
2012
1.2
1.1
1.0
0.9
Finding the
seasonal
component
by filtering
1.3
2000
2002
2004
2006
2008
2010
2012
2014
X12-ARIMA or SEATS
Both method can be viewed as filtering techniques
X12-ARIMA
A non-parametric method
No model assumed
SEATS
The components are assumed to follow ARIMA models
The filters are derived from modelling
Possible to do inference and to make forecasts with
confidence intervals
So why the name X12-ARIMA when this method is the one
that is not based on ARIMA?
Answer on the next slide
Example of
regression variables
in reg-ARIMA
modelling
Easter
2000 and 2001: Easter in
April
2008: Easter in March
2002: 4 of 5 Norwegian
Easter days in March
Trading day
Six parameters needed to
model seven days
Mon: Number of Mondays
minus Number of Sundays
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Dec
Jan
Feb
Mar
Apr
May
:
Mar
Apr
May
Jun
2000
2000
2000
2000
2000
2000
2000
2000
2000
2000
2000
2000
2001
2001
2001
2001
2001
2001
2001
2001
2001
2001
2001
2001
2002
2002
2002
2002
2002
:
2008
2008
2008
2008
Question to the
audience:
Regression Model
-------------------------------------------------------------Parameter
Standard
Variable
Estimate
Error
t-value
-------------------------------------------------------------Trading Day
Mon
-0.0019
0.00193
-1.00
Tue
0.0064
0.00194
3.31
Wed
0.0018
0.00190
0.94
Thu
-0.0016
0.00195
-0.81
Fri
0.0138
0.00188
7.37
Sat
0.0034
0.00193
1.73
*Sun (derived)
-0.0219
0.00196
-11.16
Why exactly
equal t-values?
Regression Model
-------------------------------------------------------------Parameter
Standard
Variable
Estimate
Error
t-value
-------------------------------------------------------------Trading Day
Weekday
0.0036
0.00053
6.87
**Sat/Sun (derived)
-0.0090
0.00131
-6.87
Outliers
An extreme observation caused by a special event can
be problematic
Can influence the modelling in a negative way
Parameter estimates
Forecasts
Decomposition
Solution
Include the outlier as a dummy variable in the reg-ARIMA
modelling
.0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0.
90
100
110
120
2000
2002
2004
2006
2008
2010
2012
2014
85
90
95
100
Data with level shift: Seasonally adjusted (blue) and trend (red)
2000
2002
2004
2006
2008
2010
2012
2014
Presented by
Lszl Sajtos
Hungarian Central Statistical Office
Topics
Seasonal adjustment step-by-step
(A few) issues on seasonal adjustment
Preliminary results
If results are acceptable
Output data
Expert information
Collecting expert data from the sections about datasets (potential outliers,
methodological changes, changes in exterior factors (e.g. law), connections
to other time series and sectors)
Seasonality
144
136
128
Seems additive
120
112
104
96
88
80
Probably outliers
72
64
date
56
J an2000
J an2002
J an2004
J an2006
J an2008
J an2010
J an2012
J an2014
Automatic test
Verification
Graphical analysis
Consideration based on
professional reasons
Elimination
Little significance
Significance
Non-significance or absence
Keep
Consideration based on
professional reasons
Elimination
Available expert
information
STEP 1
Less significant, but
professionally
reasonable
Significant
Keep it
Monitoring
Stability
Not
significant
Consideration based on
professional reasons
Eliminate
it
Good results
Keep model
Airline model
Manual settings
Decomposition (Step 6)
Software tools
Eliminating deterministic
effects
Decomposition
Additive
Multiplicative
Log-additive
Ljung-Box test
Box-Pierce test
2.
3.
Documentation
required!
Detailed analysis
Manual settings
satisfying
Quality diagnostics
not
Manual settings
Dissemination
(STEP 9)
Automated module
Save of output
Diagnostic, outlier %
The results
Quality diagnostics
Consistency issues
Data presentation
Revision
Revision
Revision
Unadjusted
data
SA data
Reasons:
Reasons:
Revision strategies
Goal: preserving accuracy, taking new information into consideration while
avoiding large changes
Strategies:
Extreme
Extreme types
types
Current
Concurrent
Alternative
Alternative types
types
Partial
concurrent
Controlled
current
Horizon of revision
Question: How many months of data should be revised?
Practices:
Consistency issues
Linkages in economy and
among time
series;expectations of users;
errors; etc.
Issues
Time consistency issue
Temporal constraints
Cross-sectional
constraints
e.g. GDP
Sources of inconsistency:
Benchmarking
Benchmark: typically annual data
Aim: Providing time consistency, the techniques operate with the
sum of modified sub-annual series
Benchmarking
Pro-rating
method
Denton
method
Pro-rating method
How it works: multiplies the sub-annual values by the
corresponding annual proportional discrepancies
Example: Three observations (), requirement:
Corrected values: ;
Denton method
How it works: Based on quadratic optimalization
Advantages:
Aggregation consistency
Aggregate series: time series consists of several components
(e.g. industrial series)
Goal: The aggregate series should equal to the sum of their
components
Problem: Non-linear seasonal adjustment process
Indirect SA
Direct SA