Professional Documents
Culture Documents
Lecture 1
1
What is Time Series?
Mr. Google:
Time Series are an old idea, the city of Barcelona stores data about its citizens
extensively since 13th Century
From https://mapr.com/blog/time-series-data-new-big-data/ 4
What Time Series Are
or
How is Time Series Data Different
from all Other Data ?
5
What Time Series Are
or
How is Time Series Data Different
from all Other Data ?
6
Example:
from
http://www.businessinsider.com/chocolate-consumption-vs-nobel-prizes-2014-4?r=UK&IR=T 7
• Time Series Data: a series of values recorded in time
http://shorthair.candra.info/domestic-shorthair-cat-weight-chart.html
9
Times series forecasting is performed in nearly every
organization that works with quantifiable data:
• Retails stores use it to forecast sales.
• Energy companies use it to forecast reserves, production, demand, and
prices.
• Educational institutions use it to forecast enrollment.
• International financial organizations such as the World Bank and
International Monetary Fund use it to forecast inflation and economic
activity.
• Transportation companies use time series forecasting to forecast
future travel.
• Banks and lending institutions use it to forecast new homes purchases.
• Venture capital firms use it to forecast market potential/evaluate
business plans.
• Meteorologists use it to predict precipitation, temperatures, etc.
10
A comment on time scales:
14
Sqrt(US Population), 1790-1990, ten year intervals
Commands used in R:
> uspop <- read.table(“ ”)
> pop=ts(uspop)
> sqrtpop=sqrt(pop)
> plot(sqrtpop)
15
International Airline Data.
Monthly totals of international passengers (1/1949 --12/1960)
18
Examples from
PROJECT W’17 Projects:
EXAMPLES
19
ARIMA Modeling of the Consumer Prize Index
20
ARIMA Modeling of the Consumer Prize Index
21
PROJECT: Time Series Analysis on Monthly Milk Production
Based on SARIMA Model
Data: Monthly milk production (pounds per cow) from Jan. 1962- Dec. 1975 22
PROJECT: Atmospheric CO2 Concentration with Forecasts
24
STEPS IN TIME SERIES ANALYSIS
• Model Identification
– Time Series plot of the series
– Check for the existence of a trend or seasonality
– Check for the sharp changes in behavior
– Check for possible outliers
• Remove the trend and the seasonal component to get stationary
residuals.
• Estimation
– MME
– MLE
• Diagnostic Checking
– Normality of error terms
– Independency of error terms
– Constant error variance (Homoscedasticity)
• Forecasting
– Exponential smoothing methods
– Minimum MSE forecasting
25
CHARACTERISTICS OF A SERIES
• For a time series Yt , t 0,1,2,
THE MEAN FUNCTION:
t EYt Exists iff E Yt .
The expected value of the process at time t.
0 0
26
CHARACTERISTICS OF A SERIES
• THE AUTOCOVARIANCE FUNCTION:
t ,s CovYt , Ys EYt t Ys s
E YtYs t s ; t , s 0, 1, 2,
Covariance between the value at time t and the value at time s of a stochastic process Yt.
27
EXAMPLE
• Moving average process: Let ti.i.d.(0, 1),
and
Xt = t + 0.5 t−1
28
EXAMPLE
• RANDOM WALK: Let e1,e2,… be a sequence of
i.i.d. rvs with 0 mean and variance e . The
2
i 1
m
i 1
2
n i 1
i 2 j 1
Var ciYti ci Var Yti 2 ci c j Cov Yti , Yt j
30
JOINT PDF OF A TIME SERIES
• Remember that
FX1 x1 : the marginal cdf
f X 1 x1 : the marginal pdf
FX 1 , X 2 ,, X n x1 , x2 ,, xn : the joint cdf
f X 1 , X 2 ,, X n x1 , x2 ,, xn : the joint pdf
31
JOINT PDF OF A TIME SERIES
• For the observed time series, say we have two
points, t and s.
• The marginal pdfs: fYt yt and fYs ys
32
JOINT PDF OF A TIME SERIES
• Since we have only one observation for each r.v.
Yt, inference is too complicated if distributions
(or moments) change for all t (i.e. change over
time). So, we need a simplification.
12
r.v.
10 Y4
8
6 r.v.
Y6
4
0
1 2 3 4 5 6 7 8 9 10 11 12
33
JOINT PDF OF A TIME SERIES
• To be able to identify the structure of the
series, we need the joint pdf of Y1, Y2,…, Yn.
However, we have only one sample. That is,
one observation from each random variable.
Therefore, it is very difficult to identify the
joint distribution. Hence, we need an
assumption to simplify our problem. This
simplifying assumption is known as
STATIONARITY.
34
STATIONARITY
• The most vital and common assumption in
time series analysis.
• The basic idea of stationarity is that the
probability laws governing the process do not
change with time.
• The process is in statistical equilibrium.
35
TYPES OF STATIONARITY
• STRICT (STRONG OR COMPLETE) STATIONARY
PROCESS: Consider a finite set of r.v.s.
Yt1 ,Yt2 ,,Ytn from a stochastic process
Y w, t ; t 0,1,2,.
• The n-dimensional distribution function is
defined by
FYt ,Yt ,,Yt yt1 , yt2 ,, ytn Pw : Yt1 y1 ,, Ytn yn
1 2 n
37
STRONG STATIONARITY
n-th order stationarity in distribution = strong stationarity
38
STRONG STATIONARITY
• So, for a strong stationary process
i) fYt1 ,,Ytn y1,, yn fYt1k ,,Ytn k y1,, yn
ii) EYt EYt k t t k ,t , k
Expected value of a series is constant over time, not a function of time
10
8
Y1 Y2 Y3 ……………………………………….... Yn
6
4
2 2 2
2 2
0 t
1 2 3 4 5 6 7 8 9 10 11 12
CovY2 , Y1 21 1
CovY3 , Y2 3 2 1
CovYn , Yn 1 n ( n 1) 1 Affected from time lag, k
CovY3 , Y1 31 2
CovY1, Y3 13 2
40
STRONG STATIONARITY
v) CorrYt , Ys CorrYt k , Ys k t ,s t k ,s k , t , k
t s t k s k h
Let t=t-k and s=t,
t ,t k t k ,t k , t , k
• It is usually impossible to verify a distribution
particularly a joint distribution function from
an observed time series. So, we use weaker
sense of stationarity.
41
WEAK STATIONARITY
• WEAK (COVARIANCE) STATIONARITY OR
STATIONARITY IN WIDE SENSE: A time series is
said to be covariance stationary if its first and
second order moments are unaffected by a
change of time origin.
• That is, we have constant mean and variance
with covariance and correlation beings
functions of the time difference only.
42
WEAK STATIONARITY
43
EXAMPLE
• Consider a time series {Yt} where
Yt=et
and eti.i.d.(0,2). Is the process stationary?
44
EXAMPLE
45
EXAMPLE
• RANDOM WALK
Yt e1 e2 et
where eti.i.d.(0,2). Is the process {Yt}
stationary?
46
EXAMPLE
• Suppose that time series has the form
Yt a bt et
where a and b are constants and
eti.i.d.(0,2).
(Also known as a linear trend)
Is {Yt} stationary?
47
EXAMPLE
Yt 1 et
t
48
STRONG VERSUS WEAK STATIONARITY
• Strict stationarity means that the joint distribution only
depends on the ‘difference’ h, not the time (t1, . . . , tk).
• Finite variance is not assumed in the definition of strong
stationarity, therefore, strict stationarity does not
necessarily imply weak stationarity. For example, processes
like i.i.d. Cauchy are strictly stationary but not weak
stationary.
• A nonlinear function of a strict stationary variable is still
strictly stationary, but this is not true for weak stationary.
For example, the square of a covariance stationary process
may not have finite variance.
• Weak stationarity usually does not imply strict stationarity
as higher moments of the process may depend on time t.
49
STRONG VERSUS WEAK STATIONARITY
• If process {Xt} is a Gaussian time series, which
means that the distribution functions of {Xt}
are all multivariate Normal, weak stationary
also implies strict stationary. This is because a
multivariate Normal distribution is fully
characterized by its first two moments.
50
STRONG VERSUS WEAK STATIONARITY
• For example, a white noise is stationary but may
not be strict stationary, but a Gaussian white
noise is strict stationary. Also, general white
noise only implies uncorrelation while Gaussian
white noise also implies independence. Because
if a process is Gaussian, uncorrelation implies
independence. Therefore, a Gaussian white
noise is just i.i.d. N(0, 2).
51
STATIONARITY AND NONSTATIONARITY
• Stationary and nonstationary processes are
very different in their properties, and they
require different inference procedures. We
will discuss this in detail through this course.
52