Professional Documents
Culture Documents
https://learning.edx.org/course/course-v1:MITx+6.419x+2T2023/block-v1:MITx+6.419x+2T2023+type@sequential+block@timeseries_lec1/block-v1:MITx+6.419x+2T2023+type@vertical+block@timeseries_lec1-tab2 1/6
11/22/23, 12:00 PM Introduction to Time Series: Trend, Seasonality, Stationarity, Autocovariance | Module 4: Time Series | Data Analysis: Statistical Modeling and Computation in Applications | edX
A time series is a special kind of statistical data, specifically, it is a collection of numerical measurements, called
observations
that are indexed by a time stamp . These time stamps must form a deterministic sequence and be
regularly spaced in time with equal intervals between any two adjacent stamps. For the purposes of statistical
inferences with these data, the observations are modeled mathematically as realizations of a corresponding
series (i.e., sequence) of random variables
that are defined on some common probability space . As with other types of statistical data, what is
observed by the statistician is a particular outcome of a specialized probability model:
To gain some intuition for our discussion of time series, consider the following examples:
Economic data: stock prices, inflation rate, GDP, employment rate, interest rate, exchange rates;
Biometric data: heart rate, blood pressure, weight, fMRI;
Environmental data: temperature, precipitation, pressure, humidity, pollution;
Sound data: speech, music;
In all these examples, as well as in a general time series, data take the form of discrete measurements of a real
world phenomena that evolves continuously in time. A general probabilistic model to describe such phenomena
is called a stochastic process , which is simply a collection of random variables indexed by either a
continuous or discrete time parameter . So, a time series data set can be thought of as a single realization of a
stochastic process. Each random variable has a marginal distribution . The process as a whole
also has a probability law, which can be thought of as the joint distribution of all the 's. (Interestingly, this joint
distribution of the entire process, whether continuous or discrete, is completely characterized by the collection
of all the finite-dimensional marginal distributions of random vectors where ranges
over all integers and time indexes range over all distinct time stamps.)
Solution:
The last option is correct A time series is a collection of many distinct and dependent random variables There is
https://learning.edx.org/course/course-v1:MITx+6.419x+2T2023/block-v1:MITx+6.419x+2T2023+type@sequential+block@timeseries_lec1/block-v1:MITx+6.419x+2T2023+type@vertical+block@timeseries_lec1-tab2 3/6
11/22/23, 12:00 PM
The last option is correct. A time series is a collection of many distinct and dependent random variables. There is
Introduction to Time Series: Trend, Seasonality, Stationarity, Autocovariance | Module 4: Time Series | Data Analysis: Statistical Modeling and Computation in Applications | edX
one random variable for each time stamp. The time stamps form a regular deterministic sequence, i.e. an
arithmetic progression.
Solution:
The first choice is not a time series because the indexes of the observations (and their corresponding random
variables) are not time stamps. This is an example of cross-section data.
The second choice is a time series, because the data are indexed by a time stamp. Note that these time stamps
are separated by a fixed number of trading hours (9.30 a.m. to 4 p.m. Eastern time for the U.S. stock market).
The third choice is not a time series because the time interval between two consecutive orders on a stock
exchange is itself a random variable. However, if we take the time parameter to be the order number, rather than
the actual physical time (e.g. the number of milliseconds since the opening of the trading day), then we do have
a time series.
The fourth choice is not a time series data because the indexes are locations, not time stamps. This is an
example of spatial data.
The fifth choice is a time series.
The sixth choice is not a time series because the intervals between arrivals of two consecutive patients is
random. Also, a time series would typically be a measurement taken from the same physical process, e.g. the
same patient, rather than independent physical processes.
1 observation
100 observations
50 observations
Solution:
A time series is a collection of 100 distinct random variables. This means that s can have
different marginal distributions, and an arbitrary dependence structure. Since there is only one copy of , and
we observe only a single realization of the time series (because we cannot go back in time and replay the history
of the real world), there is typically only one single observation of in a time series data set.
Previous Next
https://learning.edx.org/course/course-v1:MITx+6.419x+2T2023/block-v1:MITx+6.419x+2T2023+type@sequential+block@timeseries_lec1/block-v1:MITx+6.419x+2T2023+type@vertical+block@timeseries_lec1-tab2 5/6
11/22/23, 12:00 PM Introduction to Time Series: Trend, Seasonality, Stationarity, Autocovariance | Module 4: Time Series | Data Analysis: Statistical Modeling and Computation in Applications | edX
edX
About
Affiliates
edX for Business
Open edX
Careers
News
Legal
Terms of Service & Honor Code
Privacy Policy
Accessibility Policy
Trademark Policy
Sitemap
Cookie Policy
Your Privacy Choices
Connect
Idea Hub
Contact Us
Help Center
Security
Media Kit
https://learning.edx.org/course/course-v1:MITx+6.419x+2T2023/block-v1:MITx+6.419x+2T2023+type@sequential+block@timeseries_lec1/block-v1:MITx+6.419x+2T2023+type@vertical+block@timeseries_lec1-tab2 6/6