Professional Documents
Culture Documents
Ratjomose P. Machema
rp.machema@nul.ls
EC6041: Econometric Theory and Applications
Table of contents
Contents
1 What is Econometrics? 1
1 What is Econometrics?
Definitions of Econometrics
1
– at best, it only indicates the direction of change between variables
– Real world problems almost always require prediction of the size of
change
∗ e.g. How much will the farm price of beef increase as average con-
sumer income increases by R1000 per month?
Definitions of Econometrics
Definitions of Econometrics
2
2 Types of Econometric Models?
Definitions of Econometrics
Broadly, econometric models can be classified into a number of categories.
A first class of models describes relationships between present and past.
• Mainly built to get forecasts for future values and the corresponding uncertainty or volatility.
• How does the short-term interest rate depend on its own history?
A second type of model considers relationships btwn economic quantities over a certain time period.
• Gives us information on how (aggregate) economic quantities fluctuate over time in relation to other
quantities.
• What happens to the long-term interest rate if the monetary authority adjusts the short-term one?
Definitions of Econometrics
Third are models that describe relationships between different variables measured at a given point in time for
different units (e.g. households or firms).
– Analyse to what extent differences in household savings can be attributed to differences in house-
hold income.
• How much more would a given household, or the average household, save if income were to increase by
1%?
A class that considers relationships between different variables measured for different units over a longer time
span (at least two periods).
• Usually requires panel data, repeated observations over the same units.
• Ideally suited for analysing policy changes on an individual level, provided that it can be assumed that
the structure of the model is constant into the (near) future.
3
Definitions of Econometrics
• The job of econometrics is to specify and quantify these relationships.
• Another job of the econometrician is to judge whether the resulting model
is ‘appropriate’.
– That is, to check whether the assumptions made to motivate the es-
timators (and their properties) are correct, and to check whether the
model can be used for its intended purpose.
• The number of econometric techniques that can be used is numerous, and
their validity often depends crucially upon the validity of the underlying
assumptions.
• Thus our approach in this course is to guide you through this forest of es-
timation and testing procedures, not by describing the beauty of all possible
trees.
Definitions of Econometrics
• An distinction is made between micro-econometrics and macro-econometrics.
– Micro-metrics is characterized by its analysis of cross section and
panel data and by its focus on individual consumers, firms, and micro-
level decision makers.
– Macro-metrics is involved in the analysis of time-series data, usually
of broad aggregates such as price levels, the money supply, exchange
rates, output, investment, economic growth, and so on.
4
Steps for a Successful Empirical Study
Step 1: Careful formulation of the question of interest
• Be very precise in posing the question you hope to answer. For example,
– does attending lectures in college lead to better grades (on average)?
∗ If the severity of punishment for certain crimes increases, do crime
rates fall on average?
5
Example of metrics model
We can specify an econometric model for the crop yield/farming training example as
The constants β0 , β1 , β2 , andβ3 are the parameters of the econometric model, and it is these
(especially β3 in this example) that we hope to estimate.
• Ideally we will be able to collect data on crop yield, educ, exper, and training from a large
group of farmers.
• The last term in the equation (µ) called the error term or disturbance. It plays a very
important role in econometrics.
– It represents all other factors that can affect farmer’s crop yield, intelligence, mo-
tivation, and so on.
– The error term can also capture measurement problems in one or more of the vari-
ables.
– We could add soil quality, climate and any other variables that determine crop yield,
but we can never eliminate µ entirely.
– In fact, dealing with this error term or disturbance term is perhaps the most im-
portant component of any econometric analysis.
Hypotheses
• Once an econometric model such as (2) has been specified, various hypotheses
of interest can be stated in terms of the unknown parameters.
• We will want to use statistical methods, and data, to estimate and test
hypotheses about the parameters.
• For example,
– The hypothesis that farmer training has no effect on crop yield is β3 = 0.
– The hypothesis that one year of farming experience is worth one year
of formal education is β1 = β2 .
6
∗ A different set of data would produce a different answer.Therefore,
the conceptual framework for the empirical analysis needs to provide
both an answer to the question and a measure of how precise the
answer is. That is, construct confidence intervals for the paramet-
ers.
– In some cases, the econometric model is used to make predictions in
either the testing of a theory or the study of a policy’s impact.
Macro
• data resulting from a pooling or aggregating over individuals, households, or firms at the
local, state, or national levels.
• outcomes such as prices or income that may be expressed as numbers or some transfor-
mation of them, such as real prices or per capita income.
Qualitative
• outcomes that are of an “either-or” situation. For example, a consumer either did or did
not make a purchase of a particular good, or a person either is or is not married.
Economic Data
7
4.1 Cross-sectional data
Cross-sectional data
• Cross-sectional data set are collected on individuals/farmers, families, firms, farms, or
some other units at a given point in time.
– (Or, at least, time does not play a crucial role. Interview dates for surveys may vary
somewhat.)
– For example, several families may be surveyed during different weeks within a year.
• In this course, we will assume that a cross-sectional data set represents a random sample.
– That is, each unit in the population has the same chance of appearing the sample,
and the draws are statistically independent of one another.
– For example, if we obtain information on wages, education, experience, and other
characteristics by randomly drawing 500 people from the working population, then
we have a random sample from the population of all working people.
With cross-sectional data, we can learn about relationships among variables by studying differences
across people, firms, or other economic entities during a single time period.
• Another important difference with cross-sectional data is that we cannot assume outcomes
are independent across observation (that is, across time).
– For example, knowing what gross domestic product is in 2009 tells us a lot about
its likely range in 2010.
– When we apply econometric methods to time series data, we will have to recog-
nize that the observations are correlated across time. Sometimes this correlation is
substantial.
By tracking a single entity over time, time series data can be used to study the evolution of
variables over time and to forecast future values of those variables.
8
4.3 Independently Pooled Cross Sections
Pooled Cross Sections
• A data set consisting of independently pooled cross sections means that we have
collect cross-sectional data at different points in time and pool them together.
– For example, we may randomly sample from the producers of crop/grains in the
country 1990, 2000, and 2010.
– Our goal may be to see how the importance of input subsidies on productivity has
changed over time.
• If we obtain a random sample in each year it would be very small compared to the
entire population.
– It would be very rare that the same person would appear twice; if someone appears
twice nothing is harmed by ignoring that fact.
• Pooled cross sections are very useful for policy analysis - to study an intervention.
• Because the observations are independent (both within and across time periods),
pooled cross sections can be analyzed much like a single cross section.
– However, one often explicitly accounts for the different time periods in analyzing
the data.
• Superficially, a panel data set has a structure similar to a pooled cross section.
• The key difference is that with a panel data set, the same units (people,
houses, schools, and so on) are followed over time.
• Panel data, also called longitudinal data, are data for multiple entities
in which each entity is observed at two or more time periods.
• Following the same units over time has advantages when trying to infer
causality.