Professional Documents
Culture Documents
Econometrics
BUSI2053
The Simple Linear
Regression Model I:
Specification and
Estimation
Lecture Outline
• An Economic Model
• An Econometric Model
• The Nature of Regression Analysis
• Estimating the Regression Parameters
• Assumptions of OLS and G-M condition
• Properties of the OLS estimator
• Suggested Reading:
• Chapter 2, 3 & 4.1, 4.2: Hill, R.C., Griffiths W.E. and Lim, G.C. Principles of
Econometrics, fourth edition, Wiley, 2012 (pp. 39-110)
• Chapter 5-6: Westhoff (2013) An introduction to econometrics: a self-contained
approach, MIT Press, 2013
• Chapter 1-3: Gujarati, D.N. and Porter D.C. Basic econometrics, 5th ed., McGraw-Hill,
2009 (pp. 34-117)
• Chapter 1 & 2: Dougherty, Christopher. Introduction to econometrics. 4th ed. Oxford
University Press, 2011 (pp.83-147)
03/28/2023 Myint Moe Chit, BUSI2053 2
Before we move on,
• Random variable
• Conditional probability
• Population parameters (Fixed values)
• Sample statistics (Random variables)
• Expected value rules
• Variance and covariance rules
• Estimator and estimate
• Properties of an estimator
Exam Scores
60
50
40
30
20
10
0
0 10 20 30 40 50 60 70 80 90 100
Attendance Rate
Probability distribution of
food expenditure y given
income x = $1000
03/28/2023 Myint Moe Chit, BUSI2053 6
An Economic Model to Econometric Model
• In econometrics, we recognise that real-world expenditures
are random variables, and we want to use data to learn
about the relationship between income and expenditure.
Probability distributions of
food expenditures y given
incomes x = $1000 and x = $2000
03/28/2023 Myint Moe Chit, BUSI2053 7
Notes on Regression Analysis 1
Regression versus Correlation
• The correlation analysis measures the strength
or degree of linear association between two
variables that are assumed to be random
Average expenditure
an economic model and then a
corresponding econometric model
• This econometric model is also called a 𝐸¿
regression model
– The simple regression model can be Δ 𝐸(𝑦∨𝑥)
written as Δ𝑥 Δ 𝐸(𝑦 ∨𝑥) 𝑑𝐸 (𝑦∨𝑥)
𝛽 2= =
𝐸 ( 𝑦 ∨𝑥 ) =𝜇 𝑦 ∨𝑥 =𝛽 1+ 𝛽2 𝑥 𝛽1 Δ𝑥 𝑑𝑥
Income y
where β1 is the intercept and β2 is the
slope and they are population
dE(y|x)/dx denotes the derivative of the
parameters Quiz 1
expected value of y given an x value
03/28/2023 Myint Moe Chit, BUSI2053 9
An Econometric Model
• The model describes economic 𝑓 (𝑦)
behaviour in the population of our
interest re
it u
• If we were to sample household d
pen
expenditures at other levels of income ex
ood
the regression function describes the F
𝛽 1+ 𝛽2 𝑥=𝐸 ¿
mean value of household expenditure for
each level of income
• Note: represents the model for expected
value of y. For the actual value of y, the
model can be written as: where is Ho
us e ho
random error term. ld i
n co
m𝑥e
𝑒2
𝑦2
𝑦1 𝑒1
The relationship among y, e
and the true regression line
𝑥1
03/28/2023 𝑥2 𝑥3
Myint Moe Chit, BUSI2053 𝑥4 𝑥 11
Introducing the Error Term
• The random error term is defined as 𝑦
𝑦4
𝑒= 𝑦 − 𝐸 ( 𝑦 ∨𝑥 ) =𝑦 − β 1 − β 2 𝑥 𝑒4
𝐸¿
𝑦3
• Rearranging gives a population 𝑒3
regression model.
𝑒2
𝑦 =β 1+ β 2 𝑥+𝑒 𝑦2
𝑒1
𝑦1
• The expected value of the error term,
given x, is 𝑥1 𝑥2 𝑥3 𝑥4 𝑥
𝑒^ 𝑖= 𝑦 𝑖 − ^𝑦 𝑖= 𝑦 𝑖 −𝑏1 −𝑏 2 𝑥 𝑖
Be mindful of differences in notations in
different textbooks.
03/28/2023 Myint Moe Chit, BUSI2053 18
Estimating the Regression Parameters (exercise)
x = Number of children in a
1 2 3 4 3*4 5 household
xi y i ( 𝑥𝑖 − 𝑥
¯()𝑦 𝑖 − 𝑦
¯) ( 𝑥𝑖 − 𝑥¯ )2 ^
𝑦 =𝑏1 +𝑏2 𝑥𝑖 𝑒^
y = Number of visits to KFC
2 4 -2 -1 2 4 in a month
3 3 -1 -2 2 1
5 4 1 -1 -1 1
6 7 2 2 4 4
5 4 1 -1 -1 1
4 7 0 2 0 0
4 6 0 1 0 0
3 5 -1 0 0 1
¯𝑥 =4 𝑦¯ =5 ∑=6 ∑=12
𝑏1= ¯𝑦 −𝑏 2 ¯𝑥 =¿
03/28/2023 Myint Moe Chit, BUSI2053 19
Estimating the Regression Parameters
Estimates for the Food Expenditure Function
The population regression model
y =weekly food expenditure in $
𝑦 =β 1+ β 2 𝑥+𝑒
^𝑦 =83.42+10.21 𝑥 The fitted regression line
^𝑦 𝑖 =83.42+10.21 𝑥𝑖
𝑦¯ =283.57 The sample regression model
𝑦 𝑖 =8 3.42+1 0.21 𝑥𝑖 + 𝑒^ 𝑖
¯𝑥 =19.60
Quiz 3
x =weekly income in $100
03/28/2023 Myint Moe Chit, BUSI2053 20
Notes on Regression Analysis 3
Regression versus Causation
• Although regression analysis deals with the
dependence of one variable on other variables, it
does not necessarily imply causation.
SR2: The expected value of the random error e is: 𝐸(𝑒 𝑖 )=0
2
SR3: The variance of the random error e is: var (¿𝑒𝑖 )=𝜎 =var(¿ 𝑦 𝑖 )¿¿
SR4: The covariance between any pair of random errors, ei and ej is:
cov (¿𝑒𝑖 , 𝑒 𝑗 )=cov(¿ 𝑦 𝑖 , 𝑦 𝑗 )=0 ¿¿
SR5: The variable x is not random, and must take at least two
different values
2
SR6: Optional 𝑒 𝑁 (0,σ )
03/28/2023 Myint Moe Chit, BUSI2053 22
Properties of least square estimator
Gauss-Markov condition
Under the assumptions SR1-SR5 of the linear regression model, the OLS estimators b1
and b2 have the smallest variance of all linear and unbiased estimators of β1 and β2.
They are the Best Linear Unbiased Estimators (BLUE) of β1 and β2.
1. The estimators b1 and b2 are “best” compared to similar estimators because they
have the minimum variance.
2. The estimators b1 and b2 (not specific b1 and b2) are linear and unbiased.
3. If any of these assumptions are not true, then b1 and b2 are not BLUE