Professional Documents
Culture Documents
1 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
d) Some factors may be omitted due to their small influence on the dependent variables
e) Even if all factors are known, the available data may not be adequate for the measure
of all factors influencing a relationship
ii. The erratic nature of human beings:- The human behavior may deviate from the
normal situation to a certain extent in unpredictable way.
iii. Misspecification of the mathematical model:- we may wrongly specified the
relationship between variables. We may form linear function to non- linearly related
relationships or we may use a single equation models for simultaneously determined
relationships.
iv. Error of aggregation: - Aggregation of data introduces error in relationship. In
many of Economics data are available in aggregate form ex. Consumption, income
etc is found in aggregate form which we are added magnitudes referring to
individuals where behavior is dissimilar.
v. Errors of measurement:- when we are collecting data we may commit errors of
measurement
In order to take in to account the above source of error we introduce in econometric functions a
random term variable which is usually denoted by the latter U & is called error term, random
disturbance term or stochastic term of the function. By introducing this random term variable in
the function the model will be just like equation number (2.3). The relationship between variables
will be split in to two parts.
Example From equation (2.3)
+Yt represents the exact relationships explained by the line
Apart represented by the random term Ui is the unexplained part by the line. This
can be explained using the following graph.
Y
Yn Ct= = +yt
Un
Ct Consumption
Un
Yn
U2
Y1 X
0 Current income (Yt)
Figure 1
The line Ct: = +Yt shows /explain/ the exact relation ship between consumption & income but
other variables that affect consumption expenditure are scattered around the straight line. Then
the true relationship is explained by the scatter of observations between Ct &Yt.
2 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Ct = +Yt + Ut ------------------------ (2. 4)
Variation in Explained
+= ++ Unexplained
consumption + variation+ variation
+ +
To estimate this equation we need data on Ct, Yt+&Ut, since Ut is never observed like other
variables (Ct & Yt) we should guess the value of „U‟, that is we should make some assumptions
+
about the shape of each Ui (mean, S.E, Covariance etc)
E(Ui) = 0 or ui 0
i 0
iii) Homoscedasticity: (Constant Variance). The variation of each Ui around all values of the
explanatory value is the same i.e. the deviation of Ui around the straight line (in figure 1) is
remain the same var (Ui)= u
2
iv) The variable Ui has a normal distribution with mean zero & variance of Ui.
Ui is N(0, u )
2
v) Ui is serially independent:- the value of U in one period is not depend up on the value of
Ui in other period of time means the co-variance between Ui & Uj is equal to zero
Cov (UiUj) = 0
= E [Ui-0] [Uj-0]
= E(Ui) E(Uj)
Yi = Xi Ui
Mean of Yi (Expected value of Yi) can be found as follow
E(Yi)= E[ Xi Ui ]
E(Yi) = Xi E (Ui)
Where E (Ui) = 0 by assumption
E(Y) = Xi ----- is the mean value of the dependent variable Yi
Variance of Yi =
Var (Yi) = E [Yi-E (Yi)]2
Substitute in place of E(Y) = Xi
Var (Yi) =E [Yi-E ( X ) ]2
Again in place of Yi substitute Yi = Xi Ui
Var (Yi) = E Xi Ui X i 2
Var (Yi)= E(Ui)2
From our previous assumption the variance of Ui is equal to E (Ui)2 =u2 then
4 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
2.3 Estimation of the model
The relationship
Yi = Xi Ui 2.5
Holds for population of the values X&Y. Since these values of the population are unknown we do
not know the exact numerical values of & β' s. To calculate or obtain the numerical values of
& β we took sample observations for Y & X. By substituting these values in the population
regression we obtain sample regression which gives an estimated value of & β given by
ˆ & ˆ respectively then the sample regression line is given by
^ ^
ei 2
2 (Yi ˆ ˆXi ) 0 2.12
ˆ
5 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
ei 2
2 (Yi ˆ ˆXi ) Xi 0 2.13
ˆ
First take equation number 2.12to find the value of ̂
ˆ nˆ
2 Yi 2nˆ 2ˆ Xi 0
2 Yi 2nˆ 2ˆ Xi =0
ˆ
Yi ˆ Xi
n n
Yi Y
n
&
Xi x
n
ˆ
ˆ Y x 2.14
Take equation number (2.13) to find the value of ˆ
YiXi ˆ Xi ˆ Xi 2
2.15
YiXi (Y ˆx ) Xi ˆ Xi 2
YiXi Y Xi ˆx Xi ˆ Xi 2
6 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
YXi Y Xi ˆx Xi ˆ Xi 2
We know that Y
Y & x X substituted
n n
Yi Xi ˆ Xi
YiXi n n Xi ˆ Xi 2
Yi Xi ( Xi ) 2
n YiXi Yi Xi
ˆ 2.16
n Xi ( X )
2 2
The numerical value of ˆ & ˆ can be found in deviation forms. To write the above equation
number 2.16 in deviation form
Take the numerator which is
n Xi Yi Yi Xi
Yi Xi
Added & subtracted
n Xi Yi Yi Xi n Xi Yi Yi X Xi Yi Xi Yi
= n Xi Yi Yi Xi Xi Yi Xi Yi
= n Xi Yi Yi Xi Xi Yi Xi Yi
= n Xi Yi
Yi Xi n Xi Yi n X n Y
n
n
n n
Take n in common
n XiYi Y X x Y nxy
n Xi x Yi y -------------------------2.17
n Xi 2 2 Xi Xi Xi
2
n Xi 2 2nx X n 2 x 2
n Xi 2
2 x X nx 2
n ( xi 2 x ) 2 2.18
7 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
n ( X 1 X )(Y1 Y )
ˆ
n X 1 X ) 2
X1 X xi & Yi Y yi Substitute in the above equation
n xyi
ˆ
n xi 2
ˆ
xiyi 2.19
xi 2
Total variation
Y
Explained variation
8 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Suppose a researcher may have Yi=+βXi+ Ui model. To estimate this model he took some
sample observation to estimate the value of & β. In his estimation all the data may fall below,
above or on the line. Then using R2 he can observe that whether the regressions line will give the
best fit for these data or not.
Yi is the observed sample value
Y is the mean value of the sample
Yˆ is the estimated regression line using sample data
Yi - Y shows by how much the actual sample value is deviating from the sample mean
value. This is called total variation represent by small y.
Yˆi Y Explain by how much the estimated values are deviating from the sample mean
value. This is called explained variation & represent by ŷi
Y Yˆi This also shows that the difference between the actual value of Yi & the
estimated value of Yi ( Yˆi ). This is called unexplained variation represent by ei.
Therefore;
Total variation: yi Yˆi Y 2.20
Explained variation yˆ Yˆ Y 2.21
Unexplained variation ei Yi Yˆi 2.22
Sum it over each equation & squared it. We will have
yi 2
(Yi Y ) 2 2.23
We square it because the sum of deviation of any variable around its mean value is zero then to
avoid this we make it squared.
yˆi (Yˆi Y )
2 2
2.24
ei (Yi Yˆ )
2 2
2.25
We can write equation no (2.20) as follows
yi Yˆi Y Yi yi Y 2.26
From equation (2.21)
yˆi Yˆ Y Yˆi yˆi Y 2.27
Substitute these equations in equation number 2.21
ei = Yi- Yˆ from the above equation (2.26 &2.27 ).
Yi yi Y
&
Yˆ yˆ Y
ei yi Y yˆi Y
ei yi Y yˆ Y
ei yi yˆi
yi ei yˆi 2.28
This shows that each deviation of the sample observed values of Y from its mean Yi Yˆ yi
consists of two components
9 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
yˆi Yˆi Y which shows the explained amount by the regression line
i.
ii. ei Yi Yˆ = the unexplained variation by the regression line
By Taking equation number 2.28 yi yˆi ei Sum it over
yi ( yˆi ei )
i 1
2
i 1
2
yi yˆi 2 yˆei ei
2 2 2
2.29
ˆ
yˆei ˆ xiyi ˆ xi
2 2 2
= ˆ xyi ˆ xi 2
=
xiyi xiyi xiyi xi 2
xi 2
xi
2
xiyi
ŷiei xi xiyi yixi 0
2
yˆ 2
ei 2
yi 2
yi 2
yi 2
1
yˆ 2
ei 2
yi 2
yi 2
yˆ 2
1 r2
ei 2.33
2
yi 2
Then r 2
yˆi 2.34
2
yi 2
r 2
1
ei 2
2.35
yi 2
ˆ
2.5 The relationship between r2 and the slope coefficient .
We know that
r 2
yˆi 2
.
yi 2
ˆ
xiyi
xi 2
ˆ xi 2
xiyi
.
xi 2
yi 2
11 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
r 2 ˆ
xyi .
yi 2
Or if you further substitute ˆ again =
xiyi
xi 2
r 2 ˆ
xiyi . xiyi = ( xiyi ) 2
2.36
xi yi xi . yi
2 2 2 2
r 2 yˆi 2
. or 1
e 2
or
xiyi or ̂ xi 2
or
( xiyi ) 2
yi 2
yi 2
yi yi
2 2
xi . yi
2 2
Limiting values of r2
r 2
1
ei 2
yi 2
3) If the regression line does not explain any part of the variation then
ei 2
r 2 will be zero.
12 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Mean & Variance of the least square estimates
Mean of ˆ
We can obtain mean value of ˆ from equation number 2.20
ˆ xiyi .
xi 2
ˆ xi (Yi Y ) .
xi 2
xiYi Y xi .
xi 2
By definition it is known that the sum of any variable deviations from its mean is equal to zero.
Then xi 0 then Y xi 0
xiYi . = xiYi 2.37
ˆ xi
xi 2 2
The value of the independent variables is a set of fixed values, which do not change from sample
xi
to sample. Then will be a constant number & lets‟ represent it by K & equation 2.37 can
xi 2
be written as
ˆ kiYi 2.38
We know that from equation number 2.5
x 0 Because the sum of any variables deviations from its mean is zero
Then
xi ki 0
xi 2
Again kiXi 1
xiX
KiXi xi 2.40
2
13 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
xi Xi x Substitute in the above equation 2.40
( X x)x
kiXi
xi 2
X x x 2.41
2
xi 2
X 2
2
n
n
( X ) 2
X 2
X 2
2
n
n.
n2
( X ) 2 X 2
X 2
n
2
n2
X X
X 2 2 n X n X
X 2 2x X x X
x =X2 2
x X substitute in the denominator of equation number 2.41
X 2
x X
KiXi X 1
2
x X
Then kiXi 1
xi Xi x
Xi xi x
Substitute in the equation kiXi in place of Xi
xi
ki( xi x ) We know that ki
xi 2
xi ( xi x ) xi x xi
=
2
xi xi 2 2
xi xi again by definition xi 0
2
=
xi xi 2 2
14 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
kiXi xi
2
xi
1
2
xi substitute it
Again ki
xi 2
ˆ
xiui
xi 2
15 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Var Yi E[Ui]2 u 2 - - - - - - - - - - - - - - - - - - -2.44
From equation no 2.43 we have the following
Var ( ˆ ) = ki 2 Var ( Yi )
From equation no 2.44 Var (Yi ) u 2
Var ( ˆ ) ki 2u 2
2
xi
u ki - - - - - We know that ki E
2 2
2
2
xi
2
xi
= u 2
xi
2
xi 2
= u
( xi 2 ) 2
1
Var ( ( ˆ ) 2 - - - - - - - - - - - - - - -- - - - - - -2.45
xi 2
2
Now we can say that ˆ has a mean of & Variance of
xi 2
u
,
Then ˆ ~N xi
Mean of ̂
xiyi
From Equation no 2.14 we have ˆ yˆ ˆxˆ a gain from equation 2.19 ˆ & from
2
xi
equation number 2.38 we have ̂ kiyi substituted
ˆ y ( kiYi) x
y x kiYi
Yi
x kiYi
n
Take Yi & as common b/c x & ki are constant
1
̂ x kiYi - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - 2.45
n
Take the expected Value of ̂ since n, x & ki are constant the expected value of a
constant is a constant it self.
1
E (ˆ ) x ki (Yi ) W know that
n
E (Yi ) E ( xi Ui)
xi (i)
By definition E (ui) 0 then it will be xi
16 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
1
E (ˆ ) x ki( Xi )
n
1
x ki X Xix ki
n n
n xi
x ki x i kixi]
n n
x ki x kiXi
We know that ki 0 & kixi 1
x x
E (ˆ ) - - - - - - - - - - - - - - - - - - - -- - - - - - -- -- - 2.46
Variance of ̂
1
Var (ˆ ) E[ E (ˆ )]2 we know that from equation no 2.45 that ˆ x ki yi
n
then substitute it
1
Var (ˆ ) Var E[ x ki]Yi
n
Since n, x & ki are constant number their variance is constant.
2
1
Var (ˆ x ku var Yii
n
We know that Var Yi= u from equation no 2.43 substitute in place of Var Yi. Then we will
2
have.
2
1
Var (ˆ ) u 2 x ki
n
1 1
= u 2 2 x 2 ki 2 2 x ki
n n
We know that the summation over a constant number is equal to multiplying the constant number
by n
n 2 x ki
u 2 x 2 ki 2
n n
We proved that ki 0
xi xi 2
2
1
Again ki 2
2
xi ( xi xi 2
2
S.E ( ˆ ) var( ˆ ) ˆ
2
(2.47)
xi 2
S.E ( ˆ ) var(ˆ ) ˆ 2
X 2
(2.48)
n xi 2
ˆ ei 1
2
S.E ( ) .
n 2 xi 2
S.E ( ˆ )
ei . X 2 2
18 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Interpretation of S.E
Once we obtain the standard deviations of ˆ & ˆ using equation number 2.47 & 2.48
respectively& by comparing these estimated values with their s.e we can interpret the results as
follows.
Economic interpretation
If we have Yi ˆ ˆxi
Case A) Acceptance of the null hypothesis means
Accept Ho = i=0
ˆ
Reject the alternative that H1 = i 0. This will occur if S.E ( ˆ )
2
It means the estimation are statistically insignificant or
- We accept the null hypothesis that the true parameter is zero or
- The independent variable is insignificant.
- The sample parameter do not explain the population parameter or
- The independent variable do not influence the dependent variables (no relationship
between the dependent & independent variables)
Then the above equation will be written as
Yi ̂
ˆ
Because the value of Ho = is not different zero or the slope of the line is zero
i. Again in case of the intercept ̂
If we accept the null hypothesis HO= =0 & reject the alternative H1 0 .
ˆ
This will occur if S.E ( ˆ )
2
And the meaning of ̂ is statistically insignificant or the equation will not have intercept or the
intercept is not differed from zero. Then
Since ˆ 0
Yi ̂xi
Case B) Reject the null hypothesis & accept the alternative
i. Reject Ho = ˆ 0 =0 & accept the alternative that H1 1 0 .
ˆ
This will occur if S.E ( ˆ )
2
It means
- The estimates are statistically significant
- The estimates are significantly different from zero
- The independent variables will influence the dependent variable or there is a relation ship
between the dependent & independent variables.
ii. Reject HO= ̂ =0 & accept the alternative that H1 ˆ 0 . This will be observed if
ˆ
S.E( ˆ )
2
Again it means the equation will have intercept or ̂ is statistically significant
19 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Geometric interpretation of the S.E. test
ˆ
1) If we find that if S.E ( ˆ ) we accept he null hypothesis that HO= ̂ =0 & reject the
2
alternative H1 ˆ 0 . In this case if we have the equation Yi ˆ ˆxi since ˆ 0 we
will write this equation as Yi ̂xi and we do not have the intercept. Then the equation will
pass through the origin (fig. a)
Yi ̂xi Yi ˆ ˆxi
0
X
0 0
Fig (a) fig. (b)
ˆ
2) If we find S.E. ( ˆ ) we will reject the null hypothesis that HO= ̂ =0 & accept the
2
alternative H1 ˆ 0 . Then in this case we will have intercept term & the equation
will be Yi ˆ ˆxi (fig. b)
ˆ
3) If we find that S.E ( ˆ ) we accept the null hypothesis that Ho = ˆ0 =0 & reject
2
the alternative that H1 1 0 . Then the equation will be Yi ̂ because ˆ 0 set
20 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Y
Yˆ ˆ ˆxi
Y ̂i
0 0 X
X Fig. d
Fig. c
ˆ
4) If we find that S.E ( ˆ ) we reject the null hypothesis Ho = ˆ =0 & accept the
2
alternative that H ˆ 0 . Then the equation will be Yˆ ˆ ˆxi
1 1
21 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
ˆ
b) If t we accept the null hypothesis Ho = ˆ0 =0 & reject the
S .E ( ˆ )
alternatives H1 ˆ1 0 .
Ho = ˆ0 =0
H1 ˆ1 0 . Acceptance region
H1 ˆ1 0 .
Rejection region
Rejection region
-2.228 0 + 2.228
Figure (e)
Interpretation of t test
ˆ
If t it means reject the null hypothesis Ho = ˆ0 =0
S .E ( ˆ )
Accept t the alternative H1 ˆ1 0
The estimated values are statistically significant or
The sample observation can explain the population parameters.
There is a relation between the dependent & independent variables.
Then the t-value will lie in the critical region (shaded) Suppose if we chose 5% level of
significance and the total number of sample size is 12 & the number of estimated variables are
~
( ˆ & ) only two. Then the d.f will be 12.-2=10. Again if we are undertaking two tall test then
5%
we will have 0.025 in each side (see figure e)
2
Then to find the t-value at 5% (0.25 each side) & 10 d.f. We can read from the t-table as follows
a) In the first raw find 0.025
b) In the first column find 10 then at the intersection point of the raw & the column
you will get t-table value 2.228.
ˆ
Again if t it means accept Ho = ˆ0 =0
S .E ( ˆ )
22 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Reject the alternative hypothesis H 1 ˆ1 0
The estimated values are statistically insignificant
No relationship between the dependent & independent variables
The sample observations do not explain the population parameters.
Then the t-table value will lie in the acceptance region
Simple inspection to calculate t-test
ˆ
At 5% level of significance we can follow the following rules. Calculate t-values by .
S .E ( ˆ )
Then if this value is greater than +2 or less than -2 we reject the null hypothesis Ho = ˆ =0 & 0
accept the alternative H 1 ˆ1 0 . And the t-value will lie in the critical region. If calculated
ˆ
t smaller than +2 & greater than -2 or (if -2<t<2) we accept the null hypothesis &
S .E ( ˆ )
reject the alternative.
ˆi
t 2.48
S ( ˆ )
With in( n-k) df.
ˆ = estimated value from the sample
= is the population parameter
If we choose any confidence level say 95%. We find from the t-table i.e the value
+ t0.025 with d.f of (n-k).
P {-t0.025<t<t+0.025} = 0.95-------------------------------------------2.49
Substitute equation number 2.48 in equation number 2.49
ˆi
P{-t0.025 <t0.025} = 0.95
S .E ( ˆ )
23 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Multiply both sides by S.E. ( ˆ )
P{-t0.025 (S.E. ( ˆ ) ) ˆ <t0.025.S.E. ( ˆ ) } = 0.95
i ˆ t 0.025S.E.(ˆ ) 2.50
This confidence interval shows that the unknown population parameter i will lie with in the
defined limit 95 times out of 100. OR we are 95% confident that the unknown population
parameter i will lie with in this limit.
Ex.1.
In Table 2.1 the investment expenditure and the long run interest rate over the ten year period is
given.Test the hypothesis that investment is interest elastic by fitting regression line to the data
given & conducting the relevant test of significance. To answer the question we follow the steps
of econometric methodology.
i. The economic theory states that “investment is interest sensitive.”
ii. Mathematical model of the theory
a) Selecting variables:- Investment is the dependent variable (endogenous) and interest rate
is exogenously determined (independent variable)
b) A priori expectation of the sign of the parameters:- There is a negative (inverse
relationship) between Investment & rate of interest
c) Magnitude of the parameter:- The theory says that investment is interest elastic then the
coefficient is greater than one.
d) Specification of the model:- This will be a single equation model which investment will
be affected by rate of interest but investment will not affect rate of interest. The
relationship between investment and rate of interest is assumed to be linear (This will be
determined by the researcher).
Then
Yt Xt
Where Yt= is the level of interest
Xt = is rate of interest, >1 & is the intercept
iii. Specification of the Econometric model.
There are other variables which will affect investment these are marginal efficiency of capital
(MEC), saving, consumption, political stability etc. Since these & other variables are not
incorporated in the mathematical model of investment function we can capture these
variables by incorporating a random term Ui in our model.
Yt Xt +Ui
Then by adding the random (error, or stochastic or disturbance) term Ui we convert the
mathematical (exact) relationship between investment & rate of interesting in to in exact
relationship of Econometric models.
iv. Obtaining data:- A sample of 10 years observation data are given to estimate the
model & the type of data are time series data.
v. Estimation of the econometric model
The economic relationship is explained using a single equation model then the most
appropriate method of estimating this equation is OLS method. To estimate this model we use
table 2.1 & obtaining the following results in deviation forms.
yixi 1.42 xi 2
0.00064
yi 25826.4
2
ei 2
25,654
24 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
yi Y 753.4
xi x 0.056
n n
1.424
ˆ
2225
0.00064
ˆ Yˆ ˆxˆ From equation number 2.14
ˆ 753.4 (1.424 x0.056) 878
Then the estimated regression line will be
Yˆ 878 2225 Xi
vi. Evaluation of the estimates:-
After estimation & obtaining values of the coefficient ( ˆ & ˆ ) of the variable we should have to
evaluate the results obtained using Econometrics method
a) Economic interpretation of the results:- at this stage check whether the obtained results
are economically meaningful or not.
Yˆ 878 2225xi
i. If interest rate is zero i.e Xt =0 then Yˆ 878 means if the interest rate is zero investment
will be equal to 878 birr. This is the interpretation of the constant term ̂ in case of
linear equation
ii. ˆi Indicates that if rate of interest increase/decrease by 1 birr then investment will
decrease/increase on the average by 2225 birr. Therefore it passes the economic criterion
because it explains the inverse relationship between investment & interest rate.
b) Since the model passes a prior economic criterion, the next step is to test the reliability of
the estimated parameters using statistical tests using r2, & S.E tests.
i. the correlation coefficient test r2
To estimate r2 we can use the formula of 2.31-2.35 let‟s us
r 2 ˆ
xiyi
yi 2
We know that ˆ = -2.225 xiyi 5.424. yi 2
25,826.4
2.225(1.424)
r2 0.123
25.8264.4
This means 12.3% of the change investment is accounted (explained) by interest rate &
the remaining 87.71, is not explained by rate of interest but by some other factors
represented by Ui in our model.
ii. Standard error test:-
S.E ( ˆ ) ˆu 2 x 2
n2
25 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
S.E( ˆ )
ei xi 2 2
n 2 n xi 2
S.E ( ( ˆ )
ei 2
=
22658
=
22658
=2103.661
n2 xi 2
8(0.00064) 0.00512
Having obtained the value of S.E (ˆ ) and S.E ( ( ˆ ) we can undertake the S.E Test using
hypothesis testing.
ˆ
Test of S.E (ˆ ) : If S .E (ˆ ) then we can reject the null hypothesis Ho= ˆ 0 &
2
accept the alternative that H1= ˆ 0 .
ˆ 878 & S.E (ˆ ) = 133.046
ˆ 876
= 439> 133.046
2 2
ˆ
Then S .E (ˆ ) we can conclude that rejecting the null hypothesis & accept the
2
alternative.
Test of S.E ( ˆ ) : If S .E ( ˆ ) we can accept the null hypothesis and reject alternative.
2
2225
S .E ( ˆ ) - 2103.661 & = = 1112.5
2 2
Since S .E ( ˆ ) we accept the null hypothesis & reject the alternative
2
27 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Table 2.1
Invest- Rates of
ment interest (Yi Y ) ( Xi x ) ei =
59 804 0.045 50.6 -0.011 2560.36 0.000121 -0.5566 777.875 26.125 682.5156
60 836 0.045 82.6 -0.011 6822.76 0.000121 -0.9086 777.875 58.125 3378.516
61 765 0.055 11.6 -0.001 134.56 0.000001 -0.0116 755.625 9.375 87.89
62 777 0.06 23.6 0.004 556.96 0.000016 0.0944 744.5 32.5 1056.25
63 711 0.06 -42.4 0.004 1797.76 0.000016 -0.1696 744.5 -33.5 1122.25
64 755 0.06 1.6 0.004 2.56 0.000016 0.0064 744.5 10.5 110.25
65 745 0.05 -6.4 -0.006 40.96 0.000036 0.0384 766.75 -10.75 390.0625
66 696 0.07 -57.4 0.14 3294.76 0.000196 -0.8036 722.25 -26.25 689.0625
67 787 0.065 33.6 0.009 1128.96 0.000081 0.3024 733.375 53.625 2875.641
Yi Xi
yi xi yi xi yixi yˆi ei ei
2 2 2 2
=7,534 =0.56
28 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Table 2.2
Log Y-
0
58
8 17 0.9 1.23 -0.317 -0.399 0.1004 0.1596 0.1286 0.9163 -0.0132 . 000176
59 12 27 1.079 1.431 -141 -0.1986 0.0198 0.0394 0.02797 1.069 0.0101 0.000103
60 15 36 1.176 1.556 -0.044 -0.0737 0.00193 0.00543 0.00323 1.164 0.0121 0.000146
61 18 46 1.255 1.663 0.0352 0.0327 0.001244 0.00107 0.00115 1.245 0.01037 0.000108
62 22 57 1.342 1.756 0.1224 0.1258 0.0149 0.0158 0.0154 1.316 0.0267 0.000716
63 23 67 1.361 1.826 0.1417 0.196 0.02 0.0384 0.0278 1.369 -0.00729 0.0000531
64 26 81 1.415 1.908 0.1949 0.278 0.038 0.0776 0.0543 1.431 -0.01668 0.000278
log X log xi 2
log yi 2
log Yi log Ŷ ei 2
log Yi =11.37
=0.1965 =0.3374 logx =8.51 =0.00158
8.53 Y X =0.2564
=1.22 =1.624
29 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
2.7. Properties of the least square Estimates
xi
̂ kiYi Where ki
xi 2
30 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
1
We know that , x & ki are constant then ̂ is a linear function of the dependent variable Yi
n
Thus both ˆ & ˆ are expressed as linear function of the Y's.
2) Unbiased ness:- The bias of the estimation is defined as the difference between its expected value & the true
parameter.
Bias =E ( ˆ )
If the estimation is unbiased its bias is zero
i.e. E ( ˆ ) we proved this in the previous page equation number 2.42
Again bias =E (ˆ ) if the estimation is unbiased its bias is zero E (ˆ ) Also we proved this & you can refer
equation number 2.46
3) The minimum variance property (best estimator)
The property of minimum variance is the main reason for the popularity of the OLS method. Best in this sense means
definitely superior. One should know that when we say OLS estimator is best estimator it will have a minimum variance
as compared to any estimators obtained using other econometric methods such as 2SLS, 3SLS.
Maximum Likely hood estimators etc.
31 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
Probability
Distribution A Distribution B
x6 x2 x3 x4 x5 x8 x7 x1
Given the distribution A & B. then if the true population were B, then the probability that we would have obtained the
sample shown would be quite small. But if the true population were A then the probability that we would have drawn the
sample would be substantially large. Select the observation from population A as the most likely to have yielded the
observed data.We define the maximum likelihood estimator of as the value of ˆ which would most likely generate the
observed sample observations Y1, Y2, Y3 --- Yn. Then if Yi is normally distributed & each of Y's is drawn independently
then the maximum-likelihood estimation maximizes. P(Y1) P(Y2) . . . P(Yn)Where each P represents a probability
associated with the normal distribution.P(Y1) P(Y2)--- P(Yn) is often referred to as the likelihood function. The likelihood
function depends up on not only on the sample values but also in the unknown parameters of the problems (ˆ & ˆ ) . In
describing the likelihood function we often think of the unknown parameters as varying while the Y's (dependent
variables) are fixed.This seems reasonable because finding the maximum likelihood estimation involves a search over
alternative parameter estimators which would be most likely to generate the given sample. For this reason the likelihood
function must be interpreted differently from the joint probability distribution. In the latter case the Y's are allowed to vary
& the underlying parameters are fixed & the reverse is true in case of maximum likelihood. Now we are in a position to
search for the maximum likelihood estimators of the parameters of the two variable regression models.
Yi xi ui 2.1
We know that Yi N ( xi , 2 )
I.e Y is normally distributed with mean ( xi ) & variance 2
Assume that all the assumptions of least squares & further assume that the disturbance term has normal distribution. Will
the estimators ( ) different from the least square estimators? Will such estimators possess the desirable properties?
In our model Yi Xi Ui sample consists 'n' observations in Y&X. Then we will have a mean value
of (1 , 1 , x1 ) , ( 2 , 2 , x2 ) , ( 3 , 3 , x3 ) ... ( n , n , xn ) but a common variance of 2 . Why we will have a different
mean but a constant variance? The reason is simple that a random variable X assumes different value with a
probability of density function of f(x) but fixed values of Yi. The joint probability density function can be written as a
product of n individual density function
32 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
F[Y1 , Y2 , Y3 ...Yn / 1 1 x1 , 2 ] = f Y1 / 1 1 x1 , 2 f Y2 / 2 2 x2 , 2
f [Y3 / 3 3 x3 , ] f Y4 / 41 4 x4 , ... f Yn / n n xn ,
2 2 2
2.2
This probability distribution may be written as follows
F Y 2 2 1
Exp Yi xi ... f y n / n n xn , 2 ---------------2.3
1
2 2
2
Where exp. denotes exponential function. Then the likelihood function
L[Y1 , Y2 ...Yn , , , 2 ] = PY1 PY2 ) P(Yn 1) P(Yn
Let the likelihood function is represented by L
1 Y1 1 1 xi
2
1 Y2 2 2 x2 2
L 2 2 2
1
ex
2
2 2
1
exp
2 2
1 Y 3 31 x3
2
1 Y 4 4 x4 2
* 2 2
1
2 exp 3
2 2 1
2
exp 4
2 2
1 Yn n n x n n
1
2
. . . 2 exp
2
2.4
2
Sum it over i.e. from 1 up to n.
1
1 Y i1 ixi
2
n
2
L 2 ) exp i 2
n
2.5
2
If you represent the N product factor by
1 Yi iXi
1 2
L N 2 exp
^ 2
-----------------------------2.6
2
i 1 2
The value of Y1,Y2 ...Yn are given but the value of & , 2 are not known then this function can be
called likelihood function and denoted by Lf ( , & 2 )
1 Yi ixi
2
1
Lf ( , , ) N (2 ) exp
2
------------------2.7
2 2
2
And the equation can be written as
1 Yi ixi
2
1
Lf ( , , ) N 2
exp -------------------2.8
2 2 2
Take log of L
n n
ln Lf log 2 log 2 1 2
2 Yi i xi 2.9
2
2 2
Differentiate w.r.t. , &
2
setting these with equal to zero.
ln Lf 1
2 2
(Yi Xi )(1) 0 2.10
ln Lf 1
2 2
(Yi Xi )2 Xi 0 2.11
33 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)
ln Lf n 1
2
2 2
2 4
(Yi Xi )2 Xi 0 2.12
Equation number 3.10 will be equal to
(Yi Xi )2 Xi 0 2.13
Since 1 2 0
2
Equation no 3.11 will be again equal to
(Yi Xi )( Xi ) 0 2.14
Since 1 2 0
2
Substitute equation 2.13 in to equation 3.14 and we will have
~
Yi n~ xi -------------------------------------2.15
~ ~
Yi Xi N X Xi 2 2.16
The two equations again will give us the same normal equation OLS. From equation number 3.12 we can to
obtain the value of 2
n 1
Yi Xi ) 2 =0
2 2
2 4
1
Since 4 0 we left with
2
n
(Yi Xi ) 2 0
2 2
n
(Yi Xi ) 2 2 2
2 (Yi Xi ) 2 n
1
2 (Yi Xi ) 2
n
1
2
n
(Yi ˆ ˆXi ) 2
The ML estimation is different from OLS estimator of OLS. The variance was
ei 2
in OLS but it is
1
ui 2
n2 n
in case of ML.Thus the variance of ML is biased estimator of 2 but it is unbiased in the case of OLS. But as the
sample size increases the ML variance converges to the true population variance.
34 | P a g e Jimma University; Department of Economics BY Alemu A.and Getache K.( Phd candidates)