Professional Documents
Culture Documents
Ktee 309.1 Ecometrics Report
Ktee 309.1 Ecometrics Report
ECONOMETRIC REPORT
Topic: Factors that affect AQI
Hanoi 10/2019 1
Factors that affect AQI
Table of Contents
I. Introduction .......................................................................................................................... 3
II. Literature review ................................................................................................................ 3
1. Question of interest ............................................................................................................ 3
2. Procedure and program used .............................................................................................. 4
III. Economic model................................................................................................................. 5
1. Specifying the object for modeling .................................................................................... 5
2. Defining the target for modeling by the choice of the variables to analyze, denote {𝑥𝑖} . 5
3. Embedding that target in a general unrestricted model (GUM) ......................................... 5
IV. Econometric model ............................................................................................................ 6
V. Data collection ..................................................................................................................... 7
1. Data overview .................................................................................................................... 7
2. Data description.................................................................................................................. 7
VI. Estimation of econometric model ..................................................................................... 8
1. Checking the correlation among variables ......................................................................... 8
2. Regression run .................................................................................................................. 10
VII. Diagnosing the model problem ..................................................................................... 13
1. Normality ........................................................................................................................ 13
2. Multicolinearity ................................................................................................................ 14
3. Heteroskedasticity ......................................................................................................................... 14
VIII. Hypothesis postulated ............................................................................................................... 17
IX. Result analysis & Policy implication ........................................................................................... 18
X. Conclusion....................................................................................................................................... 19
XI.References ...................................................................................................................................... 20
2
Factors that affect AQI
I. Introduction
As much as Economy is a meaningful science that determines the social
development in general and national growth in particular, Econometrics is the
use of statistical techniques to understand those issues and test theories. Without
evidence, economic theories are abstract and might have no bearing on reality
(even if they are completely rigorous). Econometrics is a set of tools we can use
to confront theory with real-world data.
Given the data set, our group, which includes five members: Tran Ha
Trang, Khuat Duc Hung, Phan Van Phuc, Nguyen Hoang Linh, and Pham
Phuong Thao, follows the methodology of econometric comprising eight steps
to analyze the data. Note that because of the lack of information on the data set,
all inferences of abbreviations and others are based on assumptions and self-
research. As a result, we hope to have shown clearly our logic and reasoning of
analysis.
To the extent of purpose and resources, there are still deficiencies in this
report, but we look forward to providing readers with a decent view of the
overall of the data set given and the knowledge that we have gained through Dr.
Dinh Thanh Binh’s Econometrics course.
Vietnamese people have been familiar to the term “AQI” recently but we only
know it as an index to measure the air quality, very few of us really understand
it. And the government has said that the AQI of Hanoi is not objective enough
to classify Hanoi as the most polluted city in the world. This declaration has
raised a question among people “What AQI truly is?”. And now people are
horribly confusing with the ranking and the government’s declaration.
For all those reasons, our group choose to analyze the AQI of cities in China to
find out what affects the AQI and help others to understand more about AQI.
We are going to run the Regression model and test out all the necessarily to
truly understand the factors of AQI. Since we are dealing with the dataset with
322 observations, the result would be objective enough to count on.
4
Factors that affect AQI
5
Factors that affect AQI
where:
• 𝛽0 is the intercept of the regression model
• 𝛽𝑖 is the slope coefficient of the independent variable xi
• 𝑢 is the disturbance of the regression model
• 𝛽 ̂0 is the estimator of 𝛽0
• 𝛽̂𝑖 is the estimator of 𝛽𝑖
• 𝑢̂ is the residual (the estimator of 𝑢)
From this model, this report is interested in explaining AQI in terms of
each of the eight independent variables:
(𝑎𝑙𝑡𝑖, 𝑐𝑜𝑎𝑠𝑡𝑎𝑙, 𝑝𝑟𝑒𝑐, 𝑡𝑒𝑚𝑝, 𝑔𝑟𝑒𝑒𝑛, 𝑔𝑑𝑝, 𝑝𝑜𝑝𝑢, 𝑖𝑛𝑐𝑖).
6
Factors that affect AQI
V. Data collection
1. Data overview
This set of data is a secondary one, collected from a given source.
Data source: https://www.kaggle.com/maxwellnee/china-aqi-test
This data is conducted in 2015 and is a set of 322 observations which are
322 cities in China. It shows their API in 2015 and also the correlative factors,
including factors that we have mentioned above in our model.
The structure of Economic data: cross-sectional data.
2. Data description
To get statistic indicators of the variables, in Stata, the following
command is used:
sum aqi alti coastal prec temp green gdp popu inci
where:
• Obs is the number of observations.
• Std. Dev is the standard deviation of the variable.
• Min is the minimum value of the variable.
• Max is the maximum value of the variable.
7
Factors that affect AQI
. corr aqi alti coastal prec temp green gdp popu inci
(obs=323)
300
200
200
AQI
AQI
100
100
0
0
0 1000 2000 3000 4000 5000 0 .2 .4 .6 .8 1
Altitude (m) Coastal (0 for non-coastal and 1 for coastal)
300
300
200
200
AQI
AQI
100
100
0
300
200
200
AQI
AQI
100
100
0
9
Factors that affect AQI
2 2
We have 𝑅𝑤𝑖𝑡ℎ𝑜𝑢𝑡 𝑖𝑛𝑐𝑖 > 𝑅𝑤𝑖𝑡ℎ𝑜𝑢𝑡 𝑔𝑑𝑝 so we will drop inci out of the
model and keep gdp.
2. Regression run
Having checked the required condition of correlation among variables,
the regression model is ready to run. In Stata, this is done by using the
command:
reg aqi alti coastal prec temp green gdp popu
--------------------------------------------------------------------------
10
Factors that affect AQI
11
Factors that affect AQI
➢ Other indicators:
❖ Adjusted coefficient of determination adj R-squared = 0.3444
❖ Total Sum of Squares TSS = 598504.099
❖ Explained Sum of Squares ESS = 214653.49
❖ Residual Sum of Squares RSS = 383850.609
❖ The degree of freedom of Model Dfm = 7
❖ The degree of freedom of residual Dfr = 315
12
Factors that affect AQI
.005
We can also test normality using Skewness Kurtosis test for normality,
using the command:
sktest resid
. sktest resid
Skewness/Kurtosis tests for Normality
------- joint ------
Variable | Obs Pr(Skewness) Pr(Kurtosis) adj chi2(2) Prob>chi2
---------+---------------------------------------------------------------
resid | 323 0.0000 0.0001 33.59 0.0000
13
Factors that affect AQI
. vif
The value of VIF here is lower than 10, indicating that Multicollinearity
is not too worrisome a problem for this set of data.
3. Heteroskedasticity
Heteroskedasticity indicates that the variance of the error term is not
constant, which makes the least squares results no longer efficient and t tests
and F tests results may be misleading. The problem of Heteroskedasticity can be
detected by plotting the residuals against each of the regressors, most popularly
the White’s test. It can be remedied by respecifying the model – look for other
14
Factors that affect AQI
missing variables. In Stata, the imtest white command is used, which stands
for information matric test.
Exhibit 9 shows the result.
Exhibit 9: Heteroskedasticity test
. imtest, white
White's test for Ho: homoskedasticity
against Ha: unrestricted heteroskedasticity
chi2(34) = 82.54
Prob > chi2 = 0.0000
0
-50
-100
15
Factors that affect AQI
From the graph, we can see that there is an increase in the variability,
which means this set of data has Heteroskedasticity problem.
To fix the problem, robust standard errors are used to relax the
assumption that errors are both independent and identically distributed. In Stata,
regression is rerun with the robust option, using the command:
reg aqi alti coastal prec temp green gdp popu, robust
Note that comparing the results with the earlier regression, none of the
coefficient estimates changed, but the standard errors and hence the t values are
different, which gives reasonably more accurate p values.
16
Factors that affect AQI
17
Factors that affect AQI
18
Factors that affect AQI
X. Conclusion
This report is completed on the dedicated contribution of each member
and the knowledge from our study in Econometrics. This also provides us with a
good opportunity to practice what we have learned and to get a deeper
understanding of data analysis and relevant testing. From this useful application,
we hope that our work can somehow suggest you a closer look about AQI and
truly understand it.
Again, due to the limitation of understanding and resources, our report
may contain misinterpretations. We hope that PhD. Dinh Thi Thanh Binh and
readers can give us constructive comments on the report so that we would
improve ourselves and do better in the future.
Sincerely.
19
Factors that affect AQI
XI.References
1. https://www.kaggle.com/maxwellnee/china-aqi-test
2. Basic Econometrics (Fifth Edition), by Damodar N.Gujarati, Dawn C.Porter
3. Introductory Econometrics A Modern Approach (Fifth Edition), by Jeffrey
M.Wooldridge
20