Professional Documents
Culture Documents
Covid19 has breaked out in 2019 and its consequences effect on the whole
economy. Many factors can contribute to COVID infection, including people who
frequently go out without wearing a mask (face covering), do not maintain social
distance, frequently go out to work, do not use hand sanitizer, or age (older people are
more likely to get Covid positive than young people). Therefore, I would like to check
whether the Covid infection rate depends on mask, employment status and age using
in an econometric model. In this model, the infection rate is dependence variable, mask,
employment status and age are independent variables.
The economic model for infection rate can be defined by the dependent variable is
infection rate and independent variable are masks, employmentstatus and age. Each variable
with assigned probabilities and estimates, intercepts (β1 ), slope (β2 , β3 , β4 ) the random
variable (u𝑖 ) known as error term or residual.
- The infection rate is β1 when variabke masks, employmentstatus and age equal to
zero. β1 knowns as intercept and parameter.
- When employmentstatus and age are constant, if masks increase by 1 unit, infection
rate will increase by β2 unit. β2 knowns as slope and parameter.
- When masks and age are constant, if employmentstatus increase by 1 unit, the
infection rate will increase by β3 unit. β3 knowns as slope and parameter.
- When masks and employmentstatus are constant, if age increase by 1 unit, the
infection rate will increase by β4 unit. β4 knowns as slope and parameter.
Table 1 show the list of all variable taken from Professor Md Manir Hossain Mollah.
Responser have to answer for all the question including how many time you get Covid
positive? Did you wear mask when you go out ( yes =1, no=0) ? How many job do you
have? What is your age?
Table 1: The list of all variable
2.3 Methodology
Correlation coefficient is calculated using the correlation command, “cor infecrate masks
employmentstatus age” to find out the linear relationship between the variables. The
correlation between the variables can be evaluated as listed below:
- The correlation coefficient between infecrate and masks is 0.1977 , that is closer to
1, means that there is a very strong positive relationship between both the variables.
- The correlation coefficient between infecrate and age is 0.047 , that is close to 1
means that there is a positive relationship between both variables.
- The positive sign evidently implies that the trend line in the scattered diagram will be
upward sloping and that the variables would change in the same direction.
- The correlation between the same variables, such as covidp and covidp,
employmentstatus and employmentstatus, or age and age, is exactly 1.0000, which
means that there is a perfect positive correlation between the variables.
- There is no correlation between the variables that is exactly 0, indicating that there is
no weak relationship between the variables.
3.2 Regression analysis
- Interpretation of β : When masks, employmentstatus and age are constant (i.e. mask=
employment status = age=0), infection rate on average is 1.237793.
The individual hypothesis test or t-test is used to compare the means of two groups to
determine whether a process or treatment actually has an effect on the population of
interest, or whether two groups are different from one another. It is given that the model
has a 95% arbitrary confidence interval, 5% level of significance, with degrees of
freedom is 35 (df=n-k=39-4=35). So, the t-critical or t-tabulated value for the model is
± 2 with P(-t <t<t ) = 0.95.
The null 0.025 −0.025 hypothesis is failed to reject if the calculated t value falls in the
acceptance region, and the null hypothesis is rejected if it falls in the rejection region
(critical region) . The graph below is represent the acceptance and rejection (critical)
regions on a t-distribution curve:
Fig 1 : T- distribution graph
0.5027887−0
t 𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒 = 1.11
0.4518876
0.0237498−0
t 𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒 = = 0.21
0.1130416
0.0013307−0
t 𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒 = = 0.19
0.0070337
F𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒 < F𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 ( 0.05 < 2.87) indicated that the Null hypothesis is not rejected
and the model is not significant.
The table above shows that R-square is 0.0411 which mean that 4.11% of the variation
of infection rate can be explained by the independent variables or regression ( or masks,
employmentstatus and age). The R-squared value increases with more and more independent
variables, but degrees of freedom goes down. The Adjusted R-square is -0.0411.
The table show that 95% confidence interval for β2 is -0.4145918 to 1.420169, β3 is
-0.257369 to 0.2532364, and β4 is -0.0129486 to 0.01561, means that if 100 confidence
intervals are 4 constructed, 95 of them will contain β2 , β3 , β4 and 5 of them will not.
With the significance level 0.05 and degree of freedom 3, therefore F0.05,3 = 7.815
The table 9 show that χ2 𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒 = 4.2 , which is less than the critical value 7.185.
Therefore, we cannot reject the null pypothesis mean that U1 is normally distributed.
A Chow test is simply a test of whether the coefficients estimated over one group of the data
are equal to the coefficients estimated over another.
To know if there is any discrepancy between people wear mask and without wear
mask have different that might significantly affect theinfection rate, the dummy
variable “mask” can be integrated in the model (marks = 1 if people wear mask and
masks = 0 if people do not wear mask).
= β1 + β2 + (β3 + β4 ) employmentstatus
= β1 + β3 𝑒𝑚𝑝𝑙𝑜𝑦𝑚𝑒𝑛𝑡𝑠𝑡𝑎𝑡𝑢𝑠
The slope and intercept for masks and without masks in graph 4 consists of straight
“pooled” regression lines which implies that it does not observe any marginal effect
within the time series of the sample collected.
3.2.8 Heteroscedasticity Diagnostic Tests
Econometrics model:
After finding the regression , we find û and û 2 and run regression û 2 with
other independent variables.
Because F𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒 < F𝑐𝑟𝑖𝑡𝑖𝑐𝑎𝑙 (0.28< 2.87), we cannot reject the null hypothesis mean that
there is no heteroscedasticity in the model.
White test
White test is a statistical test that establishes whether the variance of the errors in a
regression model is constant, using to check homoskedasticity.
Econometrics model
Axiliaxry model :
Û 2 = ⍺1 + ⍺2 ∗ 𝑚𝑎𝑠𝑘𝑠 + ⍺3 ∗ 𝑒𝑚𝑝𝑙𝑜𝑦𝑚𝑒𝑛𝑡𝑠𝑡𝑎𝑡𝑢𝑠 + ⍺4 ∗ 𝑎𝑔𝑒 + ⍺5 ∗
(𝑚𝑎𝑠𝑘𝑠)2 + ⍺6 (𝑒𝑚𝑝𝑙𝑜𝑦𝑚𝑒𝑛𝑡𝑠𝑡𝑎𝑡𝑢𝑠 )2 + (𝑎𝑔𝑒)2 ⍺7 + ⍺8 ∗ (𝑚𝑎𝑠𝑘𝑠 ∗
𝑒𝑚𝑝𝑙𝑜𝑦𝑚𝑒𝑛𝑡𝑠𝑡𝑎𝑡𝑢𝑠 ∗ 𝑎𝑔𝑒) …………. Model (2)
Alternative hypothesis (H𝐴 ): At least one of parameter is not equal to zero. There is
heterosceddasticity in model (1)
As we can see on the table F𝑐𝑎𝑙𝑐𝑢𝑙𝑎𝑡𝑒𝑑 = 0.72 < F0.05,5,33 = 2.5026 therefore, there is
no heterosceddasticity in model (1)
4. Conclusion