Professional Documents
Culture Documents
Instructions:
1) You have to work alone. You are welcome to use lecture notes and any other supplementary
materials;
2) We will have the oral part of the exam where you have to explain and defend your answers;
3) For the exam, you have to find your dataset from the exam_data.xlsx file. We will refer to it as “your
dataset”.This data is different for everybody. The student who uses another person’s dataset will receive
an automatic zero grade.
4) Please submit your final answer in pdf format. I prefer typing. If you use handwriting make sure that it
is legible. Otherwise, your final mark will be affected.
5) I encourage you to draw graphs while solving questions.
31/07/2020
2
Hypothesis Testing Regarding 1 Population [60 percent,
each question carries 4 percent weight]
Test of Hypothesis about population mean when standard deviation is unknown
1) Assume that your dataset is a sample from the population, and each number refers to a daily
expenditure of a given student. Now, I am interested in the population average i.e. mean
expenditure of ADA students. However, the problem is I do not have the full population data.
What is your estimator and estimate of the population mean?
sigma 1.633
confidence
level 0.95
alpha 0.05
n 30
5.13333
x-hat 3
1.95996
z_alpha/2 4
0.58435
E 1
5.71768
Upper limit 4
4.54898
Lower limit 3
2) Build 90%, 95% and 99% confidence intervals for the population mean. Note that we do not
know the standard deviation. Remember of “Chinese product” joke.
3) How many observations do I need to have the margin of error equal to 2 at 95% confidence
level?
We see that the margin of error tells how many percentage points your results will differ from the real
population value. For example, a 95% confidence interval with a 2 percent margin of error means that
your statistic will be within 2 percentage points of the real population value 95% of the time.
N= 7
4) Take the average of the first three numbers in your dataset, let’s call it ‘c’. Test the hypothesis
that the true mean of expenditures is equal to ‘c’. Choose 5% level of significance and perform
the test using critical value approach. Moreover, you have to perform the same test in left-
tailed, right-tailed and two-tailed version but keep 𝛼=0.05 in all cases.
𝛼=0.05
Z = (x-bar - µ)/ σ
So Z calculated is less than z value we say that the average expenditure for 3 datasets is not equal to
true mean and accept the null hypothesis
5) Perform the same hypothesis testing (right,left and two-tailed version) using p-value approach.
Define p-value and find an approximate p value using the appropriate table. Again keep 𝛼=
0.05.
A p value is used in hypothesis testing to help you support or reject the null hypothesis.
Z = -1.8960
C = 0.95
Poor students
4
2
4
4
4
3
3
4
3
4
4
sigma 0.69
confidence level 0.95
alpha 0.05
n 11
x-hat 3.56
z_alpha/2 1.959963985
E 0.029547569
Upper limit 3.589547569
Lower limit -0.029547569
2) Build 90%, 95% and 99% confidence intervals for the population proportion.
90%
Z = 0.1229
95%
Z = 1.3009
99%
Z = 1.53
3) How many observations do I need to have the margin of error equal to 2 at 95% confidence
level?
N <= 7
4) Take the average of the first three numbers in your dataset and divide that by 10, and let’s call it
‘c’. Test the hypothesis that the true population proportion is equal to ‘c’. Choose 5% level of
significance and perform the test using critical value approach. Moreover, you have to perform
the same test in left-tailed, right-tailed and two-tailed version but keep 𝛼=0.05 in all cases.
4
2
4
c 3.333333333
sigma 0.333333333
5) Perform the same hypothesis testing (right,left and two-tailed version) using p-value approach. Define
p-value and find an approximate p value using the appropriate table. Again keep 𝛼= 0.05.
A p value is used in hypothesis testing to help you support or reject the null hypothesis.
Z = 3.1757
C = 0.95
Populaton1 Population2
7 5
8 4
7 8
5 4
4 4
2 3
5 8
5 3
5 4
5 3
5 4
5 5
6 7
6 4
5 8
Average
5.33 4.93
Standard deviation
1.35 1.81
Variance
1.82 3.26
Hence we accept null hypothesis that variances of two populations are not equal
2) What is the test statistic for testing the hypothesis about the equality of two population
variances? What is the distribution of the test statistic? Try to draw it and show some
characteristics of the distribution.
Assume these are the results of a die rolling experiment such that the value 1 appeared 6 times, the
value of 2 appeared 5 times and so on. Let me show it in a table format:
χ2 for α =0.05 is
χ2=Σ (O−E)2 / E
7
8
7
5
4
2
The critical value approach used is The level of significance which is selected in using the given α =0.05
that dictates the critical value.
sum[ (O-E)^2/E] 0.08 0.02 1.33 1.02 0.08 0.33 2.88 Chi squre value
Obtained chi square is 2.88 less than chi square using table so we accept the null hypothesis that the die
is fair that is been used
3) Define what is a p-value. Use p-value approach to test the same hypothesis. You are allowed to
use approximations based on the appropriate tables.
We state the Central Limit Theorem.as that if suppose we have a population with mean μ and
standard deviation σ and take sufficiently large random samples from the population with
replacement , then the distribution of the sample means will be approximately normally
distributed.
In the case of the law of large numbers states that the sample mean of independent and
identically distributed observations converges to a certain value.