Professional Documents
Culture Documents
Hypothesis Testing
In brief
Interval Estimation Hypothesis Testing
• You collect sample(s) • You collect sample(s)
• Make a guess about the range of • You want make (more) precise statements
population parameters – lowest and highest about the parameter values
values
• Can μ > 50?
• μ ε [μL, μH]. What are μL and μH?
• Is μ1 > μ2?
• μ1 – μ2 ε [a, b]. What are a and b?
• Talk about probabilities of certain μ
• Talk about range of μ
Statistical Hypothesis - I
• An assertion or conjecture
• About distribution on one or more random variables
• μ = 7, μ 9,
• A hypothesis, if true, might completely specify the distribution, then it is
simple hypothesis. (μ = 7)
• If not, composite hypothesis (μ 9)
Statistical Hypothesis - II
• Importance of alternative hypothesis
• X ≈ N(μ, 1). Sample.
• To test this hyp, we observe a sample, and based on this, we have to decide
whether or not to accept H0.
• We define a region “critical region” with the proviso that the hypothesis is to
be rejected if the value is in the critical region.
Significance levels and errors
• In our example, variance =1, and sd =1.
• SE of mean =
• 95% CI =>
• Reject the null ( when sample average differs from 1 by more than 1.96
divided by sq. root of the sample size.
• Type I error: rejecting null when it is supported by data.
• Type II errors: fail to reject the null when it is false.
Significance levels
• We are not determining is H0 is ”true” but only if its validity is consistent with
the resultant data.
• Thus, H0 is rejected if the resultant data are unlikely when H0 is true.
• Specify , and then require the test to have the property that whenever H0 is
true, its probability of being rejected is never greater than .
• Value of is the level of significance of the test.
• It is usually set in advance, common values: 0.1; 0.05; 0.01
Basic Method
• Suppose
• To develop a test of , at the level of significance is to start by determining a
point estimate of say d(X).
• The hypothesis is rejected if d(X) is “far away” from the region .
• To determine how “far away” it needs to be for us to reject , we need to
determine the probability dist of d(X) when is true.
• This will give us the critical region to make the test have the required
significance level
Die example
• 600 rolls of the die
• H0 : die is fair
Errors Possible
To test H0, set , and then require the test to have the probability of Type I error
occurring can never be greater than
To be more precise, this is what we mean by α and β
Critical Value
• If we desire that the test has significance level then we must determine the
critical value c that will make the type I error =
• We can determine whether or not to accept the null hypothesis by computing,
first, the value of the test statistic,
• And second, the probability that a unit normal would (in absolute value) exceed
that quantity.
• This probability is called the p-value of the test – gives the critical significance
level.
• We will fail to reject H0 if the significance level is less than the p-value, and
reject if it is greater than or equal.
Type I and Type II errors again
• : probability of committing a Type 1 error.
• : probability of committing a Type II error
• As decreases (level of significance increases), Type I error decreases
• As decreases, probability of Type II error increases.
• Delicate balance!
Central Idea
Type I Error: reject H0 when it is correct.
• 5 cans of tuna filled by machines. The quality assurance manager wishes to test the variability
of two machines.
• Machine 1: n=25; mean: 5.0492
• Machine 2: n=22; mean: 4.9808
• Variance 1: 0.1130
• Variance 2: 0.0137 oz.
• Question: is this difference due to sampling error or is it statistically significant?
• Use F test to compare variances.
REGRESSION- ECOTRIX
• In this course we have learnt to measure effects. Students understand more of statistics concepts in evening classes
then morning classes. But does “evening” open up students’ brains? Does “evening” or “moon light” improve
students’ comprehension skills?
• Causal Effects – Next leap in data comprehension!
• Regress Wage Education:
– Does education affect wage?
– Wagei = α + β * Educationi + εi
• Regress Wage Education Gender:
– For a given gender, does education affect wage?
– For a given education, does gender affect wage?
– Wagei = α + β1 * Educationi + β2 * Genderi + εi