Professional Documents
Culture Documents
Example problem
• Agricultural extension service wants to determine whether the mean yield per
acre of soybean, due to the introduction of a new variety in a district has
increased during the current year over the mean yield of the last year. Last
year’s mean yield, μ1, was 1100 kg per hectare. The current year’s mean yield
μ2, is unknown.
• To estimate μ2 , samples are collected from 50 farmers of each of the 14 talukas
across the district. The mean and standard deviation are calculated for this
sample set. This mean is = 1150, how do we know μ2 is greater than μ1 ?
Last year’s mean yield, μ1, was 1100 kg per hectare. To estimate current year’s mean yield μ2 , samples are collected.
Sample mean is = 1150, how do we know μ2 is greater than μ1 ?
μ2 ?
= 1150
z =
μ2
= 1150
Probability of finding = 1150 when . If it is less than a certain value, we can say that being greater than 1100 is plausible
Given that σ =120, what is the probability of = 1150 or greater?
Hence if we consider a significance level of 1%, then it is plausible that is greater than 1100
This is the sort of question that hypothesis tests are designed to address.
Hypothesis Testing
• Null Hypothesis – Nothing has changed, or the sample mean is opposed to the Null Hypothesis because of random
variation. i.e. it has occurred due to the small likelihood that it may.
• Alternate Hypothesis – The sample mean accurately reflects the underlying population. Hence if the sample mean is
greater than the old µ, it is because the new µ is greater.
Null Hypothesis H0 : μ ≤ 1100
Alternate Hypothesis H1 : μ > 1100
• Compute the test statistic for the observed value (), when H0 is
true
X ~ N(1100, ) z=
𝜎𝑋
• Identify the rejection region for the test statistic (All values with
disagreement equal to and greater than the observed value)
• Probability of finding the test statistic in the rejection region is
the p-value p-value
• If the p-value is greater than the significance level then H0 is
plausible
= 1150
Problem
A certain type of automobile engine emits a mean of 100 mg/s of oxides of nitrogen (NOx ). A modification to the engine
design has been proposed that may reduce NOx emissions. The new design will be put into production if it can be
demonstrated that its mean emission rate is less than 100 mg/s. A sample of 50 modified engines are built and tested. The
sample mean NOx emission is 92 mg/s, and the sample standard deviation is 21 mg/s.
Therefore, if H0 is true, the probability of a result as extreme as or more extreme than that observed is only
2.04 %. The evidence against H0 is pretty strong. It would be prudent to reject H0 and to recalibrate the scale.
One-tailed and two-tailed tests
• Whenever p-value is less than a particular threshold, the result is said to be “statistically significant”
at the 100α% level.
• The null hypothesis is rejected at the 100α% level.
Problem:
A hypothesis test is performed of the null hypothesis H0 : μ = 0. The p-value turns out to be 0.03. Is the
result statistically significant at the 10% level? The 5% level? The 1% level? Is the null hypothesis
rejected at the 10% level? The 5% level? The 1% level?
Significance level (terminology)
• Whenever p-value is less than a particular threshold, the result is said to be
“statistically significant” at the 100α % level.
• The null hypothesis is rejected at the 100α % level.
• Reporting the p-value rather than just the significance level gives more
information about the strength of the evidence against the null hypothesis and
allows each reader to decide for himself or herself whether to reject
• The equality always goes with the Null hypothesis because we want to test H0 with the value
closest to H1 . Without the equality we would not have a Null distribution with any value closest
to H1.
• The P-value is not the probability that H0 is true. H0 Is true or not true – since it is a phenomena
from the real world. It is the probability of finding an of a certain value given H0.
Other notes
• Note that the method we have just described uses the Central Limit Theorem. It
follows that for this method to be valid, the sample size must be reasonably
large, say 30 or more.
• For a large enough sample we may equate the sample standard deviation with
the population standard deviation.
i.e. s = ( )
• The calculation of the p-value was done by computing a z-score. For this reason,
the z-score is called a test statistic. A test that uses a z-score as a test statistic is
called a z test.
• What we covered is a ‘large sample test’. When large enough samples are not
available (< 30), the z test is not used.
• Important to select H0 correctly:
• Specifications for a water pipe call for a mean breaking strength μ of more
than 2000 lb per linear foot. Engineers will perform a hypothesis test to
decide whether or not to use a certain kind of pipe. They will select a random
sample of 1 ft sections of pipe, measure their breaking strengths.
• Assume the engineers test H0: μ ≥ 2000 versus H1 : μ < 2000. Will the
engineers decide to use the pipe if H0 is rejected? What if H0 is not rejected?
End of slide deck