Professional Documents
Culture Documents
i
Statistics and Probability – Senior High School Core Subject
Alternative Delivery Mode
Quarter 4 – Module 2: Formulating Appropriate Null and Alternative Hypotheses on a
Population Mean
Republic Act 8293, section 176 states that no copyright shall subsist in any work of
the Government of the Philippines. However, prior approval of the government agency or office
wherein the work is created shall be necessary for the exploitation of such work for a profit.
Such agency or office may, among other things, impose as a condition the payment of
royalties.
Borrowed materials (i.e. songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this book are owned by their respective copyright holders. Every
effort has been exerted to locate and seek permission to use these materials from their
respective copyright owners. The publisher and authors do not represent nor claim ownership
over them.
ii
INTRODUCTION
This module for Statistics and Probability – Senior High School Core subject was made
through the efforts of the development team. The contents of this module are based on the
most essential learning competencies formulated by the Department of Education.
In this time of global health crisis, the department assures the continuity of the learning
process through several learning modalities – distance learning, online learning, blended
learning, and face to face learning (if the situation permits). While it is true that not all learners
have access to the technologies to enable them to engage in distance and online learning, it
is through the collaborative efforts of educators to address this concern that this module is
created. The intention is to provide a self-paced, comprehensive, and penetrable learning
resource that addresses the most essential expected learning competencies in Statistics and
Probability.
In this series of modules for Senior High School Statistics and Probability, the learners will be
exposed to the types of random variables and their probability distributions, normal probability
distributions, sampling and sampling distributions, estimation of population mean and
proportion, the test of hypotheses, correlation, and regression analysis.
This is an effort to address the learning needs of the students in this time of a pandemic. It is
encouraged that teachers and other education stakeholders give feedback, comments, and
recommendation. We value your feedback.
In the previous module, you learned about the parameters used in hypothesis
testing. You were able to identify the parameters to be tested in given real-life problems. You
also learned how to translate the parameter into mathematical symbols as the first step in
hypothesis testing.
1
What I know
PRE-ASSESSMENT
Before you start studying this module, take this simple test first to find out
how much you already know about the topics in this module.
Directions: Read each item below and choose the correct answer.
2. A vacuum cleaner consumes less than 46 kwh per year. What hypothesis test can
you use in this claim?
A. left-tailed C. null hypothesis
B. right- tailed D. alternative hypothesis
2
8. “The introduction of modern computers affects the performance of the students.”
What kind of hypothesis is it?
A. null B. mean C. alternative D. standard Deviation
9. Consider this statement: “New cars are expected to last an average of at least
three (3) years before needing major service.” Which of the following is the null
hypothesis?
A. 𝐻𝑜: 𝜇 ≤ 3 B. 𝐻𝑜: 𝜇 < 3 C. 𝐻𝑜: 𝜇 > 3 D. 𝐻𝑜: 𝜇 ≥ 3
10. Which is the correct null hypothesis of the claim below? “Students take an
average of less than five (5) years to graduate from college.”
A. 𝐻𝑜: 𝜇 = 5 B. 𝐻𝑜: 𝜇 < 5 C. 𝐻𝑜: 𝜇 ≠ 5 D. 𝐻𝑎: 𝜇 < 5
11. In driver’s test, an average of 300 drivers pass on their first try. We want to test if
more than an average of 300 passes on the first try. Which inequality symbols
is correct (=, ≠, ≥, <, ≤, >) for the null and alternative hypotheses - 𝐻𝑜: 𝜇 __ 300
𝐻𝑎: 𝜇 __ 300?
A. <, > B. =, ≠ C. ≤, ≥ D. = , >
3
LESSON 2.1
Formulating Appropriate Null and Alternative
Hypotheses on a Population Mean
What’s in
REVIEW
ACTIVITY 2.1. No More No Less!
Directions: Which of the given equality/inequality expressions describes each
situation? Select the best answer and write the letter of your choice on a separate
sheet of paper.
1.The survey shows that the number of students (n) who have parents with a house
of their own is less than 20.
A. 𝑛<20 B. 𝑛>20 C. 𝑛≤20 D. 𝑛≥20
2. Mother gives me at most P200 allowance (n) in a week.
A. 𝑛≥200 B. 𝑛≤200 C. 𝑛>200 D. 𝑛<200
3. Larry is an industrious appliance salesman. His average sales (n) in a week is at
least P10, 000.
A. 𝑛<10,000 B. 𝑛> 10,000 C. 𝑛≤10,000 D. 𝑛≥10,000
4. A son’s savings (n) is greater than P1,500.
A. 𝑛=1,500 B. 𝑛≠ 1,500 C. 𝑛> 1,500 D. 𝑛≥ 1,500
5. Marco’s salary (n) is equal to P20, 000.
A. 𝑛= 20,000 B. 𝑛≠20,000 C. 𝑛≤ 20,000 D. 𝑛<20,000
Guide Questions:
1. How did you find the previous activity? Was it easy or difficult?
2. What previously learned principle did you apply in the activity?
3. Were you able to determine the correct expression that correspond to each
situation? Elaborate.
4. Do you think you will apply these activities in formulating null and alternative
hypotheses?
4
What’s new
ACTIVITY 2.2: Differentiate It!
Directions: Examine the pictures below then answer the guide questions that follow.
“Effect of a Fertilizer on Plant Growth”
Figure 1 Figure 2
Guide Questions:
1. What have you observed between the two figures?
2. Do you think the fertilizer has an effect to the plant?
3. What do you think are the variables shown in the pictures?
4. Is there any relationship among the variables in Figure 1 and Figure 2?
5. How does these pictures relate to hypothesis?
What is it
DISCUSSION
-It states that there is no difference -It states that the population parameter
between population parameters (such as has some statistical significance
mean, standard deviation, and so on) (smaller, greater, or different than) with
and the hypothesized value. the hypothesized value.
- The null hypothesis is often an initial - The alternative hypothesis is what you
claim that is based on previous analyses might believe to be true or hope to prove
or specialized knowledge. true.
5
To state the null and alternative hypotheses correctly:
The claim used the word “less than” which as seen in the table above,
corresponds to the symbol (<). Therefore, the answer is n<20.
Illustrative Examples:
1. The average weight of all Grade 11 students in Senior High School is 169cm.
Is this claim t
Solution: First, identify the parameter which is the mean height of all Grade 11
students. Since it is a population mean, use the notation 𝝁. The claim in this example
is that the average weight is 169 cm which translates to 𝝁=𝟏𝟔𝟗 and is considered as
6
null hypothesis. To formulate the alternative hypothesis, write the
complement/opposite of the null hypothesis which is the average weight is not equal
to 169 cm.
𝑯𝒐: The average weight of all Grade 11 students is 169 cm. / 𝑯𝑶:𝝁=𝟏𝟔𝟗 (claim)
𝑯𝒂: The average weight of all Grade 11 students is not 169 cm./ 𝑯𝒂:𝝁≠𝟏𝟔𝟗
Solution: In this hypothesis, the parameter is the average. Therefore, you will use the
symbol μ. The claim is above ₱15,000 can be written as μ > ₱15,000 and greater
than falls at alternative hypothesis, 𝑯𝒂:𝝁>₱𝟏𝟓,𝟎𝟎𝟎. Since you have already
formulated the alternative, the null hypothesis will be 𝑯𝒐:𝝁≤₱𝟏𝟓,𝟎𝟎𝟎 as complement
of >. You can also write your null hypothesis as 𝑯𝒐:𝝁=₱𝟏𝟓,𝟎𝟎𝟎.
𝑯𝒐:𝝁≤₱𝟏𝟓,𝟎𝟎𝟎or 𝑯𝒐:𝝁=₱𝟏𝟓,𝟎𝟎𝟎
𝑯𝒂:𝝁>₱𝟏𝟓,𝟎𝟎𝟎 (claim)
Solution: The parameter used in this example is average (μ) and the claim is that
average is at most 240 minutes. The word ‘at most’ has the notation of (≤) which
means that claim is at the null hypothesis. The null hypothesis would be 𝑯𝟎:μ≤𝟐𝟒𝟎.
To formulate the alternative, use the notation (>) as the complement of (≤). Therefore,
alternative hypothesis is 𝑯𝒂:μ>𝟐𝟒𝟎.
𝑯𝟎 : μ≤𝟐𝟒𝟎 or 𝑯𝟎 : μ=𝟐𝟒𝟎 (claim)
𝑯𝒂:μ>𝟐𝟒𝟎
The alternative hypothesis can take another form depending on the value of the
parameter. The parameter may increase, decrease, or changed from the null value.
An alternative hypothesis predicts not only the difference of sample mean from the
population mean but also how it would be different in a specific direction - lower or
higher. This test is called a directional or one-tailed test because the rejection region
is entirely within one tail of the distribution.
On the other hand, some hypotheses predict only that one value will be different
from another, without additionally predicting which will be higher. The test of such a
hypothesis is nondirectional or two-tailed because an extreme test statistic in either
tail of the distribution (positive or negative) will lead to the rejection of the null
hypothesis of no difference.
7
One-Tailed Two-Tailed
The table below shows the null and alternative hypotheses stated together with the
directional test.
Two-Tailed Test Right-Tailed Test Left-tailed Test
Null 𝐻0 : 𝜇 = 𝜇0 or 𝐻0 : 𝜇 = 𝜇0 or
Hypothesis 𝐻0 : 𝜇 = 𝜇0 𝐻0 : 𝜇 ≤ 𝜇0 𝐻0 : 𝜇 ≥ 𝜇0
Alternative
Hypothesis 𝐻𝑎 : 𝜇 ≠ 𝜇0 𝐻𝑎 : 𝜇 > 𝜇0 𝐻𝑎 : 𝜇 < 𝜇0
Examples:
The parameter is the population mean =85. You can write the hypotheses into
symbols: 𝐻0 ∶𝜇=85 and 𝐻𝑎∶𝜇≠85. The phrase ‘affects performance’ has no clue of the
direction of the study, so it implies either increase or decrease in performance. This
tells you that the test is two-tailed test.
𝑯𝟎 ∶𝝁=𝟖𝟓 and 𝑯𝒂∶𝝁≠𝟖𝟓 (two-tailed test)
2. A piggery owner believes that using organic feeds on his pigs will yield
greater income. His average income from the previous year was ₱120, 000. State
the hypothesis and identify the directional test.
In this example, the null hypothesis is 𝑯𝟎 ∶𝝁=𝟏𝟐𝟎,𝟎𝟎𝟎 . You may notice that the
hypothesis used the phrase ‘greater income’ that is associated with greater than.
Therefore, 𝑯𝒂∶𝝁>𝟏𝟐𝟎,𝟎𝟎𝟎. This hypothesis uses inequality symbol (>) so it is one-
tailed test and it uses greater than which specifically called for the right-tailed test.
𝑯𝟎 ∶𝝁=𝟏𝟐𝟎,𝟎𝟎𝟎 and 𝑯𝒂∶𝝁>𝟏𝟐𝟎,𝟎𝟎𝟎 (right-tailed test)
3. The average waiting time of all costumers in a restaurant before being served
is less than 20 minutes. Determine the hypotheses and the directional test.
8
You may notice that the hypothesis used the phrase ‘less than’ which denotes
that the alternative hypothesis is 𝑯𝒂∶𝝁<𝟐𝟎. This hypothesis uses inequality symbol (<)
so it is one-tailed test and it used less than which specifically called for the left-tailed
test. In this example, the null hypothesis is 𝑯𝟎 ∶𝝁≥𝟐𝟎.
𝑯𝟎 ∶𝝁≥𝟐𝟎 and 𝑯𝒂∶𝝁<𝟐𝟎 (left-tailed test)
WHAT’S MORE
ACTIVITY 2.2.
Directions: Write the null hypothesis and alternative hypothesis in notations for each
given situation.
1. Mrs. Dela Cruz claims that her students scored an average of 91 in their
Mathematics quiz. The master teacher wants to know whether the teacher’s claim is
acceptable or not.
𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________
2. A car manufacturer claims that the mean selling price of all cars manufactured is
only ₱150,000. A consumer agency wants to test whether the mean selling price of all
the cars manufactured exceeds ₱150, 000.
𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________
3. A manufacturer of soft drinks claims that all labeled 1.5-liter bottles contain an
average of 1.49 liters of soft drinks. A retailer wishes to test whether the mean amount
of soft drinks in labeled 1.5-liter bottle is less than 1.49 liters.
𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________
4. A bus company in Manila claims that the mean waiting time for a bus during rush
hour is less than 12 minutes. A random sample of 30 waiting times has a mean of 15
minutes with a standard deviation of 4.8 minutes.
𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________
5. The average power consumption of air conditioner is at most 2,700 watts as claimed
by the owner. A survey made by an electric power company found out that the mean
consumption is 3,000 with standard deviation of 225.
𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________
9
ACTIVITY 2.4 One-Tailed or Two-Tailed!
1. A used car dealer says that the mean price car in the Philippines is at least
₱350,000.
2. PAG-ASA reported that the mean annual rainfall in the Philippines is at most
4,064mm.
4. The mean age of students in a university in the previous years was 27 years old.
An instructor thinks the mean age for students is older than 27. She randomly surveys
56 students and finds that the sample mean is 29 with a standard deviation of 2.
5. The mean work week for engineers in a new company is believed to be about 40
hours. A newly hired engineer hopes that it is shorter. She asks 10 engineering friends
for the lengths of their mean work weeks. Based on the results, should she count on
the mean work week to be shorter than 40 hours?
LESSON 2.2
Identifying Appropriate Test-Statistics
What is in
10
Follow-up Questions:
1. In the items above, how did you differentiate the statements describing standard
normal distribution from those involving t-distribution?
2. Were you able to answer them easily? If not, which item/s did you find difficult to
answer?
3. Were you able to differentiate the statements characterizing normal distribution from
those describing t-distribution?
What’s new
Examples:
1. A Grade 11 researcher reported that the average allowance of Senior High School
students was more than ₱100. A sample of 40 students had mean allowance of ₱120.
At 𝛼=0.01 test, it was the claimed that the students had allowance of more than ₱
100.The standard deviation of the population is ₱50.
𝜇 = 𝟏𝟎𝟎 𝑥̅ = 𝟏𝟐𝟎 𝑛 = 𝟒𝟎 𝜎 = 𝟓𝟎
2. According to a cell phone company, the average price of cellular phone in the
Philippines is ₱12,999. However, in a sample of 20 costumers randomly asked about
the price of their cellular phone, data collected showed an average of ₱9,999 and
standard deviation of ₱7,999. Using 𝛼=0.05 level of significance, is there enough
evidence proving that the average price of cellular phone is less than ₱12,999?
𝜇 = 𝟏𝟐, 𝟗𝟗𝟗 𝑥̅ = 𝟗, 𝟗𝟗𝟗 𝑛 = 𝟐𝟎 𝑠 = 𝟕, 𝟗𝟗𝟗
What is it
DISCUSSION
11
Sample standard deviation (𝒔) is a statistic which means that this measure
of variability is calculated from only some of the individuals in a population.
Population variance (𝝈𝟐 ), in the same sense, indicates how the population
data points are spread out. It is the average of the distances from each data
point in the population to the mean, squared.
Since we already defined important things in identifying the test statistics in hypothesis
testing, let us now determine those concepts when given a problem. Let’s use the
example in Activity 2.
Example:
μ = ₱100 the average allowance of the population (Senior High School students)
𝐧=𝟒𝟎 the number of students taken from all Senior High School students
𝒙̅̅ = ₱120 the mean allowance of the sample
𝛔 = ₱50 the standard deviation of the population
Now you already know how to get the data needed in choosing test statistics.
This time, you will determine what test statistic is appropriate in computing test value
in the hypothesis testing.
A test statistic is a random variable that is calculated from sample data and
used in a hypothesis test. You can use test statistics to determine whether to reject or
accept the null hypothesis. The test statistic compares your data with what is expected
under the null hypothesis.
To identify the test statistic, you must consider whether the population standard
deviation/variance is known or unknown. If the population standard deviation σ is
known, then the mean has a normal distribution. Use z-test. If the population standard
deviation σ is unknown, then the mean has a t- distribution. Use t-test. Instead of the
population standard deviation, use the sample standard deviation.
z-test
In a z-test, the sample is assumed to be normally distributed. A z-score is
calculated with population parameters such as “population mean” and “population
standard deviation”. It is used to validate a hypothesis that the sample drawn belongs
to the same population. When the variance is known and either the distribution is
normal or sample size is large, use a z-test statistic.
t-test
Like a z-test, a t-test also assumes a normal distribution of the sample. A t-test
is used when the population variance or standard deviation are not known. When the
variance is unknown and a sample size is less than 30, use a t-test statistic assuming
that the population is normal or approximately normal.
Central Limit Theorem
12
In Central Limit Theorem, if the population is normally distributed or the
sample size is large and the true population mean 𝜇 = 𝜇0 , then z has a standard
normal distribution.
When population standard deviation σ is not known, we may still use z-score
by replacing the population standard deviation σ by its estimate, sample standard
deviation s. Since the sample is large the resulting test statistic still has a distribution
that is approximately standard normal.
Historically, this was very useful, as most statisticians before did not have
access to the t-table of quantities for very large number of degrees of freedom. But
with modern computers today, using t-test with a very large sample size is not a
problem at all.
However, since you will be using a t-table with only limited number of degrees
of freedom, you will use z-test when the sample size is large even though the
population standard deviation is unknown.
When sample sizes are small, the Central Limit Theorem does not apply.
You must then impose stricter assumptions on the population to give
statistical validity to the test procedure. One common assumption is that the
population from which the sample is taken has a normal probability distribution
to begin with. Under such circumstances, if the population standard deviation is
𝑥 −𝜇0
known, then the test statistic 𝜎 still has the standard normal distribution.
√𝑛
WHAT’S MORE
Activity 3. Which is Which?
Directions: Identify the appropriate test statistic to be used in each problem. Write z-
test or t-test on a separate sheet of paper.
___________1. A sample of n=25 is selected from a normal population, 𝑥̅̅=56 and
s=12.
___________2. Based on the report of the school nurse, the average height of Grade
11 students has increased. Five years ago, the average height of Grade 11 students
was 170cm with standard deviation of 38cm. She took a random sample of 150
students and derived the average height of 165cm.
___________3. Knowing from a previous study that the average of athletes is 80, an
athletic adviser asked how his soccer players are academically doing as compared to
other student athletes. After an initiative to help improve the average of student
athletes, the adviser randomly selected 15 soccer players and found 85 as the average
with standard deviation of 1.25.
___________4. The CEO of a battery manufacturing company claimed that their
batteries would last an average of 280 hours under normal use. A researcher randomly
selected 20 batteries from the production line and tested them. The tested batteries
had a mean life span of 250 hours with a standard deviation of 40 hours. Do we have
enough evidence to suggest that the claim of an average of 280 hours is false?
___________5. It was known that the number of tickets purchased by students at the
ticket window for the volleyball match of two popular universities followed a distribution
that has mean of 500 and standard deviation of 8.9. Suppose that a few hours before
the start of one of these matches, there are 100 eager students standing in line to
purchase tickets. If there are 250 tickets remaining, what is the probability that all 100
students will be able to purchase the tickets they want?
13
WHAT I CAN DO
APPLICATION
Read, Analyze, and Answer!
Directions: Answer the following.
1. In a sample of 𝑛 = 12 selected from a normal population, 𝑥̅ = 50, 𝑠 = 10, and null
hypothesis is 𝑯𝟎 : 𝜇 = 45.
a. What is the number of degrees of freedom?
b. What is the test statistic to be used?
POST - ASSESSMENT
Instructions: Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. If the variance is known, what test statistic is appropriate?
A. t-test C. two-tailed test
B. z-test D. one-tailed test
2. One-sample t-statistic is used instead of one-sample z-statistic when
___________________.
A. μ is known. C. μ is unknown
B. σ is known. D. σ is unknown
3. Based on the Central Limit Theorem, when the sample (n) is extremely large and
the variance is unknown, what is the statistical test to be used?
A. t-test C. two-tailed test
B. z-test D. one- tailed test
4. Which of the following is NOT a consideration in using z-test/statistic?
A. Variance is known.
B. Sample standard deviation is known.
C. The population mean is less than 30.
D. Population standard deviation is known.
5. What appropriate tool is applicable if the population is normal, sample standard
deviation is known, and sample is less than 30?
A. t-test C. normal test
B. z-test D. Central Limit Theorem
6. Which is the correct alternative hypothesis for one-tailed test?
A. 𝜇 = 25 B. 𝜇 ≠ 25 C. 𝜇 ≥ 25 D. 𝜇 < 25
14
8. What kind of hypothesis is illustrated below?
The mean score of all Grade 11 students is higher than 75.
A. one-tailed test C. null hypothesis
B. two-tailed test D. alternative hypothesis
9. When the value of parameter has significant difference with the hypothesized value,
then it is called ________________.
A. one-tailed test C. null hypothesis
B. two-tailed test D. alternative hypothesis
10. Which directional test is illustrated in the given problem below?
In a recent survey, the average amount of money students have in their wallet is
₱200.00 with standard deviation of 45. A teacher feels that the average amount is
lower. She surveyed 80 randomly selected students and found that the average
amount is ₱35.
A. left-tailed C. alternative
B. two-tailed D. right tailed
11. Which of the following symbols is NOT needed when t-test is used in computing
values?
A. 𝑛 B. μ C. 𝜎 D. 𝑠
12. If in a sample n=16 selected from a normal population, 𝑥̅̅=56 and 𝑠=12, what
statistical test is applicable to be used?
A. f-test C. z-test
B. t-test D. Central Limit Theorem
13. Based on Central Limit Theorem, the z-test for single sample may be used when
all the following conditions are TRUE except _________________.
A. Sample size is less than 30.
B. Data are normally distributed.
C. Population standard deviation is known.
D. Population standard deviation is unknown.
14. What is the sample standard deviation if a simple random sample of 220 students
is drawn from a population of 2,740 college students? Among the sampled students,
the average IQ score is 115 with standard deviation of 10.
A. 10 B. 115 C. 220 D. 2,740
15. The supervisor of a certain company claimed that the mean workday of his workers
is 8.3 hours per day. A sample of 20 workers was taken and it was found out that the
mean workday is 8 hours with standard deviation of 1 hour. At 0.01 level of
significance, is the mean workday less than 8.3 hours?
What test statistic is to be used in the given problem?
A. z-test C. right-tailed test
B. t-test D. left-tailed test
15