You are on page 1of 12

11

STATISTICS AND PROBABILITY

QUARTER 4 – MODULE 2

Formulating Appropriate Null and Alternative


Hypotheses on a Population Mean

1
What I know
PRE-ASSESSMENT
Before you start studying this module, take this simple test first to find out
how much you already know about the topics in this module.

Directions: Read each item below and choose the correct answer.

1. In formulating the alternative hypothesis, what mathematical symbol is applicable


to use in the statement, “The average score of Grade 11 (ABM) in Business
Statistics is 75.”?
A. < B. > C. = D. ≠

2. A vacuum cleaner consumes less than 46 kwh per year. What hypothesis test can
you use in this claim?
A. left-tailed C. null hypothesis
B. right- tailed D. alternative hypothesis

3. Which of the following steps is not included in formulating hypothesis?


A. Identify the claim to be tested.
B. Translate the claim into mathematical symbols/notations.
C. Use the data about sample then compute the test statistic.
D. Formulate first the null hypothesis and then the alternative hypothesis.
4. The sign of the alternative hypothesis in a left-tailed test is always ________
A. Equal C. less than
B. not equal D. greater than
5. A scientist invented a substance that increases the life of an automobile battery. If
the mean lifetime of the battery is 24 months, then what are his hypotheses?
A. 𝐻𝑜: 𝜇 = 24, 𝐻𝑎: 𝜇 ≠ 24 C. 𝐻𝑜: 𝜇 = 24, 𝐻𝑎: 𝜇 ≤ 24
B. 𝐻𝑜: 𝜇 = 24, 𝐻𝑎: 𝜇 > 24 D. 𝐻𝑜: 𝑝 = 24, 𝐻𝑎: 𝑝 > 24
6. A researcher reports that the average salary of an accountant is more than
₱40,000. A sample of 30 accountants has a mean salary of ₱42,500. At a = 0.05
test, it is found out that an accountant earns more than ₱40,000 a month. The
standard deviation of the population is ₱3,000. What is the alternative hypothesis?
A. The average salary of an accountant is equal to ₱40,000.
B. The average salary of an accountant is greater than ₱40,000.
C. The average salary of an accountant is less than or equal to ₱42,500.
D. The average salary of an accountant is greater than or equal to ₱42,500.
7. What kind of hypothesis is illustrated in statement below?
“There is no significant difference between the average weekly allowances of
morning and afternoon students in Mabunga Integrated High School.”
A. one-tailed test C. null hypothesis
B. two-tailed test D. alternative hypothesis

2
8. “The introduction of modern computers affects the performance of the students.”
What kind of hypothesis is it?
A. null B. mean C. alternative D. standard Deviation
9. Consider this statement: “New cars are expected to last an average of at least
three (3) years before needing major service.” Which of the following is the null
hypothesis?
A. 𝐻𝑜: 𝜇 ≤ 3 B. 𝐻𝑜: 𝜇 < 3 C. 𝐻𝑜: 𝜇 > 3 D. 𝐻𝑜: 𝜇 ≥ 3
10. Which is the correct null hypothesis of the claim below? “Students take an
average of less than five (5) years to graduate from college.”
A. 𝐻𝑜: 𝜇 = 5 B. 𝐻𝑜: 𝜇 < 5 C. 𝐻𝑜: 𝜇 ≠ 5 D. 𝐻𝑎: 𝜇 < 5
11. In driver’s test, an average of 300 drivers pass on their first try. We want to test if
more than an average of 300 passes on the first try. Which inequality symbols
is correct (=, ≠, ≥, <, ≤, >) for the null and alternative hypotheses - 𝐻𝑜: 𝜇 __ 300
𝐻𝑎: 𝜇 __ 300?
A. <, > B. =, ≠ C. ≤, ≥ D. = , >

12. Which of these is a correct alternative hypothesis for a two‐tailed test?


A. 𝐻𝑎: 𝜇 ≠ 7 B. 𝐻𝑎: 𝜇 = 7 C. 𝐻𝑎: 𝜇 > 7 D. 𝐻𝑎: 𝜇 < 7
13. In a commercial, a new diet program would like to claim that their methods result
in a mean weight loss of more than 22kgs in two (2) weeks. To determine if this
is a valid claim, they hire an agency that then selects 25 people to be placed on
this diet. What is the test of hypothesis?
A. null C. one tailed- test
B. alternative D. two tailed- test
14. A researcher estimated that the average height of a building in the Philippines is
at least 150 meters. A random sample of 15 buildings is selected and has the
mean of 168 meters. What are the null and alternative hypotheses?
A. 𝐻𝑜: 𝜇 > 150, 𝐻𝑎: 𝜇 ≤ 150 C. 𝐻𝑜: 𝜇 = 150, 𝐻𝑎: 𝜇 ≥ 150
B. 𝐻𝑜: 𝜇 = 150, 𝐻𝑎: 𝜇 ≠ 150 D. 𝐻𝑜: 𝜇 ≥ 150, 𝐻𝑎: 𝜇 < 150
15. A survey reported that teenagers spend an average at most four (4) hours each
day on social media. The organization thinks that, currently, the mean is higher.
Fifteen (15) randomly chosen teenagers were asked how many hours per day do
they spend on social media. The sample mean was 4.5 hours with a sample
standard deviation of 2.0. What is the test of hypothesis?
A. left-tailed test C. hypothesis test
B. two-tailed test D. right-tailed test

3
LESSON 2.1
Formulating Appropriate Null and Alternative Hypotheses on a
Population Mean

What is it
DISCUSSION

A statistical hypothesis is a statement about a parameter and deals with


evaluating the value of parameter.
In statistical hypothesis testing, there are always two hypotheses: the null and
alternative hypotheses. Below is a comparison between the two.

Null Hypothesis (𝑯𝟎 ) Alternative Hypothesis (𝑯𝒂 )

-It states that there is no difference -It states that the population parameter
between population parameters (such as has some statistical significance
mean, standard deviation, and so on) (smaller, greater, or different than) with
and the hypothesized value. the hypothesized value.

- There is no observed effect. - There is an observed effect.

- The null hypothesis is often an initial - The alternative hypothesis is what you
claim that is based on previous analyses might believe to be true or hope to prove
or specialized knowledge. true.

To state the null and alternative hypotheses correctly:

1. Identify the parameter in a given problem.


2. Identify the claim to be tested that may show up in null or alternative hypothesis.
3. Translate the claim into mathematical symbols/notations.
4. Formulate first the null hypothesis (𝐻𝑜) then alternative hypothesis (𝐻𝑎) based on
the three different ways in writing hypothesis as illustrated below:

𝑯𝒐: μ=𝒌 𝑯𝒐: μ≤𝒌 𝑯𝒐: μ≥𝒌


𝑯𝒂: μ≠𝒌 𝑯𝒂: μ>𝒌 𝑯𝒂: μ<𝒌

4
Hypothesis-Testing Common Phrases

= is equal to ≠ is not equal to


is the same as is not the same
is exactly the same as is different from
has not changed from has changed from
> is increased < is decreased
is greater than is less than
is higher than is lower than
is above is below
is bigger than is smaller than
is longer than is decreased or reduced from
is more than is not more than
≥ is at least ≤ is at most
is not less than is not more than
is greater than or equal to is less than or equal to

Let us take an example from your previous activity.


“The survey shows that the number of students (n) who have parents with
a house of their own is less than 20.”
The claim used the word “less than” which as seen in the table above,
corresponds to the symbol (<). Therefore, the answer is n<20.
Illustrative Examples:

1. The average weight of all Grade 11 students in Senior High School is 169cm.
Is this claim t
Solution: First, identify the parameter which is the mean height of all Grade 11
students. Since it is a population mean, use the notation 𝝁. The claim in this example
is that the average weight is 169 cm which translates to 𝝁=𝟏𝟔𝟗 and is considered as
null hypothesis. To formulate the alternative hypothesis, write the
complement/opposite of the null hypothesis which is the average weight is not equal
to 169 cm.
𝑯𝒐: The average weight of all Grade 11 students is 169 cm. / 𝑯𝑶:𝝁=𝟏𝟔𝟗 (claim)
𝑯𝒂: The average weight of all Grade 11 students is not 169 cm./ 𝑯𝒂:𝝁≠𝟏𝟔𝟗

2. The average price per square meter of residential lot in an exclusive


subdivision is above ₱15,000. A buyer wants to test the agent’s claim.

Solution: In this hypothesis, the parameter is the average. Therefore, you will use the
symbol μ. The claim is above ₱15,000 can be written as μ > ₱15,000 and greater
than falls at alternative hypothesis, 𝑯𝒂:𝝁>₱𝟏𝟓,𝟎𝟎𝟎. Since you have already
formulated the alternative, the null hypothesis will be 𝑯𝒐:𝝁≤₱𝟏𝟓,𝟎𝟎𝟎 as complement
of >. You can also write your null hypothesis as 𝑯𝒐:𝝁=₱𝟏𝟓,𝟎𝟎𝟎.
𝑯𝒐:𝝁≤₱𝟏𝟓,𝟎𝟎𝟎or 𝑯𝒐:𝝁=₱𝟏𝟓,𝟎𝟎𝟎
𝑯𝒂:𝝁>₱𝟏𝟓,𝟎𝟎𝟎 (claim)

5
One-Tailed and Two-Tailed Test

The alternative hypothesis can take another form depending on the value of the
parameter. The parameter may increase, decrease, or changed from the null value.
An alternative hypothesis predicts not only the difference of sample mean from the
population mean but also how it would be different in a specific direction - lower or
higher. This test is called a directional or one-tailed test because the rejection region
is entirely within one tail of the distribution.
On the other hand, some hypotheses predict only that one value will be different
from another, without additionally predicting which will be higher. The test of such a
hypothesis is nondirectional or two-tailed because an extreme test statistic in either
tail of the distribution (positive or negative) will lead to the rejection of the null
hypothesis of no difference.

One-Tailed Two-Tailed

▪ Alternative hypothesis contains ▪ Alternative contains the


the greater than (>) or less than inequality (≠) symbol.
(<) symbols
▪ It has no direction.
▪ It is directional (either right-tailed
or left-tailed)

The table below shows the null and alternative hypotheses stated together with the
directional test.
Two-Tailed Test Right-Tailed Test Left-tailed Test
Null 𝐻0 : 𝜇 = 𝜇0 or 𝐻0 : 𝜇 = 𝜇0 or
Hypothesis 𝐻0 : 𝜇 = 𝜇0 𝐻0 : 𝜇 ≤ 𝜇0 𝐻0 : 𝜇 ≥ 𝜇0

Alternative
Hypothesis 𝐻𝑎 : 𝜇 ≠ 𝜇0 𝐻𝑎 : 𝜇 > 𝜇0 𝐻𝑎 : 𝜇 < 𝜇0

Examples:

Determine the hypotheses and the hypothesis test.

1. Teacher A wants to know if mathematical games affect the performance of the


students in learning Mathematics. A class of 45 students was used in the study.
The mean score was 90 and the standard deviation was 3. A previous study
revealed that 𝝁=𝟖𝟓 and the standard deviation 𝝈=𝟓.

The parameter is the population mean =85. You can write the hypotheses into
symbols: 𝐻0 ∶𝜇=85 and 𝐻𝑎∶𝜇≠85. The phrase ‘affects performance’ has no clue of the

6
direction of the study, so it implies either increase or decrease in performance. This
tells you that the test is two-tailed test.
𝑯𝟎 ∶𝝁=𝟖𝟓 and 𝑯𝒂∶𝝁≠𝟖𝟓 (two-tailed test)

2. A piggery owner believes that using organic feeds on his pigs will yield
greater income. His average income from the previous year was ₱120, 000. State
the hypothesis and identify the directional test.

In this example, the null hypothesis is 𝑯𝟎 ∶𝝁=𝟏𝟐𝟎,𝟎𝟎𝟎 . You may notice that the
hypothesis used the phrase ‘greater income’ that is associated with greater than.
Therefore, 𝑯𝒂∶𝝁>𝟏𝟐𝟎,𝟎𝟎𝟎. This hypothesis uses inequality symbol (>) so it is one-
tailed test and it uses greater than which specifically called for the right-tailed test.
𝑯𝟎 ∶𝝁=𝟏𝟐𝟎,𝟎𝟎𝟎 and 𝑯𝒂∶𝝁>𝟏𝟐𝟎,𝟎𝟎𝟎 (right-tailed test)

3. The average waiting time of all costumers in a restaurant before being served
is less than 20 minutes. Determine the hypotheses and the directional test.
You may notice that the hypothesis used the phrase ‘less than’ which denotes
that the alternative hypothesis is 𝑯𝒂∶𝝁<𝟐𝟎. This hypothesis uses inequality symbol (<)
so it is one-tailed test and it used less than which specifically called for the left-tailed
test. In this example, the null hypothesis is 𝑯𝟎 ∶𝝁≥𝟐𝟎.
𝑯𝟎 ∶𝝁≥𝟐𝟎 and 𝑯𝒂∶𝝁<𝟐𝟎 (left-tailed test)

WHAT’S MORE
ACTIVITY 2.2.

ACTIVITY 2.3 Let’s Hypothesized!

Directions: Write the null hypothesis and alternative hypothesis in notations for each
given situation.

1. Mrs. Dela Cruz claims that her students scored an average of 91 in their
Mathematics quiz. The master teacher wants to know whether the teacher’s claim is
acceptable or not.

𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________
2. A car manufacturer claims that the mean selling price of all cars manufactured is
only ₱150,000. A consumer agency wants to test whether the mean selling price of all
the cars manufactured exceeds ₱150, 000.

𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________
3. A manufacturer of soft drinks claims that all labeled 1.5-liter bottles contain an
average of 1.49 liters of soft drinks. A retailer wishes to test whether the mean amount
of soft drinks in labeled 1.5-liter bottle is less than 1.49 liters.

𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________

7
4. A bus company in Manila claims that the mean waiting time for a bus during rush
hour is less than 12 minutes. A random sample of 30 waiting times has a mean of 15
minutes with a standard deviation of 4.8 minutes.

𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________
5. The average power consumption of air conditioner is at most 2,700 watts as claimed
by the owner. A survey made by an electric power company found out that the mean
consumption is 3,000 with standard deviation of 225.

𝐻𝑜: _________________________________________________
𝐻𝑎: _________________________________________________

LESSON 2.2
Identifying Appropriate Test-Stastic

What is in

Activity 1: Is It Zee or Tee?

Directions: Write the letter “z” if the statement is a characteristic of standard


normal distribution and “t” if the given characteristic describes t-distribution

1. It is best applied if you have a limited sample size (n < 30) as


long as the variables are approximately normally distributed.
2. It is also applicable if you do not know the populations’
standard deviation.
3. This is the best to use in a statistical test if the population
standard deviation is known.
4. It is always used for normal distribution.
5. This test is often applied in large samples (n > 30).

What is it
DISCUSSION

Before we move forward to the different test statistics, it is important to define


the following terms:
• A population includes all of the elements from a set of data.
• A sample consists of one or more observations drawn from the population.
• Sample mean (𝒙̅) is the mean of sample values collected.

8
• Population mean (𝝁) is the mean of all the values in the population. If the
sample is randomly selected and sample size is large, then the sample mean
would be a good estimate of the population mean.
• Population standard deviation (𝝈) is a parameter which is a measure of
variability with fixed value calculated from every individual in the population.
• Sample standard deviation (𝒔) is a statistic which means that this measure
of variability is calculated from only some of the individuals in a population.
• Population variance (𝝈𝟐 ), in the same sense, indicates how the population
data points are spread out. It is the average of the distances from each data
point in the population to the mean, squared.

Since we already defined important things in identifying the test statistics in hypothesis
testing, let us now determine those concepts when given a problem. Let’s use the
example in Activity 2.

Example:

A Grade 11 researcher reported that the average allowance of Senior High


School students was ₱100. A sample of 40 students has mean allowance of ₱120.
At 𝛼=0.01 test, it was the claimed that the students had allowance of more than ₱
100.The standard deviation of the population is ₱50.

μ = ₱100 the average allowance of the population (Senior High School students)
𝐧=𝟒𝟎 the number of students taken from all Senior High School students
𝒙̅̅ = ₱120 the mean allowance of the sample
𝛔 = ₱50 the standard deviation of the population

Now you already know how to get the data needed in choosing test statistics.
This time, you will determine what test statistic is appropriate in computing test value
in the hypothesis testing.
A test statistic is a random variable that is calculated from sample data and
used in a hypothesis test. You can use test statistics to determine whether to reject or
accept the null hypothesis. The test statistic compares your data with what is expected
under the null hypothesis.
To identify the test statistic, you must consider whether the population standard
deviation/variance is known or unknown. If the population standard deviation σ is
known, then the mean has a normal distribution. Use z-test. If the population standard
deviation σ is unknown, then the mean has a t- distribution. Use t-test. Instead of the
population standard deviation, use the sample standard deviation.
z-test
In a z-test, the sample is assumed to be normally distributed. A z-score is
calculated with population parameters such as “population mean” and “population
standard deviation”. It is used to validate a hypothesis that the sample drawn belongs
to the same population. When the variance is known and either the distribution is
normal or sample size is large, use a z-test statistic.
t-test

9
Like a z-test, a t-test also assumes a normal distribution of the sample. A t-test
is used when the population variance or standard deviation are not known. When the
variance is unknown and a sample size is less than 30, use a t-test statistic assuming
that the population is normal or approximately normal.
Central Limit Theorem
In Central Limit Theorem, if the population is normally distributed or the
sample size is large and the true population mean 𝜇 = 𝜇0 , then z has a standard
normal distribution.
When population standard deviation σ is not known, we may still use z-score
by replacing the population standard deviation σ by its estimate, sample standard
deviation s. Since the sample is large the resulting test statistic still has a distribution
that is approximately standard normal.
Historically, this was very useful, as most statisticians before did not have
access to the t-table of quantities for very large number of degrees of freedom. But
with modern computers today, using t-test with a very large sample size is not a
problem at all.
However, since you will be using a t-table with only limited number of degrees
of freedom, you will use z-test when the sample size is large even though the
population standard deviation is unknown.
When sample sizes are small, the Central Limit Theorem does not apply. You
must then impose stricter assumptions on the population to give statistical validity to
the test procedure. One common assumption is that the population from which the
sample is taken has a normal probability distribution to begin with. Under such
circumstances, if the population standard deviation is known, then the test statistic
𝑥−𝜇0
𝜎 still has the standard normal distribution.
√𝑛

WHAT’S MORE

Activity 3. Which is Which?


Directions: Identify the appropriate test statistic to be used in each problem. Write z-
test or t-test on a separate sheet of paper.
___________1. A sample of n=25 is selected from a normal population, 𝑥̅=56 and
s=12.
___________2. Based on the report of the school nurse, the average height of Grade
11 students has increased. Five years ago, the average height of Grade 11 students
was 170cm with standard deviation of 38cm. She took a random sample of 150
students and derived the average height of 165cm.
___________3. Knowing from a previous study that the average of athletes is 80, an
athletic adviser asked how his soccer players are academically doing as compared to
other student athletes. After an initiative to help improve the average of student
athletes, the adviser randomly selected 15 soccer players and found 85 as the average
with standard deviation of 1.25.
___________4. The CEO of a battery manufacturing company claimed that their
batteries would last an average of 280 hours under normal use. A researcher randomly
selected 20 batteries from the production line and tested them. The tested batteries
had a mean life span of 250 hours with a standard deviation of 40 hours. Do we have
enough evidence to suggest that the claim of an average of 280 hours is false?

10
___________5. It was known that the number of tickets purchased by students at the
ticket window for the volleyball match of two popular universities followed a distribution
that has mean of 500 and standard deviation of 8.9. Suppose that a few hours before
the start of one of these matches, there are 100 eager students standing in line to
purchase tickets. If there are 250 tickets remaining, what is the probability that all 100
students will be able to purchase the tickets they want?

POST - ASSESSMENT

Instructions: Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.

1. If the variance is known, what test statistic is appropriate?


A. t-test C. two-tailed test
B. z-test D. one-tailed test
2. One-sample t-statistic is used instead of one-sample z-statistic when
___________________.
A. μ is known. C. μ is unknown
B. σ is known. D. σ is unknown
3. Based on the Central Limit Theorem, when the sample (n) is extremely large and
the variance is unknown, what is the statistical test to be used?
A. t-test C. two-tailed test
B. z-test D. one- tailed test
4. Which of the following is NOT a consideration in using z-test/statistic?
A. Variance is known.
B. Sample standard deviation is known.
C. The population mean is less than 30.
D. Population standard deviation is known.
5. What appropriate tool is applicable if the population is normal, sample standard
deviation is known, and sample is less than 30?
A. t-test C. normal test
B. z-test D. Central Limit Theorem
6. Which is the correct alternative hypothesis for one-tailed test?
A. 𝜇 = 25 B. 𝜇 ≠ 25 C. 𝜇 ≥ 25 D. 𝜇 < 25

7. What kind of hypothesis is illustrated below?


The mean score of all Grade 11 students is higher than 75.
A. one-tailed test C. null hypothesis
B. two-tailed test D. alternative hypothesis
8. What kind of hypothesis is illustrated below?
The mean score of all Grade 11 students is higher than 75.
A. one-tailed test C. null hypothesis
B. two-tailed test D. alternative hypothesis

11
9. When the value of parameter has significant difference with the hypothesized value,
then it is called ________________.
A. one-tailed test C. null hypothesis
B. two-tailed test D. alternative hypothesis
10. Which directional test is illustrated in the given problem below?
In a recent survey, the average amount of money students have in their wallet is
₱200.00 with standard deviation of 45. A teacher feels that the average amount is
lower. She surveyed 80 randomly selected students and found that the average
amount is ₱35.
A. left-tailed C. alternative
B. two-tailed D. right tailed
11. Which of the following symbols is NOT needed when t-test is used in computing
values?
A. 𝑛 B. μ C. 𝜎 D. 𝑠
12. If in a sample n=16 selected from a normal population, 𝑥̅=56 and 𝑠=12, what
statistical test is applicable to be used?
A. f-test C. z-test
B. t-test D. Central Limit Theorem
13. Based on Central Limit Theorem, the z-test for single sample may be used when
all the following conditions are TRUE except _________________.
A. Sample size is less than 30.
B. Data are normally distributed.
C. Population standard deviation is known.
D. Population standard deviation is unknown.
14. What is the sample standard deviation if a simple random sample of 220 students
is drawn from a population of 2,740 college students? Among the sampled students,
the average IQ score is 115 with standard deviation of 10.
A. 10 B. 115 C. 220 D. 2,740
15. The supervisor of a certain company claimed that the mean workday of his workers
is 8.3 hours per day. A sample of 20 workers was taken and it was found out that the
mean workday is 8 hours with standard deviation of 1 hour. At 0.01 level of
significance, is the mean workday less than 8.3 hours?
What test statistic is to be used in the given problem?
A. z-test C. right-tailed test
B. t-test D. left-tailed test

12

You might also like