You are on page 1of 392

Statistics

and
Probability
Statistics and
Probability
Illustrating a Random Variable
(Discrete and Continuous)
Statistics and Probability
Illustrating a Random Variable (Discrete and Continuous)
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Roland V. Magsino


Editors: Celestina M. Alba and Nestor Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrators: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph
Introductory Message
For the facilitator:

Welcome to the Grade 11 Statistics and Probability Alternative Delivery Mode (ADM) Module.

This module was collaboratively designed, developed and reviewed by educators both from public and private institutions to assist
you, the teacher or facilitator in helping the learners meet the standards set by the K to 12 Curriculum while overcoming their
personal, social, and economic constraints in schooling.

This learning resource hopes to engage the learners into guided and independent learning activities at their own pace and time.
Furthermore, this also aims to help learners acquire the needed 21 st century skills while taking into consideration their needs and
circumstances.

As a facilitator you are expected to orient the learners on how to use this module. You also need to keep track of the learners' progress
while allowing them to manage their own learning. Furthermore, you are expected to encourage and assist the learners as they do
the tasks included in the module.

For the learner:

Welcome to the Grade 11 Statistics and Probability Alternative Delivery Mode (ADM) Module.

The hands are one of the greatest assets of the human body. No other beings in the world has hands that can grasp, hold, move , and
manipulate objects like human hands. Through our hands, we may learn, create and accomplish. Hence, the hand in this learning
resource signifies that you as a learner is capable and empowered to successfully achieve the relevant and essential competencies
and skills at your own pace and time. Your academic success lies in your own hands!

This module was designed to provide you with fun and meaningful opportunities for guided and independent learning at your own
pace and time. You will be enabled to process the contents of the learning resource while being an active learner.

This module has the following parts and corresponding icons:

What I Need to Know


This will give you an idea of the skills or competencies you are expected to learn in
the module.

What’s In
This is a brief drill or review to help you link the current lesson with the previous
one.
What’s New
In this portion, the new lesson will be introduced to you in various ways such as a
story, a song, a poem, a problem opener, an activity or a situation.
What is It
This section provides a brief discussion of the lesson. This aims to help you discover
and understand new concepts and skills.
What’s More
This comprises activities for independent practice to solidify your understanding and
skills of the topic. You may check the answers to the exercises using the Answer Key
at the end of the module.
What I Have Learned
This includes questions or blank sentence/ paragraph to be filled in to process what
you learned from the lesson.
What I Can Do
This section provides an activity which will help you transfer your new knowledge or
skill into real life situations or concerns.
Assessment
This is a task which aims to evaluate your level of mastery in achieving the learning
competency.
Additional Activities
In this portion, another activity will be given to you to enrich your knowledge or skill
of the lesson learned. This also tends retention of learned concepts.
Answer Key
This contains answers to all activities in the module.

At the end of this module you will also find:

References This is a list of all sources used in developing this module.

1
Week

1
What I Need to Know
You have studied probability and frequency distributions in statistics in the
previous level. This module was designed and written collaboratively to help you in
illustrating random variables (discrete and continuous) which are essential in
solving real life problems.
The scope of this module permits it to be used in many different learning
situations. The language used recognizes the diverse vocabulary level of students.
The lesson is arranged to follow the standard sequence of the course.
The module consists of only one lesson entitled illustrating random variables
(discrete and continuous).
After going through this module, you are expected to:
1. define random variable; and
2. illustrate random variables (discrete and continues).

What I Know
Before studying this module, take this test to determine what you already know
about the topic covered.
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. Which of the following is NOT a discrete variable?
a. Number of white marbles in the box
b. Number of students present in the classroom
c. The weight of a box of soft drinks labeled “8 ounces.”
d. The number of arrivals customers in the restaurants between 7:00 a. m to
5:00 p.m.
2. Which of the following is an example of discrete variable?
a. Distance travelled between tricycles
b. Height of the students in a certain class
c. Number of red marbles in the basket
d. Weight of student
3. A variable that can be discrete or continuous is called
a. Random sample
b. Random notation
c. Random variable
d. Random elimination
4. Which of the following is a variable whose value is obtained by measuring?
a. Continuous
B. Discrete
C. Interval
D. Nominal
5. A variable where the information or data can take infinitely many values is
a. Quantitative variable
b. Discrete variable
c. Qualitative variable
d. Continuous variable

2
6. This term can best describe a variable that can be counted
a. Continuous
b. Discrete
c. Interval
d. Ratio
7. A set of numerical values assigned to a sample space is called
a. Random experiment
b. Random sample
c. Random variable
d. None of the above
8. A variable whose value could be a finite and countable number is a
a. Continuous variable
b. Discrete variable
c. Qualitative variable
d. Poison variable
9. Which of the following statement describe a continuous random variable?
a. The average distance traveled by a jeep in a week
b. The number of students present in a Class Anthurium
c. The number of motorcycle owned by a randomly selected household
d. The number of girls taller than 5 feet in a random sample of 10 girls
10. Which of the following is discrete random variable?
a. Jerwin is 165 cm tall
b. Jerwin has three sisters
c. Jerwin weighs 68 kilograms
d. Jerwin ran 500 meters in two minutes
11. Which of the following variable is discrete random variable?
A. The amount of unleaded gasoline in a Suzuki car
B. The temperature of a cup of coffee served at a restaurant.
C. The number of boys in a randomly selected three-child family
D. The average amount spent on water bill every month of December by a
randomly selected household in Quezon Province.
12. Which of the following is NOT a discrete random variable?
A. Number of refrigerator sell each day
B. Height of dragon fruit as measured each day
C. Number of students late in going to school each day
D. Number of people went to the doctor from Monday to Friday
13. You decided to conduct a survey of families with five children. You are
interested in counting the number of girls (out of five children) in each family. Is
this a random variable?
A. Maybe
B. Cannot be determined
C. Yes, it is a random variable
D. No, it is not a random variable
14. Which of the following statement DOES NOT describe a continuous random
variable?
A. Height of students in a certain class
B. The average weight of chicken each day
C. The number of towns belong to Quezon Province
D. The distance travelled by a delivery van in an hour

3
15. Which of the following is NOT a continuous random variable?
A. The height of the airplane’s flight
B. The amount of liquid on a container
C. The number of COVID 19 cases each day
D. The length of time for the check up in the hospital

Lesson 1 Illustrating a Random Variable


(Discrete and Continuous)
This module will assist you with understanding the way toward illustrating
random variables (discrete and continuous). Let’s proceed and appreciate learning.
What’s In
In the study of basic probability, you have discovered that an experiment is
any movement that should be possible more than once under comparative condition.
The arrangement of every possible outcomes of an experiment is what we called a
sample space. You have additionally figured out how to mathematically list down the
conceivable outcome of a given experiment. In tossing a coin, for example, the
potential results are turning up a head or a tail.
For you to begin let us all understand that probability distributions can be
illustrated or classified as discrete probability distributions or as continuous
probability distributions, depending on whether they define probabilities associated
with discrete variables and continuous variables.
A variable X whose value depends on the outcome of a random process is
called a random variable. A random variable is a variable whose value is a numerical
outcome of a random phenomenon.
A random variable is denoted with a capital letter. The probability distribution
of a random variable X tells what the possible values of X are and how probabilities
are assigned to those values.
A random variable can be discrete or continuous
What’s New
Tossing a coin

As you can see in one- peso coin, it has Dr. Jose P. Rizal on one side, which
we will call it as Head (H), and the other side is the Tail (T). Toss your one-peso coin
three times and record in your notebook the results of the three tosses. In order to
write the result easily, use letter H for the heads and letter T for the tails.
If the results of your three tosses are heads, tails, heads, then you will write
HTH on your notebook.
Example 1: How many heads when we toss 3 coins?
Continue tossing your coin and record the time. If possible, use mobile phone
timer and record up to the last minutes.

4
Let say in a minute, how many times the heads and tails appeared. Then,
record all the possible answers on your notebook.
Write all eight possible outcomes. You can do this systematically so that you
do not get confused later on.
In this instance, there might be 0 heads, 1 Head, 2 Heads or 3 Heads.
Thus, the sample space is equal to 0, 1, 2, 3
Then this time the results or outcomes are NOT entirely equally likely.
The three coins land in eight possible ways:
X = Number of Head

Looking at the table we see just


1 case of Three Head, but 3 cases
of Two Heads, 3 cases of One
Head, and 1 case of Zero Heads.
So:

 P(X=3) = 1/8
 P(X=2) = 3/8
 P(X=1) = 3/8
 P(X=0) = 1/8

That particular example is a discrete variable. A discrete variable is a variable,


which can only view a countable amount of values. Thus, a discrete random variable
X has possible values 𝑥1 , 𝑥2 , 𝑥3 .....

In Graphical Form:

We can use the probability distribution


to answer questions about variable x. In
symbols, we want to find P(X ≥1). We could
add probabilities to find the answer:

P(X≥1) = P(X=1) + P(X=2) +P(X=3)

1 3 3 7
= + + =
8 8 8 8

P(X≥1) =1 – P(X < 1) = 1 - P(X = 0)

= 1 – 1/8 = 7/8

Example 2;

For a fair coin tosses twice, the probability of each of the possible values for Number
of Heads can be tabulated as shown:

5
Let x is equal to the number of heads observed. x is what we called random variable.

Number of
0 1 2
Heads

2/4
Probability 1/4 1/4
or 1/2

 P( X=2) = 1/4
 P( X=1) = 2/4
 P( X=0) = 1/4
This is again an example of a discrete variable. Thus, a discrete random
variable X has possible values x1, x2 , x3, .....

In Graphical Form:

We can use the probability distribution to


answer questions about variable x. In symbols, we
want to find P(X ≥1). We could add probabilities to
find the answer:

P(X≥1) = P(X=1) + P(X=2)

1 2 3
= + =
4 4 4

P(X≥1) =1 – P(X < 1)

= 1 – 1/4 = 3/4

While to understand the concept of


continuous variable, below are the examples
 height of students in class
 weight of 10 statistics books
 Time it takes to get to school
 distance travelled between classes
A continuous variable is a value that is being acquired by measuring.

6
What is It
To make you understand better
the previous activities, another
illustration is shown below.

1. How many outcomes are there


in tossing 2 coins? 3 coins? 4
coins?

A random variable is a numerical


quantity that is assigned to the
outcome of an experiment. We
use capital letters to represent a
random variable.
Continuous Data can acquire some value within a range (like for example a
person's height)

What’s More
This comprises activities for independent practice to solidify your
understanding and skills of the topic.

A Random Variable is a capacity that connects a real number with every


component in the sample space. It is a variable whose qualities are controlled by
chance. In this manner, a Random Variable is a numerical amount that is derived
from the results of an arbitrary trial or experiment. The word “random” is used often
in everyday life.
Types of Random Variables:
At that point, recognize the two types of arbitrary factors. These are the
discrete and continuous random variables.
Discrete Random Variables are variables can take on a finite number of
distinct values. Examples are number of heads acquired while flipping a coin three
times, the number of kin an individual has, the number of students present in a
study hall at a given time, and so forth.
You can change the experiment to just flipping a coin twice to make things
simpler. Here, the outcomes will be only four: HH, HT, TH, and TT. In addition, the
possible values of X are 0, 1, and 2.
Continuous Random Variables, then again, are random variables that take
an interminably uncountable number of potential values, regularly measurable
amounts. Examples are the height or weight of an individual, the time an individual
takes for an individual to wash, time, temperature, item thickness, length, age, etc.
Now, let us try to look to some examples of random variables from the table below.

7
Table 1. Examples of Random Variables

Types of
Number X or the Random
Experiment Random
Variable X
Variable

1. Record the number of hours an


The number of hours an specific
specific student use their mobile
student use their mobile from Discrete
from 8:00 am to 5:00 pm for the
8:00 am to 5:00 pm
past three nights
2. Buying two trays of egg in the
The weight of eggs in kilograms Continuous
market
3. Recording of the gender of family
The number of boys among the
members in a family with three
children Discrete
children
4. Students will prepare for a quiz in
How much time spends reviewing
Mathematics Continuous
for this quiz
Numbers appeared in a pair of
5. Rolling a pair of dice Discrete
dice
What I Have Learned
A. Complete the following statements by writing the correct word.
1. A variable whose value is obtained by counting data is called__________
2. A variable whose value is obtained by measuring is called_____________
3. A variable that can be discrete or continuous is ______________________
4. Time it takes to get to school is an example of ________________________
5. Number of heads in flipping coins is an example of ___________________
B. Complete the table below.

Experiment Number X or the Types of Random


Random Variable X Variable
1. Number of rings before the phone is
answered
2. Teacher ask the students to finish the
test after an hour
3. Number of complaints per day
4. Height of the tallest building in Lucena
City

5. Number of Mobile phones in a


household

8
What I Can Do
Answer the following.
Classify whether the given experiment implies a discrete random variable or a
continuous random variable. Write D if discrete and C if continuous.
_____ 1. The temperature of a solution in the laboratory
_____ 2. Collecting data about the heights of students in a public school
_____ 3. Recording the distance travelled by the bus
_____ 4. Surveying about the number of cases due to Covid - 19 pandemic
in Quezon Province
_____ 5. Number of promoted students at the end of school year

Assessment
Multiple Choice. Choose the letter of the best answer. Write your chosen letter on a
separate sheet of paper.
1. A variable where the information or data can take infinitely many values is
A. Continuous variable
B. Discrete variable
C. Quantitative
D. Qualitative variable
2. Which of the following statement describe a continuous random variable?
A. The number of students present in a Class Temperance
B. The average distance travelled by a tricycle in a month
C. The number of motorcycle owned by a randomly selected household
D. The number of girls taller than 5 feet in a random sample of 6 girls
3. A variable that can be discrete or continuous is called
A. Random sample
B. Random variable
C. Random notation
D. Random elimination
4. Which of the following is a variable whose value is obtained by measuring?
A. Continuous
B. Discrete
C. Interval
D. Normal
5. Which of the following is NOT a discrete variable?
A. Number of book per student
B. Number of green marbles in the box
C. The number of arrivals of customers in the clinic between 8:00 a. m to 4:00
p.m.
D. The weight of a box of soft drinks labeled 12 ounces.
6. Which of the following is an example of discrete variable?
A. Distance travelled between cars
B. Height of the students in a section Prudence
C. Number of blue marbles in the box
D. Weight of potatoes in the basket

9
7. A set of numerical values assigned to a sample space is called
A. Random experiment
B. Random sample
C. Random variable
D. None of the above
8. A variable whose value could be a finite and countable number is a
A. Continuous variable
B. Discrete variable
C. Qualitative variable
D. Quantitative variable
9. This term can best describe a variable that can be counted
A. Continuous
B. Discrete
C. Interval
D. Ratio
10. Which of the following is NOT a discrete random variable?
A. Height of eggplant as measured each day
B. Number of refrigerator sell each day
C. Number of late comers in going to school each day
D. Number of people went to the Rizal Park from Monday to Friday
11. Which of the following is discrete random variable?
A. Jose has four sisters
B. Jose is 163 cm tall
C. Jose weighs 68 kilograms
D. Jose ran 300 meters in one and a half minutes
12. Which of the following is NOT a continuous random variable?
A. The height of the airplane’s flight
B. The amount of liquid on a container
C. The length of time for the check up in the hospital
D. The number of clients of a certain Insurance Company each day
13. Which of the following variable is discrete random variable?
A. The amount of unleaded gasoline in a Suzuki car
B. The temperature of a cup of coffee served at a coffee shop.
C. The number of boys in a randomly selected two-child family
D. The average amount spent on electric bill every month of May by a randomly
selected household in Quezon Province.
14. You decided to conduct a survey of families with three children. You are interested
in counting the number of girl in each family. Is this a random variable?
A. Maybe
B. Cannot be determined
C. Yes, it is a random variable
D. No, it is not a random variable
15. Which of the following statement DOES NOT describe a continuous random
variable?
A. Height of students in a certain class
B. The average weight of chicken each day
C. The number of streets at barangay Tahimik
D. The distance travelled by a delivery van in an hour

10
Additional Activities
Hondagua National High School-Senior High School would like to conduct election
for the Accountancy Business and Management (ABM) officers. Complete the table
for the possible outcomes from a sample of four voters and identify also the value of
random variable of the number of “yes” votes.

Value of Random
Event Voter # 1 Voter #2 Voter #3 Voter #4 Variables
(Number of Yes votes)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16

11
Statistics and
Probability
Distinguishing Between a
Discrete and a Continuous
Random Variable
Statistics and Probability
Distinguishing Between a Discrete and a Continuous Random Variable
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Reynaldo L. Nanson


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

12
Week

1
What I Need to Know
This module was designed and written with you in mind. It is here to help you
distinguish between a discrete and a continuous random variable. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students. The lesson is arranged to
follow the standard sequence of the course. But the order in which you read them
can be changed to correspond with the textbook you are now using.

The module has one lesson:

Lesson 1 – Distinguishing between a discrete and a continuous random variable.


After going through this module, you are expected to Distinguish random variables
as discrete or continuous.

What I Know
Before studying this module, take this pre-test to find out how much you already
know about the topic covered. Write the chosen letter on a separate sheet of paper.

1. How is the variable, as described in Algebra, similar to a random variable?


A. Both variables can take on assigned numerical values
B. Both variables correspond to assigned specific probability
C. Both variables take on assigned intervals instead of numbers
D. Both variables can only take on specific numerical values
assigned to them
2. A variable that can assume any value between two given points is called
A. Discrete random variable
B. Irregular random variable
C. Uncertain random variable
D. Continuous random variable
3. Discrete random variable is a numerical quantity derived from the outcome
of an experiment that can be
A. Counted
B. Manipulated
C. Measured
D. Traced
4. Continuous random variable is a numerical quantity derived from the
outcome of an experiment that can be
A. Counted
B. Manipulated
C. Measured
D. Traced
5. You decide to collect a bunch of bottles of soft drink and measure the volume
of soft drink in each bottle. Let X be the number of ml of soft drink in each
bottle. What type of variable is X?
A. X is a constant
B. X is a Place holder
C. X is a discrete random variable
D. X is a continuous random variable

13
6. Which of the following statement describe a discrete random variable?
A. The length of span of a 10 months baby
B. The average increase in height of a baby each year
C. The average increase in weight of a baby each year
D. The number of avocado produced by an avocado tree each year
7. Which of the following is discrete random variable?
A. Cardo’s height
B. Cardo’s weight
C. Cardo’s temperature
D. Cardo’s number of owned cars
8. Which of the following is continuous random variable?
A. The cat has 2 eyes
B. The cat has 2 kittens
C. The cat has 4 paws
D. The cat weights 5.4 kg
9. Which of the following statement describe a continuous random variable?
A. The e average life span of a dog
B. The number of face mask in a pack
C. The number of shoppers in a certain mall
D. The number of “covid-19” patients in a certain hospital
10. Which of these is NOT a discrete random variable?
A. The number of absent in a class
B. The number of death per year attributed to lung cancer
C. The average amount of electricity consumed per household per
month
D. The number of people who drive through a red light each day
during rush hour
11. Which of the following statement DOES NOT describe a discrete
random variable?
A. The number of siblings in your family
B. The height of your brother in centimeters
C. The number of affected individuals by “Covid- 19” pandemic
D. The number of qualified family in “Social Amelioration Program”
12. Which of the following is NOT a continuous random variable?
A. Amount of water in a pail
B. A person’ s height each year
C. A person’s weight on each birthday
D. Number of cars manufactured in a factory each day
13. Which of the following statement DOES NOT describe a continuous
random variable?
A. The amount of sugar in a cup of coffee
B. The amount of water in a cup of coffee
C. The average earnings per day in a cafeteria
D. The number of cups of coffee sold in a cafeteria during lunch
14. The distance that a cyclist rides each day is what sort of variable?
A. Continuous
B. Discrete
C. Ordinal
D. Qualitative
15. x is the time it takes a chef to cook a specific dish, is a continuous random
variable. Why do think the variable x in the statement becomes
continuous?
A. The variable x is continuous because it takes a range of values
B. The variable x is continuous because it takes an integral values.

14
C. The variable x is continuous because it takes a countable value
D. The variable x is continuous because it takes a specific numerical
values
Lesson 1 Distinguishing Between a Discrete
and a Continuous Random Variable
Every day we often read, heard or even used the word random. You may hear
someone say “We randomly decided to go out for shopping yesterday.” But is this
really a random event? No, this is a decision that was made on the basis of other
variables such as desire and the lack of satisfaction with other options such as TV
viewing.

The word random has a different meaning in the field of statistics. Something is
random when it differs by chance. For example, when a coin is tossed twice, the
possible outcomes that can occur are {HH, HT, TT, TH} where H represent head and
T represent tail, the observed outcomes on any one toss is random.

This module will help you understand the process of distinguishing between a
discrete and a continuous random variable.

What’s In
Remember that a variable is a quantity that may change within the context of a
mathematical problem or experiment. Typically, we use a single letter to represent a
variable. The letters x, y, and z are common generic symbols used for variables. In
this lesson, we shall discuss variables that are associated with probabilities, called
random variables.
To find out if you are ready to learn this new lesson, do the following.
List the sample space of the following experiments.
Experiment Sample Space
1. Flipping a coin
2. Rolling a die
3. Drawing a card at random from a deck of 6
identical card suits labeled 1 to 6
4. Spinning the wheel shown,
assuming that the
arrow will never fall on the lines
separating the 8 sectors

5. Rolling a die and tossing a coin simultaneously

What’s New
To understand how one distinguishes between a discrete and a continuous random
variable. Do the activity below.
Complete the following table. The first one is done for you.

15
Experiment Number X Possible Value of X
Two cards are drawn from a Sum of the numbers on 4, 5, 6, 7, 8, 9, 10,
deck. the cards 11, 12, 13, 14, 15,
16, 17, 18, 19, 20

(since, the least


number in the card is
2 and the highest is
10. therefore, the
smallest possible
sum of the numbers
on the cards when
two cards are drawn
is 4, while the
highest possible sum
of the numbers on
the cards is 20)
Roll a pair of dice Sum of the number of
dots on the top faces
Toss a fair coin repeatedly Number of tosses until
the coin lands head
Height of individuals Height of each member
of the family
(You must only use a meter
stick or ruler and avoid using a
carpenter’s rule because you
might accidentally get lose of it
and will end up wounding your
fingers or hand.)
This time, you need to reflect on the following questions because it will help you to
understand the appropriate way of classifying a random variable. Answer the
following questions:
1. How do you find the activity?
2. Were you able to complete the table? If yes, how? If no, why?
3. If a random variable takes on values on continuous scale, it is continuous
random variable. If a random variable takes on outcomes that are countable
then it is discrete random variable. Now, will you be able to classify those
experiments in activity 1 as discrete or continuous? If yes, how? If no, why?
4. In your own words, how will you differentiate between discrete and continuous
random variables?
What is It
In Algebra, “variable” means one thing - an unknown value, you’ll come across
different types of variables in statistics. Some of these are: categorical variable
(variable that can be put into categories like tooth paste brands); ordinal variable
(similar to categorical variable, but there is a clear order like socioeconomic status );
quantitative variable (a broad category that includes any variable that can be counted

16
or has a numerical values associated with it); qualitative variable (a broad category
that for any variable that can’t be counted) and there’s a lot more variable that you
may encounter in the field of statistics, but our focused in this module, is on random
variable.

A Random Variable is a function that associates a real number with each element
in the sample space. It is a variable whose values are determined by chance. Thus,
in simple words, a Random Variable is a numerical quantity that is derived from the
outcomes of a random experiment.

In the experiment of tossing a coin, the number of times the coin turns up a head is
an example of random variables. Below are some examples of random variables.

a. Suppose two dice are rolled. The sum of the two numbers that face up is
an example of a random variable. This variable
make take on integers
from 2 to 12.
b. Suppose the spinner shown below is spun.
An example of a random variable for this experiment
is the number of times that the spinner stops at
number 3.
There are two types of random variables, the
discrete and continuous random variables

Types of Random Variables

Discrete Continuous
A discrete random variable A continuous random
is a random variable whose variable is a random
values represented by variable that takes values
count data. This happens on a continuous scale. This
when the set of possible hap pens when the set of
outcomes from an possible outcomes from an
experiment can be counted experiment cannot be
such as can be counted counted but can be
such as the number of measured such as height
household in a barangay, of Grade 11 students.
thus making the values of Continuous random
the random variable variables represent
countable of the random measured data like height,
variable countable just like weight and temperature.
whole numbers 0, 1, 2, 3.

17
Figure 1. Diagram of the types of Random Variable
Now, let us try to take a look to some examples of random variables from the table
below.
Table 1. Examples of Random Variables
Experiment Number X or the Random Possible Values of
Variable X Random Variable X
1. Flipping a pair of Number of heads in 4 flips of a
0, 1, 2, 3, 4
coin coin
2. Flipping a pair of Number of tosses until the coin
1, 2, 3, 4, …
coin repeatedly lands tails
3. Set of Integers Integers from 2 to 6 2, 3, 4, 5, 6
4. Set of real numbers Real numbers from 2 to 6 2<x<6
5. Life span of your Time until your iPhone battery
0<x< 
iPhone battery become defective

From the example in the table above, Examples 1, 2, and 3 are all discrete while
example 4 and 5 are continuous
Example 6. Brand Z is a company that manufactures batteries. They record
the number of defective batteries that are contained in each box
before it is sent out for delivery. If X is the number of defective
batteries per box, then what type of random variable is X?
Solution. X is a discrete random variable, and X can be a set {0, 1, 2, …, n}
where n is the total number of batteries in each box.
Example 7. Suppose Brand Z in example number 6 wants to check the
average life span of their batteries. If Y is the average number of
days that each battery lasts, then what can you conclude about
the variable Y?
Solution. We can conclude from above, that Y is a continuous random
variable. It may be assigned time intervals that corresponds to
each battery’s life span.
Because of the way the discrete random variable is defined, we can say that the range
of values that can be assigned to it is confined to the set of whole numbers. That is,
a discrete random variable may NOT take on non-integers.
What’s More
Independent Activity 1.
Look Back and Reflect
1. How do you determine the values of a random variable?
2. How do you know whether a random variable is continuous or discrete?
What is the difference between the two types of random variables?
Independent Assessment 1.
Classify the following random variables as discrete or continuous.
1. the speed of a tricycle.
2. the number of female students.

18
3. the time needed to finish the module.
4. the amount of sugar in a cup of coffee.
5. The number of defective mobile phones produced by a manufacturer.
Independent Activity 2.
Determine whether each of the following experiments/situations involve a discrete
random variable or a continuous random variable.
1. choosing an even number less than 150
2. gathering information about the average monthly electric consumption in a
certain household
3. tallying the number of family in a certain barangay that have one child
Independent Assessment 2.
Identify whether the given experiment involves a discrete random variable or a
continuous random variable. Write your answer on a separate answer sheet.
1. getting the temperature of patient admitted in a hospital
2. collecting data about the weights of students in a certain school
3. the number of patient admitted in the hospital due to COVID-19
4. the number of no work no pay in a certain municipality caused by pandemic
5. the number of families that are members of 4P’s in Quezon
6. the number of Balikbayan OFW arrive in the Philippines
7. the number of text messages received by a particular individual in a day
8. the number of possible outcomes in rolling a die
9. the amount of liquid in a 12 - ounce can of soda
10. the number of fouls committed by a basketball team during the games

What I Have Learned


Give your answer in the following statements.
1. A well - defined function from a sample space to the set of real numbers is called
a _______________________. It is commonly denoted by capital letters, such as X, Y
and Z. The specific values of a random variable is denoted by small letters such as
x, y, and z.
2. A random variable is a _______________________if its set of possible outcomes is
countable. Mostly, discrete random variable represents count data, such as the
number of enrolled students in a particular school.
3. A random variable is _______________________if it takes on values on a continuous
scale. Often, continuous random variables represent measured data, such as
heights, weights temperatures and lifespan.

19
What I Can Do
This time, it’s your turn to shine. Answer the following questions.
1. Eight students were asked to solve the Rubik’s cube. The time it took for them to
complete the puzzle is summarized in the table below.
Time (in min) Less than
1 min 1 - 2 min 2 - 3 min Greater than 3 min
Number of children who completed the Rubik’s cube 2 2 3 1
Classify what type of random variable is shown.
2. Suppose a Meter-man Apprentice was told by his superior to measure the voltage
of a certain electric outlet. The voltage reading ranges from 118 V < x < 122 V. Does
the scenario represent a discrete or a continuous random variable?

Assessment
Let us test how far you understood the lesson on random variables. Answer the test
below. Write your answers on a separate sheet of paper. I believe that with your effort
and determination, you can do it once again!

1. A set of numerical values assigned to a sample space is called


A. Random sample
B. Random variable
C. Random Process
A. Random experiment
2. Which of the following best describe variable that can be counted?
A. Categorical
B. Continuous
C. Discrete
D. Ordinal
3. Which of the following best describe variable that can be measured?
A. Categorical
B. Continuous
C. Discrete
D. Ordinal
4. You decided to order a pizza but you have to choose the type of crust and the
toppings. If there are only 6 possible combinations of ordering a pizza, from
which of the following should you choose?
A. Crust: thin or deep dish
Toppings: cheese or pepperoni
B. Crust: thin or deep dish
Toppings: cheese, bacon or pepperoni
C. Crust: thin or deep dish
Toppings: cheese, bacon, sausage or pepperoni
D. Crust: thin or deep dish
Toppings: cheese, bacon, sausage, pepperoni or hotdog
5. A drawer has 2 red ties, 3 black ties and 5 blue ties. A tie is picked at
random. What is the total number of possible outcomes?
A. 2
B. 3

20
C. 5
D. 10
6. Which of the following statement describe a continuous random variable?
A. The number of students present in a class
B. The average distance traveled by a jeep in a week
C. The number of correct guesses on a multiple choice test
D. The number of women taller than 68 inches in a random sample of 5
Women
7. Which of the following is a continuous random variable?
A. Height of sunflower as measured each day
B. Number of gadgets a business sell per day
C. Number of people attending mass every Sunday
D. Number of people eating at the restaurant everyday
8. Which of the following is discrete random variable?
A. Hipolito weighs 65 kg
B. Hipolito is 160 cm tall
C. Hipolito has two brother
D. Hipolito ran 100 meters in 10.2 seconds
9. Which of the following variable is discrete random variable?
A. Lifetime of a AAA battery
B. The amount of gasoline in a car
C. Number of goals of a football team
D. The time it takes to commute to word
10. Which of the following is NOT a discrete random variable?
A. Height of rose as measured each day
B. Number of appliances a business sell each day
C. Number of people went to church every Sunday
D. Number of students late in going to school each day
11. Which of the following statement DOES NOT describe a discrete random
variable?
A. The number of children who do household chores everyday
B. The average amount of milk consumption of a baby each day
C. The number of students who excelled in mathematics in a certain
grade level
D. The number of children who love to pray a rosary every 6 o’clock in
the afternoon
12. Which of the following statement DOES NOT describe a continuous random
variable?
A. The distance traveled by a truck in an hour
B. The average height of a coconut tree each day
C. The number of provinces belong to Region IV-A
D. The intensity of an earthquake that happens last month
13. Which of the following is NOT a continuous random variable?
A. The amount of water on a jug
B. The length of time attending a “Webinar”
C. The number of OFW who traveled abroad each day
D. The height of the tallest mountain in the Philippine

21
14. You decide to collect a bunch of cans of soda and measure the volume of
soda in each can. Let x be the number of soda in each can. What type of
variable is x?
A. x is a constant
B. x is a place holder
C. x is a discrete random variable
D. x is a continuous random variable
15. You decided to conduct a survey of families with two children. You are
interested in counting the number of girls (out of 2 children) in each
family. Is this a random variable?
A. Yes, it is a random variable
B. No, it is not a random variable
C. Maybe
D. Cannot be determine
Additional Activities
Answer the following.
Identify whether the given experiment involves a discrete random variable or a
continuous random variable. Write D if discrete and C if continuous.
_____ 1. Getting the distance travelled by a car
_____ 2. Collecting data about the weights of students in a certain school
_____ 3. The temperature in Quezon at noon times
_____ 4. Picking a multiple of 3 less than 200
_____ 5. Tallying the number of PUI (Person Under Investigation) due to
Covid-19 pandemic.

22
Statistics and
Probability
Finding Possible Values of a
Random Variable
Statistics and Probability
Finding Possible Values of a Random Variable
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module


Writers: Reynaldo L. Nanson
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

23
Week

1
What I Need to Know
This module was designed and written with you in mind. It is here to help you master
the process of finding the possible values of a random variables. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students. The lesson is arranged to
follow the standard sequence of the course. But the order in which you read them
can be changed to correspond with the textbook you are now using.
The module has one lesson:
Lesson 1 – Finding possible values of a random variable.
After going through this module, you are expected to find possible values of random
variables.
What I Know
Before studying this module, take this pre-test to find out how much you already
know about the topic covered. Write the chosen letter on a separate sheet of paper.
1. A quantity resulting from an experiment by chance that can assume
different values is called ________________.
A. Ran dom sample
B. Random variable
C. Random process
D. Random experiment
2. Which of the following is NOT a true statement?
A. Random variables can only have one value.
B. The value of a random variable could be zero.
C. The probability of all the value of a random variable could be zero.
D. The sum of all the probabilities in a probability distribution is
always equal to one.
3. If a coin is tossed, what are the possible values of the random variable for the
number of heads?
A. 0, 1
B. 0, 1, 2
C. 1, 2, 3
D. 0, 1, 2, 3
For numbers 4 - 9. Suppose you tossed two coins.
4. What are the sample spaces for the experiment above?
A. HH, TT
B. HH, HT, TT
C. HH, TH, TT
D. HH, HT, TT, TH
5. How many possible outcomes will occur from the experiment above?
A. 2
B. 3
C. 4
D. 5
6. What are the possible values of the random variable for the number of
heads?
A. 0, 1
B. 0, 1, 2

24
C. 1, 2, 3
D. 0, 1, 2, 3
7. What are the possible values of the random variable for the number of tails?
A. 0, 1
B. 0, 1, 2
C. 1, 2, 3
D. 0, 1, 2, 3
8. Which of the following is NOT a possible value of the random variable for the
number of heads?
A. 1
B. 2
C. 3
D. 4
9. Which of the following is NOT a possible value of the random variable for the
number of tails?
A. 4
B. 3
C. 2
D. 1
For numbers 10 – 11. You decide to conduct a survey of families with two
children. You are interested in counting the number of girls (out of 2 children) in
each family.
10. Is the above experiment “counting the number of girls (out of 2 children in
each family” a random variable?
A. Yes, it is a random variable.
B. No, it is not a random variable.
C. Maybe.
D. It cannot be determined.
11. If the experiment above shows a random variable, then what are the possible
values of it?
A. Its value can be 1 or 2.
B. Its value can be 0, 1, or 2.
C. Its value can be 2 or 4.
D. None, since it is not a random variable.
12. In a local community, a couple was asked the questions, “Are you satisfied
with the work of the current president?” If the husband and the wife both said
“yes”, the response is written as YY. If the husband said yes and the wife said
“no”, the response is YN. Let X be the number of “yes” responses, what are the
possible values of the random variables?
A. 0, 1, 2
B. 1, 2, 3
C. 2, 3, 4
D. 1, 1, 2
For numbers 13 - 15. Suppose two laptops are tested. Let D represent the
defective laptop and N represent the non-defective laptop.

25
13. What are the possible sample spaces for the experiment?
A. DD, NN
B. DD, DN, NN
C. DD, ND, NN
D. DD, DN, NN, ND
14. How many possible outcomes will occur from the experiment?
A. 2
B. 3
C. 4
D. 5
15. If we let X be the random variable representing the number of defective
laptops. What are the possible values of the random variable?
A. 0, 1
B. 0, 1, 2
C. 1, 2, 3
D. 0, 1, 2, 3
Well, what do you think about the pre-test? Did you do it well? Compare your
answers with those in the Answer Key on the last part of this module to find out your
level of performance in this test. If all your answers are correct, then it’s excellent!
This shows that you already know much about the topic. You may SKIP this module
and PROCEED to the next module.

If you got a low score or missed a point, it’s ok. This module was made to help
you understand important concepts about random variables that you can apply in
your daily life. If you study this module carefully, you will learn the answers to all
the items in the test and a lot more!

Are you ready? Let’s begin your journey.


Lesson 1 Finding Possible Values of a Random
Variable
We first learned about the definition of variables in the introduction of Algebra, and
we know from our Algebra lessons that a variable is a placeholder for real number
values that can be assigned to it. Some examples of variables include X for the
number of tails or Y for the number of computers or Z for running time of movies.

If three coins are tossed, what numbers can be assigned for the frequency of heads
that will occur? If three cards are drawn from a deck, what number can be assigned
for the frequency of face cards that will occur? These questions may be answered
using random variables which you will learn in this module.

What’s In
Recall that a variable is an attribute that can assume different values. We use letters
to denote or represent a variable. In this lesson, we shall discuss variables that are
resulted from an experiment by chance, called random variables.

26
To find out if you are ready to learn this new lesson, try to determine whether each
of the following experiments/situations involved discrete or a continuous random
variable.
1. Choosing an even number less than 100.
2. Tallying the number of households in a subdivision that has a least one
vehicle.
3. Gathering information about the average monthly income of a
household.
4. Teacher applicants in the Division of Quezon.
5. Recording the number of patients who recovered from COVID-19.
What’s New
To understand how one finds possible values of random variables, do the activity
below.

Suppose two iPhone are tested at random. We want to find out the number of
defective iPhone. Thus, to each outcome in the sample space we shall assign a value.
These are 0, 1, or 2. If there is no defective iPhone, we assign the number 0; if there
is 1 defective iPhone, we assign the number 1; and 2, if there are two defective iPhone.
The number of defective iPhone is a random variable. The possible values of this
random variable are 0, 1, and 2.
Read and analyze the given situation below.

Illustration
Let D represent the defective iPhone and N for the non-defective iPhone. If we let Y
be the random variable representing the number of defective iPhone, show the values
of the random variable Y. Complete the table below to show the values of the random
variable.
Value of the Random Variable Y
Possible Outcomes
(number of defective iPhone)

The completed table should look like this.

Value of the Random Variable Y (number


Possible Outcomes
of defective iPhone)

DD 2

NN 0

DN 1

ND 1

27
Now, it's your turn.

Suppose three laptops are tested at random. We want to find out the number of
non-defective laptops. Thus, to each outcome in the sample space we shall assign a
value. These are 0, 1, 2, or 3. If there is no defective laptop, we assign the number
0; if there is 1 non-defective laptop, we assign the number 1; if there are two non-
defective laptops, we assign the number 2 and 3, if there are three non-
defective laptops. The number of non-defective laptops is a random variable. Let D
represents the defective laptop and N represents the non-defective laptop. If we let
X be the random variable representing the number of non-defective laptops, show
the values of the random variable X. Complete the table below to show the values of
the random variable.

Value of the Random Variable X


Possible Outcomes
(number of non-defective laptops)

This time, you need to reflect on the following questions because it will help you to
understand the appropriate way in finding the possible values of a random variable.
Answer the following questions:
1. How do you find the activity?
2. Were you able to complete the table? If yes, how? If no, why?
3. Will you be able to find the values of a random variable? If yes,
how? If no, why?
4. In your own words, how will you describe a random variable?
5. How do you find the possible values of a random variable?
What is It
A random variable is a numerical quantity that is derived from the outcomes of
random experiments.
The random variable in the activity above is a discrete random variable because the
set of possible outcomes is countable. The possible values of random variable Y
(number of defective iPhone) are 0, 1, and 2, while the possible values of random
variable X (number of non-defective laptops) are 0, 1, 2, and 3.
Example 1. Suppose two coins are tossed, let Z be the random variable representing
the number of heads that occur. Find the values of the random variable Z.

28
Steps Solution
1. Determine the sample space. The sample space for this experiment is:
Let H represent head and T for S = {TT, TH, HH, HT}
tail.
2. Count the number of heads in
each outcome in the sample Possible Outcomes Value of the
space and assign this number Random variable Z
to this outcome. (Number of heads)
TT 0
TH 1
HH 2
HT 1

So, the possible values of the random variable Z are 0, 1, and 2.


Example 2. In a box are 2 balls - one white and one yellow. Two balls are picked one
at a time with replacement. Let X be the random variable representing the number
of white balls. Find the values of the random variable X.

Steps Solution
1. Determine the sample space. Let The sample space for this experiment is:
W represent the white ball and Y S = {WW, YY, WY, YW}
for the yellow ball.
2. Count the number of white balls
in each outcome in the sample Value of the
space and assign this number to Possible random variable X
this outcome. Outcomes (Number of white
balls)
WW 2
YY 0
WY 1
YW 1

.
So, the possible values of the random variable X are 0, 1, and 2.
What’s More
Independent Activity 1.
Understanding Possible Values of Random Variables.
Answer the following questions.
1. How do you find the values of a random variable?
2. How is this variable, as described in Algebra, similar to a random variable?
How do they differ?
Independent Assessment 1
Two cards are drawn from a deck. How many possible values can each of the
following variables take?
1. sum of the numbers on the cards
2. number of times both cards are black
3. Number of times both cards are 7s
4. Number of times the first card is six and the second card is red

29
5. Number of times the first card is face card and the second card is not a
face card
Independent Activity 2. Getting Balls from a Box
Two balls are picked in succession without replacement - 4 white balls and 5 green
balls. Let Y be the random variable representing the number of green balls. Find the
values of the random variable Y. Complete the table below.
Value of Random Variable Y
Possible Outcomes (number of green balls)

Independent Assessment 2
Four coins are toss. Let Y be the random variable representing the number of
tails that occur. Find the values of the random variable Y.
Value of Random Variable Y
Possible Outcomes
(number of tails)

What I Have Learned


Give your answer in the following statements.
1. A ______________________ is a set of possible values from a random experiment.
For example, in tossing a coin, we can either get head or tail, such tossing a coin is
an experiment where we can give values for such event.
2. A ______________________ is one that may take on only a countable number of
distinct values such as 0,1,2,3,4, etc. This variable is usually (but not necessarily)
counts. It is a random variable that can take only a finite number of distinct values.
3. To find the values of a random variable, we have to follow these steps: first, list all
______________________ in sample space; second, find the ______________________ for
each simple event; third, list the possible values for a random variable X and identify
the value for each simple event and finally; find all simple events for which X = k, for
each possible value k.

30
What I Can Do
This time, it is your turn to shine. Answer the problem below.
The households of a local community were surveyed about the number of occupants
who are working. It was found out that 25 households have one occupant working,
18 have two occupants working, 12 have three occupants working, and 5 have four
occupants working. Let X be the number of occupants working from a randomly
selected household. What are the possible values for the random variable?

Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper. .
1.A set of numerical values assigned to a sample space is called
A. Random sample C. Random variable
B. Random process D. Random experiment
2.Which of the following is a true statement?
A. Random variables can only have one value.
B. The value of a random variable could not be zero.
C. The probability of all the value of a random variable could be zero.
D. The sum of all the probabilities in a probability distribution is not
equal to one.
3.If a coin is tossed, what are the possible values of the random variable for
the number of tails?
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3
For numbers 4 - 9. Suppose you tossed three coins.
4. What are the sample spaces for the experiment above?
A. HHH, TTT
B. TTT, HHH, TTH, THT, HTH
C. TTT, HHH, HHT, THT, HTH
D. TTT, TTH, THT, HTT, HHT, HTH, THH, HHH
5. How many possible outcomes will occur from the experiment above?
A. 2 C. 6
B. 4 D. 8
6.What are the possible values of the random variable for the number of
heads?
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3

31
7. What are the possible values of the random variable for the number of tails?
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3
8. Which of the following is NOT a possible value of the random variable for
the number of heads?
A. 1 C. 3
B. 2 D. 4
9. Which of the following is NOT a possible value of the random variable for
the number of tails?
A. 1 C. 3
B. 2 D. 4
For numbers 10 - 11. You decide to conduct a survey of families with two children.
You are interested in counting the number of boys (out of 2 children) in each family
10. Is the above experiment “counting the number of boys (out of 2
children) in each family” a random variable?
A. Yes, it is a random variable. C. Maybe.
B. No, it is not a random variable. D. It cannot be determined.
11.If the statement above shows a random variable, then what are the possible
values?
A. Its value can be 1 or 2.
B. Its value can be 0, 1, or 2.
C. Its value can be 2 or 4.
D. None, since it is not a random variable.
For numbers 12 - 13. Suppose two balls are drawn in succession without
replacement from an urn containing 6 blue balls and 5 red balls.
12.How many possible outcomes will occur from the experiment?
A. 2 C. 6
B. 4 D. 8
13. Let Z be the random variable representing the number of blue balls.
What are the possible values of the random variable?
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3
For numbers 14 - 15. Suppose three laptops are tested. Let D represent the defective
laptop and N for the non-defective laptop.

32
14. How many possible outcomes will occur from the experiment?
A. 3 C. 8
B. 4 D. 9
15. If we let X be the random variable representing the number of
non-defective laptops. What are the possible values of the random variable?
A. 0, 1 C. 1, 2, 3
B. 0, 1, 2 D. 0, 1, 2, 3

Additional Activities
Answer the following.
An umbrella - check girl returns 2 umbrellas at random to 2 customers who had
previously checked in, namely Andrea and Althea. If Andrea’s umbrella was black
and Althea’s umbrella was Yellow:
1. list the elements of the sample space, S - all possible orders in which the
umbrellas can be returned, and;
2. list all possible values of the random variable X, the number of correct
matches.

33
Statistics and
Probability
Illustrating a Probability
Distribution for a Discrete
Random Variable and its
Properties
Statistics and Probability
Illustrating a Probability Distribution for a Discrete Random Variable and its Properties
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module


Writers: Melba S. Remojo
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Kelvin A. Ramintas and Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

34
Week

1
What I Need to Know
This module was designed to give you the appropriate ways of illustrating a
probability distribution for a discrete random variable and its properties. After
determining the values of the random variable in your previous lesson, there is a
need to illustrate the probability distribution. This will also help you analyze real-life
situated problems statistically in terms of relevant questions for you to better
understand them.
In this module, you will also learn how to determine if the distribution represents a
probability distribution or not. The application of the properties of probability
distribution will help you arrive at the correct conclusion regarding probabilities.
This module will also help you to improve your computation skills. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students and the lessons are arranged
to follow the standard sequence of the course.

After going through this module, you are expected to illustrate a probability
distribution for a discrete random variable and its properties.

What I Know
Let us see how far you will go and how much you know about this pretest. Choose
the letter of the best answer. Write the chosen letter on a separate sheet of paper.
1. What is the sum of the probabilities of all values of the random variable?
A. ∑ 𝑃(𝑋))= 0
B. ∑ 𝑃(𝑋)) = 1/10
C. ∑ 𝑃(𝑋)) = 1
D. ∑ 𝑃(𝑋)) = 10
Refer to the given problem in answering numbers 2-3
2. If two balls are drawn in succession without replacement from an urn
containing 5 red balls and 6 blue balls. If the value of the random variable X
represents the number of blue balls, what is the probability of getting two
blue balls?
A. 1/8
B. 1/4
C. 1/3
D. 1/2
3. Which probability distribution represents the given problem?

A. X 0 1 2
P(X) 1/2 1/4 1/4

B. X 0 1 2
P(X) 1/4 1/2 1/4

C. X 0 1 2 3
P(X) 1/8 3/8 3/8 1/8

D. X 0 1 2 3
P(X) 3/8 1/8 3/8 1/8

35
4. What must be the value of the probability of each random variable?
A. P(X) ≥ 0
B. -1≤ P(X) ≤ 1
C. 0≤ P(X) ≤ 1
D. P(X) ≤ 1
5. Which of the following values is not the value of the probability of the
random variable?
A. 1.01
B. 0.5
C. 0.15
D. 0.05
6. Which of the following values can serve as the values of a probability
distribution?
A. P(1) = 0.42, P2) = 0.31, P(3)= 0.37
B. P(1) = 9/14, P2) = 4/14, P(3)= 1/14
C. P(1) = 0.08, P2) = 0.12, P(3)= 1.03
D. P(1) = 10/33, P2) = 12/33, P(3)= 10/33
𝑥
7. If P(X)= , what are the possible values of X for it to be considered as a
6
probability distribution?
A.0,2,3
B. 1,1,2
C. 2,3,4
D. 1,2,3
8. Which formula gives the probability distribution as shown by the
table?
X 2 3 6
P(X) 1/2 1/3 1/6
1
A. P(X) =
𝑋
𝑋
B. P(X) =
6
6
C. P(X) =
𝑋
1
D. P(X) =
6
9. The following table shows the probability distribution of a discrete
random variable X. Find the value of n.

X 2 4 6
P(X) 0.40 0.15 n
A. 0.15
B. 0.25
C. 0.45
D. 0.6
10. Which of the following table represents probability distribution?

A. X 0 1 2 3
P(X) 0.11 0.15 0.42 0.44

B. X 1 3 5 7

P(X) 1/3 1/2 1/3 1/3

36
C. X 1 2 3 4
P(X) 0.32 0.28 0.28 0.12

D. X 0 2 4 6
P(X) 1/5 1/5 1/5 1/5
11. If the values of the random variable X are 0,1,2 and 3 where X
represents the number of heads in tossing a coin thrice, what is the
probability that three heads will come up?
A. 1/2
B. 3/8
C. 1/4
D. 1/8
𝑋
12. What are the values of P(x) if P(x)= , for x=0,1, and 2?
3
A. 0, 0.33, 0.67
B. 0, 1,2
C. 0, 0.45, 0.55
D. 0.2, 0.3, 0.5
13. Which of the following values can serve as the probability distribution of a
random variable X?
1
A.P(X)= for x =1,2, 3…..9
6
3+𝑋
B. P(X)= for x = 1, 2, 3, 4
3−𝑋
1
C.P(X)= for x = 1,2,3……8
8
3
D.PX) = for x = 1,2,3
5𝑥
14. The number of absences from June to March of a student based on
his class adviser’s record is shown below.

Month No. of Absences Month No. of Absences


June 0 November 2
July 2 December 1
August 1 January 4
September 3 February 1
October 2 March 0
If X be the random variable representing the number of absences. Which
table represents the probability distribution?

A. X 0 1 2 3
P(X) 2/10 2/10 3/10 3/10

B. X 0 1 2 3
P(X) 1/10 3/10 3/10 3/10

C. X 0 1 2 3 4
P(X) 1/10 1/10 3/10 3/10 1/5

D. X 0 1 2 3 4
P(X) 1/5 3/10 3/10 1/10 1/10

37
15. The probabilities that a customer buys 5,8,9,12 and 15 items in a
grocery store are 0.06, 0.14, 0.32, 0.28, and 0.20 respectively. Which
probability distribution represents the given problem?

A. X 5 8 9 12 15
P(X) 0.06 0.14 0.20 0.28 0.32

B. X 5 8 9 12 15
P(X) 0.32 0.28 0.20 0.14 0.06

C. X 5 8 9 12 15
P(X) 0.06 0.20 0.14 0.32 0.28

D. X 5 8 9 12 15
P(X) 0.06 0.14 0.32 0.28 0.20
Lesson 1 Illustrating a Probability
Distribution for a Discrete Random
Variable and its Properties
In this module, the concept of the probability distribution for a discrete
random variable will be introduced. Some decisions are made by assigning
probabilities to all possible outcomes related to the situation. When we draw
a conclusion from an experiment like tossing a coin, tossing dice, or from
other situations, it requires the use of random variable and probability
distribution. You will also learn how to determine if the given distribution
represents a probability distribution.

What’s In
In your previous lesson, you have learned how to find the possible values of a random
variable. In this module, you will learn how to illustrate a probability distribution of
a discrete random variable. To find out if you are ready to learn in this new lesson,
answer the following questions.
A. Determine whether the statement is True or False. If the answer is false, you
can modify the statement to make it true.
Statement True False
1. There are 4 outcomes if you tossed two coins.
2. If you tossed three coins where X be the random variable
representing the number of tails that occur. The possible
values of the random variable X are 0, 1 and 2
3 1 5
3.The sum of + =
4 2 8
4. The sum of 0.25 + 06 +0.36 +0.28=0.95
𝑋+1 1
5.If P(x)= , the value of P(1) is
6 2
3 3
6. If P(x)= , the value of P(4) is
𝑥−2 2

38
Solution:
1. True.
2. False: The possible values of the random variable X are 0,1,2 and 3.
3 1 5
3. False: + =
4 2 4
4. True
𝑋+1 1
5. False: If P(x)= , the value of P(1) is
6 3
6. True

B. Recall how to determine the values of the random variable by answering the given
problem.
Find the values of the random variable Y representing the number of green balls
when 2 balls are drawn in succession without replacement from a jar containing 4
red balls and 5 green balls.

Solution:
a. Determine the sample space. Let R represent the red ball and G represent
the green ball. The sample size for this experiment is: { RR, RG, GR, GG }.
b. Count the number of green balls in each outcome in the sample space and
assign the number to this outcome.

Value of the Random Variable Y


Possible Outcome (No. of Green Balls)
RR 0
RG 1
GR 1
GG 2

The values of the random variable Y are ____________.


What’s New
Activity
Number of Defective Computers

Read and analyze the situation given below:

In a computer laboratory, the teacher wants to find out if there is a defective


computer. Supposed three computers were tested at random, she asks one of her
Computer System Servicing students to list all the possible outcomes, such that D
represents the defective computer and N represents the non-defective computer. Let
X be the random variable for the number of defective computers. Then, illustrate the
probability distribution of the random variable X.

39
Based on the above problem, observe, analyze, and answer the following questions:
a. List the sample space in the given experiment. How many outcomes are
possible?
b. Construct a table showing the number of defective computers in each
outcome and assign this number to this outcome. What is the value of the
random variable X?
c. Illustrate a probability distribution. What is the probability value P(X) to
each value of the random variable?
d. What is the sum of the probabilities of all values of the random
variable?
e. What do you notice about the probability of each value of the random
variable?

What is It
To solve the problem above, you have to consider first the steps in determining the
values of the random variable that you have learned from your previous lesson.

a. Let D represent the defective computer and N for the non-defective


computer.
The sample space is:
S= {NND, NDN, DNN, DND, DDN, NDD, DDD, NNN} and there are 8 possible
outcomes
b. Count the number of defective computers in each outcome in the sample
space and assign this number to this outcome. For instance, if you list NND,
the number of defective computers is 1.

Value of the Random Variable X


Possible Outcomes
(number of defective computers)
NND 1
NDN 1
DNN 1
DND 2
DDN 2
NDD 2
DDD 3
NNN 0

There are four possible values of the random variable X representing the number of
defective computers. The possible values that X can take are 0, 1, 2, and 3.

c. Each of these numbers corresponds to an event in the sample space S of


equally likely outcomes for this experiment. Since the value of the random
variable X represents the number of defective computers, X = 0 to (NNN),
X = 1 to (NND, NDN, DNN), X=2 to (DND, DDN, NDD) and X= 3 to (DDD).

If each of the outcomes is equally likely to occur, then the probability is:

number of outcomes in the event


P (E)=
number of outcomes in the sample space

Assign probability values P(X) to each value of the random variable. Since the
number of outcomes is 8, the probability that 0 defective computer will come out

40
1 1 3
is or P (0) = , the probability that 1 defective computer will come out is or
8 8 8
1 3
P(1)= is , the probability that 2 defective computers will come out is or P(2)=
8 8
3 1 3
and the probability that 3 defective computers will come out is or P(3)= .
8 8 8

Illustrating this in a table would give as:

Number of Defective Probability P(X)


Computer X
0 1
or 0.125
8
1 3
or 0.375
8
2 3
or 0.375
8
3 1
or 0.125
8

You can also construct the table in this form:

X 0 1 2 3
P(X) 1 3 3 1
8 8 8 8

d. by adding all the probabilities


1 3 3 1 1+3+3+1 8
+ + + = = =1
8 8 8 8 8 8
you can also use decimals in determining the sum of the probabilities
0.125 + 0.375 + 0.375 + 0.125 = 1
If you add all the probabilities, the sum is equal to 1.
e. From the given activity, you can see that the values of the probability range
from 0 to 1.
The probability distribution of a discrete random variable X is a list of the possible
values of X and the corresponding probabilities of the values. It specifies the
probability associated with each possible value of the random variable. The
distribution functions of discrete random variables are concentrated as a mass for a
particular value, and generally known as Probability Mass Function.

Properties of discrete probability distribution

1. The probability of each value of the random variable must be


between or equal to 0 and 1. In symbol 0 < P(X) < 1.
2. The sum of all the probabilities of all values of the random variable
must be equal to 1. In symbol, we write it as 𝚺P(X) = 1

Ex.1.Determine if the distribution below is a discrete probability distribution:

X 1 5 7 8 9

PX) 1 1 1 1 1
3 3 3 3 3

41
The distribution must satisfy that each probability value P(X) must be
a.) from 0 to 1 and b) the sum of all the values of the probabilities
must be equal to 1.
By adding all the values of P(X)
1 1 1 1 1 𝟓
Σ P(X)= + + + + =
3 3 3 3 3 𝟑
1
The probability of each value of the random variable has the same value which is
3
and this value lies between 0 and 1 but the sum of its probabilities is not equal to 1.
Hence, this is not a probability distribution because ΣP(X) ≠ 1
Ex. 2. Determine if the distribution below is a discrete probability distribution
You can also express the values of the probabilities in decimal form.

X 1 3 5 7

P(X) 0.35 0.25 0.28 0.12


By adding all the values of P(X)
Σ P(X)= 0.35 + 0.25 + 0.28 + 0.12 =1
The probability of each value of the random variable lies between 0 and 1 and the
sum of its probabilities is equal to 1 or ΣP(X) =1. Therefore, this is a probability
distribution.

Ex. 3. Determine whether the given values can serve as the values of a
probability distribution.

a. P(1) = 0.05, P(2)=1.01, P(3)= 0.2


The probability of each value of the random variable does not lie between 0 and 1
because P(2)=1.01. Therefore, this is not a probability distribution.

3 7 1
b. P(1) = , P(2)= , P(3)=
20 20 2

3 7 1 3 7 10 20
+ + = + + = =1
20 20 2 20 20 20 20
The probability of each value of the random variable lies between 0 and 1 and the
sum of its probabilities is equal to 1 or ΣP(X) =1. Therefore, this is a probability
distribution.

Ex. 4. Determine whether the following can serve as the probability distribution of a
random variable X.
1
a. P(X)= for x= 1,2,3,…….9
7
1
This means that the value of P(1) to P(9)=
7
Illustrating this in a table would give as:
X 1 2 3 4 5 6 7 8 9
P(X) 1 1 1 1 1 1 1 1 1
7 7 7 7 7 7 7 7 7
1
The probability of each value of the random variable has the same value which is
7
9
and this value lies between 0 and 1 but the sum of its probabilities is which is not
7
equal to 1. Hence, this is not a probability distribution because ΣP(X) ≠ 1

42
12
b. P(X)= for x= 1,2,3,
25𝑥
Evaluate P(X), given the value of x :

12 12 12
P(1)= = = = 0.48
25𝑥 25(1) 25
12 12 12
P(2)= = = = 0.24
25𝑥 25(2) 50
12 12 12
P(3)= = = =0.16
25𝑥 25(3) 75
12 12 12
P(4)= = = = 0.12
25𝑥 25(4) 100
Illustrating this in a table would give as:
X 1 2 3 4
P(X) 0.48 0.24 0.16 0.12

The probability of each value of the random variable lies between 0 and 1 and the
sum of its probabilities is equal to 1 or ΣP(X) =1. Therefore, this is a probability
distribution.
What’s More
Independent Activity 1
A. Determine whether the distribution represents a probability distribution or not.
Explain your answer.

1 X 1 5 7 8
1 1 1 1
P(X) 4 8 4 8

2 X 0 2 3 4 6
1 1 1 1 1
P(X) 6 6 6 3 6

3 X 1 3 5 7
P(X) 0.35 0.25 0.22 0.12

4. P(1)=0.42, P(2)=0.31, P(3)=0.37


𝟏𝟐 𝟖 𝟑
5. P(1)= , P(2)= , P(3)=
𝟑𝟓 𝟑𝟓 𝟕

B. Determine whether the following can serve as the probability distribution of a


random variable X.
1
1. P(X)= for x= 1,2,3,…….7
7
𝑥−2
2. P(X)= for x= 1,2,3,4,5
3

Independent Activity II
Number of Girls
Let X be a random variable giving the number of girls in a randomly selected three-
child family. Assuming that boys and girls are equally likely to be selected, illustrate
the probability distribution of a random variable X.

43
Independent Activity III
Rolling Two Dice

Using the sample space for rolling two dice, illustrates a probability distribution for
the random variable X representing the sum of the numbers that appear.

Independent Activity IV
Face Mask
Julia is a boutique shop owner in her town. Due to COVID-19 pandemic, wearing a
face mask of a person is required for their safety. Since there are limited stocks
available, she decided to start another business by making a face mask. She started
selling face masks from day 1 to day 10. The data she collected is shown in the table
below.

Illustrate a probability distribution of a random variable X showing the number of


face mask sold per day and its corresponding probabilities.

Day Number of Face Mask (X)


1 25
2 20
3 15
4 14
5 15
6 10
7 12
8 10
9 15
10 14

What I Have Learned


Give your answer in the following statements.
1. It is the list of possible values of a random variable X and the corresponding
probabilities of the values. _______________________________.
2. What is the sum of all the probabilities of all values of the random variable?
______________________________________________________________.
3. What should be the probability of each value of the random variable? Write
your answer in symbol. __________________________________________.
4. How will you determine if the given distribution is a probability distribution?
__________________________________________________________.
5. Enumerate the steps on how to illustrate the probability distribution of a
discrete random variable by completing the statements given below.
Determine first the _____________space in the given experiment. Then,
find the possible values of the _____________________________. Illustrate the
probability distribution by assigning ____________________________ to each
value of the random variable.

44
What I Can Do
COVID -19 Recoveries
The ongoing pandemic of corona virus disease 2019, a novel infectious disease
caused by severe acute respiratory syndrome was spread in the Philippines on
January 30, 2020. Thousands of people in the country tested positive in this virus,
and some of them recovered from the disease. The following table shows the number
of recoveries from April 24-May 3, 2020.

Date Number of Recoveries X


April 24 40
April 25 30
April 26 70
April 27 70
April 28 43
April 29 48
April 30 20
May 1 41
May 2 40
May 3 90
a. Illustrate the probability distribution if X is the random variable that
represents the number of recoveries.
b. What safety precautions will you undertake to help avoid the spread of
the COVID-19 pandemic?

Assessment
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What must be the value of the probability of each random variable?
A. P(X) ≥ 0
B. 0≤ P(X) ≤ 1
C. -1≤ P(X) ≤ 1
D. P(X) ≤ 1
2. What is the sum of the probabilities of all values of the random
variable?
A. ∑ 𝑃(𝑋))= 0
B. ∑ 𝑃(𝑋)) = 1/10
C. ∑ 𝑃(𝑋)) = 1
D. ∑ 𝑃(𝑋)) = 10
Refer to the given problem in answering numbers 3-4
3. If two balls are drawn in succession without replacement from an
urn containing 4 red balls and 5 blue balls. If the value of the
random variable X represents the number of blue balls. What is the
probability of getting two blue balls?
1
A.
2
1
B.
3
1
C.
4
1
D.
8

45
4. Which probability distribution represents the given problem?

A X 0 1 2
1 1 1
P(X) 2 4 4

B X 0 1 2
1 1 1
P(X) 4 2 4

C X 0 1 2 3
1 3 3 1
P(X) 8 8 8 8

D X 0 1 2 3
3 1 3 1
P(X) 8 8 8 8

5. Which of the following values is not the value of the probability of the
random variable?
A. 1.01
B.1
C.0.50
D. 0.05
6. Which of the following table represents probability distribution?

A. X 0 1 2 3
P(X) 0.11 0.15 0.42 0.44

B. X 1 3 5 7
1 1 1 1
P(X)
3 2 3 3

C. X 1 2 3 4
P(X) 0.32 0.28 0.28 0.12

D. X 0 2 4 6
1 1 1 1
P(X) 5 5
5 5

7. Which of the following values can serve as the values of a probability


distribution?
A. P(1) = 0.42, P2) = 0.31, P(3 )= 0.37
B. P(1) = 9/14, P2) = 4/14, P(3) = 1/14
C. P(1) = 0.08, P2) = 0.12, P(3 )= 1.03
D. P(1) = 10/33, P2) = 12/33, P(3) = 10/33

46
𝑥
8. If P(X)= , what are the possible values of X for it to be a probability
6
distribution?
A.0,2,3
B. 1,1,2
C. 2,3,4
D. 1,2,3
9. Which formula gives the probability distribution shown by the
table?
X 2 3 6
P(X) 1/2 1/3 1/6

1
A. P(X) =
𝑋
𝑋
B. P(X) =
6
6
C. P(X) =
𝑋
1
D. P(X) =
6
10. The following table shows the probability distribution of a
discrete random variable X. Find the value of n.

X 2 4 6
P(X) 0.4 0.25 n

A. 0.15
B. 0.25
C. 0.28
D. 0.35
11. The probabilities that a customer buys 5,8,9,12 and 15 items in a grocery
store are 0.06, 0.14, 0.32, 0.28, and 0.20 respectively. Which probability
distribution represents the given problem?

A. X 5 8 9 12 15
P(X) 0.06 0.14 0.2 0.28 0.32

B. X 5 8 9 12 15
P(X) 0.32 0.28 0.2 0.14 0.06

C. X 5 8 9 12 15
P(X) 0.06 0.14 0.32 0.28 0.20

D. X 5 8 9 12 15
P(X) 0.06 0.14 0.2 0.28 0.32

12. What are the possible values of each random variable X if it


represents the number of heads in tossing a coin thrice?
A. X = {0,1,2,3}
B. X = {0, 1,2}
C. X = {1, 2, 3}
D. X = {1,2,}

47
𝑋
13. What are the values of P(x) if P(x)= , for x=0,1, and 2
3
A.0,1,2
B.0,0.33, 0.67
C. 0,0.45, 0.55
D. 0.2, 0.3, 0.5
14. Which of the following can serve as the probability distribution of a
random variable X?
1
A. P(X) = for x =1,2, 3…...8
8
3+𝑋
B. P(X) = for x = 1, 2, 3, 4
3−𝑋
1
C. P(X) = for x = 1,2,3……9
6
3
D. P(X) = for x = 1,2,3
5𝑥
15. The number of absences from June to March of a student based on his class
adviser’s record is shown below. If X be the random variable representing the
number of absences.

Month No. of Absences Month No. of Absences


June 0 November 2
July 2 December 1
August 1 January 4
September 3 February 1
October 2 March 4

Which table represents the probability distribution?

A. X 0 1 2 3
2 2 3 3
P(X) 10 10 10 10

B. X 0 1 2 3
1 3 3 3
P(X) 10 10 10 10

C. X 0 1 2 3 4
1 3 3 1 1
P(X) 10 10 10 10 5

D. X 0 1 2 3 4
1 3 3 1 1
P(X) 5 10 10 10 10

48
Additional Activities
1. Tossing Four Coins
Suppose that a coin is to be tossed four times, and let X represent the number of
tails that occur. Illustrate a probability distribution for a random variable X.
2.Spin and Add
A spinner has 3 equal sections that are labeled 1,2,
and 3. If the spinner spins twice, find the sum of the
3 1
two numbers it lands on. Let X be the random
variable that represents the sum of the two numbers.
a. List all sample spaces and identify how many 2
they are.
b. Let X be the random variable that represents
the sum of two numbers. What are the possible values of X?
c. Illustrate a probability distribution for a random variable X.

49
Statistics and
Probability
Computing Probability
Corresponding to a Given
Random Variable
Statistics and Probability
Computing Probability Corresponding to a Given Random Variable
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module


Writers: Melba S. Remojo
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Kelvin A. Ramintas and Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

50
Week

2
What I Need to Know
This module was designed and written with you in mind to help you on computing
probabilities corresponding to a given random variable. The probability of certain
events occurring or not can be important to be applied in real-life problems. You can
use probability to make judgments when you are unsure what the outcome will be.
This module will also help you to improve your computation skills. You can apply
this not only in experiments like tossing a coin, playing cards, and throwing a die
but also in different situations like in business or financial matters wherein a
statistical process is required. This will also help you in decision making regarding
future financial situations and on business decision practices.

After going through this module, you are expected to:


1. illustrate a probability distribution for a discrete random variable; and
computes probabilities corresponding to a given random variable.

What I Know
Let us see how far you will go and how much you know about this pretest. Choose
the letter of the best answer. Write the chosen letter on a separate sheet of paper.

1. The given table represents a probability distribution. What is P(2) + P(3)?


X 1 2 3 4
P(X) 1 1 1 1
3 6 6 3

1 1 1 2
A. B. C. D.
6 3 2 3
Refer to the given table in answering numbers 2-3.
In the experiment of tossing a coin twice, the following table shows the probability
distribution. What is P (X≤ 2)?

X 0 1 2
P(X) 1 1 1
4 2 4

2. What is P (X≤ 2)?


3 1 1
A. 1 B. C. D.
4 2 4
3. What is P(X≥1)?
3 1 1
A.1 B. C. D.
4 2 4
Refer to the given table in answering numbers 4-5.

Number of Tails (X) 0 1 2 3


P(X) 0.125 0.375 0.375 0.125

4. Find P (X ≥ 1)
A. 0.125 B. 0.5 C. 0.625 D. 0.875
5. Find P (X ≤ 1)

A. 0.125 B. 0.5 C. 0.625 D. 0.875

51
Refer to the given problem in answering numbers 6-8.

6. The random variable Y representing the number of nuts in a chocolate


bar has the following probability distribution. Find P (odd)

Y 0 1 2 3 4
P(Y) 1 3 3 2 1
10 10 10 10 10
3 2 1 4
A. B. C. D.
10 5 2 5
7. Find P (1 ≤ Y ≤ 4)
9 4 1 3
A. B. C. D.
10 5 2 10
8. Find P (Y> 1)
1 3 7 9
A. B. C. D.
5 5 10 10
Refer to the given problem in answering numbers 9-10.
Two jars both contain the numbers 2, 4, 6, and 8. The following table shows the
probability distribution of the sum when one number from each jar is taken at a
time with replacement.

X 4 6 8 10 12 14 16
P(X) 1 1 3 1 3 1 1
16 8 16 4 16 8 16

9. Find P(4) + P(6)


9 7 5 3
A. B. C. D.
16 16 16 16
10. Find the probability that X assumes a value greater than 10.
3 7 9 5
A. B. C. D.
8 16 16 8
11. Find the probability that X assumes a value between 6 and 14.
3 7 9 5
A. B. C. D.
8 16 16 8
Refer to the given problem in answering numbers 12-13.
The given table shows the probability distribution of a random variable Z which
represents the percentage of male students per section among Grade 11 senior high
school students.
Z 8 9 11 14 15 18
P(Z) 8% 8% 25% 17% 25% 17%

12. Find P(Z≥11)


A. 84% B.59% C. 51% D.43 %
13. Find P(8< Z <18)
A. 43% B.59% C. 75% D.83 %
Refer to the given problem in answering numbers 14-15.
The daily demand for copies of a magazine at a variety store has the probability
distribution as follows.

Number of Copies X Probability P(X)


0 0.10
1 0.25
2 0.30
3 0.16

52
4 0.05
5 0.14

14. What is the probability that two or more copies will be demanded on
a particular day?
A. 0.30 B. 0.35 C. 0.65 D. 0.75
15. What is the probability that the demand will be at least one but not
more than four?
A.0.76 B. 0.71 C.0.51 D. 0.35

Lesson 1 Computing Probability


Corresponding to a Given Random
Variable
You have learned that the likelihood of winning in a lotto draw, number of winnings
in a gamble, number of heads that come out in a toss of a coin, and other game of
chance can be estimated using probability. But do you know that it is not only for
the game of chance? We also use this in business, economics, engineering, and other
real-life situations. A data needs to determine the probabilities to make decisions
and draw a conclusion. Most of the time, you won't perform actual
probability problems, but you'll use probability to make a judgment and determine
the best course of action.

What’s In
In your previous lesson, you have learned how to illustrate a probability distribution
of a discrete random variable. To understand this lesson, your knowledge of getting
the probability of an event is very important as you have learned from your previous
lesson in Mathematics. To find out if you are ready to learn in this new lesson,
determine the probability of the following.
Find the probability of the following events.

Event (E) Probability P (E)


a. Getting an odd number in a single roll of a die
b. Getting an ace when a card is drawn from a
deck
c. Getting a number greater than 2 in a single
roll of a die
d. Getting a red queen when a card is drawn
from a deck
e. Getting doubles when two dice are rolled
f. Getting a sum of 5 or a sum of 9 when two
dice are rolled
The probability for each event will be:
a. The event of getting an odd number has three outcomes: 1, 3, or 5. Since there
3 1
are 6 faces in a die, then the probability of getting an odd number is or .
6 2

53
b. The event of getting an ace when a card is drawn from a deck is 4. Since there
4
are 52 faces in a deck of cards, then the probability of getting an ace is or
52
1
.
13
c. The event of getting a number greater than 2 in a single roll of a die has four
outcomes: 3, 4, 5, and 6. Since there are 6 faces in a die, then the probability
4 2
of getting a number greater than 2 is or .
6 3
d. The event of getting a red queen when a card is drawn from a deck is 2. Since
there are 52 faces in a deck of cards, then the probability of getting a red
2 1
queen is or .
52 26
e. The event of getting doubles when two dice are rolled has 6 outcomes:
{(1,1),(2,2),(3,3),(4,4),(5,5),(6,6)}.Since there are 36 outcomes in rolling two
6 1
dice, then the probability of getting doubles is or .
36 6
f. The event of getting a sum of 5 when two dice are rolled has 4 outcomes:
4
{(1,4), (4,1),(2,3),(3,2)}.Hence, the probability is .The event of getting a sum
36
of 9 when two dice are rolled has 4 outcomes: {(4,5),(5,4),(3,6),(6,3)}. Hence,
4
the probability is .
36
The probability of getting a sum of 5 or a sum of 9 when two dice are rolled is P(sum
number of pairs with sums of 5 number of pairs with sums of 9
of 5 or sum of 9)= +
total number of outcomes total number of outcomes

4 4
= +
36 36

8 2
= or
36 9

What’s New
Golden’s bakery is known for its famous Filipino delicacies. Among these foods which
is a native delicious food called “kakanin” is a “leche puto”. The bakeshop owner
recorded the number of boxes of “leche puto” that were delivered each day. The
number of boxes delivered for 10 days is shown below.

Number of
Day
Boxes(X)
1 35
2 37
3 50
4 45
5 37
6 45
7 40
a. What is the probability that 40 or more boxes will
8 42
be delivered on a day?
9 45
b. What is the probability that the number of boxes
delivered will be at least 37 but not more than 50? 10 42

c. What is the probability that at most 40 boxes will be delivered on a particular


day?
d. Find P (X≤ 45)
e. Find P (40) + P (50)

54
What is It
In the previous module, you have learned how to illustrate a probability distribution
for a discrete random variable and its properties. Some experiments like tossing a
coin, throwing a die, and other activities consist of one or more outcomes. If each of
the outcomes is equally likely to occur, then the probability is:

number of outcomes in the event


P (E)=
number of outcomes in the sample space

To answer the given question, you have to construct first the probability distribution.
Let X the value of the random variable represented by the number of boxes of “leche
puto”. The probability distribution is shown below.
Number of Boxes X Probability P(X)
35 1
10
37 1
5
40 1
10
42 1
5
45 3
10
50 1
10
a. The probability that 40 or more boxes will be sold in a particular day means
P (X≥ 40).
This means that you have to add P(X=40), P(X=42), P (X= 45), and (X= 50).
P (X≥ 40) = P(40) + P(42) + P(45) + P(50)
then, substitute its corresponding probability
1 1 3 1
= + + +
10 5 10 10
1 2 3 1
= + + +
10 10 10 10
7
= or 0.7
10

b. The probability that the number of boxes delivered will be at least 37 but not
more than 50 means P (37≤ X< 50).
Hence, the values included are P(37), P(40), P42) and P(45), then substitute
its corresponding probability
P (37≤ X< 50) = P(37) + P(40) +P(42) + P(45),
1 1 1 3
= + + +
5 10 5 10
2 1 2 3
= + + +
10 10 10 10
8
=
10

55
4
= or 0.8
5

c. The probability that at most 40 boxes will be delivered in a particular


day means P (X≤ 40) so the values of X are P(40), P(37) and P(35)
P (X≤ 40) = P(40) + P(37) + P(35)
1 1 1
= + +
10 5 10
1 2 1
= + +
10 10 10
4
=
10
2
= or 0.40
5

d. Find P (X≤ 45) = P(45) + P(42) + P(40) + P(37) +P(35)

3 1 1 1 1
= + + + +
10 5 10 5 10

3 2 1 2 1
= + + + +
10 10 10 10 10

9
= or 0.90
10

Other solution:

P (X≤ 45) = 1-P(50)

1
= 1-
10

9
=
10

1 1
e. Find P (40) + P (50) = +
10 10

2
=
10

1
= or 0.10
5

What’s More
Independent Activity I
Number of Cell Phones
The number of cell phones sold per day at Gadgets Zone has the following probability
distribution.

Number of Cell Phones Probability P(X)


Sold X
0 0.15
1 0.10
2 0.25

56
3 0.30
4 0.20

Find:
1.P(X≥ 2)
2.P(x≤ 3)
3. P(1≤ 𝑋 ≤ 4)
4.P(2) + P(4)
Independent Activity II
Number of Absences

The monthly absences of a student based on his class adviser’s record are presented
in the probability distribution below.

X 0 1 2 3 4
1 3 3 1 1
P(X) 10 10 10 10 5

a. What is the probability that the number of absences is less than 3?


b. What is the probability that the number of absences is at least 2?
c. What is the probability that the number of absences is greater than 1 but
less than 4?

Independent Activity III


Bayanihan To Heal as One Act
In a certain barangay, the DSWD conducted a survey among the ages of senior
citizens who will receive cash assistance or the Social Amelioration Program (SAP).
This program is a cash emergency subsidy program for Filipino families whose lives
are greatly affected by the Enhance Community Quarantine (ECQ). It is mandated
by the new law, the “Bayanihan To Heal As One Act”, which was signed by the
President on March 25, 2020. The given data shows the probability distribution
among the ages of senior citizens.

Age X Probability P(X)


60 0.16
61 0.10
62 0.10
63 0.07
65 0.13
67 0.10
68 0.07
70 0.07
72 0.10
73 0.07
80 0.03
a. What is the probability that at least 65 years old senior citizen will receive
the SAP?
b. What is the probability that at most 70 years old senior citizen will receive
the SAP?

57
c. What is the probability that at least 60 years old but less than 70 years old
senior citizen will receive the SAP
What I Have Learned
Answer the following statements.
1. Specify the formula on how to compute the probability of a discrete
random variable.
__________________________________________________________________
2. What important skills did you develop in getting the probability?
___________________________________________________________________
3. Cite examples of an experiments or activities wherein we can apply the
computation of probabilities of a random variable.
___________________________________________________________________
4. In your own words, define experiment, outcomes and probabilities.
You can cite your own example.
___________________________________________________________________
What I Can Do
A pair of fair dice is rolled. Let X the random variable representing the sum of the
numbers that appear.

a. Construct the probability distribution of X for a pair of


dice.
b. Find P(X≥ 8)
c. Find P(X≤ 7)
d. Find the probability that X takes an even
value.
e. Find P(3 ≤X≤ 10)
Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. The given table represents a probability distribution. What is P(1) +
P(4)?
X 1 2 3 4
P(X) 1 1 1 1
3 6 6 3

1 1
A. C.
6 2
1 2
B. D.
3 3
Refer to the given table in answering numbers 2-3.
In the experiment of tossing a coin twice, the following table shows the
probability distribution. What is P (X≤ 2)?

X 0 1 2
P(X) 1 1 1
4 2 4

58
2. What is P (X≤ 2)?
1 3
A. C.
4 4
1
B. D. 1
2
3. What is P(0) + P(2)?
1
A. 1 C.
2
3 1
B. D.
4 4
Refer to the given table in answering numbers 4-5.

Number of Tails (X) 0 1 2 3

P(X) 0.125 0.375 0.375 0.125

4. Find P (X ≤ 1)
A. 0.125 C. 0.625
B. 0.5 D. 0.875
5. Find P (X ≥ 1)
A. 0.125 C. 0.625
B. 0.5 D. 0.875
Refer to the given problem in answering numbers 6-8.
6. The random variable Y representing the number of nuts in a chocolate bar has
the following probability distribution. Find P (odd)
Y 0 1 2 3 4

P(Y) 1 3 3 2 1
10 10 10 10 10

3 1
A. C.
10 2
2 4
B. D.
5 5
7. Find P (0 ≤ Y ≤ 3 )
9 1
A. C.
10 2
4 3
B. D.
5 10
8. Find P (Y< 3 )
1 7
A. C.
5 10
3 9
B. D.
5 10
Refer to the given problem in answering numbers 9-10.
Two jars both contain the numbers 2, 4, 6, and 8. The following table shows the
probability distribution of the sum when one number from each jar is taken at a
time with replacement.

X 4 6 8 10 12 14 16
P(X) 1 1 3 1 3 1 1
16 8 16 4 16 8 16
9. Find P(6) + P(8)
9 5
A. C.
16 16
7 3
B. D.
16 16

59
10. Find the probability that X assumes a value greater than 12.
3 9
A. C.
8 16
3 5
B. D.
16 8
11. Find the probability that X assumes a value between 6 and 14.
3 9
A. C.
8 16
5 5
B. D.
16 8
Refer to the given problem in answering numbers 12-13.
The given table shows the probability distribution of a random variable Z which
represents the percentage of male students per section among Grade 11 senior high
school students.

Z 8 9 11 14 15 18
P(Z) 8% 8% 25% 17% 25% 17%
12. Find P(Z≥9)
A. 92% C. 59%
B.84% D.43 %
13. Find P(8< Z <15)
A. 84% C. 59 %
B.73 % D.50 %
Refer to the given problem in answering numbers 14-15.
The daily demand for copies of a magazine at a variety store has the probability
distribution as follows.

Number of Copies X Probability P(X)


0 0.10
1 0.25
2 0.30
3 0.16
4 0.05
5 0.14
14. What is the probability that three or more copies will be demanded
on a particular day?
A. 0.30 C. 0.65
B. 0.35 D. 0.75
15. What is the probability that the demand will be at least one but not
more than five?
A.0.76 C.0.51
B. 0.71 D. 0.43
Additional Activities
Analyze and solve the following problems:
1. Box A and B contain numbers 1,2 3, and 4. The following is the probability
distribution of the sum when one number from each box is taken at a time
with replacement.
X 2 3 4 5 6 7 8
P(X) 1 1 3 1 3 1 1
16 8 16 4 16 8 16
a. Find P(even)
b. Find P (X≥ 5)
c. Find the probability that X assumes a value of less than 4.
d. Find the probability that X assumes a value greater than 6.
e. Find the probability that X assumes a value between 3 and 8.

60
2. The probabilities that a customer buys 5,6,7, 8, and 9 items in a
convenience store has the following probability distribution.
X 5 6 7 8 9
P(X) 0.25 0.12 0.28 0.15 0.20
a. Find P(X≤ 8)
b. Find P(5≤ X < 8)
c. What is the probability that a customer will buy at least 6 items?
d. What is the probability that a customer will buy at most 7
items?

61
Statistics and
Probability
Computing Probability
Corresponding to a Given
Random Variable
Statistics and Probability
Computing Probability Corresponding to a Given Random Variable
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module


Writer: Annabele L. Laraquel
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

62
Week

2
What I need to know?
This module was created and written with you in mind on how to illustrate
the mean and variance of a discrete random variable. This will also help you analyze
real-life situated problems statistically in terms of relevant questions for you to better
understand them. Your adept at analysis will help you appreciate the richness, and
beauty of Statistics which will motivate you to apply to similar events and create
statistical measures of your own. From this module, you will also learn how to use
the given illustration to determine the mean, variance, and standard deviation of the
discrete random variables.

Your patience in analyzing figures and illustrations offered here in the module
will help you upgrade your good pattern recognition skills as it tackles appropriate
culture-based situated problems. Your ability to analyze, reason–out, and make a
judgment or even decision out of statistical measures will also be practiced here. The
extent of this module permits it to be used in many different learning situations. The
language used recognizes the diverse vocabulary level of students. The lessons are
arranged to follow the standard sequence of the course. But the order in which you
read them can be changed to correspond with the textbook you are now using.
The module focused on illustrating the mean and variance of a discrete
random variable.
After going through this module, you are expected to:
1. learn the important concepts of mean and variance of a discrete random
variable; and
2. illustrate the mean and variance of a discrete random variable.
What know
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. Which of the following terms is considered as a measure of the `central location'
of a random variable?
A. Probability Value
B. Mean Value
C. Numerical Value
D. Variance
2. Which of the following notations is equivalent to the mean of the probability
distribution?
A. 𝜎
B. ∞
C. 𝜇
D. None of the Above
3. Which of the following notations is equivalent to the expected value of the
probability distribution?
A. 𝐸(𝑋)
B. 𝜀
C. 𝜎
D. 𝑃(𝑥)
4. Which of the following statements is TRUE about the standard deviation of a
discrete random variable?
A. It shows the spread out or dispersion of discrete random variables.
B. It is obtained by multiplying the x values and their corresponding probability.
C. It deals with the average or center of location of the probability distribution.

63
D. It is the summation of the product of the square of the difference of x and its
probability.

5. Use the illustration below. What is the mean of the distribution?


A. 4
B. 6
C. 7
D. 8

1 2 3 4 5 6 7 8 9 10

6. Which of the following statements is TRUE?


A. The values further away from 𝜇𝑥 have a small probability.
B. The values further away from 𝜇𝑥 have a large probability.
C. The values closer from 𝜇𝑥 don’t have equal probability.
D. The values closer from 𝜇𝑥 have equal probability.
7. Which term can be displayed the variability or the dispersions of the random
variables.
A. Probability Value
B. Expected Value
C. Mean Value
D. Variance
8. What formula is used to find the variance of a discrete random variable?

A. σ_x^2=∑▒〖x∙p(x)〗; for all possible values of x


B. 𝜎𝑥2 = ∑(𝑥 + 𝜇)2 ∙ 𝑝(𝑥); for all possible values of x
C. 𝜎𝑥2 = ∑(𝑥 − 𝜇)2 ∙ 𝑝(𝑥); for all possible values of x
D. 𝜎𝑥2 = ∑(𝑃(𝑥) + 𝜇)2 ∙ 𝑥 ; for all possible values of x

9. Which formula is appropriate to use in finding the mean of a discrete random


variable?

A. E(x) = μx = ∑ x ∙ p (x)
B. E(x) = μx = ∑ x + p (x)
C. E(x) = μx = ∑ x − p (x)
D. E(x) = μx = ∑ x ∙ p (x)2
For items 10 -11, refer to the diagram below.

1 2 3 4 5 6 7 8 9 10

64
10. What is the value of μx ?
A. 4
B. 5
C. 6
D. 7
11 How dispersed are the scores from the mean?
A. 4
B. 5
C. 6
D. 7
For items 12 -13, refer to the diagram below.

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14

12. What is the value of μx ?


A. 5
B. 7
C. 9
D. 11
13. How dispersed are the scores from the mean?
A. 4
B. 6
C. 7
D. 8
For items 14 -15, refer to the diagram below.

12 24 36 48 60 72

14. What is the value of μx ?


A. 24
B. 36
C. 48
D. 60
15. How dispersed are the elements from the mean?
A. 12
B. 24
C. 36
D. 48

65
Lesson 1 Illustrating Mean and Variance of
Discrete Random Variable
As claimed by some research, every individual spent an average time of ten
hours on their mobile phone. What does it mean? What is the average value that he
or she spent in using a mobile phone? Does it imply that everyday every person
spends ten hours of his day dealing with a gadget? Or does it imply that an individual
goes through ten hours every day on a cell phone? How is the time spent by different
persons vary from one another? These questions lead to another idea in probability
and statistics. This is the mean and the variance of the discrete random variable.
This module is all about illustrating and determining the average of a discrete
random variable. Likewise, obtaining how the element is being spread away from the
mean. The term average is the mean or the expected value or the expectation in
probability and statistics. Once we have calculated the probability distribution for a
random variable, we can illustrate its expected value. The mean of a random variable
shows the location or the central tendency of the random variable.

What is In
Let’s have this first for a review.
Suppose you tossed a fair coin twice. Let X be the number of heads that are observed.
a. Construct the probability distribution of X.
b. Find the probability that at least one head is observed.

Solution:
a. The possible values that X can take are 0, 1, and 2. Each of these numbers
corresponds to an event in the sample space S = {HH, HT, TH, TT} of equally likely
outcomes for this experiment:
For X = 0 the corresponding sample space will be {TT};
For X = 1 the corresponding sample space will be {HT, TH} and
For X = 2 to {HH}.

The probability of each of these events, hence of the corresponding value of X,


can be found simply by counting, as it shows in the table of probability distribution
below:

X 0 1 2

P(X) 0.25 0.5 0.25

b. At least one head is the event X ≥ 1, which is the union of the mutually exclusive
events X = 1 and X = 2. Thus
P(X≥1) = P(1) + P(2) = 0.50 + 0.25 = 0.75
Therefore, the probability that at least one head is 0.75

66
What’s New
Let’s Roll a Die!
Read and analyze the situation given below:
During Town Fiesta, people used to go to Carnival that most folks call it
“Perya”. Mang Ben used to play “Beto–beto” hoping that he would win.
While he is thinking about what possible outcomes in every roll would be, he
is always hoping that his bet is right.
Let us help Mang Ben to know the probabilities of each outcome by filling the
table below.

X 1 2 3 4 5 6

P(X) 1/6 1/6 1/6 1/6 1/6 1/6

Based on the data above, observe, analyze, and answer the following
questions:

1. Is the probability of x lies between 0 and 1?


________________________________________________________________________
2. What is the sum of all probabilities of X?
________________________________________________________________________
3. Is there a negative probability? Is it possible to have a negative
probability?
________________________________________________________________________
4. How will you illustrate the average or mean of the probabilities of discrete
random variable?
________________________________________________________________________
5. Can you show a process where you can illustrate the variance and the standard
deviation of the probability?
________________________________________________________________________
Answer:

1. Yes, the probability of X lies between 0


and 1.
2. The sum of all probabilities of X is
exactly 1.
3. No negative probabilities because it is
impossible to have it based on the
characteristic of the probability of
discrete random variables.
4. The mean is 3.5
If you are wondering how we came up
with the answer for questions 4 and 5,
you will understand everything as we go
on in our discussion

67
What is It
If you roll a fair die several times, what will be the average outcome? Imagine
rolling it 6000 times. You would expect to roll about 1000 ones, 1000 twos, and so
on: about 1000 occurrences of each possible outcome. What would be the average
value of the outcomes obtained? Approximately, the mean or expected value would
be;
(1000 ∗ 1) + (1000 ∗ 2) + ⋯ + (1000 ∗ 6) 21 000
= = 3.5
6000 6000
It only implies that the weighted average of the six possible outcomes 1, 2,…,
6 with weights provided by the relative frequencies. Remember that 3.5 isn’t a value
that we can observe if we will do the experiment.
By similarity with information and relative frequencies, we can characterize
the mean of a discrete random variable by utilizing probabilities from its dispersion,
as follows.
1. The mean is considered as a measure of the `central location' of a random
variable. It is the weighted average of the values that random variable X can take,
with weights provided by the probability distribution.
2. The Expected Value or Mean Value of a discrete random variable x is can be
computed by first multiplying each possible x value by the probability of observing
that value and then adding the resulting quantities. Symbolically,
𝑬(𝑿) = 𝝁𝒙 = 𝑋1 ∗ 𝑃(𝑋1 ) + 𝑋2 ∗ 𝑃(𝑋2 ) + ⋯ + 𝑋𝑛 ∗ 𝑃(𝑋𝑛 ) = ∑ 𝑿 ∗ 𝑷(𝑿)
Example1: Suppose a random variable X that has the discrete uniform distribution
with possible values 1, 2,…, n. The mean or expected value is presented by
𝑛 𝑛
1 1 1 𝑛(𝑛 + 1) 𝑛 + 1
𝐸(𝑋) = 𝜇𝑥 = ∑ 𝑥 ∗ ( ) = ∑ 𝑥 = ∗ =
𝑛 𝑛 𝑛 2 2
𝑥=1 𝑥=1
Since the possible values are 1, 2,…, n when rolling a die a fair of die the mean
6+1 7
is = = 𝟑. 𝟓. This 3.5 is the expectation only. That’s the reason why in question 4,
2 2
we come up to 3.5 as its mean value.
So, doing the experiment which is rolling a single die several times and getting
average or mean of the total possible outcomes, we, therefore, conclude that the
average tends to be close to 3.5. This also implies that the more rolls we do, the closer
the average will be to 3.5.
Example 2.

0.1669500

0.1669000

P(X) 0.1668500

0.1668000

0.1667500

0.1667000

0.1666500

0.1666000
1 2 3 4 5 6 X

68
From the figure above, it shows the probability distribution for the possible
outcome in rolling a fair die. Notice that they have equal probabilities or chances to
occur. This distribution is symmetric and the mean is 3.5 which found in the middle
of the distribution, it is in the axis of symmetry.
We have noticed that the mean of a random variable X is a measure of the
central location of the distribution of X. If we are summarizing highlights of the
distribution of X, it is evident that location is not the only significant aspect
If the example above complicated you, a simpler one is provided for you.

Example 3. Suppose you want to find the average number of apples sold in any five
grocery stores as indicated by the figure below.

Analyzing the illustration above, then the mean or expected value is 11. If you
are wondering why this happens you may also follow with these processes:
1. Separate 8 elements on the left side of the mean then get the average of those a
(3, 4, 5, 6, 7, 8, 9, 10) which is equal to 6.5
2. Do the same on the right side of mean (12, 13, 14, 15, 16, 17, 18, and 19) which
is equal to 15.5.
3. Get the average value of the numbers 6.5, 11, and 15.5. It will be equal to 11 also
As a whole, we can generate a physical representation and interpretation of
the discrete random variable X with its probability P(X). Assume that the x-axis is an
unbounded see-saw in both directions, and we put weights corresponds to
probability P(X) at each associated value(x) of random variable X. Therefore, the mean
(𝝁𝒙 ) is the point where the see-saw balance. In other words, it is the center of weights
of the observation.
In the off chance, that value of X close to its mean 𝜇𝑥 are likely and values
further away from 𝜇𝑥 have small probability, at that point, the distribution of X will
be firmly concentrated around 𝜇𝑥 . For this situation, the spread of the distribution
of X is little. On the contrary, if the expected value of X some distance from its mean
𝜇𝑥 are likely, the spread of the distribution of X will be sizeable.
These concepts lead the way to the most important measure of spread,
the variance, and a closely related measure of variability, the standard deviation.
You probably have met the concepts of variance and standard deviation when
summarizing data in your Junior High School Mathematics. These were the sample
variance and the sample standard deviation. The difference here is that we are
referring to properties of the distribution of a discrete random variable.
The variance of a random variable displays the variability or the dispersions
of the random variables. It shows the distance of a random variable from its mean.
The Variance of a Discrete Random Variable X, denoted by 𝝈𝟐𝒙 is computed
by first subtracting the mean from each possible x value to obtain the deviations,
then squaring each deviation and multiplying the result by the probability of the
corresponding x value, and then finally adding these quantities.
Definition: If X is a random variable with mean 𝐸(𝑋) = µ𝑥 , then the variance of X is
defined by,

𝑽𝒂𝒓(𝑿) = 𝝈𝟐 𝒙 = ∑ (𝒙 − 𝝁𝒙 )𝟐 ∙ 𝑷(𝒙)
𝒂𝒍𝒍 𝒑𝒐𝒔𝒔𝒊𝒃𝒍𝒆
𝒗𝒂𝒍𝒖𝒆𝒔 𝒐𝒇 𝒙

69
Remember that if the value of the variance is small, then the values of the random
variable are close to the mean.
The Standard Deviation of x, denoted by 𝝈𝒙 , is the square root of the
Variance. In symbol,

𝜎𝑥 = √𝑽𝒂𝒓(𝑿) or 𝝈𝒙 = √𝝈𝒙 𝟐
Example 4. Let us consider the same histogram above. Illustrate the variance and
standard deviation.
To be able to determine the element spread away from the mean, we can do
this by adding this procedure in the illustration given.

0.1669500 3 3

0.1669000

P(X) 7
0.1668500 4 10

0.1668000

0.1667500

0.1667000

0.1666500

0.1666000
1 2 3 4 5 6
X

Since we are comparing two elements this time (1 and 3 on the left side of the
mean and 4 and 6 in the right., the 𝜇𝑥 3.5 will be doubled which became 7. Adding
1 and 3 elements gives us 4, similarly on the right side adding 4 and 6 gives us 10.
Get the average value of 4 and 10 which resulted in 7 which is our new 𝜇𝑥 . Subtract
7 by 4 which is 3, likewise subtract 10 by 7 which resulted in 3. This implies that
the distance of the element from the mean in either direction is 3 which describes
the spread of the elements in the observation.
And this is how we come up with the answer in question under What’s More.
Example 5. Consider this figure.
3

0
1 2 3 4 5 6 7 8 9 10

70
a. Illustrate the mean.
b. Compare the variance and standard deviation.
Solution:
a. Since the data is symmetric along the x-axis, so the mean 𝜇𝑥 is equal to
6. (You may verify the 𝜇𝑥 value using the 3 steps provided in the previous
example)
b. To illustrate the variance and standard deviation without having
computation, study the diagram below.
5 5
7 7
7 12 17

2 5 7 10

0
1 2 3 4 5 6 7 8 9 10
Figure A. 𝜇𝑥
Since we are comparing two elements this time (2 and 5 on the left side of the
mean and 7 and 10 in the right, the 𝜇𝑥 is doubled which became 12. Adding 2 and 5
elements gives us 7, similarly on the right side adding 7 and 10 gives us 17. Get the
average value of 7 and 17 which resulted in 12 which is our new 𝜇𝑥 . Subtract 12 by
7 which is 5, likewise subtract 17 by 12 which also resulted in 5. This implies that
the distance of the element from the mean in either direction is 5 which describes
the spread of the elements in the observation.
Example 6. Compare the variance and standard deviation of this distribution.
4 4

16
8 12
4

3 9
3

2 5 7

0
1 2 3 4 5 6 7 8 9 10
Figure B 𝜇𝑥

71
Following the same procedure just like in example 4, for figure A, the 𝜇𝑥 is 6.
Now let’s compare 3 and 5 on the left side of the mean and 7 and 9 in the right.
Adding 3 and 5 which is 8 and on the right side add 7 and 9 which results in 16. The
average of 8 and 16 is 12 which happens to be our new mean. Subtract 8 from 12
which is 4, similarly subtract 12 from 1 wherein the result is also 4. This shows that
the distance of the element from the mean in either direction is 4 which expresses
the spread of the elements in the observation.
Example 7. Analyze and observe the figure below
6 6

6 12 18

2
2 4 8 10
1

0
1 2 3 4 5 6 7 8 9 10

Figure C 𝜇𝑥

Again, since we are comparing two elements this time (2 and 4 on the left side
of the mean and 8 and 10 in the right, the 𝜇𝑥 is doubled which became 12. Adding 2
and 4 elements give us 6, similarly, on the right side adding 8 and 10 gives us 17.
Get the average of 6 and 18 which also results in 12 which is our new 𝜇𝑥 . Subtract
12 by 6 which is 6, likewise subtract 18 by 12 which also results in 6. This suggests
that the distance of the element from the mean in either direction is 6 which
describes the spread of the elements in the observation.
This time let us compare the variance and standard deviation of the three
given illustrations. Because Figure A has a spread value of 5, Figure B has 4 and
Figure C has spread value of 6 in either direction then we can have this comparison;
𝜎𝐵 < 𝜎𝐴 < 𝜎𝐶
Which illustrations show less spread out? Then we can easily answer it by
simply observing the figure we generated. Figure B is less dispersion since it spread
by 4 in either direction. Moreover, which illustrations shows the farthest spread out?
Looking back to the three illustrations we have, then the farthest spread out is Figure
C since the element dispersed 6 away from the mean.
In general, unlike the mean, there is no simple direct interpretation of the
variance or standard deviation. The variance is undifferentiated to the moment of
inertia in physics, but that is not necessarily widely understood by students. What
is important to understand is that, in relative terms:
 a small value of standard deviation (or variance) means that the dispersion of
the random variable is narrowly concentrated around the mean.

72
 a large value of standard deviation (or variance) means that the distribution
is spread out, with some possibility of observing values at some distance
from the mean.

What’s More
Independent Activity: Study and analyze.

5 10 15 20 25 30 35 40

1. Figure above shows the number of polo shirt sold by 5 different RTW boutiques.
a. Illustrate the mean. (2 points)
b. Illustrate the variance and standard deviation. (5 points)

2. From the figure below,


a. Illustrate the mean or expected value. (2 points)
b. Illustrate the variance and standard deviation. (5 points)
3

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14

2. Compare the variance and standard deviation of the two figures provided.

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14

Figure A

73
3

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14

Figure B

What I Have Learned


1. The mean a measure of the `central location' of a random variable. It is the
weighted average of the values that random variable X can take, with weights
provided by the probability distribution.
2. The Expected Value or Mean Value of a discrete random variable x is can be
computed by first multiplying each possible x value by the probability of
observing that value and then adding the resulting quantities. In symbol,
𝑬(𝑿) = 𝝁𝒙 = 𝑋1 ∗ 𝑃(𝑋1 ) + 𝑋2 ∗ 𝑃(𝑋2 ) + ⋯ + 𝑋𝑛 ∗ 𝑃(𝑋𝑛 ) = ∑ 𝑿 ∗ 𝑷(𝑿)
3. The Variance of a Discrete Random Variable X, denoted by σ_x^2 is computed
by first subtracting the mean from each possible x value to obtain the
deviations, then squaring each deviation and multiplying the result by the
probability of the corresponding x value, and then finally adding these
quantities.
4. The formula in determining the variance of a discrete random variable is,
𝑽𝒂𝒓(𝑿) = 𝝈𝟐 𝒙 = ∑ (𝒙 − 𝝁𝒙 )𝟐 ∙ 𝑷(𝒙)
𝒂𝒍𝒍 𝒑𝒐𝒔𝒔𝒊𝒃𝒍𝒆
𝒗𝒂𝒍𝒖𝒆𝒔 𝒐𝒇 𝒙
5. Mean, variance, and standard deviation can be illustrated by looking pattern
and analyzing given illustrations and diagrams.
6. A small value of standard deviation (or variance) means that the dispersion of
the random variable is narrowly concentrated around the mean.
A large value of standard deviation (or variance) means that the distribution is spread
out, with some possibility of observing values at some distance from the mean

What I Can Do
Family Budget
Things to do:

1. Create a Table of Expenses of your family in a week. (Ask help from


your Mother)
2. List the number of expenses for each day.
3. Create a graphical representation of your data gathered.
4. Base on the graphical representation you made, illustrate for the
following:

74
a. mean or expected value; and
b. variance and standard deviation.

5. Share your output to the Class Group Chat through Image or Video
Presentation.

TASK CRITERIA
Accuracy of the Data Gathered 50%
Clarity and content of the visual 25%
representation
Originality and creativity of the 25%
Presentation
TOTAL 100%

Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. Which of the following use the formula of (𝑥) = ∑ 𝑥 ∙ 𝑝 (𝑥) ?
A. Probability Distribution
B. Variance of Discrete Probability Distribution
C. Standard Deviation of Discrete Probability Distribution
D. Mean or Expected Value of Discrete Probability Distribution
2. Among the notations below, which is equivalent to 𝐸(𝑥)?
A. 𝜎𝑥2
B. 𝜎𝑥
C. 𝜇𝑧
D. 𝑃 (𝑋)
3. Which of the following statements best describe the expected value of a discrete
random variable?
A. It is the simple average of all possible outcomes.
B. It is the geometric average of all possible outcomes.
C. It is the weighted average over all possible outcomes.
D. It is the complex average of all possible outcomes in the distribution.
4. To which of the following concepts refer this statement “the sum of the product of
each value of a discrete random variable and its corresponding probability”?
A. Probability Distribution
B. Variance of Discrete Probability Distribution
C. Standard Deviation of Discrete Probability Distribution
D. Mean or Expected Value of Discrete Probability Distribution
5. Use the illustration below. What is the mean of the distribution?

10 20 30 40 50 60 70
A. 20
B. 30
C. 40
D. 50

75
6. Which of the following statement is TRUE?
A. A small value of standard deviation (or variance) indicates that the distribution
of the random variable is concentrated narrowly around the mean
B. A negative value of standard deviation (or variance) indicates that the
distribution of the random variable is concentrated narrowly around the mean.
C. The values closer from 𝜇𝑥 have equal probability.
D. The values closer from 𝜇𝑥 have equal probability.
7. Which term is calculated by summing the product of the square of the difference
between the value of the random variable and the expected value, and the
associated probability of the value of the random variable, taken over all of the
values of the random variable, and finally taking the square root?
A. Probability Distribution
B. Variance of Discrete Probability Distribution
C. Standard Deviation of Discrete Probability Distribution
D. Mean or Expected Value of Discrete Probability Distribution
8. What another notation can be used for a variance?
A. E(X)
B. P(X)
C. 𝑃 (𝑋)
D. Var(X)
9. If the variance of a probability distribution is 2.6 grams, then what proper way to
do to get the standard deviation?
A. √2.6
B. (2.6)2
C. 1.61 + 2.6
D. 1.61 – 2.6
For items 10 -11, refer to the diagram below.

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14

10. What is the value of 𝜇𝑥 ?


A. 4
B. 5
C. 6
D. 7
11. How dispersed are the scores from the mean?
A. 5
B. 6
C. 7
D. 8

76
For items 12 -13, refer to the diagram below.

3 6 9 12 15 18 21
12. What is the value of 𝜇𝑥 ?
A. 6
B. 9
C. 12
D. 15
13. How dispersed are the elements from the mean?
A. 9
B. 12
C. 15
D. 18
For items 14 -15, refer to the diagram below.

0
1 2 3 4 5 6 7 8 9 10 11 12 13 14

14. What is the value of μx ?


A. 5
B. 7
C. 9
D. 11
15. How dispersed are the scores from the mean?
A.5
B. 7
C. 9
D. 11
Additional Activities
Study and analyze the figures below.

3 3

2 2

1 1

0 0
1 2 3 4 5 6 7 8 9 10 1 2 3 4 5 6 7 8 9 10

77
Figure A Figure B

0
1 2 3 4 5 6 7 8 9 10

Figure C
A. Compare the means of the three figures.
B. Show the relationship between the variance and standard deviation among
three given figures.

78
Statistics and
Probability
Calculating Mean and Variance
of a Discrete Random Variable
Statistics and Probability
Calculating Mean and Variance of a Discrete Random Variable
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module


Writers: Annabele L. Laraquel
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

79
Week

2
What I Need to Know
This module was created and written with you in mind on how to calculate the mean
and variance of a discrete random variable. This will also help you analyze real-life
situated problems statistically in terms of relevant questions for you to better
understand them. Your adept at analysis will help you appreciate the richness, and
beauty of Statistics which will motivate you to apply to similar events and create
statistical measures of your own. From this module, you will also learn how to
determine the value of the mean, variance, and standard deviation of the discrete
random variables, and the purpose of the author for a better understanding of the
story.

Your patience in solving problems here in the module will help you upgrade your
computational skills as it tackles appropriate culture-based situated problems. Your
ability to explain, reason–out, and make a judgment or even decision out of statistical
measures will also be practiced here. The extent of this module permits it to be used
in many different learning situations. The language used recognizes the diverse
vocabulary level of students. The lessons are arranged to follow the standard
sequence of the course. But the order in which you read them can be changed to
correspond with the textbook you are now using.

The module focused on calculating the mean and variance of a discrete random
variable.

After going through this module, you are expected to:


1. apply the important concepts of mean and variance of a discrete random
variable; and
2. calculate the mean and variance of a discrete random variable.

What I Know
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. Which of the following is NOT included in the process of calculating the
mean of the discrete random variable X?
A. Identify the correct probabilities for each x value.
B. Multiply each x value by its probability.
C. Get the summation of the product.
D. Get the square root of the product.
2. Which of the following is TRUE about the value of the mean of a discrete
random variable?
A. Mean Value is always equal to 1
B. Mean Value cannot be negative.
C. Men value is equal to the expected value
D. Mean, Variance, and Standard Deviation are equal.
3. To determine the expected value of the discrete random variable which
processes should be done?
A. Get the squared sum of the difference of each value of a discrete
random variable its probability.
B. Get the summation of the difference of each value of a discrete random
variable and its probability.

80
C. Get the summation of the product of each value of a discrete random
variable and its probability.
D. Get the square root of the summation of the product each value of a
discrete random variable and its probability.
4. What can we generate if we take the squares of standard deviation?
A. Expected Value
B. Mean Value
C. Probability value
D. Variance
5. In tossing a coin, what are the possible values of the random variable X?
A. 0 B. 0,1 C. 1, 2 D. 0, 1, 2
6. If the variance of a probability distribution is 2.6 grams, what is the standard
deviation?
A. 1.61 B. 1.16 C. 1.06 D. 1.01
For items 7-10, refer to the scenario provided below.

Juan is not a very smart student. When he tries to answer a multiple-choice


question, he used to guess everything. Suppose, he is taking a quiz that has two
multiple choice questions on it and that each question has four possible answers
only one of which is correct. Let X represent the number of correct answers on
the quiz.
7. What are the possible values of the random variable X?
A. 0 B. 0, 1 C. 1, 2 D. 0, 1, 2
8. Which probability distribution is an appropriate presentation of the given
scenario?
A. x 0 1 2
P(X) 0.5625 0.375 0.0625

B. x 0 1 2
P(X) 0.0625 0.5625 0.375
C. x 0 1 2
P(X) 0.375 0.0625 0.0625

D. x 0 1 2
P(X) 0.5625 0.0625 0.375

9. What is the expected value of the discrete probability distribution?


A. 0.5 B. 0.4 C. 0.05 D. 0.04
10. What is the probability that Juan gets at least 1 correct answer?
A. 0.0625 B. 0.375 C. 0.4375 D. 0.5625
For items 11-12, refer to the scenario and table provided below.
In a recent Barangay Basketball League, each player went to free throws 2 times.
The number of free throws made by each player is described by the following
probability distribution.

Number of free throws, Probability, P(X)


x
0 0.20

1 0.45

81
2 0.35

11. What is the mean of the probability distribution?


A. 1.00 B. 1.15 C. 2.00 D. 2.25
12. What is the probability that both free throws will be out of the basket?
A. 0.20 B. 0.45 C. 0.35 D. 1.00
For items 13-15, refer to the scenario and table provided below
The number of qualified voters living in the household on a randomly selected
Subdivision block is described by the following probability distribution.

Number of qualified voter/s, x 1 2 3 4

Probability, P(X) 0.25 0.50 0.15 0.10

13. What is the mean of the probability distribution?


A. 1.0 B. 1.8 C. 2.0 D. 2.1
14. What are the variance and standard deviation respectively of the probability
distribution?
A. 0.99 and 0.50
B. 0.89 and 0.62
C. 0.79 and 0.89
D. 0.80 and .088
15. What is the probability that less than 3 votes will be in any household?
A. 0.25 B. 0.50 C. 0.75 D. 1.00
Lesson 1 Calculating Mean and Variance of a
Discrete Random Variable
If you took an examination, do you sometimes think of where would be your score
lie? Would it be below or above the average score? Afterward, you would like also to
know how far you are from the average score. Finally, you would ask yourself what
is the meaning of those locations from the center or mean of the probability
distribution? If you are asking these questions, this learning module is designed for
you.
In this module, we can calculate and solve the average of a discrete random variable.
The term average is the mean or the expected value or the expectation in probability
and statistics. Once we have calculated the probability distribution for a random
variable, we can calculate its expected value. The mean of a random variable shows
the location or the central tendency of the random variable.
In this module, you will understand the process on how to accurately determine the
mean value and variance including the standard deviation of a discrete random
variable. You may find this module as a piece of good fortune in acquiring brand new
learning.

What’s In
Let’s have this first for a review.
The number of COVID -19 patients seen in the ER in any given hour is a random
variable represented by X. The probability distribution for X is presented below.

82
P(X)
0.5
COVID-19 Patients
0.4

0.3

0.2

0.1

0 X
15 16 17 18 19

1. Find the probability that in a given hour:


a. exactly 17 patients arrive;
b. at least 16 patients arrive; and
c. at most 18 patients arrive.
2. What is the average number of COVID-19 patients who arrived in the ER in
an hour?
3. Illustrate the mean and standard deviation of the data gathered.
Solution:
1, a. 𝑃(𝑋 = 17) = 0.20
b. 𝑃(𝑋 ≥ 16) = 0.2 + 0.2 + 0.1 + 0.1) = 0.6
c. 𝑃(𝑋 ≤ 18) = 0.4 + 0.2 + 0.2 + 0.1) = 0.9
2. 𝜇𝑥 =17
3. It shows the hat element spread 3 away from the mean which
implies its variability.

3 3

31 34 37

P(X)
0.45 15
0.4
16
0.35
0.3 18 19
0.25
0.2
0.15
0.1
0.05
0 X
15 16 17 18 19

𝜇𝑥

83
What’s New
Time to Check
Read and analyze the situation given below:
Mr. Umali, a Mathematics teacher, regularly gives a formative assessment composed
of 5 multiple-choice items. After the assessment, he used to check the probability
distribution of the correct responses, and the data is presented below:

Test Item 𝑿 Probability 𝑷(𝑿)


0 0.03
1 0.05
2 0.12
3 0.30
4 0.28
5 0.22
Based on the data above, observe, analyze and answer the following questions:
1. What is the summation of the probabilities of the given random variable?
_________________________________
2. Why is it that the probability of random variable x less than 1 but
nonnegative value?
3. What is the average or mean of the given probability distribution?
_______________________________________________________________.
4. What are the values of the variance and the standard deviation of the
probability distribution? _______________________________________.
Answer:
1. The summation of the probabilities of the given random variable is equal to
2. The probability of the discrete random variable X lies between 0 and 1, so it
is impossible to have a negative probability.
3. 𝐸(𝑋) = 𝜇𝑥 = 3.41
4. 𝑉𝑎𝑟(𝑋) = 𝜎 2 𝑥 = 1.5819
If you are wondering how we came up with the answer for questions 4 and 5, you
will understand everything as we go on in our discussion.

What is It
In the previous module, mean or expected value characteristics had been presented,
even variance and standard deviation of a discrete random variable.
Expected Value or Mean Value is the sum of the products of each possible value of
a random variable and that value’s probability. In symbol,

𝑬(𝑿) = 𝝁𝒙 = 𝑋1 ꞏ 𝑃(𝑋1 ) + 𝑋2 ꞏ 𝑃(𝑋2 ) + ꞏꞏꞏ + 𝑋𝑛 ꞏ 𝑃(𝑋𝑛 ) = ∑ 𝑿 ꞏ 𝑷(𝑿)

So, in what way can we use the concept of expected value to calculate the mean and
variance of a probability distribution?
Using the previously given data, multiply each x value by its probability and add the
results to get the mean (𝜇 x ) or expected value(𝑬(𝑿)).

84
Test Item, Probability, 𝑿 ∗ 𝑷(𝑿)
𝑿 𝑷(𝑿)

0 0.03 0
1 0.05 0.05
2 0.12 0.24
3 0.30 0.9
4 0.28 1.12
5 0.22 1.1

𝐸(𝑋) = 𝜇𝑥 = ∑ 𝑥 ∙ 𝑃(𝑋)

= [(0)(0.03) + (1)(0.05) + (2)(0.12) + (3)(0.30) + (4)(0.28) + (5)(0.22)]


𝑬(𝑿) = 𝝁𝒙 = 𝟑. 𝟒𝟏
So, the answer to question 3 can now be stated. It indicates that the expected value
or mean of the random variable is 3.41.
Now that you have calculated the mean or the average marks in the five tests of
Mathematics. It will be convenient this time to determine the difference in the score
of each item from the mean or expected value. This difference in marks shows the
variability of the possible values of the random variable.
Let’s recall the formula in finding the variance of a discrete random variable x,

𝑽𝒂𝒓(𝑿) = 𝝈𝟐 𝒙 = ∑ (𝒙 − 𝝁𝒙 )𝟐 ∙ 𝑷(𝒙)
𝒂𝒍𝒍 𝒑𝒐𝒔𝒔𝒊𝒃𝒍𝒆
𝒗𝒂𝒍𝒖𝒆𝒔 𝒐𝒇 𝒙

Remember that if the value of the variance is small, then the values of the random
variable are close to the mean.
The Standard Deviation of x, denoted by 𝝈𝒙 , is the square root of the Variance. In
symbol,

𝜎𝑥 = √𝑽𝒂𝒓(𝑿) or 𝝈𝒙 = √𝝈𝒙 𝟐
Again, let’s use the same data above, variance and standard deviation of the given
random variable can be presented in this manner. The formula for 𝑉𝑎𝑟(𝑋) says to
take a weighted average of the squared distance to the mean. By squaring, remember
that we are averaging only non-negative values so that the spread to the right of the
mean won’t cancel that to the left. By using expected value, we are weighting high
probability values more than low probability values. So, the table will now look like
this.

Test Item, Probability 𝒙 ∙ 𝑷(𝑿) 𝒙− 𝝁 (𝒙 − 𝝁)𝟐 (𝒙 − 𝝁)𝟐 ∙ 𝑷(𝑿)


𝒙 𝑷(𝑿)
0 0.03 0 -3.41 11.6281 0.3488
1 0.05 0.05 -2.41 5.8081 0.2904
2 0.12 0.24 -1.41 1.9881 0.2386
3 0.30 0.90 -0.41 0.1681 0.0504
4 0.28 1.12 0.59 0.3481 0.0975
5 0.22 1.10 1.59 2.5281 0.5562

85
𝑬(𝑿) ∑(𝒙 − 𝝁)𝟐 ∙ 𝑷(𝑿)
= 𝝁𝒙
= 𝟑. 𝟒𝟏 = 𝟏. 𝟓𝟖𝟏𝟗

We can also do this process.

𝑽𝒂𝒓(𝑿) = 𝝈𝟐 𝒙 = ∑(𝒙 − 𝝁)𝟐 ∙ 𝑷(𝑿)

(0 − 3.41)2 (. 03) + (1 − 3.41)2 (. 05) + (2 − 3.41)2 (0.12) + (3 − 3.41)2 (0.30)


=[ ]
+(4 − 3.41)2 (0.28) + (5 − 3.41)2 (0.22)
𝑽𝒂𝒓(𝑿) = 𝝈𝟐 𝒙 = 𝟏. 𝟓𝟖𝟏𝟗

The Variance is 1.5819, and the Standard Deviation is √𝟏. 𝟓𝟖𝟏𝟗 , and it is
equivalent to 𝝈𝒙 = 1.26
Therefore, the correct answers for question 4 are 1.5819 and 1.26 respectively.
For uniformity of the answers, expected value or mean and standard deviation are
both expressed into two decimal places while variance is up to four decimal places
to lessen rounding error.
Take note on units:
1. 𝜎𝑥 has the same units as X.
2. 𝑉𝑎𝑟(𝑋) or 𝝈𝟐 𝒙 has the same units as the square of X. So, if X is in
meters, then 𝑉𝑎𝑟(𝑋) is in meters squared.

Since 𝜎𝑥 and X have the same units, the standard deviation is a natural
measure of spread. In some ways, the standard deviation is the more tangible of the
two measures, since it is in the same units as X.

Let us take this example,

Suppose that a coin is tossed twice so that the sample space is S = {𝐻𝐻, 𝑇𝐻,
𝐻𝑇, 𝑇𝑇}. Let X represent the “number of heads that can come up”, Based on the
prepared discrete probability distributions of the random variable X below, calculate
the mean, variance, and standard deviation.

Outcome or Sample HH HT TH TT
Point
𝒙 2 1 1 0

Applying the concepts from the activity given above, let us complete the table below.

𝒙 𝑷(𝑿) 𝒙 ∙ 𝑷(𝑿) 𝒙− 𝝁 (𝒙 − 𝝁)𝟐 (𝒙 − 𝝁)𝟐 ∙ 𝑷(𝑿)


0 ¼ or 0.25 0 -1 1 0.25
1 ½ or 0.5 0.5 0 0 0
2 ¼ or 0.25 0.5 1 1 0.25

𝝁𝒙 = ∑ 𝒙 ∙ 𝑷(𝑿) = 𝟏 ∑(𝒙 − 𝝁)𝟐 ∙ 𝑷(𝑿) = 𝟎. 𝟓𝟎


The expected value or mean is 1.
The Variance is 0.50, and
The Standard Deviation is √𝟎. 𝟓0, and it is equivalent to 𝝈 = 0.71.

86
We have discovered that the Expected Value E(X) is only the average or mean (µ𝑥 ) of
random variable X. It's occasionally called a "weighted average" because more
frequent values of X are weighted more highly in the average. It is additionally how
we anticipate that X should behave on-average as time goes on. The mean of a
random variable X is a measurement of the central location of the distribution of X.
If we are summarizing features of the distribution of X, it is clear that location is not
the only relevant feature. The second most important component is the spread of the
distribution.
Again, the variance of a discrete random variable X is defined by

where the sum is taken over all values of x for which P(x) > 0 but less than 1. So, the
variance of X is the weighted average of the squared deviations from the mean μ,
where the weights are given by the probability function P(x) of X.

The standard deviation of X is defined to be the square root of the variance of X.


That is,

Because of this definition, the variance of X is often denoted by σ2x.

In another way around, the standard deviation is the more tangible of the two
measures, since it is in the same units as X. For example, if X is a random variable
measuring length in cm, then the standard deviation is in centimeter(𝑐𝑚), while the
variance is in 𝑐𝑚2 .

Remember that with respect on units:


1. σ has the same units as X.
2. Var(X) has the same units as the square of X. So, if X is in meters, then
Var(X) is in meters squared. Because σ and X have the same units, the
standard deviation is a natural measure of spread.

Always remember that the variance cannot be negative, because it is an average of


squared quantities. This is appropriate, as a negative spread for distribution does
not make sense. Hence, 𝝈𝟐 𝒙 ≥ 𝟎 and 𝝈𝒙 (𝑿) ≥ 𝟎.
What’s More
Independent Activity 1: Study and analyze
The number of shoes sold per day at a retail store is shown in the table below.
Illustrate the mean, variance, and standard deviation of this distribution.
X 19 20 21 22 23

P(X) 0.4 0.2 0.2 0.1 0.1

87
Independent Assessment 1: Fill me in and solve
Write all the necessary formula and show the complete solution.
Formula to be used:
a. Mean ________________________
b. Variance_____________________
c. Standard Deviation___________
Solution:

Independent Activity 2: Study and analyze

The number of patients seen in the Emergency Room in any given hour is a random
variable represented by x. The probability distribution for x is:

X 10 11 12 13 14

P(X) 0.4 0.2 0.2 0.1 0.1

Independent Assessment 2: Fill me in and solve


Write all the necessary formula and show the complete solution.
Formula to be used:
a. Mean __________________________
b. Variance_______________________
c. Standard Deviation_____________
Solution:
Independent Activity 3: Study and analyze
Suppose that a coin is to be tossed four times, and let X represent “the number of
TAILS that can come up”. Find the mean, variance, and standard deviation of this
distribution.

X 0 1 2 3 4

P(X) 1 4 1 6 3 4 1 1
𝑜𝑟 𝑜𝑟 𝑜𝑟
16 16 4 16 8 16 4 16
Independent Assessment 3: Fill me in and solve
Write all the necessary formula and show the complete solution.
Formula to be used:
a. Mean ________________________
b. Variance_____________________
c. Standard Deviation___________
Solution:
What I Have Learned
1. The formula for calculating Mean or Expected value is _______________________
2. I have discovered that the Expected Value E(X) is only the_______________ of
random variable X. It's occasionally called a _______________ because more
frequent values of X are weighted more highly in the average.
3. The variance of a discrete random variable X is defined _____________________
4. The standard deviation of X is defined to be the square root of the variance of
X. In symbol it expressed into ________________________________

88
5. The variance cannot be ___________, because it is an average of squared
quantities. Hence, 𝝈𝟐 𝒙 ≥ 𝟎 and 𝝈𝒙 (𝑿) ≥ 𝟎.
What I can do
LET’S DO FAMILY PLANNING!
Things to do:
1. Create a short script as if you are with your dream partner in life and you both
talking about Family Planning.
2. During your planning, specify in your decision how many girls (G) and boys
(B) in the family you both want to have. Consider also your random variable
X stand for.
3. After your planning, construct a probability distribution of whatever decision
you had.
4. Based on the probability distribution table, illustrates the following:
a. Mean or Expected Value;
b. Variance; and
c. Standard Deviation.
5. Share your output to the Class Group Chat through Image or Video
Presentation.

TASK CRITERIA
Accuracy of the solution 50%
Clarity and content of the script 25%
Originality and creativity of the 25%
Presentation
TOTAL 100%

Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. What formula is appropriate to use in calculating the expected value?
A. 𝐸(𝑥) = ∑ 𝑥 ∙ 𝑝 (𝑥)
B. 𝐸(𝑥) = ∑ 𝑥 − 𝑝 (𝑥)
C. 𝐸(𝑥) = ∑ 𝑥 ∙ 𝑝 (𝑥)2
D. 𝐸(𝑥) = ∑ 𝑥 /𝑝 (𝑥)
2. Which among the list of formulas should be used to solve for the variance of
discrete random variables of the given data above?
A. 𝜎𝑥2 = ∑(𝑥 + 𝜇)2 ∙ 𝑃(𝑥); for all possible values of x
B. 𝜎𝑥2 = ∑(𝑥 − 𝜇)2 ∙ 𝑃(𝑥); for all possible values of x
C. 𝜎𝑥2 = ∑ 𝑥 ∙ 𝑃(𝑥); for all possible values of x
D. 𝜎𝑥2 = ∑(𝑃(𝑥) + 𝜇)2 ∙ 𝑥 ; for all possible values of x
3. Which of the following statements does not describes the value of the
standard deviation?
A. A small standard deviation (or variance) means that the distribution
of the random variable is narrowly concentrated around the mean.

89
B. A large value of standard deviation (or variance) indicates that the
distribution is spread out, with some chance of observing values at
some distance from the mean.
C. Standard deviation is obtained by getting the square root of the
variance.
D. It is obtained by squaring the variance.
4. What can we generate if we take the summation of the product of each value
assigned to the random variable and its corresponding probability?
A. Expected Value
B. Probability value
C. Standard Deviation
D. Variance
5. Suppose that a coin is to be tossed four times, and let X represent “the
number of TAILS that can come up”, what are the possible values of the
random variable X?
A. 0 B. 0, 1 C. 1, 2,3 D. 0, 1, 2 ,3, 4
6. Which of the following is the correct value of standard deviation (𝜎) of a
discrete random variable X if the variance is 2.5754?
A. 1.61 B. 1.60 C. 1.06 D. 1.01
For items 7-9, refer to the scenario and table provided below.
Let X be a random variable defining number of students getting 95 above grade.

X 0 1 2 3
P(X) 0.2 0.1 0.4 0.3
7.
7. What is the expected value of X from the given table?
A. 1.3 B. 1.8 C. 1.9 D. 2.3
8. What is the variance (𝜎𝑥 ) of the given probability distribution?
2

A. 1.16 B. 1.18 C. 1.20 D. 1.28

9. Which of the following value is appropriate for deviation (𝜎𝑥 ) of the given
probability distribution?
A 1.02 B. 1.04 C. 1.06 D. 1.08
10. A Grade 11 – HUMMS researcher surveyed the households in Brgy. Quipot,
Tiaong Quezon. The random variable X represents the number of college
graduates in the households. The probability distribution of X is shown
below.

X 0 1 2
P(X) 0.25 0.5 0.25
Find the values of variance and standard deviation.
A. 0.3 and 0.51 respectively
B. 0.4 and 0.61 respectively
C. 0.5 and 0.71 respectively
D. 0.6 and 0.81 respectively
11. In the 50 items test, Miss Santos, a Mathematics teacher claimed that most
of the students’ scores lie closer to 40. In this situation, which of the following
terms parallel to the score of 40?
A. Variance
B. Standard Deviation
C. Expected Value or Mean
D. Median

90
For items 12-15, refer to the scenario and table provided below.
In her Flower Shop, Vera recorded the probability distribution for the number of
flower arrangements that she delivered each day.

X 7 8 9 10 11
P(X) 0.20 0.20 0.30 0.20 0.10

12. What is the probability that she made at least 9 flower arrangements?
A. 0.60 B. 0.50 C. 0.40 D. 0.30
13. What is the mean or expected value of the given probability distribution?
A. 8.5 B. 8.6 C. 8.7 D. 8.8
14. Which of the following is the appropriate value for the variance?
A. 1.55 B. 1.56 C. 1.57 D. 1.58
15. What value corresponds to the standard deviation?
A. 1.25 B. 1.26 C. 1.27 D. 1.28
Additional Activities
1. Willie works in an automotive tire factory. The number X of sound but
blemished tires that he produces on a random day has the probability
distribution.

x 2 3 4 5
P(X) 0.48 0.35 0.10 0.07

a. Find the probability that Leomar will produce more than three blemished
tires.
b. Find the probability that Leomar will produce at most four blemished
tires.
c. Calculate the mean, variance, and standard deviation of a discrete random
variable.

2. The Land Bank of the Philippines Manager claimed that each saving account
customer has several credit cards. The following distribution showing the
number of credits cards people own.

x 0 1 2 3 4
P(X) 0.18 0.44 0.27 0.08 0.03

Show the complete table of values in calculating the Mean, Variance, and
Standard Deviation.

91
Statistics and
Probability
Interpreting the Mean and the
Variance of a Discrete Random
Variable
Statistics and Probability
Interpreting the Mean and the Variance of a Discrete Random Variable
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module


Writer: Elizabeth D. Lalunio
Editors: Celestina M. Alba and Nestor Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

92
Week

3
What I Need to Know
This module was designed and written with you in mind. It is here to help you
understand why we need to find the mean and variance of a discrete random variable
and how to interpret it. The language used recognizes the diverse vocabulary level of
students. The lessons are arranged to follow the standard sequence of the course.
But the order in which you read them can be changed to correspond with the
textbook you are now using.

This module has only one lesson which focuses on interpreting the mean and
variance of a discrete random variable. Your knowledge and skill on how to use the
formula for the mean, variance, and standard deviation that you learned from
previous modules are pre-requisite skills. Accurate computations will yield accurate
interpretations. In case that there will be mistakes in the calculations of mean,
variance, and standard deviation, interpretation of the data will be affected. And if
we interpret the result based on the miscalculations, our tendency to make a not so
wise decision is higher. So, to be successful in this module, always review your
computations before you make interpretations. Expect also that some of the practice
exercises and problems included here need your patience and determination because
it requires long computations. It will teach you the virtues of being patient and
striving for excellence. Bear in mind that you need to review first your solutions to
arrive with accurate interpretations. So, are you now ready? Let us start your journey
today.

After going through this module, you are expected to:


1. interpret the mean and variance of a discrete random variable; and
2. recall and apply the formulas for the mean, variance, and standard deviation
of a discrete random variable.

What I Know
Before studying this module, take this test to determine what you already know
about the topic covered.
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What do you call a random variable with possible values that form a finite or
countable set?
A. continuous C. finite
B. discrete D. infinite
2. What term is used to describe the average value of a discrete random variable
over numerous trials of an experiment?
A. mean C. standard deviation
B. probability D. variance
3. Which of the following represents the amount of spread, dispersion, or variability
of the items in a distribution?
A. mean or expected value
B. median and mode
C. outcomes and probability distribution
D. variance and standard deviation
4. Which of the following is also equal to the square root of the variance?
A. mean C. probability

93
B. median D. standard deviation
5. How would you interpret a very small variance or standard deviation but not equal
to zero?
A. The values of the random variables are farther from the mean.
B. The values of the random variables are nearer to the mean.
C. The values of the random variables are equal to the mean.
D. The values of the random variables have no relationship with the mean.
6. Which of the following shows most likely the largest possible variance or
variability?
A. number of girls in a randomly selected three-child family
B. number of newborn babies per minute.
C. number of Oreo cookies inside a 133 grams pack from different
branch of 711 stores
D. number of patients who are positive with COVID-19 in different
hospitals
7. Which of the following data shows most likely the smallest possible variance or
variability?
A. number of books in different branch of National Bookstore
B. number of books inside a pack bag of grade 11 students
C. number of books inside the library of different universities
D. number of books inside the library of different households
8. What formula is described by 𝜎 2 = ∑[𝑥 2 𝑃(𝑥)] − 𝑢2 ?
A. the mean of a discrete random variable
B. the variance of a discrete random variable
C. the standard deviation of a discrete random variable
D. the expected value of a discrete random variable
For numbers 9-12, refer to the probability distribution of rolling a single
unfair die.
𝑥 1 2 3 4 5 6
𝑃(𝑥) 0.1 0.1 0.1 0.5 0.1 0.1
9. What is the mean of the probability distribution?
A. 2.5 B. 3.7 C. 4.1 D. 5.7
10. What is the variance of the probability distribution?
A. 1.81 B. 2.34 C. 3.70 D. 4.26
11. What is the standard deviation of the probability distribution?
A. 1.07 B. 1.35 C. 1.92 D. 2.06
12. How would you interpret the mean value that you get in item number 9?
A. The mean value is the difference between each probable value of the
outcome when you roll the unfair die in numerous trials.
B. The mean value is the higher probable value of the outcome when you roll
the unfair die in numerous trials.
C. The mean value is the lowest probable value of the outcome when you
roll the unfair die in numerous trials.
D. The mean value is the closest value to the most probable value of the
outcome when you roll the unfair die in numerous trials.
13. Which of the following is NOT a property of the variance?
A. The variance is not equal to the standard deviation.
B. A small variance means that the distribution of the random variable is
narrowly concentrated around the mean.
C. A large variance means that the distribution is spread out, with some chance
of observing values at some distance from the mean.
D. The value of the variance is less than zero.

94
For numbers 14-15. The mean of the probability distribution below is equal to 18.2
with a variance of 5.86 and a standard deviation of 2.42.

Number of cellphones sold per


day in a retail store (𝑥) 15 18 19 20 22

Probability (𝑃(𝑥) 0.30 0.20 0.20 0.15 0.15


14. How would you interpret the mean value of 18.2?
A. The least number of cellphones that will be sold in a day is 18 pieces.
B. The highest number of cellphones that will be sold in a day is 18 pieces.
C. The average number of cellphones that will be sold in a day is 18 pieces.
D. No interpretation can be made about the mean value of 18.2.

15. How would you interpret the values of variance and standard deviation?

A. It gives the difference between the highest number of cellphones and the least
number of cellphones that can be sold.
B. It is the average number of cellphones that can be sold in a day.
C. It describes how the data or the number of cellphones sold in a day varies.
D. No interpretation can be made about the variance and standard deviation.
Lesson 1 Interpreting the Mean and Variance
of a Probability Distributions
Decision-making is an important skill that each individual should acquire. It is a
must that you should be wise when making decisions. In real life, we always
encounter situations where we have to choose the best option to arrive with the right
decision. Thinking critically and logically will help you to decide on things wisely.
Considering that you might become a CEO of a company, a professional, a father or
a mother, surely this lesson will help you to decide on future financial situations.
Interpreting the mean and variance of probability distributions will give you the idea
on how to weigh things, for you to choose the right option.

What’s In
What’s the formula?
The following formulas were introduced to you in the previous lessons, write the
correct formula in each box to complete this task.
Formula 1: 𝜇𝑥 = ∑[𝑥𝑖 ∙ 𝑃(𝑥)]
Formula 2: 𝜎 2 = ∑[𝑥 2 ∙ 𝑃(𝑥)] − 𝑢2
Formula 3: 𝜎 2 = ∑(𝑥 − 𝑢)2 𝑃(𝑥)

Formula 4: 𝜎 = √[𝑥 2 ∙ 𝑃(𝑥)] − 𝑢2


Situation 1: I am thinking of finding the average number of heads that would appear
if I will toss two coins together twice. Which formula should I use?

95
Situation 2: I went to the grocery store and I noticed that there are five different kinds
of brown sugar at different prices, each brand has different probabilities that they
will be sold. What formula should I use if I want to determine how the prices vary
from the average price of the product that would be sold to the customers repeatedly?

Solution:
Situation 1: The correct formula is 𝜇𝑥 = ∑[𝑥𝑖 ∙ 𝑃(𝑥)], because it is the formula to be
used if you want to find the mean of a discrete random variable.

Situation 2: The correct formula is 𝜎 = √[𝑥 2 ∙ 𝑃(𝑥)] − 𝑢2 , because it is the formula to


be used if you want to find the standard deviation of a discrete random variable.
Even though formulas 2 and 3 (formulas to find the variance) measures also the
variability of the values assumed by the random variable X (prices of brown sugar) in
repeated trials (sold to the customers repeatedly). The units of standard deviation
match those with X because standard deviation is a square root of the variance.

What’s New
An Unfair Die
Suppose an unfair die is rolled and let X be the random variable representing the
number of dots that would appear with a probability distribution below.

Outcome 1 2 3 4 5 6
(𝑥)
Probability 0.1 0.1 0.1 0.5 0.1 0.1
P(𝑥)
Questions:
1. What will be the average number of dots that would appear?
2. How does the assumed value of the outcome vary from the average number
of dots that would appear?
3. Will you join in a game of chance using an unfair die?
What Is It
Recall that in your previous lesson, you learn how to calculate the mean, variance,
and standard deviation of a discrete random variable. So, with those skills, you will
be able to answer the questions in the previous activity.
First, let us find the mean of the probability distribution using the formula.

𝜇𝑥 = ∑[𝑥𝑖 ∙ 𝑃(𝑥)]

𝜇𝑥 = 1(0.1) + 2(0.1) + 3(0.1) + 4(0.5) + 5(0.1) + 6(0.1)


𝜇𝑥 = 3.7
Therefore, the mean of the probability distribution is 3.7. Since the mean is equal to
3.7, we can say that the average number of dots that would appear is a number near

96
to 3.7 which is 4. If we will repeat the experiment over and over, there is a higher
possibility the 4 dots will always appear since the average number of dots that would
appear is 4.
To determine the variability of the assumed values from the mean, use the formulas
for finding variance and standard deviation. But first, construct a table like the one
below:

𝑥 𝑃(𝑥) 𝑥 ∙ 𝑃(𝑥) 𝑥2 𝑥 2 ∙ 𝑃(𝑥)


1 0.1 0.1 1 0.1
2 0.1 0.2 2 0.4
3 0.1 0.3 9 0.9
4 0.5 2 16 8
5 0.1 0.5 25 2.5
6 0.1 0.6 36 2.6

∑[𝑥 2 ∙ 𝑃(𝑥)] = 15.5

Then use the formulas, thus

𝜎 2 = ∑[𝑥 2 ∙ 𝑃(𝑥)] − 𝑢2 = 15.5 − (3.7)2 = 1.81

𝜎 = √𝜎 2 = √1.81 = 1.35
Therefore, the variance of the random variable X (the number of dots appeared) is
equal to 1.81 while the standard deviation is equal to 1.35. Take note that, small
variance or standard deviation means that the assumed values or data points tend
to be very close to the mean, while higher variance or standard deviation means that
the assumed values or data points are spread out from the mean. Specifically, the
variance and standard deviation measures or describes how far a set of data
(assumed values of random variables) is spread out. Since the value of the standard
deviation is 1.35, we can say that the assumed values of each outcome are somewhat
close to the mean for about 1.35 units from the mean.
Knowing this information, you may now decide whether you will join a game using
an unfair die. Make sure you will think more critically and logically before you make
a decision and remember being fair and just is a virtue.
Bear in mind that the mean of a discrete random variable is just the average value
of a random variable over repeated trials of experiment and in the next lesson it will
be discussed further and its application to the real-life situation.
To understand more the concept of the variance and standard deviation, the following
are examples that will give you a deeper idea about the concept of variance and
standard deviation.
The first data in the following examples is somewhat expected to have a higher
variance and standard deviation or variability compare to the other data.
Example 1
The population size of 100 different countries around the world compare to the
population of 10 countries belong to the first world countries

97
Due to a large number of samples (100 different countries) and other factors such as
land area, economic status, etc., you may say that most likely you will get a higher
variance and standard deviation compared to the population of 10 countries belong
to the first world countries
Example 2
The number of students in different universities compares to the number of students
in a class in a particular school.
Considering different universities with different size and course offerings, you may
say that the difference between their enrolments will be much higher than the
difference of each class size in a particular school
Example 3
The number of people who love to play basketball in different barangays to the
number of basketball players from different teams who play in the first-quarter game.
Basketball is a famous game and almost every guy loves to play this game. Imagine
that you will gather the number of people who love to play from different barangays,
it is expected that you will gather different results compared to the number of
basketball players from different teams who play in the first-quarter game. Almost
everybody is aware that there are only five players from each team to play a game,
therefore if you will compute the variance of the data from different teams you will
get a value of zero. The mean will be equal to 5 because each team who play the game
will should have 5 players only, at the same time if you will compute the variance of
distribution with equal inputs you will get a value of zero.
The mean and variance of a discrete random variable are easy to understand if you
know the concepts behind these two measurements. As you go on with the different
modules in Statistics and Probability, mean and variance will always be included in
the discussion. The mean, variance, and standard deviation are important in
statistics because they serve as the basis for other types of statistical calculations or
statistical tests.

What’s More
In the following activities, read each situation carefully to solve each problem.
Write your answer on a separate sheet of paper.
Practice Activity 1
The number of cellular phones sold per day at the E-Cell Retail Store with the
corresponding probabilities is shown in the table below. Compute the mean,
variance, and standard deviation and interpret the result.
Number of cellphones sold
per day in a retail store (𝑥) 15 18 19 20 22

Probability (𝑃(𝑥) 0.30 0.20 0.20 0.15 0.15

Solution:
Complete the statement:

𝜇𝑥 = ∑[𝑥𝑖 ∙ 𝑃(𝑥)] = 15(0,30) + 18(0.20) + 19(0.20) + 20(0.15) + 22(0.15) = _______

98
The mean is equal to ____________ therefore, it means that the average number
of cellular phones of sold per day is ________________.
To find the variance complete the table below:

𝑥 𝑃(𝑥) 𝑥 ∙ 𝑃(𝑥) 𝑥2 𝑥 2 ∙ 𝑃(𝑥)


15 0.30 4.5
18 0.20 3.6
19 0.20 3.8
20 0.15 3
22 0.15 3.3

∑[𝑥 2 ∙ 𝑃(𝑥)] = ___________

𝜎 2 = ∑[𝑥 2 ∙ 𝑃(𝑥)] − 𝑢2 = _______ − ________ = ___________

𝜎 = √𝜎 2 = __________ = ___________
Therefore, the variance of a probability distribution is equal to ___________, while
the standard deviation is equal to _________________.
Independent Activity 1
Beth’s Bread and Pastry Shop determines the number of cupcakes sold per day with
its corresponding probabilities. Find the mean, variance, and standard deviation of
the probability distribution below. If Beth, the owner of the shop is claiming that the
average number of cupcakes sold in a day is 150 pieces, do you think it is a believable
claim?
Number of cupcakes
sold per day (𝑥) 90 120 135 150 160 175

Probability 𝑃(𝑥) 0.15 0.10 0.20 0.20 0.20 0.15


Practice Activity 2
Analyze the following pairs of data and identify which of the following will most likely
yield to a higher variance and higher standard deviation. Put a check mark on the
appropriate box.

1. Number of students from different grade levels.

Number of boys in a family with three children.

2. Number of fish inside the aquarium from different households.

Number of fish inside a can from the different local brands of sardines.

3. Number of COVID-19 patients from different hospitals.

Number of family members with fever in a barangay.

4. Number of M & M’s peanuts inside a 1.69 0z bag from different retail
stores.

99
Number of kernels in corn of different sizes.

5. Number of players in a group playing the game “the boat is sinking.”

Number of passengers in different luxury cruise ships.


Independent Activity 2
Give examples of at least two pairs of data that will most likely yield different variance
and standard deviation. Identify the one with a higher possible value of variance and
explain why.
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
______________________________________________________

What I Have Learned


Complete the following statements by writing the correct word or words.
1. The mean of a discrete random variable is interpreted as the ______________ value
of a random variable over repeated trials of an experiment.
2. The variance and standard deviation of a discrete random variable measured or
described the ____________________ of the assumed values of the random variable
to the mean.
3. A small variance or standard deviation means that the assumed values or data
points tend to be ___________________to the mean.
4. A higher variance or standard deviation means that the assumed values or data
points are __________________ from the mean
What I Can do
Oh, what a game!
Things to do:
Device a game of chance using a fair die with corresponding winnings and losses.
For example, in rolling a die you may assign even outcomes as winning outcomes
and the odd outcomes as the losses or think of your own plan. Let X the amount to
be won by a player or the amount he will lose when he plays your game. Compute
for the mean or the average amount a player may win or lose, the variance and the
standard deviation, and interpret the results. Don’t forget that each face of a die has
1
a probability of . Be prepared to describe your game and demonstrate it in our
6
class next meeting or you may prepare a video of your demonstration to be
submitted online.
Rubrics for the task:

4 3 2 1
Categories Excellent Satisfactory Developing Beginning

Organization of The The The The


Game Plan mechanics of mechanics of mechanics of mechanics of
the game is the game is the game is the game is
clear and clear and somewhat not not clear.

100
enjoyable to enjoyable to clear and
play with a play with no shows
twist. It also twist. It also concepts of
shows shows mean and
concepts of concepts of variance
mean and mean and
variance. variance.

Accuracy of the The There is one There is one There are


Solutions computations mistake in the mistake in more
are accurate computations the mistakes with
with a table of with a table of computations no table of
probability probability with no table probability
distributions. distributions. of probability distributions.
distributions.

Demonstration The The The The


demonstration demonstration demonstratio demonstrator
accurately shows some n did not is not
shows how to missing work well prepared to
play the game. details on how because the demonstrate
to play the audience how to play
game. cannot follow the game.
the rules of
the game.

Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. Which of the following is an example of a discrete random variable?
A. weight of newborn babies
B. body temperature of COVID-19 patients
C. number of heads that will come out if you toss a coin twice
D. height of basketball players
2. Which of the following best describe the mean of a discrete random variable?
A. It is the lowest assumed value of a discrete random variable.
B. It is the highest assumed value of a discrete random variable.
C. It is the average value of a discrete random variable over numerous trials of
an experiment.
D. It is the amount of spread, dispersion, or variability of the assumed value of a
discrete random variable.
3. Which of the following best describe the variance and standard deviation of a
probability?
A. It is the lowest assumed value of a discrete random variable.
B. It is the highest assumed value of a discrete random variable.
C. It is the average value of a discrete random variable over numerous trials of
an experiment.
D. It is the amount of spread, dispersion, or variability of the assumed value of a
discrete random variable.

101
4. Which of the following best describe the standard deviation of a probability
distribution?
A. It is twice the variance.
B. It is the product of the mean and the variance.
C. It is the ratio of the mean and the variance.
D. It is the square root of the variance.
5. How would you interpret a very small variance or standard deviation?
A. The values of the random variables are equal to the mean.
B. The values of the random variables are closer to the mean.
C. The values of the random variables are farther from the mean.
D. The values of the random variables have no relationship with the mean.
6. Which of the following data show most likely the largest possible variance
or variability?
A. number of pieces of French fries in a regular pack from different orders of
customers at McDonalds
B. number of boys in families of three-children
C. number of customers per hour who went shopping at SM Super Malls
D. number of heads that will appear if two coins are tossed together
repeatedly
7. Which of the following data show most likely the smallest possible
variance or variability?
A. the number of passengers in a tricycle per destinations
B. the number of applicants in the different job opening
C. the number of families who own a private vehicle in different cities in NCR
D. the number of adults who use public restrooms in Metro Manila
8. What formula is described by 𝜎 = √∑[𝑥 2 𝑃(𝑥)] − 𝑢2 ?
A. the mean of a discrete random variable
B. the variance of a discrete random variable
C. the standard deviation of a discrete random variable
D. the expected value of a discrete random variable
For numbers 9 -12, refer to the probability distribution of the number of books
borrowed from a school library in a day and its corresponding probabilities.
𝑥 20 25 30 35 40 45
𝑃(𝑥) 0.1 0.1 0.4 0.2 0.1 0.1
9. What is the mean of the probability distribution?
A. 25 B. 29 C. 30 D. 32
10. How would you interpret the mean value that you get from item number 8?
A. It is the least number of books borrowed from the school library in a day.
B. It is the largest number of books borrowed from the school library in a day.
C. It is the average number of books borrowed from the school library in a day.
D. It is the difference between the largest and the least number of books borrowed
from the school library in a day.
11. What is the variance of the probability distribution?
A. 38 B. 40 C. 43 D. 46
12. What is the standard deviation of the probability distribution?
A. 6.16 B. 6.32 C. 6.56 D. 6.78
13. Which of the following is NOT a property of the variance?
A. A small variance means that the distribution of the random variable is
narrowly concentrated around the mean.
B. A large variance means that the distribution is spread out, with some chance
of observing values at some distance from the mean.
C. The variance is a value that is always positive.
D. The variance is a value that is always negative.

102
For numbers 14-15. The mean of the probability distribution below is equal to
37.05 with a variance of 36.75 and a standard deviation of 6.06.

Number of ice
candy sold per
30 32 36 40 42 45
day in a retail
store (𝑥)
Probability (𝑃(𝑥) 0.30 0.10 0.15 0.10 0.10 0.25
14. How would you interpret the mean value of 37.05?
A. The least number of ice candy that will be sold in a day is 37 pieces.
B. The highest number of ice candy that will be sold in a day is 37 pieces.
C. The average number of ice candy that will be sold in a day is 37 pieces.
D. No interpretation can be made about the mean value of 37.05.
15. If you are the owner of the retail store, how many ice candies will you
prepare to ensure that you can supply the demands of your customers
every day?
A. 10 pieces and below C. 21-30 pieces
B. 11-20 pieces D. 30 pieces and above
Additional Activities
This activity is design for you to gain a deeper understanding of the mean,
variance, and standard deviation. It will not just help you to practice your skill but
it will give you a chance to compare the results with theoretical explanations.
Try it and enjoy it!
Things to do:
Roll a die 50 times. Record the result and construct a probability distribution to
find the mean, variance, and standard deviation. Compare the results with
theoretical results:
Take Note of This:
The probability distribution of rolling a die is given below, this will serve as your
theoretical result.
Outcome (𝑥) 1 2 3 4 5 6
Probability of the 1 1 1 1 1 1
Outcome
𝑃(𝑥) 6 6 6 6 6 6

103
Statistics and
Probability
Solving Problems Involving Mean
and Variance of Probability
Distributions
Statistics and Probability
Solving Problems Involving Mean and Variance of Probability Distributions
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module


Writer: Elizabeth D. Lalunio
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Kelvin A. Ramintas and Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

104
Week

3
What I Need to Know
This module was intended and written focusing on learners like you. It is here to help
you apply the mean or expected value of the discrete random variable or the average
value when the experiment is repeated many times to real-life situations such as
taking a chance in a raffle, making a bet in games of chance and considering an
insurance policy. It is hoped that this module will help you to make decisions
whether or not to participate in games of chances like lotteries, raffle draws, and
gambling.
In this module, you will learn how to compute the probability or the chance of
winning and losing. Your knowledge and skill on how to use the formula for the mean
and variance that you learned from previous modules are important, for you to solve
real-life problems involving the mean and variance of probability distributions. Good
luck and happy learning.

After going through this module, you are expected to:


1. calculate the mean and the variance of a discrete random variable;
2. solve real-life problems involving mean and variance of probability
distributions; and
3. interpret the expected value of a discrete random variable.

What I Know
Before studying this module, take this test to determine what you already know
about the topic covered.
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.

1. Which of the following represents the mean value of a discrete random variable?
A. Expected value C. Standard deviation
B. Median D. Variance
For numbers 2-3, refer to the table below:
𝑥 2 4 6 8
𝑃(𝑥) 0.12 0.29 0.24 0.35
2. Which of the following is the correct formula to find the expected value of the
probability distribution?
A. 𝐸(𝑥) = ∑[𝑥 ∙ 𝑃(𝑥)] C. 𝐸(𝑥) = ∑[𝑥 2 ∙ 𝑃(𝑥)] − 𝑢2
B. 𝐸(𝑥) = ∑[𝑥 2 ∙ 𝑃(𝑥)] D. 𝐸(𝑥) = √∑[𝑥 2 ∙ 𝑃(𝑥)] − 𝑢2
3. What is the mean or the expected value of the given probability distribution?
A. 4.85 B. 5.64 C. 6.32 D. 7.12
For numbers 4-6, refer to the following:
You buy one ₱500.00 raffle ticket for a prize of new Toyota Wigo car valued at ₱675,
000.00. Two thousand tickets are sold.
4. What is the probability that you will win the prize?
1 1 1
A. 1 B. C. D.
500 2000 675000
5. If X denotes the net gain from the purchase of a randomly selected ticket.
Which of the following tables of values shows the probability distribution
of X?
x 675,000 -500

105
A. P(x) 1 1999 C. x 674,500 -500
2000 2000 P(x) 1 1999
2000 2000
x 675,000 500 x 674,500 500
B. V P(x) 1 1999 D. P(x) 1 1999
2000 2000 2000 2000

6. What is the mean or the expected value for those who will buy the tickets?
A. -162.50 B. -162.25 C. 162.25 D. 162.50
For numbers 7-8, refer to the following:
One thousand tickets are sold for ₱10.00 each. One ticket will win ₱2,000.00, two
tickets will win ₱1,000.00 each and four tickets will win ₱500.00 each.
7. What is the probability of winning any amount in the purchase of two tickets?
1 2 5 7
A. 𝐵. C. D.
1000 1000 1000 1000

8. What is the probability of losing in the purchase of two tickets?


2 500 998 999
A. 1000 𝐵.
1000
C.
1000
D.
1000
For numbers 9-11, refer to the following:
A roulette wheel in a fiesta carnival has the numbers 1 through 50. If you bet ₱5.00,
you will have a chance to win a kitchen utensil worth ₱200.00.
9. If X denotes the net gain from taking a chance to bet on a randomly selected
number. Which of the following tables of values shows the probability distribution
of X?
A. x 195 -5 C.
x 50 -5
P(x) 1 49 P(x) 1 199
50 50 200 200
B. x 5 -5 D. x 200 5
P(x) 1 4 P(x) 1 5
5 5 200 200
10. What is the mean or the expected value for those who take a bet?
A. – 1.8 B. -1 C. -0.95 D. -0.5
11. How much the organizer will earn if 100 games will be played such that each
number has a bet of ₱5.00?
A. ₱2,000.00 B. ₱3,000.00 C. ₱4,000.00 D. ₱5,000.00
For items 12-13, refer to the following:
Good Hands is a car insurance company who offers to pay ₱600,000.00 if a car is
destroyed beyond repair due to an accident. This insurance policy cost ₱25,000.00 per
year and the probability that the company will need to pay the amount of insurance is
0.003.
12. What is the probability that the company will NOT pay any amount to its client
within a year?
A. 0.003 B. 0.250 C. 0.500 D. 0.997
13. What is the expected value of the insurance to its buyer?
A. Each policyholder expects to lose ₱20,000.00 every year.
B. Each policyholder expects to lose ₱22,500.00 every year.
C. Each policyholder expects to lose ₱23,200.00 every year.
D. Each policyholder expects to lose ₱25,000.00 every year

106
For items 14 to 15, refer to the following:
A life insurance company will sell a ₱500,000.00 five-year term life insurance policy
exclusive for police enforcers for a premium of ₱1,000.00. The chance that a police
enforcer will survive in 5 years is 99.95%.
14. What is the probability that a police enforcer will NOT survive in five-
years?
A. 0.05
B. 0.005
C. 0.005
D. 0.0005
15. What is the expected value to the company of a single policy?
A. ₱550.00
B. ₱650.00
C. ₱750.00
D. ₱850.00
Lesson 1 Solving Problems Involving Mean
and Variance of Probability
Distributions
Are you interested in buying lotto tickets? How about making a bet in a
roulette wheel or taking a chance in raffle tickets? Do you want to know how much
gain will you have if you will win in games of chance or how much money will be
wasted if you lost on a bet? Are you planning to invest money in a life insurance
company? If you had thought of the following questions this lesson is for you.
What’s In
For you to begin let us recall first the previous lessons which are very
important for you to succeed. From your last module, you learned that the expected
value or mean value of a discrete random variable, denoted by x, is computed by
first multiplying each possible x value by the probability of observing that value and
then adding the resulting quantities.
Symbolically,
𝐄(𝐱) = 𝐮𝐱 = ∑[𝐱 𝐢 ∙ 𝐏(𝐱 𝐢 )].
where:
𝑥𝑖 is the value of the random variable for the outcome 𝑖;
𝑃(𝑥𝑖 ) is the probability that a random variable will have an outcome 𝑖;
𝑢𝑥 is the mean of a random variable 𝑥; and
𝐸(𝑥) is the expected value of random variable 𝑥.

Also, you already learned the equation for computing the variance of a discrete
random variable. To find the variance, remember the formula:
𝜎 2 = ∑(𝑥 − 𝜇)2 ∙ 𝑃(𝑥)

which can be simplified as


𝜎 2 = [∑ 𝑥 2 ∙ 𝑃(𝑥)] − 𝑢2
where:
𝑥 2 is the square value of a discrete random variable 𝑥;
𝑃(𝑥) is the probability of an outcome of a discrete random variable;
𝑢2 is the square of the mean of a discrete random variable; and

107
𝜎 2 is the variance of a discrete random variable

Moreover, the standard deviation 𝝈, of a discrete random variable X is the square


root of its variance, hence the formula is:

𝝈 = √∑(𝒙 − 𝝁)𝟐 ∙ 𝑷(𝒙) or 𝝈 = √[∑ 𝒙𝟐 ∙ 𝑷(𝒙)] − 𝒖𝟐

The variance and standard deviation of a discrete random variable X may be


interpreted as measures of variability of the values assumed by the random variable
in repeated trials of the experiment.

Before you proceed to the next lesson, study the example below, and reflect whether
you are now ready to proceed to the new lesson.
Example
The table below shows the probability distribution of the number of girls in a family
of three children in Barangay Maligaya. Calculate the mean and variance of the
random variable with the given probability distribution.
Outcome or GBB,BGB, GGB, GBG,
BBB GGG
Sample Point BBG BGG
𝑥 0 1 2 3
𝑃(𝑥) 1/8 3/8 3/8 1/8
Solution:
Finding the mean of the discrete random variable:
𝐸(𝑥) = 𝑢𝑥 = ∑[𝑥𝑖 ∙ 𝑃(𝑥𝑖 )]
1 3 3 1
𝐸(𝑥) = 𝑢𝑥 = [(0 ∙ ) + (1 ∙ ) + (2 ∙ ) + (3 ∙ )]
8 8 8 8
3 6 3
𝐸(𝑥) = 𝑢𝑥 = [(0) + ( ) + ( ) + ( )]
8 8 8
12 3
𝐸(𝑥) = 𝑢𝑥 = = = 1.5
8 2
Therefore, the mean or the expected value of the discrete random variable is
1.5.
To find the variance and standard deviation of the discrete random variable you need
to prepare a table like the one below.

𝑥 𝑃(𝑥) 𝑥2 𝑥 2 ∙ 𝑃(𝑥)
0 1⁄ 0 0
8
1 3⁄8 1 3⁄8
2 3⁄ 4 12⁄ 𝑜𝑟 3⁄
8 8 2
3 1⁄ 9 9⁄
8 8
From the table, the variance and standard deviation can be computed using:
3 12 9 24
∑ 𝑥 2 ∙ 𝑃(𝑥) = 0 + + + = =3
8 8 8 8
𝜎 2 = [∑ 𝑥 2 ∙ 𝑃(𝑥)] − 𝑢2 = 3 − (1.5)2 = 3 − 2.25 = 0.75
𝜎 = √𝜎 2 = √0.75 = 0.87
Therefore, the variance of the discrete random variable is 0.75, while the
standard deviation is 0.87.

108
What’s New
GAME OF CHANCE

Read the conversation below and answer the questions that follow.

Oh no! I do understand but if


you want to take a chance I
Our television set is no have here 5 raffle tickets out of
Hey Romulo I longer functional and I 1000 released tickets which
Ok, so what’s
have a problem. your problem need to buy a new one. cost ₱100.00 each. If you will
my friend? But the problem is I don’t be lucky enough, you’ll get a
have enough budget for 32-inch LED Television worth
that. ₱15,000.00.

That’s a good deal and the prize


is so exciting! But I need to check
my budget before buying a ticket.
Thanks for informing me. I’ll just It will be 10 days from now.
call you when I am ready to buy. You still have time to decide,
By the way, when is the raffle just call me if you already Thank you You’re welcome Cardo!
date? have your decision. Romulo!

109
Questions:

1. If you are Cardo, would you buy a raffle ticket? Why?


_______________________________________________________________
2. If Cardo decided to buy five tickets, what is the probability that he would
win the prize if 1000 tickets were sold? ____________________________
What is the probability that Cardo will lose the bet? __________________
3. How much money will Cardo gain if he wins the prize?________________
4. How much money will be wasted if he will not win the prize and he buy
one ticket?___________________________________________________________
5. What if 1000 tickets were purchased by different individuals, what is the
expected value of buying one ticket?___________________________________
6. How would you describe Romulo as a friend?___________________________
What Is It
To answer question 1 in the previous activity, you have to understand first the
consequences of buying a ticket, will it give you advantages and disadvantages? Can
you afford to spend extra money to buy a ticket?
Look at this way, if you will buy a ticket you will have a chance to win the
prize worth ₱15,000.00 if you are lucky enough it is good for you. But on the other
hand, what if you are not lucky, then you will lose ₱500.00 for buying the five tickets.
Considering this amount (₱500.00), you are aware that you can buy more essential
needs like foods, school and medical supplies, and other miscellaneous needs with
this amount rather than using it to buy tickets. However, if you allocated extra money
for other unexpected expenses, buying a ticket is not bad at all. The point of making
the right decision is not difficult if you know how to consider what is more important.
If you are Cardo, you need to understand the given details on the situation for
you to make the right decision. If you will win the prize, your net gain is ₱15,000.00
less ₱500.00 (amount spend for buying 5 tickets) or ₱14,500.00. The probability that
5
you will win the prize is five out of one thousand tickets or . But if you decide to
1000
buy one ticket only, then your net gain is ₱15,000.00 less ₱100.00 (amount spend for
buying 1 ticket) or ₱14,900.00 and the probability that you will win the prize is one
1
out of one thousand tickets or . The more tickets you buy, the more chances of
1000
winning. However, your chance of losing is higher than your chance of winning
because it is equal to 1 less than the probability of winning if you buy five tickets the
995
probability that you will not win the prize is or 0.995 while if you buy one ticket
1000
999
only, the probability is or 0.999.
1000

What if 1000 tickets were purchased by different individuals, what is the


expected value of buying one ticket? To answer this question, you need to apply the
formula for the mean or expected value of the discrete probability distributions which
is equal to 𝐄(𝐱) = 𝐮𝐱 = ∑[𝐱 𝐢 ∙ 𝐏(𝐱 𝐢 )]. The expected value is also defined as the average
value of a random variable over numerous trials of an experiment. The table below
is the probability distribution of the given situation.

𝑥 14, 900 -100


𝑃(𝑥) 1 999
𝑜𝑟 0.001 𝑜𝑟 0.999
1000 1000

110
Using the formula of expected value,
𝐄(𝐱) = (𝟏𝟒, 𝟗𝟎𝟎)(𝟎. 𝟎𝟎𝟏) + (−𝟏𝟎𝟎)(𝟎. 𝟗𝟗𝟗)
𝐄(𝐱) = −𝟖𝟓
The negative value means that one loses money on the average. In particular,
someone who buy tickets, and although has a chance to win losses ₱85.00 per ticket
purchased.
Having this knowledge, you can now make a wise decision, but remember
important things should be prioritized, if you can afford to buy tickets without
sacrificing your essential needs it is okay to take a chance sometimes. You should
also consider saving money for future use, because not every day you have enough
funds, having extra money would be a great help in times of need.
Meanwhile, having a friend like Romulo who are ready to help you or to advise
you in time of need is a precious one. He is a kind of friend who does not insist on
what he wants but let his friend decide on his own after giving a piece of advice. You
are blessed if you will find a true friend that you can rely on. Having a true friend is
wealth.
What’s More
In the following activities, read each situation carefully to solve each problem. Write
your answer on a separate sheet of paper.
Practice Activity 1

Raffle Tickets Problem

Senior citizens of a particular barangay organize a Christmas raffle bonanza.


One thousand raffle tickets are sold for ₱50.00 each. Each one has an equal chance
of winning. The first prize is a 32-inch LED TV worth ₱10,000.00, the second prize
is an electric oven worth ₱5,000.00 and the third prize is a grocery pack worth
₱2,500.00. Let X denote the net gain from the purchase of one ticket.

a. Complete the table below showing the probability distribution of X.

𝑥 9,950 4,950 2,450 -50


𝑃(𝑥)

b. Find the probability of winning any amount in the purchase of one ticket.
Let 𝑃(𝑊) the probability of winning any amount

𝑃(𝑊) = _____ + _____ + _____


𝑃(𝑊) = _____
c. Find the expected value of X, then interpret.

The expected value is equal to the product of net gain and the probability
of winning (in this case you have three chances of winning, you may win

111
the first, second, or third prize) plus the product of net loss and the
probability of losing.

𝐄(𝐱) = _______________

Interpretation:__________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________

Independent Assessment 1

Raffle for a Cause


A raffle is held to raise funds for a charity event. A raffle ticket costs ₱150.00
and there are two thousand tickets to be sold. Only one ticket holder will win a prize
of ₱150, 000. 00. Construct the probability distribution of the discrete random
variable, compute for the expected value, and interpret the result.

Practice Activity 2

Game of Chance Problem (A Roulette Wheel)


A roulette wheel in a fiesta carnival has the numbers 1 through 30. If you bet
₱5.00 you will have a chance to win a prepaid load worth ₱100.00. Let X denote the
net gain for a bet.

a. Complete the table below showing the probability distribution of X.

𝑥 95 -5
𝑃(𝑥)

b. Find the expectation if you play a bet and interpret the result.

𝐄(𝐱) = _______________

Interpretation:__________________________________________________________
________________________________________________________________________
______________________________________________________

c. How much the organizer will earn if 100 games will be played such that each
number has a bet of ₱5.00? ____________________________
(Hint: Compute for the net gain of the organizer per game and multiply it
by 100)
Independent Assessment 2

Colored Wheel
A roulette wheel is divided into eight colors blue, red, yellow, green, orange,
violet, black, and white. Each time roulette is spun, you can have a bet of ₱10.00. If
the outcome is either black, blue, or violet you will lose the bet. But if the outcome
is red or yellow, your money will be doubled (you will win additional ₱10.00 plus your
bet), and if it is green or orange you will win nothing and your bet will be given to
you. But if the result is white, your money will be tripled (you will win an additional

112
₱20.00 plus your bet. If X denotes the net gain of a player what is the expected value
of a ₱10.00 bet?
Practice Activity 3

Insurance Investment Problem


A life insurance company will sell a ₱250,000.00 one-year term life insurance
policy for members of armed forces of the Philippines for a premium of ₱500.00. Let
X denote the net gain from the insurance company. Based on the collected data of
the company, a member of the armed forces police has a 99.96% chance of surviving
within one year. Let X the net gain of an insurance company from a single
policyholder.

a. Complete the table below showing the probability distribution of X.

𝑥 500 -249,500
𝑃(𝑥)
b. Find the expectation if you play a bet and interpret the result.

𝐄(𝐱) = _______________

Interpretation:_____________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
Independent Assessment 3

Body Parts Insurance


An insurance company will sell a ₱500,000.00 one-year term leg insurance
policy for ramp models for a premium of ₱1,000.00. Find the expected value to the
company of a single policy if a model has a 99.97% chance of being uninjured in one
year?

Compare your answers with those found in the Answer Key on the last part of
this module. Did you get all items correct? Great! If you did, you learned the lesson
well. If you were not able to get the correct answer, review your computations.
What I Have Learned
A. Complete the following statements by writing the correct word or words and
formulas.
1. The mean of a discrete random variable X is also called the _________________.
2. The expected value is the _______________ value of a random variable over
numerous trials of an experiment.
3. The formula to find the mean or expected value of a discrete random variable is
___________________.
4. The _________________ and __________________ of a discrete random variable X
indicate the variability of X over a large number of experiments or trials.
B. In your own words, write the different steps on how you can solve real-life problems
involving mean or expected value.

_______________________________________________________________________________
_______________________________________________________________________________
_______________________________________________________________________________

113
What I Can Do
Raffle for a Cause!
Things to do:
Make a plan for a Christmas raffle bonanza in which the PWD (Person with
Disability) will be the beneficiaries. The plan should consist of the number of tickets
to be sold, the amount per ticket, and the number of winners and their
corresponding prizes. Let X be the random variable representing your net gain per
ticket, and compute the expected value of X and the net gain if all tickets will be
sold out. Also, using your creativity prepare a sample of your ticket. You may use
the sample guide to complete your task.

Name of Project

Brief Description

Number of tickets to be sold

Price of ticket to be sold

Winners and prizes:

Expected Value of X:

(Please show the probability


distributions and your solution)
Net Gain of the Project:

Sample Ticket:

Rubrics for the task:

Categories 4 3 2 1
Excellent Satisfactory Developing Beginning

Organization of The name of The name of The name of The name of


Plan the project is the project is the project is the project is
appropriate somewhat somewhat not
and with clear appropriate appropriate appropriate
descriptions and with clear and vague and with
of the project. vague

114
descriptions descriptions descriptions
of the project. of the project. of the project.

Accuracy of the The There is one There is one There are


Solutions computations mistake in the mistake in the more
are accurate computations computations mistakes with
with a table of with a table of with no table no table of
probability probability of probability probability
distributions. distributions. distributions. distributions.

Originality and Sample Ticket Sample Ticket Sample Ticket Sample Ticket
creativity is unique and is nice but not is just is just an
(Sample Ticket) shows unique and ordinary with ordinary and
creativity with with complete complete without
complete information. information. complete
information. information.

Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. Which of the following represents the expected value of the discrete random
variable?
A. mean C. mode
B. median D. variance
For numbers 2-3, refer to the table below:
𝑥 1 2 3 4
𝑃(𝑥) 1 3 2 1
5 10 5 10
2. What is the expected value of the given probability distribution?
2 2
A. 10
C.
5

12 12
B. 10
D.
5

3. What of the following best described the expected value of a probability


distribution?
A. It measures the variability of the values assumed by the random variables.
B. It gives the difference between the highest and lowest values assumed by
the random variables
C. It is the square root of the variance of the probability distribution.
D. It is the average value of a random variable over numerous trials of an
experiment.
For numbers 4-6, refer to the following:
One thousand tickets are sold for ₱50.00 each. One ticket will win ₱10,000.00, two
tickets will win ₱5,000.00 each and three tickets will win ₱1,000.00 each. Let X
denote the net gain from the purchase of a randomly selected ticket.
4. What is the probability that you will win a prize if you buy two tickets?
2
A. 2 C.
1000

2 998
B. D.
500 1000

115
5. Which of the following tables of values shows the probability distribution of X?

A. x 10,000 5,000 1,000 -50


P(x) 1 2 3 994
1000 1000 1000 1000

B. x 10,000 5,000 1,000 -50


P(x) 1 2 3 1
999 998 997 999

C. x 9,950 4,950 950 -50


P(x) 1 2 3 994
1000 1000 1000 1000

D. x 9.950 4,950 1,000 -50


P(x) 1 2 3 1
999 998 997 999
6. What is the mean or the expected value of a person who buys a ticket?
A. - 25 C. 27
B. - 27 D. 25

For numbers 7-8, refer to the following:


You buy three ₱1,000.00 raffle tickets for a prize of a new 20-passenger Sarao
jeepney valued at ₱800, 000.00. Two thousand tickets are sold.

7. What is the probability of winning the prize in the purchase of three


tickets?
1 3
A. C.
2000 2000
2 4
B. D.
2000 2000
8. What is the probability that you will NOT win the prize? (refer to item 7)
3 997
A. C.
2000 2000
97 1997
B. D.
2000 2000
For numbers 9-11, refer to the following:
A roulette wheel in an amusement park has the numbers 1 through 60. If you bet
₱100.00 for a randomly chosen number from 1 to 60, you will have a chance to win
a cellular phone worth ₱5,000.00.
9. If X denotes the net gain from taking a chance to bet on a randomly
selected number. Which of the following tables of values shows the
probability distribution of X?
A. X 5000 -1000 C. x 60 -1
P(x) 1 59 P(x) 1 59
60 60 60 60
B. x 4900 -1000 D. x 100 -1
P(x) 1 59 P(x) 1 59
60 60 60 60

10. What is the mean or the expected value for those who take a bet?
A. – 10.12 C. -16.67
B. -15.40 D. -18.25

116
11. How much the organizer will earn if 100 games will be played?
A. ₱50,000.00 C. ₱150,000.00
B. ₱100, 000.00 D. ₱250,000.00

For numbers 12 -15, refer to the following:


A life insurance company will sell a 1 million three-year term life insurance policy
exclusive in a particular risk group for a premium of ₱2,000.00. Let X denote the
net gain to the company from the sale of one policy.
12. If a person in this risk group has a 99.95% chance of surviving within
three years, what is the probability that he will NOT survive within three
years?
A. 0.995 C. 0.005
B. 0.9995 D. 0.0005
13. How much is the net loss of the insurance company from a policyholder
if his family will avail of the insurance benefits?
A. ₱1,000,000.00 C. ₱998,000.00
B. ₱999, 000.00 D. ₱995,000.00
14. Which of the following tables of values shows the probability distribution of X, if
each policyholder has a 99.95% chance of surviving within three years?
A. x 2,000 -1,000,000 C. x 2000 -998,000
P(x) 0.9995 0.0005 P(x) 0.9995 0.0005
B. D.
x 1,000,000 2,000 x 2000 -998,000
P(x) 0.9995 0.0005 P(x) 0.9995 0.0005
15. Find the expected value to the company of a
single policy?
A. ₱1,500.00 C. ₱1,800.00
B. ₱1,690.00 D. ₱1,950.00
Additional Activities
If you want to try more, these activities are for you. It will help you to practice your
skill in solving real-life problems involving mean or expected value and to make wise
decisions on matters like taking a chance in lottery, starting your own business and
considering investment.
Study and analyze each situation to solve the problem.
1. Game of Chance (Lottery)
A lotto works by picking 6 numbers from 1- 42 (Combinations of numbers from 1-42
taken 6 at a time produce 5,245,786 number combinations). A ticket costs ₱24.00 to
play the lottery. If you win today, you would win 5 million pesos after taxes. If you
play the lottery today, what would be your expected winnings or losses?
2. Business Opportunity
The net gain for a day of Joy in a Cup Bread and Pastry Shop depends on the
number of cupcakes sold within a day. If X is the amount of income within a day for
five consecutive days. Compute the mean or expected earnings for a day and the
variance. The probability of X is shown below.

𝑥 5000 4500 4000 3500 3000


𝑃(𝑥) 0.20 0.16 0.31 0.17 0.16

117
3. Investment
A person may earn ₱100,000.00 by investing in the stocks of an international
company with a probability of 0.40 or lose ₱35,000.00 over the same period with a
probability of 0.60. Let X denote the net gain of a person who will invest in the
company, construct the probability distribution of X, and compute for the expected
value of a person who will invest in the same company. Interpret the result
How well did you perform in this lesson?
Are you now ready for the next lesson?
If your answer is Yes, I am HAPPY for you. Good Luck and happy learning on your
next journey. 😊

118
Statistics and
Probability
Illustrating a Normal Random
Variable and its Characteristics
Statistics and Probability
Illustrating a Normal Random Variable and its Characteristics
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module


Writers: Elson A. Valencia
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Kelvin A. Ramintas and Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

119
Week

3
What I Need to Know
This module was designed to help you as a learner to illustrate the normal random
variable and its characteristics. You can also learn how a normal curve used in our
daily life through the application of the concepts of an empirical rule to approximate
the percentage of the random variables in the distribution. It is easy to understand
the functions or uses of normal distribution because the examples and problems are
based on the situations that we experience nowadays
In a previous module, you have learned about the Random Variable and its
Classification It also discussed the Probability Distributions of Discrete Random
Variables. This time, you will focus on the Normal Probability Distribution, also
called Normal Curve which is used to determine the probabilities of a continuous
random variable. On the other hand, the knowledge, skills, and attitude you acquired
about the percentage, mean and standard deviation is very useful to attain the target
objectives of this module

After going through this module, you are expected to:


1. define a normal random variable;
2. illustrate a normal random variable and its characteristics; and
3. state and apply the concepts of the empirical rule or 68% -95%-
99.7% of the distribution.

What I Know
Let us see how far you will go and how much you know about this pretest. Please
answer all questions with your best and utmost honesty so that we can gauge your
learning in the target competency of this lesson. Let’s get started!

Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. The total area under the normal curve is _________.
A. -1 B. 0 C. 0.5 D. 1
2. The normal curve is bell-shaped.
A. False B. True C. Sometimes D. It depends
3. Which part of the normal curve is extended indefinitely in both directions along
the horizontal axis, approaching but never touching it?
A. center B. tail C. top D. spread
4. According to the property of a Normal Probability Distribution, the mean is equal
to what measures of central tendency?
A. median B. mode C. both A and B D. only A
5. Which of the following rules state that almost all data fall within the 1, 2, and 3
Standard Deviation of the Mean when the population is normally distributed?
A. Empirical rule
B. Lottery rule
C. Pascal’s triangle rule
D. Sampling rule
6. Under the curve, the area to the left of the mean is_________
A. 30% B. 47.72% C. 50% D. 68%

120
7. Using the empirical rule, the area under the normal distribution curve that lies
within three standard deviations from the mean is approximately 95%.
A. False
B. True
C. Sometimes
D. None of the Above
8. Which of the following statements is CORRECT about the shape of the normal
curve?
A. The shape of the normal curve is based on the given Mean and Standard
Deviation of the distribution.
B. The shape of the normal curve is based on the given Mean and Median of
the distribution.
C. The shape of the normal curve is based on the given area and standard
Deviation of the distribution.
D. All of the above
9. Which of the following graphs is the BEST example of the normal curve?
A. B. C. D.

10. Based on the empirical rule, the bell-shaped distribution will have
approximately 68% of the data within what number of Standard Deviations
from the mean?
A. 0 B. 1 C. 2 D. 3
11. If the average age of retirement for the population in the Philippines is 65 years,
with a standard deviation of 5 years, what is the approximate age range in
which 68% of people retire?
A. 60 – 70 years B. 55 – 65 years C. 55 – 60 years D. 60 -65 years
12. If the mean and the standard deviation of a continuous random variable that is
normally distributed are 10 and 3 respectively, find an interval that contains
68% of the distribution.
A. [7, 13] B. [4, 13] C. [7, 16] D. [3, 10]
13. According to the 68%-95%-99.7% rule or better known as an empirical rule, the
standard deviation and mean interval that covers approximately 95% of the
data from a frequency distribution is____
A. 2µ ± 2ơ B. µ ± ơ C. µ ± 2ơ D. µ ± 3ơ
14. The first quarterly assessment result of Grade 11- HUMSS students in their
Statistics and Probability subject had scored with a mean of 40 and a
standard deviation of 8. A histogram of the data shows that the distribution is
normal.
Use the Empirical Rule to find the percentage of scores between 24 and 32.
A. 13.5% B. 68% C. 81.5% D. 95%
15. Suppose the distribution of data about the recovered victims of COVID 19
daily has a mean of 145 and a standard deviation of 22. How many standard
deviations away from the mean is a value of 101?
A. It is one standard deviation above the mean.
B. It is one standard deviation below the mean.
C. It is two standard deviations above the mean.
D. It is two standard deviations below the mean.
How did you find this pretest? Did you encounter both familiar and unfamiliar
terms, symbols, and notations? Kindly compare your answer in the Answer Key on
the last part of this module.
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. While in the event you missed a point, please proceed with the

121
module as it will enrich your knowledge in the normal distribution, especially in
illustrating the normal random variables and its characteristics.
Lesson 1 Illustrating the Normal Random
Variable and its Characteristics
Did you take a National Achievement Test or any Placement Tests? How about
measuring your height and weight during the first week of your PE class to know
your Body Mass Index? Have you ever tried to determine the location or make an
inference about your grades, height, or weight in the distribution? If not, this lesson
will teach you how to appreciate the importance of knowing the answer to those
questions?
What’s In
Let’s have a review first. Read the following statements carefully and determine
whether it is TRUE or FALSE. Write the word TRUE if it is correct and FALSE if it is
incorrect, then modify the statement to make it true.

________________1. Probability is the value greater than or equal to zero but less
than or equal to one.
________________2. Discrete variables are the infinite numerical values like heights,
weights, distance and length
________________3. 34% is also equal 0.34.
________________4. Mean, mode and standard deviation are the measures of central
tendency.
________________5. Mean is equal to the summation of scores divided by the
number of cases.
What’s New
A. The distribution of the height (X) in centimeter (cm) of the 16 teachers of SCNHS
was presented below. Construct a histogram for the random variable (X).

X F
138 1
139 2
140 3
141 4
142 3
143 2
144 1

B. The following items show four pictures captured by the author during
his travel in the different places and constructed using GeoGebra
application which represents a common object. Analyze the group of
pictures carefully then identify the common object they
represent. Write your answer by filling in the empty boxes below
each item. A hint is also provided for you.

122
1. 2.

3.

4. 5.

Can you define or describe each word that you have obtained in this activity?

1. ____________________________________________________________

2. ____________________________________________________________

3. ____________________________________________________________

4. ____________________________________________________________

5. ____________________________________________________________
What is it
How do you find the activity? How can you describe the graph you have made in
Activity A? Are the terms or words you obtained in Activity B will describe the graphs
in Activity A?

In the activity above, the distribution of the height of the teachers is one of the
examples of the distribution of a normal random variable because the set of
numerical values is normally distributed. Height is also described as a continuous
random variable. You have also noticed that the mean of the distribution can be
found at the center which is 141 cm and the distribution from the center to the right
is a mirror of the distribution to the left as shown in the figure and table below. See
Figure 1.

123
Normal Probability Distribution is a probability distribution of continuous
random variables. It shows graphical representations of random variables obtained
through measurement like the height and weight of the students, the percentile
ranks of the A&E (Accreditation and Equivalency) Test result of the Alternative
Learning System students, or any data with infinite values. It is used to describe the
characteristics of populations and help us visualize the inferences we make about
the population. It also used to determine the probabilities and percentile of the
continuous random variables in the distribution. For example, your grades in
Mathematics is one of the scores in the distribution, you can predict the location of
that score in the distribution and interpret it with regards to the mean and standard
deviation.
Properties of Normal Curve
The graphical representation of the normal distribution is popularly known as
a normal curve. The normal curve is described clearly by the following characteristics
1. The normal curve is bell-shaped.
2. The curve is symmetrical about its center. This means that, if we draw a
segment from the peak of the curve down to the horizontal axis, the segment
divides the normal curve into two equal parts or areas.
3. The mean, median, and mode coincide at the center. This also means that in
a normal distribution, or a distribution described by a normal curve, the
mean, median, and mode are equal.
4. The width of the curve is determined by the standard deviation of the
distribution.
5. The tails of the curve are plotted in both directions and flatten out indefinitely
along the horizontal axis. The tails are thus asymptotic to the baseline. A
portion of the graph that is asymptotic to a reference axis or another graph is
called an asymptote, always approaching another but never touching it.
6. The total area under a normal curve is 1. This means that the normal curve
represents the probability, or the proportion, or the percentage associated with
specific sets of measurement values.

To visualize and understand the properties/characteristics of a normal curve,


please refer to figure 2, as shown below.

Fig. 2: Graph of a Normal Curve

A normally distributed random variable with a mean µ = 0 and standard


deviation ơ = 1 is called a standard normal variable. It is presented using standard
normal distribution where the center of the curve is zero, which is mean and added
one unit from the center to the right and subtract one unit from the center to the
left. This is shown below in figure 3.

124
Fig. 3: Standard Normal Distribution

The shape of a normal curve is based on the two given parameters, the mean
and the standard deviation of the distribution. When comparing two distributions
each described by the normal curve, the following are the three situations based on
the said parameters
a. When the means are not equal, but the standard deviations are equal. (µ 1
≠ µ2 ; ơ1 =ơ2 ), the curves have a similar shape but centered at different
points, as shown in figure 4.

Fig. 4: The Mean are Not Equal, but the Standard


Deviations are Equal.
b. When the means are equal, but the standard deviations are equal. (µ 1 = µ2
; ơ1 ≠ ơ2 ), the curves are centered at the same point but they have different
height and spreads. as shown in figure 5.

Fig. 5: The Mean are Equal, but the Standard


Deviations are Not Equal.

c. When the means are different and the standard deviations are also
different (µ1≠ µ2 ; ơ1 ≠ ơ2 ), the curves are centered at different points and
vary in shapes, as shown in figure 6.

125
Fig. 6: The Means are Different and the Standard
Deviations are also Different
EMPIRICAL RULE

You can easily understand the functions of normal probability distribution in


our real-life world using the empirical rule. Because this rule is used to roughly test
the distribution’s normality, if many data of a random variable fall outside the lower
and upper limits of the three-standard deviation, this means that the distribution is
not normal.
The empirical rule is better known as 68% - 95% - 99.70% rule. This rule
states that the data in the distribution lies within one (1), two (2), and three (3) of the
standard deviation from the mean are approximately 68%, 95%, and 99.70%,
respectively. Since the area of a normal curve is equal to 1 or 100% as stated on its
characteristics, there are only a few data which is 0.30% falls outside the 3-standard
deviation from the mean. For instance, the distribution of the grades of the Senior
High School students in Statistics and Probability for the Third Quarter is shown
below in Figure 7.

Fig.7. Distribution of the Grades of the SHS Students in Statistics and


Probability for the Third Quarter

Using the empirical rule, you can say that the distribution of the grades of the
Senior High School in Statistics and Probability are distributed normally because the
shape of the graph is bell-shaped and symmetric about its mean, which is 87. This
also indicates that most of the students’ grades fall close to the mean and fewer of
them are far below and/or above the mean
It can summarize the distribution in the following percentage:

126
 68% of data lies within 1 standard deviation from the mean have a
grade of 83 to 91
 95% of data lies within 2 standard deviations from the mean have a
grade of 79 to 95
 99.70% of data lies within 3 standard deviations from the mean have a
grade of 83 to 9
Illustrative Example 1:
The scores of the Senior High School students in their Statistics and
Probability quarterly examination are normally distributed with a mean of 35 and a
standard deviation of 5.
Answer the following questions:
a. What percent of the scores are between 30 to 40?
b. What scores fall within 95% of the distribution?
Solution:
Draw a standard normal curve and plot the mean at the center. Then, add the
standard deviation to the mean once and mark it to the right of the mean. Add twice
the standard deviation to the mean and put it to the right of the first sum. Then, add
thrice the standard deviation to the mean and mark it to the right of the second sum.
Do the same to the left. This time, instead of adding, subtract the standard deviation
from the mean. An illustration is provided below in figure 7:

Fig. 7: Distribution of Scores of Senior High School Students


Answer:
a. The scores 30 to 40 falls within the first standard deviation from the mean.
Therefore, the scores that fall between 30 and 40 is approximately 68% of
the distribution.
b. Since 95% of the distribution lies within 2 standard deviations from the
mean, then the scores corresponding to this area of the distribution are
scores from 25 up to 45.
Illustrative Example 2:
The district nurse of Candelaria East needs to measure the BMI (Body
Mass Index) of the Alternative Learning System students. She found out
that the heights of male students are normally distributed with a mean of
160 cm and a standard deviation of 7 cm. Find the percentage of male
students whose height is within 153 cm to 174 cm.

127
Fig. 8: Distribution of BMI of the Alternative Learning System Students

Solution: The mean of this problem is 160, it is subtracted by 7 to the left (e.g. 160
-7 = 153; 153-7 = 146; 146 – 7 = 139) and add 7 to the right. (e.g. 160 + 7 = 167; 167
+ 7 = 174 + 7 = 181).

As stated in figure 8, 153 cm falls at 1 standard deviation from the mean to


the left and the height of 174 cm falls at 2 standard deviations from the mean to the
right. Therefore, it covers the whole 68% and 13.5%. of the distribution and the sum
of it is 81.5%

ANSWER: 81.5% of the male students have a height between 153 cm to 174 cm.
What’s More
Activity 1.
A. True or False. Write the word TRUE if the statement is correct and FALSE
if the statement is incorrect
_______1. The total area of the normal curve is 1.
_______2. Normal probability distribution is used to determine the
probabilities of a discrete random variable.
_______3. The mean, median and mode in a normal distribution have
the same value.
_______4. The distance of the number in the horizontal line or the x-axis
should always proportional.
_______5 The normal probability distribution has a mean µ = 1 and
standard deviation ơ = 0.
Activity 2

A. Use Empirical rule to complete the following table. Write on the respective
column the range or interval of the scores based on the given parameters.

Mean Standard Deviation 68% 95% 99.7%


Ex 50 3 47 to 53 44 to 56 41 to 59
1 135 28
2 87 5.5
3 213 15
4 567 20
5 785 29
B. Illustrate the distribution in Activity 2.A through a diagram.

128
Activity 3.
Read and analyze the following problem to answer the question.
The government launched Enhanced Community Quarantine on March 15,
2020, as a preventive measure to fight the spread of COVID-19 in the country. Many
of us experienced difficulties in getting our daily needs like food or getting money to
pay for our bills and rentals. Our government gave financial support and food, but it
is not enough for the people. Fortunately, we are blessed because the spirit of love
and generosity is still present in the heart of our fellow countrymen. They donated
money, food, time, and other essential things.
Suppose you and your friends conducted a donation-drive in your barangay.
Each of the donors gave a certain amount with an average of Php 320.00 and a
standard deviation of Php 60.00. Assuming, it is normally distributed,
1. What percentage of the distribution that the amount falls between Php.
260.00 and Php 380.00?
2. How much of the donations fall in 95% of the distributions?
3. How much of the donations fall in 99.70% of the distributions?

What I Have Learned


1. A normal random variable is a set of a numerical value of continuous random
variables which is normally distributed.
2. Normal Distribution is a probability distribution of continuous random variables.
It also represents the probability, or the proportion, or the percentage associated
with specific sets of measurement values.
3. A normal curve has the following characteristics:
a. It should be bell-shaped and symmetric about its center.
b. Both sides of the tails of a normal curve do not touch the horizontal axis.
c. In the horizontal axis, use 0 at the center where the mean is located and
use 1, 2, and 3 to the right and -1, -2 and -3 to the left. But still,
remember there is an integral part of each number. However, it is not
limited to the -3 and 3, it is extended indefinitely
d. The distances between numbers in the horizontal line or the x-axis are
always equal.
4. Empirical rule. This rule states that the data in the distribution lies within the 1,
2, and 3 of the standard deviation of the mean.
5. Sixty-eight percent (68%) of data lies within 1 standard deviation from the mean.
6. Ninety-five percent (95%) of data lies within the 2-standard deviation from the
mean.
7. Ninety-nine and seven-tenths percent (99.70%) of data lies within the 3-standard
deviation from the mean.

What I Can Do
MAKE A SURVEY

Materials Needed
1. Cellphone` 4. Calculator
2. Computer/Laptop 5. Short Bond Paper
3. Pen/pencil
Instructions
1. Contact at least 20 of your classmates through SMS or messenger apps.

129
2. Ask them for the result of their pre-test (What I Know) or pre-assessment
of this module
3. Compute the mean and standard deviation of the collected data includes
your own score.
4. Use the empirical rule, to answer the following questions
a. What is the interval or range of scores falls within 68%?
b. What is the interval or range of scores falls within 95%?
c. What is the interval or range of scores falls within 99.7%?
d. What is the interval or range of scores falls within 81.5%?
e. In which interval away from the mean does your score belong?
5. Illustrate the distribution of the data you obtained from this survey through a
diagram and put a mark on the interval where your score located.
RUBRICS
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. What is half of the total area of the normal curve?
A. -1 B. 0 C. 0.5 D. 1
2. What is the shape of a Normal Probability Distributions?
A. bar B. bell C. circle D. line
3. Which part of a normal curve is asymptotic with the horizontal axis?
A. center B. curve C. tail D. top
4. Which of the following statements is the characteristic of a normal probability
distribution?
A. The three measures of central tendency such as mean, mode, and median
are equal.
B. The three measures of central tendency such as mean, mode, and median
are not equal
C. The two measures of central tendency such as mean and mode are equal
while the median is greater than zero
D. The two measures of central tendency such as mean and mode are equal
while the median is less than zero.
5. Which of the following statements describe the concepts of empirical rule?
A. It states that the data in the distribution lies within one (1), two (2), and
three (3) of the standard deviation from the mean are approximately 68%,
95% and 99.70%, respectively .
B. It states that the data in the distribution lies within 100%.
C. It states that the data in the distribution lies within one (1), two (2), and
three (3) of the standard deviation from the mean are exactly 68%, 95%
and 99.70%, respectively.
D. It states that the data is asymmetric.
6. Under the curve, the area to the right of the mean is_________
A. 30% B. 47.72% C. 50% D. 68%
7. Which of the following parameters is the basis of the shape of the normal
curve?
A. mean B. standard deviation C. Variance D. Both A and B
8. What is the equivalent of the mean of the following graph?

A. 0 B. 1 C. 2 D. 3

130
9. Using the empirical rule, the area under the normal distribution that lies within
three standard deviations of the mean is approximately_____.
A. 68% B. 95% C. 97% D. 99.7%
10. Based on the empirical rule, the bell-shaped distribution will have
approximately 68% of the data within what number of standard deviations of
the mean?
A. 0 B. 1 C. 2 D, 3
11. If the average age of retirement for the population in the Philippines is 65 years
and with a standard deviation of 5 years, what is the approximate age range in
which 95% of people retire?
A. 60 – 70 years B. 55 – 65 years C. 55 – 60 years D. 55- 75 years
12. If the mean and the standard deviation of a continuous random variable that is
normally distributed are 10 and 3, respectively, find an interval that contains
99.7% of the distribution.
A. [7, 16] B. [4, 16] C. [7, 16] D. [1, 19]
13. According to the 68%-95%-99.70% rule, the standard deviation and mean
interval that covers approximately 99.70% of the data from a frequency
distribution is____
A. 2µ ± 2ơ B. µ ± ơ C. µ ± 2ơ D. µ ± 3ơ
14. The first quarterly assessment result of Grade 11- STEM students in their
Statistics and Probability subject had scored with a mean of 40 and a standard
deviation of 8. A histogram of the data shows that the distribution is normal.
Use the Empirical Rule to find the percentage of scores between 24 and 48.
A. 13.5% B. 68% C. 81.5% D. 95%
15. Suppose that distribution of data about the number of deaths of COVID 19
positive has a mean of 45 and a standard deviation of 18. How many standard
deviations away from the mean is a value of 81?
A. It is one standard deviation above the mean
B. It is two standard deviations above the mean
C. It is one standard deviation below the mean
D. It is two standard deviations below the mean
Additional Activities
Read the following problems carefully. Apply the empirical rule and illustrate your
answer through a diagram.
1. IQ scores of the ALS students in the Division of Quezon are normally
distributed with a mean of 110 and a standard deviation of 10.
a. What percent of the distribution falls within the IQ scores of 100 to 130?
b. What percent of the distribution falls within the IQ scores of 90 to 140?
2. A normal distribution of data has a mean of 78 and a standard deviation of 9.
a. What scores fall within 68% of the distribution?
b. What scores fall within 95% of the distribution?

131
Statistics and
Probability
Identifying Regions Under
Normal Curve Corresponds to
Different Standard Normal
Values
Statistics and Probability
Identifying Regions Under Normal Curve Corresponds to Different Standard Normal Values
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module


Writers: Elson A. Valencia
Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Nenita N. De Leon and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

132
Week

4
What I Need to Know
This module was designed to easily understand how to identify regions under
the normal curve corresponding to different standard normal values. It includes the
steps/procedures on how to use the table of areas under the normal curve. Also, this
module will help you to accomplish the succeeding modules about normal
distributions and to solve real-life situations in predicting the probabilities,
percentage, and proportions of every random variable in a normal distribution.
In the previous module, you learned the characteristics of a normal curve.
These characteristics will be your guide to accurately identify the regions or areas
under the normal curve corresponding to different standard normal scores. You can
also use your mastery in performing addition and subtraction of decimals to solve
the problems.
After going through this module, you are expected to:
1. identify the regions under the normal curve of different standard normal
values;
2. use the z Table (Table of Areas under the Normal Curve) to find the
regions that correspond to z values; and
3. sketch the normal curve showing the required regions or areas.
What I Know
Let us see how far you will go and how much you know about this pretest.
Let’s get started.
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. What is the sum of the area that corresponds to the right of 0 and to the left of
0?
A. 0.5 B. 1 C. 1.5 D. 2
2. Where can we find the area of a specific region under the curve?
A. Table of Areas of Geometric Figure
B. Table of Areas Under Normal Curve
C. Table of Contents
D. Periodic Table of Elements
3. Which of the following does NOT belong to the group?
A. to the right B. less than C. to the left D. below
4. Which of the following figures is the best illustration of the area to the right of z
=1?
A. C.

B D.

5. What is the area between z = - 1.23 and z = 2?


A. 0.0865 B. 0.4772 C. 0.8679 D. 0.8779

133
6. What will you do to find the area between the two z values?
A. Subtract the obtained value from the Z table to 1
B. Rewrite the value obtained from z table
C. Subtract the two obtained values from the z table.
D. Calculate the mean and standard deviation
7. Find the area above z= 2.56
A. 0.0052 B. 0.4948 C. 0.9946 D. 9948
8. Find the area of the shaded region of the given figure.

A. 0.1587 B. 0.3413 C. 0.3907 D. 08413


9. Find the z value of the following graph that corresponds to the given area.

A. – 1.56 B. -1.45 C. 1.45 D. 1.56


10. Find the z value of the following graph that corresponds to the given area

A. – 1.06 B. -1.09 C. 1.09 D. 2.09


11. How can you describe this graph?

A. The shaded region of the normal curve is below the z = 1


B. The shaded region of the normal curve is above the z=-1
C. The shaded region of the normal curve is between z=0 & z=1
D. The shaded region of the normal curve is between z=-1& z=1
12. Find the area under the standard normal curve between z = -2.5 and z = 1.5.
A. 0.0062 B. 0.9270 C. 0.9332 D. 0.9392
13. Find the area under the standard normal curve between z = 0 and z = 3.
A 0.4987 B. 0.5000 C. 0 .9987 D. 1.4987
14. Find the area under the normal curve from z =0.83 to z= 1.84.
A. 0.0007 B. 0.1704 C. 0.7967 D. 0.9671
15. Find the sum of the areas under the standard normal curve to the left of z = -1.35
and to the right of z = 2.26.
A. 0.0766 B.0.0995 C. 0.1004 D. 0.2009
How do you find this pretest? Did you encounter both familiar and unfamiliar
terms, symbols, and notations? Kindly compare your answer in the Answer Key on
the last part of this module
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. While in the event you missed a point, please proceed with the
module as it will enrich your knowledge in the normal distribution, especially in
identifying regions under normal curve corresponds to different standard normal
values.

134
Lesson 1 Identifying Regions under Normal
Curve Corresponds to Different
Standard Normal Values
In your previous lessons in Mathematics, you learned how to solve for the area
of certain geometric figures or shapes. This time, you are going to identify the areas
or regions of a curve without using a formula or even a density function of standard
normal random variables because the z – table or Table of Areas under the Normal
Curve is already given on the last page of this module.

What’s In
Let’s have this first for a review.
A. Perform the indicated operation manually. Then check your answer using
calculator or any calculating device
1. 0.9983 – 0.4531 = ____________ 6. 0.8413 + 0.1313 =______________
2. 1 - 0.3413 = ____________ 7. 0.0017 + 0.056 =_______________
3. 0.4772 - .0033 =______________ 8. 0.5 + 0.5000 =______________
4. 0.5 – 0.1537 =_____________ 9. 0.4490 + 0.352 =_______________
5. 1 – 0.2318 =_____________ 10. 0.1001 + 0.0009=______________
B. Investigate all parts of the diagram below if the characteristics of a
normal curve is present.

What’s New
A. The following numbers are the number found at the left side and upper part of
the table. Identify the number that lies on the intersection between the given set
of numbers. Use the Table of the Area Under Normal Curve indicated at the last
page of this module
1. -1.6 and 0.09 = ________
2. 1.0 and 0.07 = _________
3. 1.1 and 0.06 = _________
4. 0.5 and 0.08 = _________
5. 1.0 and 0.00 = _________
B. Perform the indicated operation. Write your answer inside the box connected to
the shaded region of the following normal curve.

135
What is It
In the previous module, we discussed the properties of the normal probability
distribution, and one of the things that you learned is that the total area under the
curve is equal to one. Since the standard normal distribution is defined as normally
distributed random variable having a mean of zero ( µ= 0 ) and standard deviation of
one (ơ = 1), the fifty (50) percent of the region under the curve is below the mean and
the other 50% is above the mean. Figure 1, as shown below illustrate the standard
normal curve.

Figure 1: Standard Normal Curve


You can easily identify the area of the regions under the normal curve by
using the Table of Areas under the Normal Curve which is also known as z-Table (see
attachment at the last page of this module). This table gives the area of any value of z
from -3.99 to 3.99. The value from this table will describe the area of the specific
region of the curve to the left of the given z-value.
How to use the Table of Areas under the Normal Curve?
Example: Find the area to the left of -1.69
Solution: First, split the given z-value into hundredths. We can find the whole
number and the tenths digit (-1.6) at the left side of the table while the hundredths
(0.09) located at the uppermost of the table. Then the intersection of these numbers
will be the area of the normal curve to the left of the z-value.

136
Illustration:

Answer: The area to the left of z= -1.69 is 0.0455.


STEPS ON HOW TO FIND THE AREA THAT CORRESPONDS TO Z –VALUE
1. Draw/sketch a normal curve and locate the given z-value on the normal curve
2. Shade the region of the curve according to the condition of z-value whether it
is below, above, or between.
3. Use the table of the area under the normal curve to find the corresponding
area.
4. Choose the appropriate operation based on step 2 and 3
4.1. When the z-value is to the left or any related terms (e.g. below, less
than) just write the value we obtained in step 3
4.2. When the z-value is to the right or any related terms (e.g. above, greater
than), subtract 1 by the obtained value in step 3
4.3. When the shaded region is in between of the two z-value, subtract the
biggest by the smallest value obtained in step 3
5. Label the shaded region and draw a conclusion
Example 1: Find the area that corresponds below z = -1.35
Solution:
Step 1. Draw/sketch a normal curve and locate the given z-value.

137
Step 2: Shade the region of the curve according to the condition of z-value
whether it is below, above, or between.

Step 3. Use the table of the area under the normal curve to find the
corresponding area

The intersection between -1.3 and 0.05 is 0.0885


Step 4. Choose the appropriate operation based on step 2 and 3
4.1 When the z-value is to the left or any related terms (e.g. below,
less than) just write the value we obtained in step 3
Since the shaded region of the curve is to the left of the z= -1.35
and the intersection between -1.3 and 0.05 is 0.0885. Therefore,
the area of the shaded region is 0.0885
Step 5. Label the shaded region and draw a conclusion.

Therefore, the area that corresponds below z = -1.35 is 0.0885.

138
Example 2: Find the area to the right of z = -1.35

Step 1:

Step 2:

Step 3. The intersection between -1.3 and 0.05 is 0.0885

Step 4. Since the shaded region is to the right of z-value, we will subtract
0.0885 from 1. The difference is 0.9915

Step 5

Therefore, the area to the right of z = -1.35 is 0.9915.


Example 3: Find the area between z = -1.30 and z = 2
Step 1:

Step 2:

139
Step 3. The value that corresponds to z= 1.3 is 0.0968 and z= 2 is
0.4772.
Step 4. Since the shaded region is within the two z-value, subtract 0.4772
by 0.0968. The difference is 0.8804
Step 5 Therefore, the area between z = -1.30 and z = 2 is 0.8804.

What’s More
Activity 1.
Identify the regions under the normal curve of the following standard normal
values using the z table (refer to the last page of this module).
1. z = - 3.25 ______________ 6. z = -1.33 _______________
2. z = 1.25 ______________ 7. z = -2.5 _______________
3. z = 0.56 ______________ 8. z = 0.29 _______________
4. z = 3 ______________ 9. z = - 0.13 _______________
5. z = 1.96 ______________ 10. z = - 1.96 _______________
Activity 2
Find the area or region that corresponds to each of the following z values. Sketch
and label each curve.
1. to the right of z = 2.18
2. below z= -1.40
3. above z= 0.57
4. between z = -2.34 and z = 1.57
5. between z = 0.32 and z = 2.42
What I Have Learned
1. The total area of the normal curve is 1
2. Z-table or Table of Areas Under Normal Curve contains the corresponding area
of the z value.
3. Steps on how to find the area that corresponds to z –value.
a. Draw/sketch a normal curve and locate the given z-value on the normal curve
b. Shade the region of the curve according to the condition of z-value whether it
is below, above, or between.
c. Use the table of the area under the normal curve to find the corresponding
area.
d. Choose the appropriate operation based on step 2 and 3.
e. Label the shaded region and draw a conclusion.
4. When the z-value is to the left or any related terms (e.g. below, less than) just
write the value we obtained from the z table.
5. When the z-value is to the right or any related terms (e.g. above, greater than),
subtract 1 by the obtained value from the z table.
6. When the shaded region is in between of the two z-value, subtract the biggest by
the smallest value obtained from the z table.

140
What I Can Do
For instance, one of your friends from another school asks your help about his
problem in Statistics and Probability. He needs to answer his assignment but
unfortunately, he left his module in the school. He wants to know how to identify
the regions under the normal curve corresponding to different standard normal
values. Your friend will send a picture of a normal curve with the corresponding
area, and he asks you favor to explain each picture on how to obtain the value of
the shaded part of the normal curves. Here is the screenshot of the pictures:

Write your message


___________________________________________________________________
___________________________________________________________________
___________________________________________________________________
___________________________________________________________________
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. How many percents is the total area that corresponds to the right of 0 and to the
left of 0?
A. 25% B. 50% C. 75% D. 100%
2. What is another term of z table?
A. Table of Areas of Geometric Figure
B. Table of Areas Under Normal Curve
C. Table of Contents
D. Periodic Table of Elements

141
3. Which of the following does NOT belong to the group?
A. to the right B. above C. to the left D. greater than
4. Which of the following figures is the best illustration of the area to the left of z =1?

A. B. C .

B. D

5. Find the area of the shaded region of the given figure.

A. 0.1587 B. 0.3413 C. 0.3907 D. 0.8413


6. What mathematical operation to use in getting values from the z table to find the
area between the two z values?
A. Add B. Divide C. Multiply D. Subtract
7. What is the area between z = 0.23 and z = 1.99?
A. 0.3858 B. 0.3588 C. 0.8239 D. 0.8583
8. Find the area above z= 2.14
A. 0.052 B. 0.0162 C. 0.0324 D. 0.9838
9. Find the z value of the following graph that corresponds to the given
area.

A. – 2.50 B. - 2.0 C. -1.56 D. - 1.50


10. Find the z value of the following graph that corresponds to the given area

A. – 2.24 B. -2.25 C. 2.24 D. 2.25

142
11. How will you describe this graph?

A. The shaded region of the normal curve is below the z = 1


B. The shaded region of the normal curve is above the z=-1
C. The shaded region of the normal curve is between z=0 and z=1
D. The shaded region of the normal curve is between z= -1 and z=1
12. Find the area under the standard normal curve between z = -1.5 and z = 2.52.
A. 0.0668 B. 0.9270 C. 0.9273 D. 0.9392
13. Find the area under the standard normal curve between z = -3 and z = 0.
A 0.4987 B. 0.5000 C. 0 .9987 D. 1.4987
14. Find the area under normal curve from z = - 2.83 to z= -0.82.
A. 0.2038 B. 0.2061 C. 0.2084 D. 0.2083
15. Find the sum of the areas under the standard normal curve to the left of z = -
3.14 and to the right of z = 2.62.
A. 0.0008 B.0.0036 C. 0.0044 D. 0.0052
Additional Activities
Do this additional activity in your activity notebook.
Find the area that corresponds to each of the following z values. Sketch and label
each curve.
1. to the right of z = 1.28
2. below z= -0.20
3. above z= 1.57
4. at least z =-127
5. at most z = 1.39
6. between z = -1.29 and z= - 0.88
7. greater than z= 0
8. to the left of z= 1
9. between z = -2.34 and z = 1.57
10. between z = 0.32 and z = 2.42

143
Statistics and
Probability
Converting a Normal Random
Variable to a Standard Normal
Variable and Vice-versa
Statistics and Probability
Converting a Normal Random Variable to a Standard Normal Variable and Vice-versa
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Dhojie N. Yanto, Elson A. Valencia, and Victor M. Disilio


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

144
Week

4
What I Need to Know
This module was designed to help you as a learner to converting a normal random
variable to a standard normal variable and vice versa. In this lesson, your acquired
knowledge and skills in formula derivation and rounding off numbers is needed. You
will learn how to convert a normal random variable 𝑥 to standard normal variable 𝑧
and vice-versa.
In a previous module, you have learned about how to illustrate a normal random
variable and its characteristics. It also discussed how to identify regions under the
normal curve corresponding to different standard normal values. This time, you will
focus on converting Normal Random Variable to Standard Normal Random Variable.
A normal distribution may take any values. These values are called raw scores. Raw
scores came from different sources cannot be compared unless we converted them
to standard scores.
Meanwhile, the knowledge, skills, and attitude you acquired about the previous
modules of normal distribution such as – the mean, standard deviation, raw scores
(x) and z – scores – is very useful to attain the target objectives of this module

After going through this module, you are expected to:


1. distinguish a raw score (x) and the standard score (z);
2. express normal random variable (x) as standard normal
variable (z).
3. express standard normal variable (z) as normal random
variable (x).
4. apply the concepts of normal random and standard normal
variables in solving real-life problems.
What I Know
Let us see how far you will go and how much you know about this pretest. Please
answer all questions with all your best and utmost honesty so that we can clearly
gauge your learning of the target competency of this lesson. Let’s get started.
Direction: Read and analyze each item carefully. Choose the letter of the
best answer. Write the chosen letter on a separate sheet of paper.
1. Which of the following is the measure of the number of standard
deviations (σ) a particular data value is away from the mean (μ)?
A. Standardization
B. Random Normal Variable
C. Standard Normal Variable
D. Probability Distribution
2. Given: x = 35 ; µ = 15 ; σ = 4. Find the z score.
A. -5
B. 5
C. 10
D. 15
3. Given: z = - 2.25 ; µ = 60 ; σ = 8. Find the value of x
A. 18
B. 42
C. 78
D. 102
For questions number 4 and 5
Suppose the provincial health officer of Quezon reported that the average COVID 19
positive for 30 days is 8 patients with the standard deviation of 4 patients.

145
4. What is the z score of 15 patients?
A. -5.75
B. -1.75
C. 1.75
D. 5.75
5. What is the z score of 4 patients?
A. – 1
B. -0.5
C. 0.5
D. 1
6. The mean score of the students on pre-assessment is 10 and the
standard deviation is 2. What was Elson’s raw score if his z – score
is 3?
A. 4
B. 12
C. 16
D. 20
7. Some DepEd public and private schools administered National
Achievement Test for Grade 6 and Grade 10. Suppose the test for
Grade 10 has a mean score of 110 and a standard deviation of 15. If
Juan, a Grade 10 student, has obtain a z-score of 1.27, what was
his score on the test?
A. 91
B. 92
C. 129
D. 130
For questions number 8-10
The Enhance Community Quarantine (ECQ) on some places was lifted by our
President Rodrigo Roa Duterte last May 15, 2020. Suppose that after the said ECQ,
one of the dieticians of well-known company conducted the survey about the effect
of home quarantine on the health of the Filipino people. He found out that average
weight for women is 60 kg with a standard deviation of 4 kg.
8.If the weights are normally distributed, what is the z-score of a woman with a
weight of 70.
A. -2.5
B. -1.5
C. 1.5
D. 2.5
9. What is the z score of a woman with a weight of 50 kg?
A. -2.5
B. -1.5
C. 1.5
D. 2.5
10. How many kilograms will be corresponds to the z score of 0.5 of the
weight of woman?
A. 56kg
B. 58kg
C. 60 kg
D. 62 kg

146
11. If the scores in a Probability and Statistics test are normally
distributed with a mean of 25 and standard deviation of 3.5. What is the
z-score for a score of 30?
A. 1.43
B. 1.06
C. 0.72
D. -1.43
12. Gracia, an ALS student, obtained a z-score of 2.35 in an achievement
test that is normally distributed with a mean grade of 83 and standard
deviation of 5. What was Gracia’s grade in the achievement test?
A. 71.25
B. 80.65
C. 85.35
D. 94.75
13. In a normal distribution with a mean of 69 and standard deviation of 15,
what does a z-score of -2 indicates?
A. It indicates that the score is equal to 39.
B. It indicates that the score is above the mean.
C. It indicates that the score is 39 less than the mean
D. It indicates that the difference between the score and the
mean is 39.
14. Francis Leo and Romy who are best friends took a college entrance
examination. The mean score and standard deviation of the
examination are 115 and 12, respectively. If Francis Leo and
Romy obtained z-scores of 1.5 and -0.95 respectively, what was
their scores in the examination?
A. 132 and 104
B. 133 and 104
C. 132.75 and 104
D. 133.75 and 103.6
15. The velocities of cars in express way are normally distributed with
a mean of 80 km/hr and standard deviation of 8 km/hr. Jessie’s
velocity has a z-score of 2.5, what was his velocity?
A. 100 km/hr
B. 105 km/hr
C. 110 km/hr
D. 115 km/hr
How do you find this pre-test? Did you encounter both familiar and unfamiliar
terms, symbols and notation? Kindly compare your answer in the Answer Key on the
last part of this module.

If you obtain 100% or a perfect score, skip the module and immediately move to the
next module. While in the event you missed a point, please proceed on the module
as it will enrich your knowledge in normal distribution, especially in illustrating the
normal random variables and its characteristics
Lesson 1 Converting a Normal Random
Variable to a Standard Normal
Variable and Vice-versa
Did you take your National Career Assessment Examination (NCAE) on your Grade
9? How about the National Achievement Test on your Grade 6 and 10? Have you ever
wondered why there are corresponding percentiles in the test result such as 86, 95

147
or even 99+? If not, this lesson will teach you on how to appreciate the importance
of knowing the answer in those questions?
What’s In
Let’s review. Solve for x in terms of another variable.
1. ax = b
2. x + y =1
3. y = mx + b
a+ b
4. =1
x
x− μ
5. z =
σ
Answers:
𝑦− 𝑏
1. x = b/a 2. x = 1-y 3. x = 4. x = a + b 5. x = zσ + μ
𝑚

What’s New
Fill Me Up
Complete the table below by filling in the correct value of the required variable. Use
𝑥− 𝜇
this formula 𝑧 = and solve.
𝜎

z x 𝜇 𝜎

___ 23 18 4

-1.17 ___ 125 30

___ 515 475 100

0.565 ___ 15.5 5.5

This time, you need to reflect on the following questions. Answer the following
questions:
1. How did you find the activity?
2. How were you able to find the value of z? x?
3. Can you derive the formula in finding the value of x?
4. Do you think it is possible to solve for the μ and σ given the value of x and z?
What is It
Was the activity fun and interesting? How did you manage to derive the
formula? Are you successful in completing the table?
The activity allows you to manipulate the given data and the missing value
through derivation. In this case, you convert the raw score (x) into the standard score
𝑥− 𝜇
(z) using the formula 𝑧 = . You substitute the given values and uses properties
𝜎
of equality and algebraic rules to obtain the needed data
In solving real-life problems involving the normal curve, it is particularly
important to convert a random variable x to a standard normal variable or z-score.
This procedure is known as “standardizing” or “standardization” of a random
variable, where a standardized value is called a z-score. A z-score is a measure of the
number of standard deviations (σ) a particular data value is away from the mean (μ).

148
Suppose your score on a test in Probability and Statistics is 39 and the scores
are normally distributed with a mean of 33 and standard deviation of 3, then your
score is exactly 2 standard deviations above the mean. If you scored 30, then it is
exactly 1 standard deviation below the mean. All values that are above the mean
have positive z-scores and all values that are below the mean have negative z-scores.
If you obtained a z-score of -3, this means that your score is 3 standard deviations
below the mean, that is, 33 – 3(3) = 33 – 9 = 24.
Given any value x from a normal distribution with mean μ and standard
deviation σ, to convert x to a z-score (standard normal score), you need to;
(a) Subtract the mean μ from x.
(b) Divide this quantity, 𝑥 – 𝜇, by the standard deviation σ.
The formula used in converting a random variable x to a standard normal
variable z is:
𝑥− 𝜇
𝑧 =
𝜎

where: 𝑧 – standard normal score or z – score


𝑥 – any data value in a normal distribution
𝜇 - mean
𝜎 - standard deviation

Example 1.
Suppose IQ scores are normally distributed with a mean of 100 and standard
deviation of 10. If your IQ is 85, what is your z-score? (Round off your answer to the
nearest hundredths)
Solution:
𝑥− 𝜇
The z-score can be computed using the formula; 𝑧 =
𝜎

85 − 100 −15
𝑧 = = = −𝟏. 𝟓𝟎
10 10
Example 2.
On a nationwide placement test that is normally distributed, the mean was
125 and standard deviation was 15. If you scored 149, what was your z-score?
(Round off your answer to the nearest hundredths)
Solution:
𝑥− 𝜇
Using the formula 𝑧 =
𝜎
149 − 125
𝑧 =
15
24
= = 1.60
15
Example 3.
The heights of teachers in Sta. Catalina National High School are normally
distributed with a mean of 150 cm and standard deviation of 15 cm. The height of
Sir Victor has a z-score of 3.25. What is the actual height of Sir Victor? (Round off
your answer to the nearest hundredths)
Solution:
In this example, z-score is given while the normal random variable is
unknown. To solve for the normal random variable x, multiply the z-score (z) by the
standard deviation σ, then add the mean μ.
Therefore;
𝑥 = 𝑧𝜎 + 𝜇
= 3.25(15 cm) + 150 cm
= 48.75 cm + 150 cm

149
𝑥 = 198.75 𝑐𝑚
Example 4.
The time it takes for a cell to divide is normally distributed with an
average of 60 minutes and standard deviation of 5 minutes. How long will it take for
a given cell to divide if its “mitosis” has a z-score of -1.35?
Solution:
𝑥 = 𝑧𝜎 + 𝜇
= -1.35(5 min) + 60 min
= -6.75 min + 60 min
𝒙 = 𝟓𝟑. 𝟐𝟓 𝒎𝒊𝒏𝒖𝒕𝒆𝒔
Example 5.
Suppose you have the population values 40 and 70 and that their
corresponding z – scores are -1 and 2, respectively.
1. Is it possible to determine the population’s mean and standard deviation?
2. If so, what are these values?
Solution:
To begin, let us first identify the given. Let
𝑥1 = 40 𝑥2 = 70
𝑧1 = −2 𝑥2 = 1

Use these given and substitute to the formula (12-2), we have


𝑥1 = 𝑧1 𝜎 + 𝜇 𝑥2 = 𝑧2 𝜎 + 𝜇
40 = −2𝜎 + 𝜇 70 = 1𝜎 + 𝜇
Note: In this example, we do not have 𝜎1 and 𝜎2 , and 𝜇1 and 𝜇2 since we are describing
a single population. Thus, 𝜎1 = 𝜎2 = 𝜎 and 𝜇1 = 𝜇2 = 𝜇.
Furthermore, we need to review our algebra specifically on systems of linear
equation. So,
40 = −2𝜎 + 𝜇 equation 1
70 = 1𝜎 + 𝜇 equation 2
Subtract equation 1 from equation 2, we have
40 = −2𝜎 + 𝜇 equation 1
70 = 1𝜎 + 𝜇 equation 2
30 = 1𝜎 − (−2𝜎)
30 = 3𝜎 the difference
1 1 1
(30 = 3𝜎) multiply by
3 3 3
10 = 𝜎 simplify
The standard deviation of the population is 10. To obtain the mean, substitute it to
either equation 1 or 2.
40 = −2𝜎 + 𝜇 ⇒ 40 = −2(10) + 𝜇 ⇒ 60 = 𝜇
70 = 1𝜎 + 𝜇 ⇒ 70 = 1(10) + 𝜇 ⇒ 60 = 𝜇
Both agrees that the mean of the population is 60.
Therefore,
1. It is possible to determine the population’s mean and standard deviation
given only the raw score (x) and its corresponding standard score (z).
2. The mean and standard deviation of the population is 60 and 10,
respectively.

150
What’s More
Independent Activity
Find Me
Supply the needed information by solving the missing value.
1. x = 25, µ= 40, σ = 6, z = ?
2. z = -2.25, µ= 218, σ = 36, x = ?
3. x = 39, µ= 30, σ = 3, z = ?
4. x = 125, µ= 105, σ = 8, z = ?
5. z = 1.11, µ= 145, σ = 20, x =?
Independent Assessment
Solve Me
Solve the following problems.
1. The mean number of hours a Filipino worker spends on the computer is
3.1 hours per workday. Assume the standard deviation is 0.5 hour and is
normally distributed, how long does a worker spend on the computer if his
z-score is 1.2?
2. Each month, a Filipino household generates an average of 28 pounds of
newspaper for garbage or recycling. Assume the standard deviation is 2
pounds. Determine the z-score of a household that generates 22 pounds
of newspaper.
3. The Candelaria Automobile Association reports that the average time it
takes to respond to an emergency call is 30 minutes. Assume the variable
is normally distributed and the standard deviation is 4.5 minutes. How
long will a call be responded if it has a z-score of 0.75?
4. The average monthly salary for newly – hired teachers is P21,945. If the
distribution is approximately normal with a standard deviation of P3250.
How much will a teacher earn in a month if his salary has a z-score of
1.15?
What I Have Learned
Complete the statements below by filling in what you have learned from this module.
I believe you can do this without going back to the module. In the event you are
unsure with your response. Please reread the module. You can seek help from your
facilitator or classmates if there is something you need to clarify. Your ability to
comprehend and honesty in answering this part is very much appreciated. Happy
learning!
1. A __________ is a measure of the number of standard deviations (𝜎) a particular
data value is away from the mean (𝜇)
2. _________________________________ is equal to the quotient between the
difference of normal random variable (x) or raw score and mean and standard
deviation. It is written as:
𝑥− 𝜇
z=
𝜎
3. __________________________________ is equal to the product of z score and
standard deviation plus the mean. It is written as:
𝑥 = 𝑧𝜎 + 𝜇
What Can I Do
Read and analyze the following problem carefully

Victor, a Grade 11 student of Valencia National High School is one of the candidates
for the scholarship program in one of the well-known companies in their

151
municipality. He was already submitted the different requirements like ITR of his
parents and G10 Report Card. The HR of the said company make a list of the five
priority subjects and he instructed his clerk to compute the mean and standard
deviation. Assuming that you are the assigned clerk, kindly complete the following
table:

SUBJECTS Grade
Filipino 85
English 83
Math 90
Social Science 88
Science 88
Mean
SD
Based on the above data, express each grades of the respective subjects into z-
scores. Show your complete solution and write your final answer on the following
table.
SUBJECTS Grade z- scores
Filipino 85
English 83
Math 90
Social Science 88
Science 88

Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. What is another term used for standard normal score?
A. z- score
B. mean score
C. x score
D. t score
2. Given: x =125; µ = 105; 𝜎 = 16. Find the z score.
A. -1.25
B. -1
C. 1
D. 1.25
3. Given: z = 2.25; µ = 70; 𝜎 = 9. Find the value of x
A. -49.75
B. 35.11
C. 90.25
D. 148.5
For questions number 4 and 5
Suppose the provincial health officer of Batangas reported that the average COVID19
positive for 40 days is 10 patients with the standard deviation of 6 patients.
4. What is the z score of 18 patients?
A. 1.33
B. 1.34
C. 4.66
D. 4.67

152
5. What is the z score of 4 patients?
A. – 1
B. -0.5
C. 0.5
D. 1
6. If Elson's z-score on post-assessment is -3, the mean is 10 and the standard
deviation is 2, what was his raw score?
A. 4
B. 12
C. 16
D. 20
7. Some DepEd public and private schools administered National Achievement
Test for Grade 6 and Grade 10. The test for Grade 10 has a mean score of and
a standard deviation of 10. If Juan, a Grade 10 student has z-score of 1.25,
what was his
score on the test?
A. 127
B. 128
C. 129
D. 130

For questions number 8 - 10


The Enhance Community Quarantine of some places was lifted by our
President Rodrigo Roa Duterte last May 15, 2020. Suppose that after the said ECQ,
one of the dieticians of well-known company conducted the survey about the effect
of home quarantine on the health of the Filipino people. He found out that average
weight for women is 70 kg with a standard deviation of 6 kg.

8. If the weights are normally distributed, what is the z-score of a with a


weight of 63
A. -1.17
B. -1.16
C. 1.16
D. 1.17
9. What is the z score of a woman with a weight of 85 kg?
A. -2.5
B. -1.5
C. 1.5
D. 2.5
10. How many kilograms will be corresponds to the z score of 1.5 of the weight
of woman?
A. 76kg
B. 79 kg
C. 82 kg
D. 85 kg
11. If the scores in a Probability and Statistics test are normally distributed
with a mean of 31 and standard deviation of 4.5. What is the z-score for a score
of 40?
A. 1
B. 2
C. 2.5
D. 3

153
12. Gracia, an ALS student, obtained a z-score of 1.35 in an achievement test
that is normally distributed with a mean grade of 85 and standard deviation of
5. What was Gracia’s grade in the
achievement test?
A. 71.25
B. 78.25
C. 91.75
D. 94.65
13. In a normal distribution with a mean of 15 and standard deviation of 3,
what does a z-score of -2 indicate?
A. It indicates that the score is equal to 9.
B. It indicates that the score is below the mean.
C. It indicates that the score is 9 less than the mean.
D. It indicates that the difference between the score and the mean is 6.
14. Ronnie and Loida who are best friends took a college entrance
examination. The mean score and standard deviation of the examination are
110 and 15, respectively. If Ronnie and Loida obtained z-scores of 1.25 and -
0.75 respectively, what was their scores in the examination?
A. 129 and 99
B. 128.75 and 88.75
C. 130 and 99
D. 140.75 and 100.75
15. The velocities of cars in express way are normally distributed with a Mean
of 90 km/hr and standard deviation of 10 km/hr. Jessie’s velocity has a z-
score of 2.5, what was his velocity?
A. 100 km/hr
B. 105 km/hr
C. 110 km/hr
D. 115 km/hr
Additional Activities
Read the following problems carefully. Apply what you have learned in the module.
The Candelaria Automobile Association reports that the average time it takes to
respond to an emergency call is 30 minutes. Assume the variable is normally
distributed and the standard deviation is 4.5 minutes.
a. How long will a call be responded if it has a z-score of 0.75?
b. How long will a call be responded if it has a z-score of – 1.83?

154
Statistics and
Probability
Computing Probabilities and
Percentiles Using the Standard
Normal Table
Statistics and Probability
Computing Probabilities and Percentiles Using the Standard Normal Table
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Dhojie N. Yanto, Elson A. Valencia and Victor M. Disilio


Editors: Celestina M. Alba and Nestor N.Sandoval,
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

155
Week

4
What I Need to Know
This module was designed to help you as a learner to compute for probabilities
and percentiles using the standard normal table. In this lesson, you will learn how
to apply the z-score in finding probabilities and percentiles by simply looking at the
Table of Standard Normal Distribution. Moreover, this module will enable you to
solve real-life problems on probabilities and percentiles. It is easy to understand the
uses of normal distribution because the examples and problems provided are based
on real-life situations.
In the previous modules, you have learned about identifying regions under the
normal curve corresponding to different standard normal values and converts a
normal random variable to a standard normal variable and vice versa. This time, you
will focus on computing for probabilities and percentiles using the standard normal
table. A standard normal table or unit normal table or z – table, is a mathematical
table composed of a list of z – values and probabilities that arrange cumulatively.
Meanwhile, the knowledge, skills, and attitude you acquired about the normal
distribution, mean, standard deviation, raw score (x) and standard scores (z) is
extremely useful to achieve our target objectives on this module

After going through this module, you are expected to:


1. utilize the use of the Table of Standard Normal Distribution;
2. compute probabilities and percentiles related to a given 𝑧-score or
normal random variable 𝑥; and
3. sketch the graph of the normal distribution.

What I Know
Let us see how far you will go and how much you know about this pre-test. Please
answer all questions with all your best and utmost honesty so that we can gauge
your learning of the target competency of this lesson. Let’s get started!

Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. Which of the following notations indicate the probability of a z value from the left?
A. 𝑃(𝑍 > 𝑧) B. 𝑃(𝑍 < 𝑧) C. 𝑃(𝑋 < 𝑥) D. 𝑃(𝑋 > 𝑥)
2. What does P(a<Z<b) notation mean?
A. Probability of z that is in the left of a and b
B. Probability of z that is in the right of a and b.
C. Probability of z that is in the opposite of a and b.
D. Probability of z that is in between two other z values a and b.
3. What is the probability of z value indicated by 𝑃(𝑍 < −1.78)?
A. 0.0375 B. 0.3075 C. 0.9625 D. 0.9633
4. Find the probability value of 𝑃(𝑍 > −1.00).
A. 15.87% B. 34.13% C. 84.13% D. 90.13%
5. Compute the probability value of 𝑃(−1.35 < 𝑍 < 2.75)
A. 90.85% B. 99.70% C. 0.9085 D. Both A and C
6. Find the z-scores that bound the middle 92.50% of the area under the standard
normal curve.
A. -1.44, 144 B. -1.44, -1.78 C. 1.44, 1.78 D. -1.78, 1.78
7. Which of the following graph is the correct illustration of the notation 𝑃(𝑎 < 𝑧 <
𝑏)?

A. B. C. D.

156
8. The average score on Statistics and Probability First Quarterly Assessment is 35
points with a standard deviation of 5. What is the probability that Beth’s score is
less than 45 points?
A. -2 B. 2 C. 97.72% D. 2.28%
9. Given the normal distribution with a mean of 35 and a standard deviation of 3,
which interval of the variable defines an area under the curve corresponding to a
probability of approximately 68.26 %?
A. (35,38) B. (32, 38) C. (29,38) D. (29,41)

For items 10 to12, use the situation below.


Eight hundred fifty businessmen accepted the Mayaman Challenge of one of
the Philanthropic Filipinos who are willing to give relief goods for our fellow Filipinos
affected by the recent pandemic. They will be contributing an average of 120 sacks
of rice with a standard deviation of 25 sacks of rice.
10. What is the probability that the businessman will contribute less than 90 sacks
of rice?
A. 11.51% B. 76.99% C. 88.49% D. 89.45%
11. What is the probability that the businessman will contribute more than150
sacks of rice?
A. 11.51% B. 76.99% C. 88.49% D. 89.45%
12. How many businessmen will contribute a sack of rice between 100 sacks
and 145 sacks?
A. 534 B.535 C. 715 D. 716
13. The ages of ALS students enrolled at Sta. Catalina National High School are
normally distributed with a mean of 19 years and a standard deviation of 3.5
years. If a student is selected at random, what is the probability that his age is
under 22 years?
A. 11.67% B. 19.49% C. 80.51% D. 90.57%
14. Teaching the value of productive work, ABM students launched the “Go Negosyo”
project. If their weekly sales on this project is normally distributed with mean of
P3,200 and standard deviation of P500. What percent of their weekly sales are
over P4,000?
A. 94.52% B. 77.34% C. 22.57% D. 5.48%

15. In problem number 14, how many percents of their weekly sales are less than
P3,000?
A. 11.61% B.34.46% C. 65.54% D 88.39%

How do you find this pre-test? Did you encounter both familiar and unfamiliar
terms, symbols, and notations? Kindly compare your answer in the Answer Key on
the last part of this module.
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. While in the event you missed a point, please proceed with the
module as it will enrich your knowledge in normal distribution, especially in
illustrating the normal random variables and its characteristics.

Lesson 1 Computing Probabilities and


Percentile Using the Standard
Normal Table
Have you tried comparing your previous performance to your present
performance? Or comparing your performance in different subjects like Math and

157
English? Which of the two subjects are you better - Math with a score of 30 whose
mean is 25 and a standard deviation of 5 or English with a score of 32 whose mean
is 27 and a standard deviation of 6?
These are simple questions we thought do not require statistics, but it is. In
this lesson, this module will guide you on how to compute probabilities and
percentile using the z - table.

What’s In
Let’s review. Supply the needed information by writing on the response area.
Questions Responses here
1. Convert 0.4783 into percent. _____________________
2. What is the equivalent of 38.45% in decimal? _____________________
3. What is the product of 450 and 0.2356? _____________________
(Round off your answer into nearest whole number)
4. What is the 25% of 100? _____________________
5. What is the total area of a normal curve? _____________________

What’s New
Find the area of the shaded region under the normal curve and make a brief and
concise interpretation of the following graph.

1. 2.

3. 4.

5. 6.

158
What is It
How did you find the activity? Was it fun and challenging at the same time?
How did you determine the area of the shaded region? What are the strategies you
commonly used?
A normal distribution curve can be used as a probability distribution curve
for normally distributed variables. The area under the standard normal distribution
curve can also be thought of as a probability. That is, if it's possible to select any 𝑧
value at random, the probability of choosing one, say, below 1.45 would be the same
as the area under the curve at the left of 1.45. In this case, the area is 0.9265.
Therefore, the probability of randomly selecting a 𝑧 value below of 1.45 is 0.9265 or
92.65%. The problems involving probabilities and percentiles are solved in the same
manner as finding the areas under a normal curve.
In finding probabilities, the following notations will be used:
(a) 𝑃(𝑍 < 𝑧) – probability at the left of 𝑧
(b) 𝑃(𝑍 > 𝑧) = 1 – 𝑃(𝑍 < 𝑧) – probability at the right of 𝑧
(c) 𝑃(𝑎 < 𝑧 < 𝑏) – the probability of 𝑧 that is in between two other 𝑧 values,
say 𝑎 and 𝑏
(d) 𝑃(𝑧 < 𝑎 ) ∪ 𝑃(𝑧 > 𝑏 ) – the probability of 𝑧 is in the opposite direction of two
values, say 𝑎 and 𝑏
(e) 𝑃(𝑋 < 𝑥) – probability at the left of a normal random variable 𝑥
(f) 𝑃(𝑋 > 𝑥) – probability at the right of a normal random variable 𝑥
(g) 𝑃(𝑎 < 𝑋 < 𝑏) – the probability of a normal random variable 𝑋 that is in
between two other normal random variables, say 𝑎 and 𝑏.
(h) 𝑃(𝑋 < 𝑎 ) ∪ 𝑃(𝑋 > 𝑏 ) – the probability of 𝑋 is in the opposite direction of
two values, say 𝑎 and 𝑏
Example 1.
Find the probabilities for each of the following.
(a) 𝑃(𝑍 < 1.32)
(b) 𝑃(𝑍 < −1.05)
(c) 𝑃(−0.75 < 𝑍 < 1.56)
(d) 𝑃(𝑍 > −0.88)
Solution:
(a) The probability 𝑃(𝑍 < 1.32) means to find the area under the normal curve
at the left of 𝑧 = 1.32. From the table, find the intersection on row 1.3 and
column 0.02. Thus, the probability is 0.9066 or 90.66%.

(b) The probability 𝑃(𝑍 < −1.05) is the same as the area under the normal
curve at the left of 𝑧 = −1.05. From the table, the area is 0.1469. Thus,
the probability is 0.1469 or 14.69%.

159
(c) The probability 𝑃(−0.75 < 𝑍 < 1.56) is the same as the area under the
normal curve between 𝑧 values -0.75 and 1.56. To get the area, subtract
the area of 𝑧 = −0.75 from the area of 𝑧 = 1.56. Thus, 𝑃(−0.75 < 𝑍 <
1.56) = 𝑃(𝑍 < 1.56)– 𝑃(𝑍 < −0.75) = 0.9406 – 0.2266 = 𝟎. 𝟕𝟏𝟒𝟎 𝑜𝑟 𝟕𝟏. 𝟒𝟎%.

(d) The probability 𝑃(𝑍 > −0.88) is the same as the area under the normal
curve at the right of 𝑧 = −0.88. To get the area, subtract the area of 𝑧 =
−0.88 from 1. Thus, 𝑃(𝑍 > −0.88) = 1 – 𝑃(𝑍 < −0.88) = 1 – 0.1894 =
𝟎. 𝟖𝟏𝟎𝟔 𝑜𝑟 𝟖𝟏. 𝟎𝟔%.

Example 2.
Let 𝑋 be a normal random variable with mean 𝜇 = 15 and standard deviation
𝜎 = 3. Find the probabilities of the following:
(a) 𝑃(𝑋 < 19)
(b) 𝑃(10 < 𝑋 < 19)
Solution:
(a) To find the probability, convert the normal random variable 19 to a z-score.
𝑥− 𝜇 19− 15 4
Thus, 𝑃(𝑋 < 19) = 𝑃(𝑍 < ) = 𝑃(𝑍 < ) = 𝑃(𝑍 < ) = 𝑃(𝑍 < 1.33).
𝜎 3 3
From the table, the probability at the left of 𝑧 = 1.33 is equal to 0.9082 or
90.82%.
(b) To find the probability, convert the normal random variables 10 and 19 to
10−15 19−15
z values. Thus, 𝑃(10 < 𝑋 < 19) = 𝑃( <𝑍 < ) = 𝑃(−1.67 < 𝑍 <
3 3
1.33) = 𝑃(𝑍 < 1.33) – 𝑃(𝑍 < −1.67). From the table, we have
0.9082 – 0.0475 = 𝟎. 𝟖𝟔𝟎𝟕 or 𝟖𝟔. 𝟎𝟕%.
These probabilities can also be applied to real-life problems.
Example 3.
The mean number of hours a Filipino worker spends on the computer is 3.1
hours per workday. Suppose the spending time is normally distributed with a
standard deviation of 0.5 hour, find the percentage of workers who spend less than
3.5 hours on the computer assuming that the variable is normally distributed.
Solution:
The percentage of workers who spend less than 3.5 hours is the same as the
area under the normal curve at the left of a normal random variable of 3.5 hours.
3.5−3.1
Convert 3.5 to a z value. That is, z = = 0.80. From the table, the area at the left
0.5
of z = 0.8 is 0.7881. Thus, the percentage of workers who spend less than 3.5 hours
is 78.81%.

160
Example 4.
Each month, a Filipino household generates an average of 28 pounds of
newspaper for garbage or recycling. Assume that the distribution is normal with a
standard deviation of 2 pounds. If a household is selected at random, find the
probability of its generating
a. between 27 and 31 pounds per month.
b. more than 30.2 pounds per month.
Solution:
(a) The probability of generating between 27 and 31 pounds is 𝑃(27 < 𝑋 <
27−28 31−28
31). This is equal to 𝑃( < 𝑍 < ) = 𝑃(−0.5 < 𝑍 < 1.50) = 𝑃(𝑍 <
2 2
1.50) – 𝑃(𝑍 < −0.5) = 0.9332 – 0.3085 = 0.6247. Thus, the probability of
generating between 27 and 31 pounds of newspaper per month is 0.6247
or 62.47%.

(b) The probability of generating more than 30.2 pounds per month is
equivalent to 𝑃(𝑋 > 30.2) which is equal to 1 – 𝑃(𝑋 < 30.2) since it is right-
30.2−28
tailed.1 – 𝑃(𝑋 < 30.2) = 1 – 𝑃 (𝑍 < ) = 1 – 𝑃(𝑍 < 1.10) = 1 – 0.8643
2
= 0.1357. Thus, the probability of generating more than 30.2 pounds of
newspaper per month is 0.1357 or 13.57%.

Example 5.
The Mauban Automobile Association reports that the average time it takes to
respond to an emergency call is 35 minutes. Assume the variable is approximately
normal and the standard deviation is 5.5 minutes. If 200 calls are randomly selected,
approximately how many were responded in less than 25 minutes?
Solution:
The number of calls responded in less than 25 minutes is equal to the
percentage of 200 calls which were responded in less than 25 minutes. First, find

161
25−35
𝑃(𝑋 < 25). 𝑃(𝑋 < 25) = 𝑃(𝑍 < ) = 𝑃(𝑍 < −1.82). From the table, this is equal
5.5
to 0.0344. Thus, there are (0.0344)(200) = 6.88 or approximately 7 calls were
responded in less than 25 minutes.

Example 6.
To help students improve their reading, Mauban south district decides to
implement a reading program. It is to be administered to the bottom 5% of the
students in the district, based on the scores of a reading achievement test which are
normally distributed. If the average score for the students in the district is 122.6 and
the standard deviation is 18, find the cut-off score that will make a student eligible
for the program.
Solution:
The problem is asking for a
normal random variable that
corresponds to a z value and
occupying an area of 5% or 0.05
from the left under the normal
curve. To find this normal random
variable which is the cut-off score
that will make a student eligible
for the reading program, look for
the 𝑧 value on the table
corresponding to an area of 0.05
and that is equal to −1.645. Then, convert this 𝑧 value to a normal random variable
𝑥, that is, 𝑥 = 𝑧𝜎 + 𝜇 = (−1.645)(18) + 122.6 = −29.61 + 122.6 = 92.99 ≈ 93.
Therefore, a student will be eligible in the reading program if he scored 93 and below.

Example 7.
To pass in the accreditation and
equivalency (A&E) test, ALS students must
score in the top 15% in general ability tests.
The test has a mean of 200 and a standard
deviation of 20. Find the lowest possible
score to pass the test assuming the test
scores are normally distributed.
Solution:
The lowest possible score is the
normal random variable corresponding to a
𝑧 value occupying an area of 0.15 from the
right of the normal curve. To solve for the
required variable, given that P(Z > z) = 0.15
NOTE: Remember 𝑃(𝑍 < 𝑧) + 𝑃(𝑍 > 𝑧) = 1
𝑃(𝑍 > 𝑧) = 0.15
1 – 𝑃(𝑍 > 𝑧) = 1 – 0.15
1 – 𝑃(𝑍 > 𝑧) = 1 – 0.15 since it is right tailed
= 0.85
from the table, look for the z value corresponding to this area.
Thus, 𝑧 = 1.03 then convert this to a normal random variable x
𝑥 = 𝑧𝜎 + 𝜇 = (1.03)(20) + 200 = 220.6 ≈ 221

Therefore, the lowest possible score to pass the test is 221.

162
What’s More
Independent Practice
Find Me
A. Use the Table of Standard Normal distribution to determine corresponding
probabilities of the following:
1. 𝑃(𝑍 < −1.45)
2. 𝑃(0.5 < 𝑍 < 2.33)
3. 𝑃(𝑍 > 1.78)
B. 𝑋 is a normally distributed random variable with a mean of 60 and a
standard deviation of 8. Find the probabilities indicated by using the table.
4. 𝑃(𝑋 < 52)
5. 𝑃(48 < 𝑋 < 64)
6. 𝑃(𝑋 > 57)
Independent Assessment
Solve Me
Solve the following problems by supplying the needed information.
The average number of calories in a 1.8-ounce chocolate bar is 230. Suppose
that the distribution of calories is approximately normal with a standard deviation of
10.
1. Find the probability that a randomly selected chocolate bar will have less
than 200 calories.
2. Find the probability that a randomly selected chocolate bar will have
greater than 195 calories.
3. What is the percentage that a cholate bar is randomly selected between
200 calories and 250 calories?
What I Have Learned
Complete the statements below by choosing the provided options in what you
have learned from this module. I believe you can do this without going back to the
module. In the event, you are unsure of your response, please reread the module.
You can seek help from your facilitator or classmates if there is something you need
to clarify. Your ability to comprehend and honesty in answering this part is very
much appreciated. Happy learning!
A. Underline the appropriate response to make each statement true.
1. 𝑷(𝒁 < 𝒛) – Probability at the (left, right) of 𝑧.
2. 𝑷(𝒁 > 𝒛) = 1 – 𝑃(𝑍 < 𝑧) – Probability at the (left, right) of z.
3. 𝑷(𝒂 < 𝒛 < 𝒃) – Probability of z that is in (between, opposite direction) two other
𝑧 values 𝑎 and 𝑏.
4. 𝑷(𝑿 < 𝒙) – Probability at the (left, right) of a normal random variable 𝑥.
5. 𝑷(𝑿 > 𝒙) – Probability at the (left, right) of a normal random variable 𝑥.
6. 𝑷(𝒂 < 𝑿 < 𝒃) – Probability of a normal random variable 𝑋 that is in (between,
opposite direction) two other normal random variables 𝑎 and 𝑏.
B. Arrange the statements below step by step to compute for the probabilities and
percentiles using the Standard Normal Table. Write your responses to the provided
table.
Convert the raw score (x) to a standard score (z).
Look for the probabilities on the Standard Normal Table.
Sketch the graph of the normal curve.
Step 1
Step 2
Step 3

163
What Can I Do
Read and analyze the following problem carefully
A. Suppose your section will participate in the field demonstration at the Sports
festival in your school. This field demonstration must be consisting of 20 learners
per section. Your adviser assigned you to organize a team that is willing to join in
the said event. He also instructed you to collect information about your
classmates’ height because he will need to buy a costume. Since your community
is under the General Community Quarantine because of the recent pandemic,
you need to contact your classmates through SMS or messenger to get their
heights and write it on the table below.

Name Height (cm) Name Height (cm)


1 11
2 12
3 13
4 14
5 15
6 16
7 17
8 18
9 19
10 20
B. Answer the following questions based on your collected data.
1. What is the mean? standard deviation?
2. Suppose the above data is normally distributed. What is the probability
that the height of your classmates picked randomly is
a. less than 155 cm?
b. more than 143 cm?
c. between 140cm and 160 cm?
3. How many participants in your team have heights between 140 cm to
170 cm?

Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. Which of the following notations indicate the probability of a z value from the
right?
A. P(Z > z) B. P(Z < z) C. P(X < x) D. P(X > x)
2. What does P(a<Z<b) notation mean?
A. Probability of x that is in the right of a and b.
B. Probability of x that is in the opposite of a and b.
C. Probability of X that is in between two other z values a and b.
D. Probability of a normal random variable X that is in between two other
normal random variables a and b
3. What is the probability of z value indicated by P(Z > -1.78)?
A. 0.0375 B. 0.3075 C. 0.9625 D. 0.9633
4. Find the probability value of P(Z< -1.0)
A. 15.87% B. 34.13% C. 84.13% D. 90.13%
5. Compute the probability value of P(1.35< Z< 2.75)
A. 8.55% B. 85.5% C. 85.85% D. 90.85%

164
6. Find the z-scores that bound the middle 85.02% of the area under the
standard normal curve.
A. (-1.44, 1.44) B. (-1.44, -1.78) C. (1.44, 1.78) D. (-1.78, 1.78)
7. Which of the following graph is the correct illustration of the notation
𝑃(𝑍 < 𝑎) ∪ 𝑃(𝑍 > 𝑏)?

A. B. C. D.
8. The average score on Statistics and Probability Summative Test is 40 points
with a standard deviation of 4. What is the probability that Beth’s score is
25 points?
A. -3.75% B. -0.009% C. 0.009% D. 3.75%
9. A set of data is normally distributed with a mean of 40 and a standard deviation
of which interval of the variable defines an area under the curve corresponding
to a probability of approximately 95.44 %?
A. (37,40) B. (34,43) C. (34,46) D. (37,46)

For item numbers 10-12


Nine hundred thirty-two businessmen accepted the Mayaman Challenge of
one of the Philanthropic Filipinos who are willing to give relief goods for our fellow
Filipinos affected by the recent pandemic. They will be contributing an average of
135 sacks of rice with a standard deviation of 15 sacks of rice.
10. What is the probability that the businessman will contribute less than 105 sacks
of rice?
A.98% B. 97.72% C. 2.28% D. 2.0%
11. What is the probability that the businessman will contribute more than
150 sacks of rice?
A. 15.87% B. 76.45% C. 80.78% D. 84.13%
12. How many businessmen will contribute a sack of rice between 105 sacks
to 145 sacks?
A. 534 B.653 C. 676 D. 726
13. The ages of Senior High students enrolled at Sta. Catalina National High School
are normally distributed with a mean of 17 years and a standard deviation of 2.5
years. If a student is selected at random, what is the probability that his age is
under 16 years?
A. 15.54% B. 34.46% C. 68.92% D. 84.56%
14. The weekly sales of ABM students on their “Go Business” project are normally
distributed with a mean of P4,250 and a standard deviation of P300. How many
percent of their weekly sales are over P4,700?
A. 6.68% B. 13.36% C. 86.63% D. 93.32%
15. In problem number 14, how many percent of the ABM students' weekly sales are
less than P4,700?
A. 6.68% B. 13.36% C. 86.63% D. 93.32%

165
Additional Activities
Read the following problems carefully. Supply the needed information. Round off
your answer to the nearest hundredths (two decimal places) for the standard scores
and nearest ten thousandths (four decimal places) for the area of probabilities.
1. Use the Table of Standard Normal distribution to find the probabilities of the
following:
a. P(Z < -1.51)
b. P(Z < 1.37)
c. P(Z > 1.78)
d. P(Z > 1.51)
e. P(1.15 < Z < 2.33)

2. To lessen the number of non-numerates in Sta. Catalina National High School,


the Mathematics department implemented Project MEAN (Masterful Elimination
of All Non-numerates). A student is required to undergo the program if he/she
scored below 20 in the numeracy test. If the scores of the students in the
numeracy test are normally distributed with a mean of 30.50 and a standard
deviation of 9.5, what percentage of the students needed to undergo the program?

166
167
168
Statistics and
Probability
Identifying the Different
Random Sampling Techniques
Statistics and Probability
Identifying the Different Random Sampling Techniques
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Maria Cristina O. Rosas and Coren R. Manalo


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

169
Week

5
What I Need to Know
This module was designed and written with you in mind on how you can
identify the different random sampling techniques as applied in solving real-life
situated problems. It is here to help you realize the use and importance of selecting
the appropriate type of random sampling method that can yield significant results in
your future researches. This module was written and intended for senior high
students like you who are taking Statistics and Probability subject. Your adept at
analysis will help you appreciate the richness, and beauty of this subject which will
motivate you to apply to similar situations and develop your own statistical
procedures.

The module contains only one lesson which is illustrating random sampling.
It offers relevant problems or situations which help you to understand what is being
tackled in this module. Your ability to interpret, reason – out, and make a judgment
or even decision out of statistical measures will also be practiced here.

After going through this module, you are expected to:


1. define terms related to random sampling;
2. illustrate random sampling; and
3. identify different random sampling techniques.

What I Know
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What do you call to the set of all people, objects, events, or ideas
researchers want to investigate?
A. data C. sample
B. population D. statistics
2. Which of the following can be considered as the best sample?
A. conveniently chosen C. selected at random
B. purposefully selected D. self-selected
3. From the given populations, which of the following sample is most
likely to be representatives of the population from which it is drawn?
A. population: senior high school students sample: grade 10 students
B. population: online shoppers sample: market vendors
C. population: users of social media sample: students
D. population: SAP beneficiaries sample: government officials
4. Which of the following is NOT a distinguishing feature of simple
random sampling?
A. Each element in the population has an equal probability of being
selected
B. Random numbers selected to decide which elements are included as
the sample.
C. Each element has a unique identification.
D. Samples are selected based on the subjective judgment of the researcher.
5. Which of the following is a type of random sampling method?
A. convenience C. purposive
B. non-probability D. stratified
6. Which of the following is NOT a type of random sampling method?
A. cluster C. simple
B. convenience D. systematic

170
7. A researcher was given a random list of all graduating senior high school
students. He decided to conduct a survey by asking every seventh student on
the list regarding the courses that they want to take in college. What type of
sampling technique he is planning to use?
A. cluster C. stratified
B. random D. systematic
8. You are a researcher and you wish to do a comparative study between two
hospital's success rates in fighting COVID-19. Instead of mixing patients in the
two hospitals, you simply divided your samples into two and randomly select
samples based on the hospital that an individual attended for treatment. What
type of sampling technique are you using?
A. cluster C. stratified
B. random D. systematic
9. You wish to determine the effects of implementing the COVID-19 community
quarantine in the Philippines. Among the variables, the socio-economic
status of your respondents is your primary concern. What sampling technique
would best select your sample for this study?
A. cluster C. stratified
B. purposive D. systematic
10. You go to the streets of your barangay to conduct a study on how SAP
beneficiaries spend their money during the COVID-19 community
quarantine. You ask every by-passer to participate in your research. What
type of sampling technique are you using?
A. convenience C. stratified
B. purposive D. systematic
For numbers 11-15, refer to the situation below:
The Philippine High School caters 5,000 senior high students. Mr. Manalo, the school
principal, wants to obtain information about their plans after graduation.
11. What is the target population in his study?
A. senior high students in his school
B. senior high students in his district
C. parents of senior high students in his school
D. parents of senior high students in the nation
12. Mr. Manalo writes each name of all senior high school students on small
pieces of paper. He rolls and put them in a box and then draws 500 names as
respondents in his study. What random sampling technique does he apply?
A. cluster C. stratified
B. simple D. systematic
13. The principal creates a list of all senior high students. He decides to include
every tenth student in his samples. Which random sampling technique does
he apply?
A. cluster C. stratified
B. simple D. systematic
14. Mr. Manalo groups the senior high students according to their grade level
and strand. He proportionately and randomly chooses sample students from
each group. Which random sampling technique does he apply?
A. cluster C. stratified
B. simple D. systematic
15. The principal groups the senior high school students according to what
barangay they live. He randomly chooses 5 barangays and then includes all the
students living in the randomly chosen barangays in the sample. Which
random sampling technique does he apply?
A. cluster C. stratified
B. simple D. systematic

171
Lesson 1 Random Sampling
In research, collecting data can either be done in the entire population or the
subset of this population called sample. If a researcher opts to use sample rather
than a population, he must take considerations on the number of samples and how
these samples can be chosen out of his target population.
A population includes all of its elements from a set of data. The size of the
population is the number of observations in the population. For example, if ABS-
CBN network has 11,000 employees having the required blood type in a certain
study, then we have a population of size 11,000.
Sample consists of one or more data drawn from the population. It is a subset,
or an incomplete set taken from a population of objects or observations. Taking
samples instead of the population is less time-consuming and cost-effective.
Although sampling has advantages, it can also be a source of bias and inaccuracy.
Random Sampling is a sampling method of choosing representatives from the
population wherein every sample has an equal chance of being selected. Accurate
data can be collected using random sampling techniques.

What’s In
Analyze the following scenarios then answer the questions that follow.

a. As of May 6, 2020, the Department of Health recorded 131,786 unique


individuals who got tested for the COVID-19 out of approximately 109,369,019
Filipinos.
1. Identify the population.
_____________________________________________________________________________
_____________________________________________________________________________
_____________________________________________
2. How many elements are there in the population?
____________________________________________________________________________
____________________________________________________________________________
_______________________________________________
3. Who are the samples?
____________________________________________________________________________
____________________________________________________________________________
_______________________________________________
4. How many samples are selected?
____________________________________________________________________________
____________________________________________________________________________
_______________________________________________

5. Did the situation illustrates random sampling? Why or why not?


____________________________________________________________________________
____________________________________________________________________________
_______________________________________________
b. Barangay Pagkakaisa has 400 family beneficiaries of the government’s Social
Amelioration Program (SAP). The Brgy. Captain decides to survey how these
recipients spend their money during community quarantine. He writes each of
the names of the beneficiaries in small pieces of paper. He rolls and put them in
a box then draws 50 names to answer the survey questionnaire.

172
6. Identify the population.
___________________________________________________________________________
___________________________________________________________________________
________________________________________________
7. How many elements are there in the population?
____________________________________________________________________________
____________________________________________________________________________
_______________________________________________
8. Who are the samples?
____________________________________________________________________________
____________________________________________________________________________
_______________________________________________
9. How many samples are selected?
____________________________________________________________________________
____________________________________________________________________________
_______________________________________________
10. Did the Brgy. Captain use random sampling in the situation? Why?
____________________________________________________________________________
____________________________________________________________________________
_______________________________________________

POPULATION SAMPLE

College
students in a
certain Nursing
University students

What’s New
You already knew that random sampling is a method of selecting a sample from a
population in such a way that every member of the population is equally likely to be
chosen as part of the sample. Do you know that we have four ways to randomly select
our sample? Yes, we have four types of random sampling which you will be able to
define and differentiate in the following activity.

I. Match the word in column A to its definition in column B.


Column A Column B

1. Simple a. methodical procedure

2. Systematic b. having few parts; not complex

3. Stratified c. to divide or arrange into classes

4. Cluster d. a number of similar things that occur


together

173
II. You’ve just familiarized yourself with the keywords related to random sampling.
Now you are ready to distinguish its different types. Use your understanding of the
previous activity to identify whether the following illustrates simple, systematic,
stratified or cluster random sampling.
1. A researcher writes the name of each student on a piece of paper, mixes the
papers in a bowl, and draws 7 pieces of paper.
2. A researcher selects every 7th student from a random list.
3. A researcher tells the class to count and then selects those students who count
a multiple of 7 numbers.
4. A researcher separates the list of boys and girls, then draws 7 names by
gender.
5. A researcher surveys all students from 3 randomly selected classes out of 7
classes.
What is It
Let us analyze the situations given above.
1. A researcher writes the name of each student on a piece of paper, mixes the
papers in a bowl, and draws 7 pieces of paper.
Situation 1 illustrates simple random sampling. The pieces of paper correspond
to each student as elements of the population. All of them have an equal
chance of being selected as a sample by randomly picking 7 pieces of paper in
a bowl.
2. A researcher selects every 7th students from a random list.
3. A researcher tells the class to count and then selects those students who
count a multiple of 7 numbers.
Situations 2 and 3 illustrate systematic random sampling because samples
are being selected based on the kth consistent intervals. Selecting every 7th student
on the random list of names creates an equal chance for all of the students.
The same thing happened in selecting students who count multiple of 7 or 7, 14,
21, and so on.
4. A researcher separates the list of boys and girls, then draws 7 names by
gender.
Situation 4 illustrates stratified random sampling because the students
were divided into two different strata or groups, boys and girls. With a proportional
number for each group, samples will then be selected at random from these two
groups.
5. A researcher surveys all students from 3 randomly selected classes out of 7
classes.
Situation 5 illustrates cluster sampling since all students are divided into
clusters or classes, then 3 classes were selected at random out of the 7 classes.
All of the students of these three classes comprised the samples of the study. Take
note that each cluster is mutually homogeneous yet internally heterogeneous.

There are different types of random sampling.


a. A Simple random sampling technique is the most basic random sampling
wherein each element in the population has an equal probability of being
selected. They are usually represented by a unique identification number that is
written on equal-sized and shaped papers and then selection of samples is
possible through the lottery method. Random numbers selected to decide
which elements are included as the sample. The drawing number of paper is based
on the desired number of samples.

174
Simple Random Sampling

JUAN DE LA CRUZ

ANDRES KAPULE

MARIA PAYAPA

JOSEFA MABUHAY

MARCELA MABINI

Systematic Random Sampling

b. Systematic random sampling is a random sampling that uses a list of all


the elements in the population and then elements are being selected
based on the kth consistent intervals. To get the kth interval, divide the
population size by the sample size.
c. Stratified random sampling is a random sampling wherein the
population is divided into different strata or divisions. The number of
samples will be proportionately picked in each stratum that is why all
strata are represented in the samples.
d. Cluster sampling is a random sampling wherein population is
divided into clusters or groups and then the clusters are randomly
selected. All elements of the clusters randomly selected are considered
the samples of the study.

175
Stratified Random Sampling

Cluster Sampling

The sampling techniques that involve random selection are called probability
sampling. Likewise, simple random, systematic, and stratified and cluster sampling
are all probability sampling techniques.
There are also sampling techniques that do not involve random selection of data.
They are called non-probability sampling. An example of this is convenience
sampling wherein the researcher gathers data from nearby sources of information
exerting minimal effort. Convenience is being used by persons giving questionnaires
on the streets to ask the passers-by.
Purposive sampling is also not considered a random sampling since the respondents
are being selected based on the goal of the studies of the researcher. If the study is
about the students who are children of OFW, the researcher will get samples who are
children of OFW. This excludes other students from being a sample.

176
What’s More
Independent Practice
Identify the random sampling technique used in each item.
1. You are given a list of all graduating students in your school. You decide to
survey every 10th student on the list and ask them the organization that they
belong.
2. You wish to make a comparison of the gender differences in Mathematics
performance. You divide the population into two groups, male and female, and
randomly pick respondents from each of the group.
3. You assign numbers to the members of the population and then use draw lots
to obtain your samples to answer your survey on the most popular festivals in
the country.
4. You randomly pick five out of fifteen barangays to conduct your survey in your
municipality or city about their best environment-friendly practices.
5. You write the names of each student in pieces of paper, shuffles, and then draw
eight names to answer a survey on their ethical media practices.
Independent Assessment
Analyze the given situation below and then identify the type of random sampling
methods being illustrated in each item. Write the letter of your answer in a separate
answer sheet.
Each of the 30 basketball high school teams has 12 players. The organizer wants to
have a quick survey to know the average height of the players.

1. Each team will be asked to place papers with its players’ names into a separate
fishbowl and randomly drew out five names. The five names from each team
will be combined to make up the sample. Which of the following sampling
techniques is being used in this situation?
A. cluster C. stratified
B. simple D. systematic
2. The organizer lists all the players in a sheet of paper and then assigns a
unique number for each. Sixty numbers will be picked to get the samples.
Which random sampling technique did the organizer apply?
A. cluster C. stratified
B. simple D. systematic
3. All players will be grouped according to their age and will randomly choose
players from each group to measure their height. Which random sampling
technique did he apply?
A. cluster C. stratified
B. simple D. systematic
4. The organizer created a list of all players, decided to survey every sixth name
on the list, and later asked those players that were selected. Which random
sampling technique did he apply?
A. cluster C. stratified
B. simple D. systematic
5. A team is randomly selected to answer the question prior to the study. Which
random sampling technique did he apply?
A. cluster C. stratified
B. simple D. systematic

177
What I Have Learned
Fill in the blank of word or group of words to complete the sentence to gauge what
you have learned with this module.

_________________________ is the collection of the entire objects, individuals or


data being examined in the study.

_________________________ is a part of population that is chosen as


representatives of the entire population.

__________________________ is the method of getting samples.

__________________________ is a sampling method of choosing representatives


from the population wherein every sample has an equal chance of being
selected.

There are different types of random sampling.

a. _______________________is the most basic random sampling wherein each


element in the population has an equal probability of being selected. They are
usually represented by a unique identification number that is written on
equal-sized and shaped papers and then selection of samples is possible
through the lottery method. Random numbers selected to decide which
elements are included as the sample. The drawing number of papers is based
on the desired number of samples.
b. _______________________is a random sampling which uses a list of all the
elements in the population and then elements are being selected based on the
kth consistent intervals. To get the kth interval, divide the population size by
the sample size.
c. ________________________ is a random sampling wherein the population is
divided into different strata. Selection of samples will be proportionately
randomly picked in each stratum that is why all strata have representatives in
forming the samples.
d. ________________________ is a random sampling wherein population is divided
into clusters and then the clusters is randomly selected. All elements of the
clusters randomly selected are considered the samples of the study.
There are also sampling techniques that do not involve random selection of data.
Example of which is the _________________________ wherein the researcher gather
data from nearby sources of information exerting a minimal effort.
____________________________ is also not considered a random sampling since the
respondents are being selected based on the goal of the studies of the researcher.

What I Can Do
Decide a specific problem that you want to study about the COVID-19 pandemic.
1. Who are your target respondents or sources of data?
____________________________________________________________________________
____________________________________________________________________________
_____________________________________________________
2. How are you going to obtain your data?
____________________________________________________________________________
____________________________________________________________________________
_____________________________________________________

178
3. What sampling method would be best for your study? Why would you
select that method?
____________________________________________________________________________
____________________________________________________________________________
_____________________________________________________
4. What do you think would be the greatest description of a best sample?
Why?
____________________________________________________________________________
____________________________________________________________________________
_____________________________________________________

5. If you are to create your own research, will you use random sampling?
Why or why not? If yes, what kind of random sampling are you going to
use? If not, how will you obtain your samples?
____________________________________________________________________________
____________________________________________________________________________
______________________________________________________

Assessment
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What do you call the set of all data that researchers want to investigate?
A. data C. random
B. population D. sample
2. Which of the following can be considered as the best sample?
A. conveniently chosen C. selected at random
B. purposefully selected D. self – selected
3. From the given populations, which of the following sample is most likely to
be representatives of the population from which it is drawn?
A. population: SAP beneficiaries sample: car owners
B. population: lawmakers sample: students
C. population: online shoppers sample: cellphone users
D. population: graduating students sample: grade 8 students
4. In random sampling, the sample should be_______ the population
A. as large as C. not taken from
B. different from D. representative of
5. Which of the following is NOT a type of random sampling?
A. cluster C. simple
B. purposive D. systematic
6. Which of the following is random sampling?
A. convenience C. purposive
B. non-probability D. stratified
7. As a researcher, you ask the people of your barangay on how they spent
their money during the community quarantine. You decided to group them
according to their socio-economic status and then proportionately and
randomly selecting samples from these groups. Which type of sampling are
you using?
A. convenience C. stratified
B. purposive D. systematic
8. As a researcher, you want to know the plans of graduating senior high
school students. You ask all by-passers to participate in your research.
Which type of sampling are you using?
A. convenience C. stratified

179
B. purposive D. systematic
9. You wish to examine the effects of community quarantine on your fellow
students. You decided to borrow a list from the guidance office and select
every 10th student on it. What sampling technique are you using?
A. convenience C. stratified
B. purposive D. systematic
10. You want to know the effects of the COVID-19 to the Filipino people and you
will ask the medical frontliners. Which type of sampling will you use?
A. convenience C. stratified
B. purposive D. systematic

For numbers 11-15, refer to the situation below:


A National High School has 2,000 first year high school students. Mrs. Mogol, the
school principal, wants to obtain information from these students about last year’s
lesson that has not been tackled.

11. What is the target population in her study?


A. all students in her school
B. parents of all students in her school
C. first year high school students in her school
D. parents of first year high school in her school
12. The principal creates a list of all first year high school students, decides to
survey every seventh student on the list. Which random sampling technique
does she apply?
A. cluster C. stratified
B. simple D. systematic
13. Mrs. Mogol writes each name of all first year high school students on small
pieces of paper, he then put them in a box and draws 300 names to
participate in the study. Which random sampling technique does she apply?
A. cluster C. stratified
B. simple D. systematic
14. The principal groups the first year high school students according to the
barangay where they live. She randomly picks a barangay and all of the
students living in that barangay answer the questionnaire. Which random
sampling technique does she apply?
A. cluster C. stratified
B. simple D. systematic
15. Mrs. Mogol groups the first year high school students according to their
school’s last attended. She proportionately and randomly chooses students
from each group. Which random sampling technique does she apply?
A. cluster C. stratified
B. simple D. systematic

Additional Activities
In the following situation, give the target population and identify possible samples
(should be taken from the target population).
The first one is done as an example for you.

1. The opinions of the Catholic Bishops’ Conference of the Philippines (CBCP)


regarding guidelines for the Liturgical Celebration in the “New Normal”
condition.
Population: Catholic Bishops in the Philippines
Sample: Bishops in Luzon

180
2. A survey on the most popular TV noontime show in the Philippines
Population: ________________________________________________________________
Sample: ___________________________________________________________________

3. The study habits of public senior high school students in selected schools in
Manila.
Population: ________________________________________________________________
Sample: ___________________________________________________________________

4. The degree of satisfaction of parents about the quality of education their


children will get from online classes in Quezon Province.
Population: ________________________________________________________________
Sample: ___________________________________________________________________

5. The characteristics of the most likable teachers according to students.


Population: ________________________________________________________________
Sample: ___________________________________________________________________

181
Statistics and
Probability
Computing for the Parameter
and Statistic
Statistics and Probability
Computing for the Parameter and Statistic
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Coren R. Manalo


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

182
Week

5
What I Need to Know
This module was designed and written with you in mind on how you can describe
the given population and sample based on their characteristics. It is here to help you
see the importance of having different statistical tools to be used in your future
researches. The scope of this module is the different parameter and statistic
measures that can accompany you in your upcoming studies. This will also help you
analyze real-life situated problems statistically in terms of relevant questions for you
to better understand them. Your adept at analysis will help you appreciate the
richness, and beauty of Statistics which will motivate you to apply to similar events
and create statistical measures of your own. This module was written and intended
for senior high students like you who are taking Statistics and Probability subject.

The module has one lesson:


 Computing for the parameter and statistic.
Your patience in solving offered problems here in the module will help you improve
your computational skills as it tackles relevant culture-based situated problems.
Your ability to interpret, reason–out, and make a judgment or even decision out of
statistical measures will also be practiced here. The scope of this module permits it
to be used in many different learning situations. The language used recognizes the
diverse vocabulary level of students like you. The lessons are arranged to follow the
standard sequence of the course. But the order in which you read them can be
changed to correspond with the textbook you are now using.

After going through this module, you are expected to:


1. distinguish between a sample statistic and a population parameter; and
2. calculate the parameter or statistic of the given data.

What I Know
For you to be able to have an insight regarding parameter and statistic, you must
first answer the pre-test before proceeding to the self-learning part of this module.
Read and analyze each question carefully then choose the letter of the best answer.
Write your chosen letter on a separate sheet of paper.

1. What do you call a number that describes a population characteristic?


A. data C. samples
B. parameter D. statistic
2. What do you call a number that describes a sample characteristic?
A. data C. samples
B. parameter D. statistic
3. Which of the following is a parameter?
A. 𝜎 2 C. 𝑥̅
B. s D. s2

For numbers 8-11, the following are scores of students:

9, 7, 8, 9, 7, 5, 8, 10, 7, 8, and 10.

4. What is the value of µ? (2 points)


A. 7 C. 8
B. 7.33 D. 8.33

183
5. What is the value of the population variance? (2 points)
A. 1.56 C. 2
B. 1.87 D. 2.2
6. What is the value of 𝜎? (2 points)
A. 1.25 C. 1.41
B. 1.37 D. 1.48

For numbers 12-15, the following are the six scores of the students above that are
randomly selected:
9, 7, 5, 8, 7, 8

7. What is the value of the sample mean? (2 points)


A. 7 C. 8
B. 7.33 D. 8.33
8. What is the value of s ? (2 points)
2

A. 1.56 C. 2
B. 1.87 D. 2.2
9. What is the sample standard deviation? (2 points)
A. 1.25 C. 1.41
B. 1.37 D. 1.48
Rubric for 2-point item
Equivalent Points Description
Complete solution with the correct
2 points
answer
1 point Half of the solution is correct
0 point Wrong solution or no answer

Lesson 1 Computing for the Parameter and


Statistic
In Statistics, it is sometimes impossible to gather data from the entire
population but through sampling, researchers get a clearer picture of the whole
group being studied. One can draw inferences about a certain population by carefully
selecting random samples and using those data to make interpretations about the
entire population. Most of the researches need numerical values to easily interpret
the results of the study. Those values are computed based on all the elements of the
population or from the randomly selected samples that were taken. The values can
be classified into parameters and statistics. Parameter and statistics are usually
computed through different measures of central tendency and variability.

What’s In
Can you still remember how to calculate for the mean, variance, and standard
deviation? Do the following exercise.
Teacher Jane randomly selected the grades in Statistics and Probability of 7 Grade
11 students. Calculate the mean grade, variance, and standard deviation of the
given set of data.

x ̅
𝐱−𝒙 ̅)𝟐
(𝐱 − 𝒙
94

184
88
79
89
91
84
77
∑x = ̅)𝟐 =
∑(𝐱 − 𝒙

Mean Variance Standard Deviation


∑x ∑(x−𝑥̅ )2
𝑥̅ = 𝑠2 = 𝑠=√
∑(x−𝑥̅ )2
𝑛 𝑛−1
𝑛−1

What’s New
Solve for the mean, variance, and standard deviation of the given set of data of a
population.
X X-µ (X − µ)2
94
85
88
79
78
75
89
91
84
77
∑X = ∑(X − µ)2 =

Mean Variance Standard Deviation


∑X ∑(X−µ)2
µ= 𝜎2 = 𝜎=√
∑(X−µ)2
𝑁 𝑁 𝑁

185
Answer the following questions:
1. On the first set of data, your computed values are what we call statistics.
What do statistics mean?
____________________________________________________________________________
____________________________________________________________________________
____________________________________________________________________________
_____________________________________________________________________

2. On the second set of data, your computed values are what we call
parameters. What does the parameter mean?
____________________________________________________________________________
____________________________________________________________________________
____________________________________________________________________________
____________________________________________________________________

3. How do parameters and statistics differ from one another?


____________________________________________________________________________
____________________________________________________________________________
____________________________________________________________________________
____________________________________________________________________
What is it
A parameter is a measure that is used to describe the population while statistic is a
measure that is used to describe the sample. To understand more, let us discuss the
two measures.

PARAMETER
Below are grades in Statistics of Grade 11 students during the third quarter.

94 85 88 79 78
75 89 91 84 77
Let us compute the population mean, population variance, and population standard
deviation.

POPULATION MEAN
The mean is the sum of the data divided by the number of data. The mean is used to
describe where the set of data tends to concentrate at a certain point.
Population mean is the mean computed based on the elements of the population or
data. The symbol µ (read as “mu”) is used to represent population mean. To compute
for the population mean, we simply add all the data (X) and then, divide it by the
∑X
number of elements in the population (N). We apply the formula: µ =
𝑁

where: µ = the population mean


∑x = the summation of x (sum of the measures)
𝑁 = number of elements in the population

Grades in Statistics of Grade 11 Students during the Third Quarter


Student’s Number Grade (X)
1 94
2 85
3 88
4 79

186
5 78
6 75
7 89
8 91
9 84
10 77
N = 10 ∑X = 840

In our case, adding all the 10 grades will give us a sum of 840.
∑X 840
We substitute the values obtained to the formula µ = , therefore, µ = = 84
𝑁 10
Our computed population mean µ is 84.

POPULATION VARIANCE AND POPULATION STANDARD DEVIATION


Variance and standard deviation determine how to spread or to scatter each
data on the set from the mean. Standard deviation is simply the square root of the
variance.
Population variance is the computed variance of the elements of the
population. The symbol 𝜎 2 (read as “sigma squared”) is used to represent population
variance.
To compute for the population variance, we apply the formula:

∑(X−µ)2
𝜎2 =
𝑁

where: X = given data


µ = the population mean
𝑁 = number of elements in the population
Population standard deviation is the computed standard deviation of the
elements of the population. The symbol 𝜎 (read as “sigma”) is used to represent
population standard deviation.

To compute for the population standard deviation, we use the formula:

∑(X−µ)2
𝜎=√
𝑁

where: X = given data


µ = the population mean
𝑁 = number of elements in the population
Consider the data given above, to solve for the population variance and
population standard deviation, we have this table:

Student’s Number Grade (X) X−µ (X − µ)2


1 94 94 – 84 = 10 102 = 100
2 85 85 – 84 = 1 12 = 1
3 88 88 – 84 = 4 42 = 16
4 79 79 – 84 = – 5 52 = 25
5 78 78 – 84 = – 6 62 = 36

187
6 75 75 – 84 = – 9 92 = 81
7 89 89 – 84 = 5 52 = 25
8 91 91 – 84 = 7 72 = 49
9 84 84 – 84 = 0 02 = 0
10 77 77 – 84 = 7 72 = 49
N = 10 ∑X = 840 ∑(X − µ)2 = 382

The third column is computed through subtracting the mean to the scores, while the
fourth column is computed by squaring the third column. Since there is a symbol ∑
or summation in the formula, we need to add the computed values in the fourth
column.
Again, for the population mean,
∑X 840
µ= = = 84
𝑁 10
For the population variance, we substitute the computed values to our formula, thus
∑(X−µ)2 382
𝜎2 = = = 38.2
𝑁 10
For the population standard deviation, we can also substitute the computed values
to the formula, or we can simply get the square root of the variance.
∑(X−µ)2 382
𝜎=√ =√ = √38.2 ≈ 6.18
𝑁 10

Population mean (µ), population variance ( 𝜎2) and population standard deviation (𝜎)
are what we called parameters.

STATISTIC
From the previous data of the population, suppose that we randomly select only 7
data out of the total 10 data in the population.

94 85 88 79 78
75 89 91 84 77

Compute the sample mean, sample variance, and sample standard deviation. Here
is the result:

SAMPLE MEAN
The sample mean is the average of all the data of the samples. The symbol 𝑥̅
(read as “x bar”) is used to represent the sample mean. To compute for the sample
mean, we simply add all the data and divide it by the number of elements in the
∑x
sample (n). We apply the formula: 𝑥̅ =
𝑛

where: 𝑥̅ = the sample mean


∑x = the summation of x (sum of the measures)
𝑛 = number of elements in the sample

188
Grades in Statistics of Grade 11 Students during the Third Quarter
Population Sample
Student’s Number
(Student’s Grades) (Student’s Grades)
1 94 94
2 85
3 88 88
4 79 79
5 78
6 75
7 89 89
8 91 91
9 84 84
10 77 77
N=10; n = 7 ∑x = 840 ∑x = 602
In our case, adding the 7 samples will give us a sum of 602.
∑x 602
We substitute to the formula 𝑥̅ = , therefore, 𝑥̅ = = 86.
𝑛 7
Our computed sample mean 𝑥̅ is 86.

In this example, there is a slight difference between the population mean and the
sample mean. But notice that there is no difference regarding the method in
determining the value of the population mean and the sample mean. For the divisor,
the population mean µ uses N (population size) while sample mean x applies n
(sample size).
SAMPLE VARIANCE AND SAMPLE STANDARD DEVIATION
Sample variance is the computed variance of the elements of the sample. s 2 is used
to represent sample variance.
∑(x−𝑥̅ )2
To compute for the sample variance, we apply the formula: 𝑠2 =
𝑛−1
where: x = given data
𝑥̅ = the sample mean
𝑛 = number of elements in the sample
Sample standard deviation is the computed standard deviation of the elements of the
sample. s is used to represent sample standard deviation.
∑(x−𝑥̅ )2
To compute for the sample standard deviation, we use the formula: 𝑠 = √
𝑛−1
where: x = given data
𝑥̅ = the sample mean
𝑛 = number of elements in the sample
As you would notice, the sample standard deviation is also the square root of the
sample variance.
Population Sample
Student’s
(Student’s (Student’s x − 𝑥̅ (x − 𝑥̅ )2
Number
Grade) Grade)
1 94 94 94 – 86 = 8 82 = 64
2 85
3 88 88 88 – 86 = 2 22 = 4
4 79 79 79 – 86 = – 7 – 72 = 49

189
5 78
6 75
7 89 89 89 – 86 = 3 32 = 9
8 91 91 91 – 86 = 5 52 = 25
9 84 84 84 – 86 = – 2 – 22 = 4
10 77 77 77 – 86 = – 9 – 92 = 81
∑x = 840 ∑x = 602 ∑(x − 𝑥̅ )2 = 236

The fourth column is computed by subtracting the mean to the grades, while the last
column is computed by squaring the third column. Since there is a symbol ∑ or
summation, we need to add the computed values.

Again, the sample mean is


∑x 602
𝑥̅ = = = 86
𝑛 7
For the sample variance, that is
∑(x−𝑥̅ )2 236 236
𝑠2 = = = ≈ 39.33
𝑛−1 7−1 6

For the sample standard deviation, we have


∑(x−𝑥̅ )2 236 236
𝑠=√ =√ = √ ≈ √39.33 ≈ 6.27
𝑛−1 7−1 6

Sample mean (𝑥̅ ), sample variance (s2) and sample standard deviation (s) are what
we call statistic.

Remember that parameters are for population while statistics are for sample.
Other examples of parameters and statistics are the population proportion and
correlation coefficient. For population proportion, we use “p” for sample and “P” for
the population. In correlation coefficient, we use “r” for the sample and “𝜌” (read as
rho) for the population. These will be discussed in the latter part of this course.
What’s More
Since the difference between parameter and statistic is already clear to you,
let us now apply the different parameters and statistics of the following data. Believe
me, you can do this!
Independent Practice 1A
Complete the table below by writing the correct symbols and formula in each
parameter and statistic.

Different Parameter Statistic


Measures Symbol Formula Symbol Formula

Mean

Variance

Standard
Deviation

190
Independent Practice 1B
Complete the table below by computing for the different parameters and statistics.

Population: 1, 2, 3, 4 and 5
Sample: 2, 4 and 5

Measures Parameter Statistic

Mean

Variance

Standard
Deviation

Independent Assessment

I. The following are the scores of ten ABM students in their fourth-quarter
examination:

26 37 45 35 31
47 23 34 44 38

Complete the table and compute for the population mean, population variance, and
population standard deviation.

Student’s No. Scores (X) X−µ (X − µ)2


1
2
3
4
5
6
7
8
9
10
N= ∑X = ∑(X − µ)𝟐 =

191
Write your computations here:

population mean population variance population standard


deviation

II. The scores of the 5 randomly selected HUMSS students are given below:
26, 45, 31, 34, 44

Complete the table and compute for the sample mean, sample variance, and sample
standard deviation.

Student’s No. Scores (x) x − 𝑥̅ (x − 𝑥̅ )2


1
2
3
4
5
n= ∑x = ̅)𝟐 =
∑(𝐱 − 𝒙

Write your computations here:

sample mean sample variance sample standard


deviation

What I Have Learned


Let’s see what you have learned in this module by answering the following questions.
____________________ 1. What is the measure that is used to describe the population?
____________________ 2. What type of letters does parameter is usually denoted?
____________________ 3. What is the formula for the population mean?
____________________ 4. What is the formula for the population variance?
____________________ 5. What is the formula for the population standard deviation?
____________________ 6. What type of measure is used to describe the sample?
____________________ 7. What type of letters does statistic is usually denoted?
____________________ 8. What is the formula for the sample mean?
____________________ 9. What is the formula for the sample variance?
____________________ 10. What is the formula for the sample standard deviation?

192
What I Can Do
Create a group with 8 members or ask any 8 people nearby. Inquire a specific
question that a possible answer is a number. Record those answers of each member
of the group or the people you have surveyed. Solve for the sample statistic using the
computations that we have discussed.
You may also choose from any of the following topics for survey.
 the number of Philippine festivals you have attended
 the number of historical places you have visited
 the number of Filipino heroes you have known
 the number of members of your family
 the number of gadgets you have
Holistic Rubric:

The student demonstrates a clear and thorough understanding of


100% the concepts and procedures embodied in the task. He/she
completed the task accurately.

The student demonstrates a partial understanding of the concepts


80% and procedures embodied in the task. He/she completed the task
inaccurately.

The student demonstrates a partial understanding of the concepts


60% and procedures embodied in the task. He/she does not complete
the task.

The student demonstrates a limited understanding of the concepts


30% and procedures embodied in the task. He/she does the task
inaccurately and incompletely

0% The student does not attempt anything in the task.

Assessment
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What do you call a number that describes a sample characteristic?
A. data C. sample
B. parameter D. statistic
2. What do you call a number that describes a population characteristic?
A. data C. sample
B. parameter D. statistic
3. Which of the following is a statistic?
A. 𝜎 2 C. 𝜎
B. µ D. s

193
For numbers 4-6, the following are scores of students in their quiz:
7, 11, 8, 8, 19, 15, 7, 9, 9, 20, 17, 14

4. What is the value of µ? (2 points)


A. 10.4 C. 12
B. 11 D. 14.4
5. What is the value of the population variance? (2 points)
A. 21 C. 23.43
B. 22.91 D. 27.33
6. What is the value of 𝜎? (2 points)
A. 4.58 C. 4.84
B. 4.79 D. 5.23

For numbers 13-15, the following are the seven scores of the students above that are
randomly selected:

7, 8, 8, 15, 9, 20, 17

7. What is the value of the sample mean? (2 points)


A. 10.4 C. 12
B. 11 D. 14.4
8. What is the value of s2? (2 points)
A. 21 C. 23.43
B. 22.91 D. 27.33
9. What is the sample standard deviation? (2 points)
A. 4.58 C. 4.84
B. 4.79 D. 5.23

Rubric for 2-point item


Equivalent Points Description
Complete solution with the correct
2 points
answer
1 point Half of the solution is correct
0 point Wrong solution or no answer

Additional Activities
Given a set of data, which is more advantageous, computing the population
parameter or the sample statistic? Why?

What are the advantages and disadvantages of choosing parameter over statistic and
vice versa?
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________

194
Statistics and
Probability
Identifying Sampling
Distribution of Statistics
Statistics and Probability
Identifying Sampling Distribution of Statistics
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writer: Coren R. Manalo


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

195
Week

5
What I Need to Know
This module was designed and written with you in mind on how you can
illustrate the sampling distribution of the sample mean. It is here to help you see the
importance of having different statistical tools to be used in your future researches.
The scope of this module is the sampling distribution of the sample mean that can
accompany you in your upcoming studies. This will also help you analyze real-life
situated problems statistically in terms of relevant questions for you to better
understand them. Your adept at analysis will help you appreciate the richness, and
beauty of Statistics which will motivate you to apply to similar events and create
statistical measures of your own. This module was written and intended for senior
high students like you who are taking Statistics and Probability subject.

The module has one lesson:


 Sampling Distributions of Sample Mean.
Your patience in solving offered problems here in the module will help you
improve your computational skills as it tackles relevant culture-based situated
problems. Your ability to interpret, reason–out, and make a judgment or even
decision out of statistical measures will also be practiced here. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students. The lessons are arranged
to follow the standard sequence of the course. But the order in which you read them
can be changed to correspond with the textbook you are now using.

After going through this module, you are expected to:


1. illustrate sampling distribution;
2. construct a sampling distribution of sample mean; and
3. create a histogram of the sampling distribution of sample mean.

What I Know
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.

1. What distribution pertains to the frequency distribution of the sample mean from
all the possible random samples of a particular sample size n taken from the given
population?
A. frequency C. population
B. normal D. sampling
2. Which of the following is NOT a step-in creating sampling distribution of
the sample mean?
A. Determine the number of sets of all possible random samples.
B. Compute for the standard deviation and variance of the samples.
C. Construct a frequency distribution table of the sample mean and
probability.
D. List all the possible random samples and solve for the sample mean of
each set of samples.
3. Which of the following is the mean of the samples 7, 11, 22, 24, and 27?
A. 15.17 C. 22
B. 18.20 D. 22.75

196
For numbers 4-9, a population consists of the data {1, 2, 3, 4}.
4. How many different samples of size n = 2 can be drawn from the
population?
A. 6 C. 4
B. 5 D. 3
5. Which of the following sample mean has the greatest frequency in the
sampling distribution?
A. 1.5 C. 2.5
B. 2 D. 3
6. What is the frequency of sample mean 4?
A. 3 C. 1
B. 2 D. 0
7. Wat is the probability of the sample mean 3.5 in the sampling
distribution?
1 2
A. C.
6 3
2 3
B. D.
6 4
8. What is the lowest value of the sample mean in this sampling
distribution?
A. 0 C. 2
B. 1.5 D. 2.5
9. Which of the following histogram correctly shows the sampling
distribution of sample mean of size n = 2?

A. 2 C. 2
6 6

1 1
6 6

B. 0 1.5 2 2.5 3 3.5 D. 0 1.5 2 2.5 3 3.5


4 3
4 3
3
2
4
3
2
4 1
1
3
4

0 2 2.5 3 3.5 4 0 2 2.5 3 3.5 4


For numbers 10-15, refer to the given below:
a population consists of the data {3, 5, 7, 9, 11}.

10. How many different samples of size n = 3 can be drawn from the
population?
A. 5 C. 8
B. 7 D. 10
11. Which of the following sample mean appears most frequent in the
sampling distribution?
A. 5 C. 7
B. 5.67 D. 8.33

197
12. Which of the following sample mean has the least frequency in the
sampling distribution?
A. 5 C. 7
B. 6.33 D. 8.67
13. What is the probability of the sample mean 8.33 in the sampling
distribution of sample means?
2 1
A. C.
10 5
2 1
B. D.
5 10
14. What is the lowest value of the sample means in this sampling
distribution?
A. 5 C. 7
B. 6.33 D. 8.67
15. Which of the following histogram correctly shows the sampling
distribution of sample means of size n = 3?
2 2
A. C. 10
10

1 1
10 10

0 5 5.67 6.33 7 7.67 8.33 9 0 5 5.67 6.33 7 7.67 8.33 9


B. 2 D. 2
10 10
2 2
5 5
1 1
10 10
1
5

0 5 5.67 6.33 7 7.67 8 9 0 5 5.67 6.33 7 7.67 8 9

Lesson 1 Sampling Distribution of the Sample


Mean
In almost all fields of study, we gather data. These data are used to estimate
things about the nature of human beings, phenomena, and even our world. If the
number of elements in the population is too many, it will be extremely hard for the
researchers to collect all data from the group. In some cases, we need to use random
sampling to get the appropriate statistic from the samples to draw conclusions that
would represent the population. However, there can be numerous different sets of
possible samples of the same size that can be drawn from a given population. If we
gather another set of a random sample, wouldn’t be the sample mean become
different? How can we be sure that one set of random samples can represent the
entire population? How can we tell that our statistic is accurate?

During your junior high school classes, you already discussed frequency
distribution and on the earlier part of this course, you also tackled the probability
distribution. In this module, we will discuss another distribution, the sampling
distribution of the mean. Sampling distribution shows all the possible samples, in a
given population, together with their mean. It describes the probability of the mean
of all sets of samples with the same sample size n. The idea of sampling distribution

198
helps us to recognize how we use one random sample to conclude for the population.
It defines the entire possible statistic we could conclude up with.

The sampling distribution is a significant idea in Statistics. It is considered to


be a foundation and essential to learn. We could have better knowledge about the
population using statistical inference.
What’s In
In this lesson, you will learn how to construct a sampling distribution of the
sample mean. This part consists of three activities to prepare you for the lesson.
To begin with, the concept of combination tells us the number of all possible
random samples of population size N taken at sample size n.

A. Evaluating the Combination of N Objects Taken n at a Time


Evaluate the following.
1. 4C2

2. 6C4

3. 5C3

4. 3C2

5. 7C5

B. Computing for the Mean of a Sample


Find the mean of the following sets of numbers.
Sample Mean
6. 4, 7, 11
7. 2, 5, 7, 1,
8. 18, 5, 13, 9, 11
9. 12, 19, 14, 7, 10, 4
10. 6, 15, 27, 22, 16, 19, 24

C. Constructing Frequency Distribution


The following are the mean scores of Grade 11 students of Quezon National
High School. Construct a frequency distribution for different means.
15.5 13.5 15.5 14 15

14 14.5 16 14.5 13.5

15 16.5 15 13 15.5

16 14.5 14 15.5 15

16 15 16.5 14.5 17

Means Tally Frequency

199
What’s New
A population of Senior High School consists of numbers 1, 2, 3, 4, and 5.
Let us create a sampling distribution of size 3 by following the steps.
A. First, let us evaluate the combination of N (population size) objects taken
n (sample size) at a time.
Therefore, we have
NCn = 5C3 = ____________________ samples
To get this, we apply the formula for a combination of N taken n at a time,
NCn, where N is the size of the population, and n is the sample size.
B. For the second step, write all the possible samples and compute for the
mean of each sample. To complete the table, compute for the sample mean,
samples are already given.
Observation Sample Mean
1 1, 2, 3 (1 + 2+ 3) ÷ 3 = 6 ÷ 3 = 2
2 1, 2, 4 2.33
3 1, 2, 5
4 1, 3, 4
5 1, 3, 5
6 1, 4, 5
7 2, 3, 4
8 2, 3, 5
9 2, 4, 5
10 3, 4, 5
C. Then, construct a frequency distribution of the sample mean.
Sample Mean Frequency
2
2.33
2.67
3
3.33
3.67
4

200
Sampling distribution of the sample means is a frequency distribution
using the computed sample mean from all the possible random samples of
a particular sample size taken from the given population.

D. Lastly, add another column for the probability distribution. The probability
distribution of the sample means is also called the sampling distribution
of the sample mean. Since we have 10 possible outcomes, for the
probability, the denominator will be 10 and the number on the numerator
part will be the number of frequencies.

E.
Sample Means Frequency Probability P(x)
1
2 1 = 0.10
10

2.33 1

2.67 2

3 2

3.33 2

3.67 1

4 1

If we are to create a histogram, the sample means will be placed on the x-axis
while the probability will be placed on the y-axis.
2
10

1
10

0 2 2.33 2.67 3 3.33 3.67 4


What is It
Sampling distribution of the sample means is a frequency distribution using
the computed sample mean from all the possible random samples of a particular
sample size taken from the given population.

Steps to follow in making a sampling distribution of the sample mean:


1. Determine the number of sets of all possible random samples that can be drawn
from the given population by using the formula, NCn, where N is the population
size and n is the sample size.

In our activity, we are given with a population of 1, 2, 3, 4, and 5 and sample


size of 3, therefore we have

201
𝑁! 5! 5! 5∙4∙3∙2∙1
NC n = 5C3 = = = = = 10
[𝑛!(𝑁−𝑛)!] 3!(5−3)! 3!2! (3∙2∙1)(2∙1)
*or use your calculator function nCr
2. List all the possible random samples and solve for the sample mean of each set
of samples.
Sample Mean

1, 2, 3 2

1, 2, 4 2.33

1, 2, 5 2.67

1, 3, 4 2.67

1, 3, 5 3

1, 4, 5 3.33

2, 3, 4 3

2, 3, 5 3.33

2, 4, 5 3.67

3, 4, 5 4

3. Construct a frequency and probability distribution table of the sample means


indicating its number of occurrence or the frequency and probability.

Sample Means Frequency Probability P(x)


1
2 1 = 0.10
10
1
2.33 1 = 0.10
10
2
2.67 2 = 0.20
10
2
3 2 = 0.20
10
2
3.33 2 = 0.20
10
1
3.67 1 = 0.10
10
1
4 1 = 0.10
10

What’s More
Independent Practice 1A
How many different samples of size n can be obtained from the following
population N sizes?
N N NC n

1. 4 2
2. 6 4
3. 8 3
4. 10 4

202
5. 15 5

Independent Practice 1B
I. A population consists of the values (1, 4, 3, 2). Consider samples of size 2 that can
be drawn from this population.
a. List down all the possible samples and corresponding sample mean
Sample Sample Means

b. Construct the sampling distribution of the sample means.


Sample Means Frequency Probability P(x)

Independent Assessment
Coren was able to sell several pairs of Marikina shoes that have sizes of 4, 5, 6, 7,
and 8. Consider samples of size 3 that can be drawn from this population.
a. List down all the possible samples and corresponding sample mean.
Sample Sample Mean

b. Construct the sampling distribution of the sample means.


Sample Mean Frequency Probability P(x)

203
c. Draw a histogram of the sampling distribution of the sample mean.

What I Have Learned


Let us see what you have learned in this module by completing the following
statements.
A _________________________ of the sample mean is a frequency distribution using the
computed sample mean from all the possible random samples of a particular sample
size taken from the given population.

Steps to follow in making a ______________________________________________:


1. Determine the number of set of all possible random samples that can be drawn
from the given population by using the formula ______ where N is the
___________________ and n is the ___________________.
2. List all the possible ___________________ and solve for the ___________________ of
each set of samples.
3. Construct a ___________________ of the sample mean indicating its number of
occurrence or the frequency and together with its ___________________ .
What I Can Do
Since you already know how to construct the sampling distribution of the sample mean
and its corresponding histogram, let us apply your knowledge in the following data.

The ages of ALS learners in Bagong Pag-asa Community Learning Center (CLC) are 19,
20, 15, 17, 27, and 22. Consider samples of size 4 that can be drawn from this
population.
a. List down all the possible samples and corresponding sample mean.

Sample Sample Mean

204
b. Construct the sampling distribution of the sample mean.

Sample Mean Frequency Probability P(x)

c. Draw a histogram of the sampling distribution of the sample mean.

Assessment
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. What distribution pertains to the frequency distribution of the sample mean from
all the possible random samples of a particular sample size n taken from the given
population?
A. frequency B. normal C. population D. sampling
2. Which of the following is NOT a step-in creating sampling distribution of the
sample mean?
A. Determine the number of sets of all possible random samples.
B. Compute for the standard deviation and variance of the samples.
C. Construct a frequency distribution table of the sample means and
probability.
D. List all the possible random samples and solve for the sample mean of
each set of samples.
3. Which of the following is the mean of sample 6, 10, 21, 25, and 28?
A. 15.17 B. 18 C. 21.2 D. 22

For numbers 4-9, refer to the following set of data of a population {11, 12, 13, 14}.
4. How many different samples of size n = 2 can be drawn from the
population?
A. 6 B. 5 C. 4 D. 3
5. Which of the following sample mean appears most frequent in the
sampling distribution?
A. 10 B. 11.5 C. 12 D. 12.5
6. What is the lowest value of the sample mean in this sampling
distribution?
A. 11.5 B. 12 C. 12.5 D. 13
7. What is the frequency of the sample mean 12.5?
A. 0 B. 1 C. 2 D. 3

205
8. What is the probability of the sample mean 12 in the sampling
distribution?
1 2 2 3
A. B. C. D.
6 6 3 4

9. Which of the following histogram correctly shows the sampling


distribution of sample mean of size n = 2?
A. C. 2
2 6
6
1
1 6
6

B. 2 0
D. 0 11.5 12 12.5 13 13.5
11.5 12 12.5 13 3
6 13.5 3
2
1 3
6 1
3

0 12 12.5 13 13.5 4 0 12 12.5 13 13.5 4

For numbers 10-15, refer to the following population consists of the data
{2,4, 6, 8, 10}.
10. How many different samples of size n = 3 can be drawn from the
population?
A. 6 B. 8 C. 10 D. 12
11. Which of the following sample mean appears most frequent in the
sampling distribution?
A. 3.67 B. 4 C. 4.67 D. 6
12. What is the lowest value of the sample means in this sampling
distribution?
A. 3.67 B. 4 C. 4.67 D. 6
13. What is the frequency of the sample mean 4.67?
A. 0 B. 1 C. 2 D. 3
14. What is the probability of the sample mean 5.33 in the sampling
distribution of the sample mean?
1 1 1 2
A. B. C. D.
10 6 5 5
15. Which of the following histogram correctly shows the sampling
distribution of sample means of size n = 3?
A. 2 C. 2
10 10

1
1
10
10

2
B. 10 0 4 4.67 5.33 6 6.67 7.33 8 D. 2
10 0 4 4.67 5.33 6 6.67 7.33 8

1
1
10
10

0 4.67 5.33 6 6.67 7.33 8 8.67


0 4.67 5.33 6 6.67 7.33 8 8.67

206
Additional Activities
Ask a family with five members in your community. Now, consider samples of size 3
that can be drawn from the data.

Answer the following questions:


1. How many possible samples of size 4 can be drawn?
2. List all the possible samples and compute the sample mean.
3. Construct the sampling distribution of the sample mean.
4. Draw a histogram corresponding to the sampling distribution of the sample
mean.

Sample Sample Mean Frequency Probability P(x)

207
Statistics and
Probability
Finding the Mean and Variance
of the Sampling Distribution of
the Sample Mean
Statistics and Probability
Finding the Mean and Variance of the Sampling Distribution of the Sample Mean
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Author: Hermenegildo R. Pineda


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

208
Week

6
What I Need to Know
This module was designed and written with you in mind on how you can
determine the mean and variance of the sampling distribution of means in solving
real-life situated problems. Also, you will learn the properties of the sampling
distribution of the sample mean for finite and infinite populations. These topics in
statistics are the foundation of all researchers in various fields (engineering,
education, science, e.g.) that would like to understand the behavior, properties, and
characteristics of the population being observed.
The module has one lesson:
 Lesson 1 - Mean and Variance of the Sampling Distribution of the
Sample Mean

At the end of this module, you are expected to find the mean and variance of the
sampling distribution of the sample mean.

What I Know
Let us see how far you will go and how much you know about this pretest. Let’s get
started!

Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer. The test has a total of 15 points.
A B C D 1. Which of the following measures the dispersion of the
O O O O sampling distribution of the sample mean and is
calculated using the relation
∑(𝑋 − 𝜇)2 • 𝑃(𝑋) or ∑[𝑋2 𝑃(𝑥)] − 𝜇2 ? (1 point).

A. mean of the sampling distribution of sample mean


B. variance of the sampling distribution of the sample
mean
C. standard deviation of the sampling distribution of
the sample mean
D. average deviation of the sampling distribution of the
sample mean

A B C D 2. Which of the following is the notation used for the mean


O O O O of the sampling distribution of sample means? (1 point)
A. 𝜇 C. 𝜎
B. 𝜇𝑥̅ D. 𝜎𝑥̅

A B C D 3. Which of the following equations is the formula used to


O O O O determine?
𝜎 1−𝑛
A. 𝜎𝑥̅ = •√ C. 𝜎𝑥̅2 = ∑[𝑃(𝑋̅ ) • (𝑋̅ − 𝜇)2 ]
√𝑛 𝑁−1
𝜎2 1−𝑛
B. 2
𝜎𝑥̅ = • D. 𝜎𝑥̅2 = ∑[𝑃(𝑋̅) • (𝜇̅ − 𝑋̅)2 ]
𝑁 𝑁−1

209
A B C D For numbers 4-8, please refer to the problem below.
O O O O A population consists of five (5) measurements 2, 3, 6, 5,
and 7.

4. What is the mean of the population? (2 points)


A. 3.44 C. 4.92
B. 4.60 D. 5.20

A B C D 5. What is the variance of the population? (2 points)


O O O O A. 3.44 C. 4.92
B. 4.60 D. 5.20

A B C D 6. How many different samples of size 𝑛 = 2 can be


O O O O drawn from the population? (2 points)
A. 7 C. 9
B. 8 D. 10

A B C D 7. What is the mean of the sampling distribution of


O O O O the sample means? (3 points)
A. 3.44 C. 4.92
B. 4.60 D. 5.20

A B C D 8. What is the variance of the sampling distribution of the


O O O O sample means with a sample size of 2? (3 points)
A. 0.89 C. 2.20
B. 1.29 D. 3.90

Rubric for 3-point item:

Equivalent points Description


3 points Complete solution with the correct answer
2 points Last major step of the solution is incorrect
1 point One major step of the solution is correct
0 point Wrong solution or no answer

Rubric for 2-point item:

Equivalent points Description


2 points Complete solution with the correct answer
1 point Half of the solution is correct
0 point Wrong solution or no answer
Lesson 1 Mean and Variance of the Sampling
Distribution of the Sample Mean
Statisticians do not just define the discrepancy of the individual data values
about the mean of the population. They are also concerned to know how the means
of the samples of the same size taken from the same group vary about the population
mean. In this lesson, you will learn how to describe the sampling distribution of the
sample mean by computing its mean and variance.

210
What’s In
Let’s have this first for a review. Find out if you are ready to learn the next lesson by
answering the activity.
A. The following are sample data obtained from the Statistics quiz of 22 students
in Math High School in Quezon Province. Compute the mean and variance of
the frequency distribution.

Scores F Scores f
61 1 84 2
68 1 86 2
71 1 88 2
72 1 89 1
75 1 90 1
81 1 91 3
82 1 95 1
83 2 96 1
Total 22
B. Mean and Variance of the Probability Distribution
Complete the table below and find the mean and variance of the given
probability distribution.

𝑿 𝑷(𝑿) 𝑿 • 𝑷(𝑿) (𝑿 − 𝝁) (𝑿 − 𝝁)𝟐 (𝑿 − 𝝁)𝟐 •𝑷(𝑿)


1 0.134
2 0.231
3 0.09
4 0.165
5 0.112
6 0.07
7 0.198
What’s New
Can you identify the steps in finding the mean and variance of the sampling
distribution of sample means? Study the following problem to perform this task.
Instruction: Identify the steps in solving the mean and variance of the sampling
distribution of the sample means by analyzing each given illustration.
Mark is conducting a survey on grade 12 students of Nasyonalismo High
School. He found out that there are only few students who knew about the makers
of the Philippine flag consisting of 1, 2, 3, 4, and 5 SHS students from 5 sections.
Suppose that the sample size of 2 sections were drawn from this population (without
replacement), describe the sampling distribution of the sample means.

211
1. What is the mean and variance of the sampling distribution of the
sample means?
2. Compare these values to the mean and variance of the population.
Illustration/Solution Steps
(1) µ = 𝛴𝑥/𝑁
= (1+2+3+4+5) / 5 1. _______
= 3.00

(2) σ2 = Σ(X- µ)2/ N 2._______


𝑿 (a) X−µ (b)(𝑿 − µ)𝟐 = 10 / 5
1 -2 4 =2 (a)_______
2 -1 1
3 0 0 So, the variance of the (b)_______
4 1 1 population is 2.
5 2 4
Σ(x- µ)2=10

𝑵!
Use the formula (3) NCn = . Here N=5 and n=2.
𝒏!(𝑵−𝒏)!
3. _______
5C2 = 10
So, there are 10 possible samples of size 2 that can be drawn.

(4) Samples (4) Mean 4. _______


1,2 1.5
1,3 2.0
1,4 2.5
1,5 3.0
2,3 2.5
2,4 3.0
2,5 3.5
3,4 3.5
3,5 4.0
4,5 4.5

212
(5) Sampling Distribution of Sample 5. _______
Means
Sample Frequency Probabilit
Mean y
̅)
(𝑿 ̅)
𝑷(𝑿
1.5 1 1/10
2.0 1 1/10
2.5 2 1/5
3.0 2 1/5
3.5 2 1/5
4.0 1 1/10
4.5 1 1/10
Total 10 1.00

(6)
6. _______
Sample Mean Probability (a)
̅)
(𝑿 ̅)
𝑷(𝑿 (𝑿̅ ) • 𝑷(𝑿
̅)
a. _______
1.5 1/10 0.15
2.0 1/10 0.20 b._______
2.5 1/5 0.50
3.0 1/5 0.60
3.5 1/5 0.70
4.0 1/10 0.40
4.5 1/10 0.45
Total 1.00 (b) 3.00
𝜇𝑥̅ = ∑[𝑋̅ • 𝑃(𝑋̅)]
𝝁𝒙̅ = 𝟑. 𝟎𝟎

Sample Probability (a) (b) (c) 7._______


Mean S. Mean – (S. Mean- Probability•((S
P.Mean P. Mean)2 . Mean-P. a._______
̅)
(𝑿 ̅)
𝑷(𝑿 ̅ − 𝝁)𝟐
(𝑿 Mean)2
̅ − 𝝁) ̅ ) • (𝑿
̅ − 𝝁)𝟐 b._______
(𝑿 𝑷(𝑿
1.5 1/10 -1.5 2.25 0.225
c._______
2.0 1/10 -1.0 1.00 0.100
2.5 1/5 -0.5 0.25 0.050 d._______
3.0 1/5 0.0 0.00 0.000
3.5 1/5 0.25 0.25 0.050
4.0 1/10 1.00 1.00 0.100
4.5 1/10 2.25 2.25 0.225
Total 1.00 (d) 0.750

Note: S. Mean = sample mean, P. Mean= population mean

So, the (7) variance of the sampling distribution is 0.75

213
Solution:
1. Compute the mean of the population using the formula µ = 𝛴𝑥/𝑁. The value
equals to 3.0.
2. Compute the variance of the population using the formula 𝜎 2 = 𝛴(𝑥 − µ)2 / 𝑁.
a. Subtract each measurement by the computed population mean.
b. Square the results obtained in (a) then add. Divide the sum by the frequency
of measurement to get the value of the population variance. The value equals
to 2.0.
3. Determine the number of possible samples of size 2 (without replacement). Use
the combination formula NCn where N is the population size and n is the sample
size.
4. List all possible samples and compute the corresponding means.
5. Construct the sampling distribution of the sample means.
6. Compute the mean of the sampling distribution of the sample means.
Follow these steps:
a. Multiply each sample mean by the corresponding probability.
b. Add the results obtained in a. (The sum of the values corresponding
to that column is the mean of the sampling distribution of the sample
means.)
̅̅̅ • 𝑃(𝑋̅)]
𝜇𝑥̅ = ∑[𝑋
7. Compute the variance of the sampling distribution of the sample means
using the formula 𝜎𝑥̅2 = ∑[𝑃(𝑋̅) • (𝑋̅ − 𝜇)2 ].

Follow these steps:


a. Subtract the population mean from each sample mean.
b. Square the difference obtained in a.
c. Multiply each result in b by the corresponding probability.
d. Add the results in c. (The sum is the variance of the sampling
distribution of the sample means.).

MIND THIS: The mean of the population is equal to the sampling


distribution of sample mean

What is It
Based on the previous activity, the mean of the population is equal to the
sample mean. This happens because all samples obtained of n size came from the
same population. The variance of the population is greater than the variance of the
sample means since a sample of n is less than the population N then the data vary
by a small amount as compared to the population. The activity also taught you the
steps in finding the mean and variance of the sampling distribution of the sample
mean.
Mean of the Sampling Distribution of the Sample Mean
𝜇𝑥̅ = ∑[𝑋̅ • 𝑃(𝑋̅)]
Variance of the Sampling Distribution of the Sample Mean

214
𝜎𝑥̅2 = ∑[𝑃(𝑋̅) • (𝑋̅ − 𝜇)2 ] 𝑜𝑟 ∑[𝑋̅ 2 • 𝑃(𝑋̅)] − 𝜇2 or
𝜎2 𝑁−𝑛
𝜎𝑥2 = • for finite population
𝑛 𝑁−1

𝜎2
𝜎𝑥̅2 = for infinite population
𝑛

Where: 𝜇𝑥̅ = mean of the sampling distribution of the sample mean


𝜎𝑥̅2 = variance of the sampling distribution of the sample mean
∑[𝑋̅ • 𝑃(𝑋̅)] = sum of the products of the sample mean and the
probability of the sample mean
𝜇 = population mean
𝜎 2 = population variance
𝑛 = sample size
𝑁 =population size
𝑋̅ = sample mean
𝑃(𝑋̅) = probability of the sample mean
(𝑋̅ − 𝜇)2 = square of the difference between the sample mean and
population mean
∑[𝑃(𝑋̅) • (𝑋̅ − 𝜇)2 ] = summation of the products of probability of the
sample mean and the square of the difference between the sample mean
and the population mean
𝜇2 = square of the population mean
𝑋̅ 2 = square of the sample mean
𝑋̅ 2 • 𝑃(𝑋̅)= product of the square of the sample mean and the
probability of the sample mean
∑[𝑋̅ 2 • 𝑃(𝑋̅ )] = sum of the product of the square of the sample mean and
the probability of the sample mean

What’s More
Independent Activity
MEAN AND VARIANCE OF THE SAMPLING DISTRIBUTION OF THE SAMPLE
MEAN.

Direction: Perform what is being asked. Follow the steps indicated in the first
column of the table to complete the activity. In the second column, write your
solution on the spaces provided.
Situation: A population of PWD learners in 5 schools in a certain municipality of
Quezon Province are (𝒙) 9, 5, 6, 12, and 15. Suppose that two schools were selected
as samples, determine the mean and variance of the sampling distribution of sample
mean.

215
Steps Illustration/Solution
1. Compute the mean Formula:
of the population µ. (1) µ = 𝛴𝑋/𝑁= __________

The mean of the population is ________.

2. Compute the
variance of the 𝑿 (a) 𝑿 − µ (b) (𝑿 − µ)𝟐
population 𝜎 2 .

(a) Subtract each


measurement by
the computed
population mean
(𝒙 − µ) (2) 𝜎 2 = 𝛴(𝑋 − µ)2 / 𝑁 =_________
(b) Square the results So, the variance of the population is ______.
obtained in (a)
then add. Divide
the sum by the
frequency of
measurements to
get the value of the
population
variance.
𝛴(𝑥 − µ)2 / 𝑁

3. Determine the
number of possible (3) Use the formula NCn. =
𝑁!
=__________
𝑛!(𝑁−𝑛)!
samples of size n = 2.

4. List all possible


samples and their Samples of size 2 (4) Mean (4)
corresponding
means.

216
5. Construct the (5) Sampling Distribution of Sample Means
sampling distribution Sample Mean Frequency Probability
of the sample means. ̅
(𝑿) (𝒇) ̅)
𝑷(𝑿

Total (f)

Sample Probability (a) S. Mean • Probability


6. Compute the
Mean
mean of the ̅) ̅) ̅ ) • 𝑷(𝑿
̅)
(𝑿 𝑷(𝑿 (𝑿
sampling
distribution of the
sample means.
Follow these steps:

a. Multiply the
sample mean by
the corresponding
probability. Total (b)

b. Add the results.

Note: S. Mean = Sample Mean


(6) 𝜇𝑥̅ = ∑[𝑋̅ • 𝑃(𝑋̅)]
𝜇𝑥̅ = ______

217
7. Compute the
variance of the
sampling distribution Sampl Probability (a) (b) (c)
of the sample mean. e Mean S. Mean (S. Mean- Probability•(S.
Follow these steps: – P.Mean P. Mean)2 Mean-P.
a. Subtract the Mean)2
population mean from ̅)
(𝑿 ̅)
𝑷(𝑿 ̅ − 𝝁)
(𝑿 ̅ − 𝝁)𝟐
(𝑿
each sample mean. ̅ ) • (𝑿
𝑷(𝑿 ̅ − 𝝁)𝟐

b. Square the
difference.

c. Multiply the results


by the corresponding
probability.
Total (d)
d. Add the results. Note: S. Mean = Sample Mean, P. Mean = Population Mean
(7) Variance:

𝜎𝑥̅2 = ∑[𝑃(𝑋̅) • (𝑋̅ − 𝜇)2 ]= ____________

So, the variance of the sampling distribution of sample the


means is _________.

1. How did you find the activity?


2. How do you compare the mean of the sample mean and the mean of the
population?
3. How does the variance of the sample mean and the variance of the population
differ?

MIND THIS: In computing for the variance of the sampling distribution of


the sample means, you may use the formula 𝜎𝑥̅2 = ∑[𝑃(𝑋 ̅ ) • (𝑋̅ − 𝜇)2 ] or
𝜎 2 𝑁−𝑛
𝜎𝑥̅2 = 𝑛 • 𝑁−1 for finite population (sampling without replacement) and
𝜎2
𝜎𝑥̅2 = for infinite population (sampling with replacement). Also, the
𝑛
𝜎
standard error of the mean is computed using the formula 𝜎𝑋̅ =
√𝑛

Independent Assessment
Direction: Provide step-by-step solutions to the given problem below. Write your
answers on the spaces provided.
Consider all samples of size 2 from the population 2, 5, 6, 9, 11, and 13. Compute
the mean, and variance of the sampling distribution of the sample mean.

218
First step:

Second step
a)

b)

Third step

Fourth step

Fifth step

Sixth step
a)

b)

Seventh step
a)

b)

c)

d)

What I Have Learned


Answer the following questions.
1. What are the 7 step-rule in finding the mean and variance of the sampling
distribution of the sample mean?

2. What are the formulas that can be used to determine the mean and variance of
the sampling distribution of the sample mean for finite population (sampling
without replacement) and for infinite population (sampling with replacement)?

What I Can Do
Make use of your acquired skills in this lesson.
Direction: Perform what is being asked. Solve the problem by showing the step-by-
step process.

219
Situation: Joe, a DOH COVID-19 tracker, found out that the number of infected
persons of COVID-19 in seven low-risk provinces of the country are 1, 9, 2, 12, 8, 7,
and 10. Suppose that 5 provinces are drawn as sample.

1. What is the mean (𝜇), variance (𝜎 2 ) and standard deviation of the population (σ)?
2. How many different samples of size n =5 can be drawn from the population?
List them with their corresponding mean.
3. Construct the sampling distribution of the sample mean.
4. What is the mean (𝜇𝑥̅ ) of the sampling distribution of the sample mean? Compare
this to the mean of the population.
5. What is the variance of the (𝜎𝑥̅2 ) of the sampling distribution of the sample mean?
Assessment
Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer. This test is equivalent to 15 points.
A B C D 1. Which of the following is described as the quotient of
O O O O the population variance and the sample size n? (1 point)
A. mean of the sampling distribution of sample
mean
B. variance of the sampling distribution of the
sample mean
C. standard deviation of the sampling
distribution of the sample mean
D. average deviation of the sampling distribution
of the sample mean

A B C D 2. Which of the following is the notation used for the


O O O O variance of the sampling distribution of the sample
mean? (1 point)
A. μ C. 𝜎 2
2
B. 𝜎𝑥̅ D. 𝜇𝑥̅

A B C D 3. Which of the following is the formula used to


O O O O determine the variance of the sampling distribution of
the sample mean? (1 point)
𝜎 1−𝑛
A. 𝜎𝑥̅ = •√ C. 𝜎𝑥̅2 = ∑[𝑃(𝑋̅) • (𝑋̅ − 𝜇)2 ]
√𝑛 𝑁−1

𝑠2
B. 𝜎𝑥̅2 = D. 𝜎𝑥̅2 = ∑[𝑃(𝜇̅ ) • (𝑋̅ − 𝜇)2 ]
𝑁+1

A B C D For numbers 4-8, please refer to the problem below.


O O O O A population of a certain LGBTQ+ community is consists
of six (6) measurements 2, 3, 6, 9, 11, and 5.

4. What is the mean of the population? (2 points)


A. 10 C. 6
B. 8 D. 5

A B C D 5. What is the variance of the population? (2 points)


O O O O A. 10 C. 6
B. 8 D. 5

220
A B C D 6. How many different samples of size n= 4 can be drawn
O O O O from the population? (2 points)
A. 20 C. 8
B. 15 D. 6

A B C D 7. What is the mean 𝑥̅ of the sampling distribution of the


O O O O sample mean? (3 points)
A. 4 C. 6
B. 5 D. 7

A B C D 8. What is the variance of the sampling distribution of the


O O O O sample mean? (3 points)
A. 4 C. 2
B. 3 D. 1

Rubric for 3-point item:

Equivalent point/s Description


3 points Complete solution with the correct answer
2 points Last major step of the solution is incorrect
1 point One major step of the solution is correct
0 point Wrong solution or no answer

Rubric for 2-point item:

Equivalent Description
point/s
2 points Complete solution with the correct answer
1 point Last major step of the solution is incorrect
0 point Wrong solution or no answer
Additional Activities
Miss Mathica Kumpleto (MMK), a SHS Math teacher in Mathalino National
High School would like to find the mean and variance of the sampling distribution
of the sample mean of the height of her students by forming a group consisting of
6 members. She also said that they need to determine the height (in meters) of each
member of the group and to get a list of the average height of each group. She
further instructed her students to submit their answer with the following data.

1. List all possible samples of size 2 and their corresponding mean.


2. Construct the sampling distribution of the sample mean.
3. Calculate the mean of the sampling distribution of the sample mean. Compare
this to the mean of the population.
4. Calculate the variance of the sampling distribution of the sample mean.
Compare this to the variance of the population.

221
Statistics and
Probability
Defining Sampling Distribution
of the Sample Mean for Normal
Population
Statistics and Probability
Defining Sampling Distribution of the Sample Mean for Normal Population
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Hermenegildo R. Pineda


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Kelvin A. Ramintas and Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

222
Week

6
What I Need to Know
This module was designed and written for you to learn the definition of the
sampling distribution of the sample mean for the normal population when the
variance is known and unknown. Also, you will learn some of the probability
distributions that are used to determine the probability value of a certain event for
small and large samples. This serves as a tool for statisticians or any interesting
group who wants to test the sample mean using statistical formulas and to make a
rightful decision in the future.
The module has one lesson:
 Lesson 1 - Defining sampling distribution of the sample mean for
normal population when the variance is:
a) known
b) unknown
At the end of this module, you are expected to define the sampling distribution
of the sample mean for the normal population when the variance is known and
unknown.

What I Know
Let us see how far you will go and how much you know about this pretest. Let’s get
started!

Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer.

A B C D 1. This distribution is used to estimate population parameters


O O O O when the population variance is unknown, and the sample size
is less than 30.
A. Z-Distribution
B. t-Distribution
C. Chi-Square Distribution
D. Pearson Correlation Coefficient
A B C D 2. Which of the following notations is used for the standard
O O O O deviation of the sampling distribution of the sample mean?
A. 𝜇 C. 𝜎
B. 𝜇𝑥̅ D. 𝜎𝑥̅
A B C D 3. Which of the following equations is the formula used to
O O O O transform the sample mean to an approximately standard
normal variable Z when the population variance is known?
𝑥−𝜇 𝑥̅ −𝜇
A. 𝑧 = 𝑠 C. 𝑧 = 𝑠
( ) ( )
√𝑛 √𝑛
𝑥−𝜇 𝑥̅ −𝜇
B. 𝑧 = 𝜎 D. 𝑧 = 𝜎
( ) ( )
√𝑛 √𝑛
A B C D 4. The mean weight of two thousand (2000) 11-year-old children
O O O O is 35 kilograms with a standard deviation of 5 kilograms.
Samples of 25 children were selected and the mean weight of
this group was found to be 34 kilograms with a standard
deviation of 3 kilograms. What condition of the sampling

223
distribution of the sample mean for normal population
manifests in the situation?
A. Population variance is known
B. Population variance is unknown
C. Sample standard deviation is small
D. Sample standard deviation is large
A B C D 5. Which of the following formulas is used to calculate the
O O O O standard deviation of the sampling distribution of the sample
mean when the population standard deviation is unknown?
𝜎 𝑁+𝑛 𝑠
A. 𝜎𝑥̅ = • √ C. 𝑠𝑥̅ =
√𝑛 𝑁−1 √𝑛
𝜎 𝑠+1
B. 𝜎𝑥̅ = D. 𝜎𝑥̅ =
√𝑛 √𝑛
A B C D 6. A population consists 1, 9, 2, 12, 8, 7, and 10. Suppose samples
O O O O of size 5 are drawn from this population. This situation is an
example of the sampling distribution of the sample mean for a
normal population wherein population variance is __________.
A. unknown C. large
B. known D. small

A B C D 7. Suppose that the rice mill wants to test its product to find out
O O O O whether it meets the average weight of 50 kilograms.
Twenty-five sacks of rice are weighed and found to have an
average weight of 49.8 kg with a standard deviation of 0.12 kg.
The rice mill owner is in a situation of the sampling
distribution of the sample mean wherein the population
variance is __________.
A. unknown C. large
B. known D. small
A B C D 8. Suppose the manager of a Jollibee Corp. wishes to know
O O O O whether their chicken packaging machine still packs accurately
after 6 months of use. The machine was rated to pack
1,500-gram pack of chicken with a standard deviation of 30
grams. The statistician tests a sample of 40 packs to test the
accuracy of the machine. What example of sampling
distribution of the sample mean for normal population portrays
the situation?
A. variance is known
B. variance is unknown
C. standard deviation is small
D. standard deviation is large
A B C D 9. This distribution is used to determine the probability of an
O O O O event by transforming the mean of the sample to an
approximately standard normal variable if the population
variance is known and sample size is greater than 30.
A. t-Distribution
B. Z-Distribution
C. Chi-Square Distribution
D. Pearson Correlation Coefficient

224
A B C D 10. Given 𝜇 = 15, 𝑠 = 5, and 𝑛 = 16. This is an example of the
O O O O sampling distribution of the sample mean for a normal
population when the population variance is _______.
A. unknown
B. known
C. large
D. small
A B C D 11. Which of the following formulas is used to calculate the
O O O O standard deviation of the sampling distribution of the sample
mean when the population standard deviation is given?
𝜎 𝑁−𝑛 𝑠
A. 𝜎𝑥̅ = • √ C. 𝑠𝑥̅ =
√𝑛 𝑁−1 √𝑛
𝜎 𝑠+1
B. 𝜎𝑥̅ = D. 𝜎𝑥̅ =
√𝑛 √𝑛

A B C D 12. Mr. Pineda administered a Mathematics Achievement Test to a


O O O O random 45 graduating SHS students. In this example, the mean
and standard deviation are 95 and 11. The population
parameters are μ=89 and σ=8 . What example of the sampling
distribution of the sample mean for the normal population
portrays the situation?
A. variance is known
B. variance is unknown
C. standard deviation is small
D. standard deviation is large
A B C D 13. Given μ=90, 𝑋̅=92 s=5, and n=28. This is an example of the
O O O O sampling distribution of the sample mean for a normal
population when the population variance is _______.
A. unknown C. large
B. known D. small

A B C D 14. A sample of five measurements, randomly selected from


O O O O approximately normally distributed population, resulted in the
summary statistics: 𝑋̅=34.7, s=10.2. The population mean is
equal to 36. The given data is an example of the sampling
distribution of the sample mean for a normal population
wherein the variance is _________?
A. unknown C. large
B. known D. small
A B C D 15. Given μ=15, σ=5, 𝑋̅=12.8 and n=100. This is an example of the
O O O O sampling distribution of the sample mean for a normal
population when the population variance is _______.
A. unknown C. large
B. known D. small

Lesson 1 Defining Sampling Distribution of


the Sample Mean for Normal Population
As stated from the previous lessons, one of the most important uses of statistics is
to arise a valid generalization of the population based on statistics of the sample.
Since many samples can be chosen from the same population, each sample having
its own statistics- the mean, variance etc.-quite different from those of other samples,
the variation poses a problem regarding the reliability of the characteristics of the

225
population estimated from one sample. However, if the samples are probability
samples, the reliability of the results can be measured and at the same time, the
parameters can be estimated with a high degree of confidence. This lesson is intended
to define those statistical techniques designed to meet these ends.
What’s In
Let’s have a review first. Find out if you are ready to learn the next lesson by answering
the activity.
Mark is conducting a survey on Senior High School students who are active on
environmental activities on their community. Given a population consisting of the
following measurements (𝑋) 2, 8, 11, 7, 𝑎𝑛𝑑 3. Suppose that samples of size 2 are drawn
from the population. Compute the mean and variance of the sampling distribution
of the sample mean. Complete the table below to accomplish this task.
Table 1: List of samples of size 2 and the corresponding sample mean.
Sample Frequency Probability
Mean (𝒇) ̅)
𝑷(𝑿
̅)
(𝑿

Total (f)
Table 2: Sampling Distribution of the Sample Mean
Samples of size 2 Sample Mean

226
Table 3: Mean of the Sampling Distribution of the Sample Mean.

Sample Probability Sample Mean •


Mean Probability
̅)
(𝑿 ̅)
𝑷(𝑿 ̅ ) • 𝑷(𝑿
(𝑿 ̅)

Total

Table 4: Variance of the Sampling Distribution of the Sample Mean

Sample Probability Sample Mean – (Sample. Mean- Probability• (Sample.


Mean Population. Population. Mean-Pop. Mean)2
Mean Mean) 2

(𝑋̅) 𝑃(𝑋̅) (𝑋̅ − 𝜇) (𝑋̅ − 𝜇)2 𝑃(𝑋̅) • (𝑋̅ − 𝜇)2

Total

What’s New
Activity 1: KNOWING MO NA?
Instruction: Read and analyze the following situations. Determine whether the
following statements have a known or unknown population variance. Write your
answer on the space provided. Identify also the formula to be used to estimate the
standard error of the mean by writing the symbol 𝜎𝑥̅ , when the population variance
is known and the symbol 𝑠𝑥̅ when the population variance is unknown

Situation Answer Standard Error


Formula
1. Because of Inclusive Education, learners with
disabilities are also part of the normal students.
Consider a population of the PWD learners
consisting of 1, 2, 3, 4, and 5. Samples of size 2 are
drawn from this population.
2. Given 𝜇 = 15, 𝑠 = 6 𝑎𝑛𝑑 𝑛 = 16
3. IQ scores are normally distributed with standard
deviation 𝜎 = 15 𝑝𝑜𝑖𝑛𝑡𝑠. You wish to determine the
𝜇 IQ of all students in Matiyaga National High
School by taking a random sample of 16 students.

227
4. The general weighted average (GWA) of the Grade 7
applicants for admission in QSHS over the past 7
years has been consistently
𝜇 = 92.78. This year a sample data of the applicant’s
GWA are 98, 89, 92, 95, 96, 90, 91, 92, 97 and 93
5. A finite population composed of six items whose
values are 2, 3, 6, 7, 8, and 10. Samples of 3 items
are drawn at random without replacement.

Solution:
8. Known – This is a population data. Although the value of the variance is not
given, you can still determine the population mean and population variance of
∑𝑋
the data using the formulas 𝜇 = , 𝜎 2 = 𝛴(𝑥 − µ)2 / 𝑁 respectively. So, this
𝑁
situation is an example of the sampling distribution when the variance is known.
𝜎
Since the population variance can be calculated, used 𝜎𝑥̅ = in the computation
√𝑛
of the standard error of the mean.
9. Unknown- The population variance is unknown. The given values are the
population mean 𝜇 = 15, the sample standard deviation 𝑠 = 6, and the sample
𝑠
size 𝑛 = 16. Use the formula 𝑠𝑥̅ = for the standard error of the mean.
√𝑛
10. Known- the population variance is given with the value of the square of the
𝜎
standard deviation 𝜎 2 = 152 . Apply the formula 𝜎𝑥̅ = in computing for the
√𝑛
standard error of the mean.
11. Unknown- The population variance is unknown. The only given values are the
𝑠
population mean 𝜇 = 92.78 and the sample size 𝑛 = 10. Use the formula 𝑠𝑥̅ = for
√𝑛
the standard error of the mean.
12. Known – This is a population data, you can determine the population mean and
∑𝑥
variance of the data using the formulas 𝜇 = , 𝜎 2 = 𝛴(𝑥 − µ)2 / 𝑁. So, this
𝑁
situation is an example of the sampling distribution where the population
𝜎
variance can be computed. Apply the formula 𝜎𝑥̅ = to solve for the standard
√𝑛
error of the mean.
What is It
Based on the previous activity, you learned from the situations presented that by
analysis, you can easily find out if the given problem provides the value of the
population variance or if the population variance is unknown. Also, when the
population variance is known, the standard deviation of the sampling distribution of
𝜎 𝑠
the mean is computed using the formula 𝜎𝑥̅ = , while the formula 𝑠𝑥̅ = is used to
√𝑛 √𝑛
estimate the standard error of the mean when the population variance is unknown.
Distribution of the Sample Mean for Normal Population
1. Population variance 𝜎 2 is known
The population has a mean μ and variance of 𝜎 2 , the distribution of the
sample mean is (at least approximately) normal and standard error of the
𝜎
mean 𝜎𝑥̅ = , where σ is the population standard deviation and 𝑛 is the sample
√𝑛
size. To determine the probability of a certain event, we can use the 𝑧 −
𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑖𝑜𝑛 by transforming the mean of the sample data to an approximately

228
𝑥̅ −𝜇
normal variable 𝑧 , using the relation 𝑧 = 𝜎 . This distribution is best applied
√𝑛
for large sample sizes, say 𝑛 ≥ 30.

2. Population variance 𝜎 2 is unknown


𝑠
The standard error of the mean becomes 𝑠𝑥̅ = , where 𝑠 is the point-
√𝑛
estimate of 𝜎 (population standard deviation) or the sample standard deviation
and 𝑛 is the sample size. To estimate the population parameters, we can use
𝑥̅ −𝜇
the 𝑡 − 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑖𝑜𝑛 by using the formula 𝑡 = 𝑠 . Remember that as n the
√𝑛
sample size is very large, the standard deviation 𝑠 is almost indistinguishable
from the population standard deviation 𝜎 and therefore 𝑡 and 𝑧 distributions
are essentially identical. Remember that, we use the 𝑡 − 𝑑𝑖𝑠𝑡𝑟𝑖𝑏𝑢𝑡𝑖𝑜𝑛 for small
sample size, say 𝑛 < 30.

What’s More
Independent Activity
DEFINE THE SAMPLING DISTRIBUTION OF THE SAMPLE MEAN FOR NORMAL
POPULATION WHEN THE VARIANCE IS KNOWN AND UNKNOWN.
Instruction: Read and analyze the following situations and supply the values of the
following variables (if there is any). On the third column, write known on the space
provided if the situation gives or can compute the value of the variance, otherwise
write unknown. Identify also the formula to be used to estimate the standard error
of the mean by writing the symbols 𝜎𝑥̅ , when the population variance is known and
𝑠𝑥̅ when the population variance is unknown.

Situation Given Answer Standard


error
formula
1. Consider a population 𝑁 = ____, 𝑛 = ______,
consisting of 1, 2, 3, 4, 5, 8,
9, and 11. Samples of size 3
are drawn from this
population.
2. Given the population mean 𝑛 = ______
of 12, and a sample
𝜇 = ______
standard deviation of 3 in a
sample size of 125 𝑠 = ______
3. A population composed of 𝑁 = ____, 𝑛 = ______,
11 items whose
measurements are 12, 7,
9,11,8, 20, 23, 18, 13, 22,
and 10. Samples of 5 items
are drawn at random
without replacement.
4. The owner of the drugstore 𝑁 = ____, 𝑛 = ______,
chain has 100 drugstores
𝜇 = ______,
that sell Flexcoslow, a
surgical mask that prevents 𝜎 2 = ______,

229
a person from catching ̅
𝑋 = _______
liquid droplets of viruses.
His operation manager said 𝑠 = _______
that the average drugstores
in his chain sell 12 boxes of
the Flexcoslow a day with a
standard deviation of 2
boxes. To find out if this is
true, the owner randomly
selected 10 stores and
found that the mean is 15
boxes with a standard
deviation of 1 box.
5. The mean height of the 𝑛 = _____,𝜇 = ______,
students at Maligaya ̅
𝑋 = _____, 𝑠 = ______
Elementary School is
137.16 cm. To find out if the
statement is true, a sample
of 15 students is taken from
the population. The mean of
the sample is 139.1 cm with
a standard deviation of 2.23
cm.
4. How did you find the activity?
5. How does each situation differ from each other?
6. What is the difference between the two symbols 𝜎𝑥̅ and 𝑠𝑥̅ in finding the
standard error of the mean?
Independent Assessment
Instruction: Read and analyze the following situations and supply the values of the
following variable in the second column. Determine whether the following statements
have a known or unknown population variance and provide a brief explanation to
support your answer. Also, identify the formula to be used for the standard error of
the mean by writing 𝜎𝑥̅ , when the population variance is known and 𝑠𝑥̅ when the
population variance is unknown.

Situation Given Answer Standard


Error Formula
1. Consider a population consisting of 15, 2, 𝑁 = ____
12, 4, 5, 8,6, 9 and 17. Samples of size 5
𝑛 = _____
are drawn from this population.

2. Given the population mean of 98, and a 𝑛 = ______,


sample standard deviation 2 in a sample
𝜇 = ______,
size of 45.
𝑠 = ______
3. A population composed of 10 items whose 𝑁 = _____,
measurements are 12,11, 15, 8, 20, 23, 𝑛 = ______,
18, 13, 22, and 10. Samples of 6 items are
drawn at random without replacement.

230
4. An SHS teacher claims that the average 𝑛 = _____
time it takes a group of students to
𝜇 = ______
complete the Mathematics examination is
50.5 minutes with a variance of 17.64 𝜎 2 = ______
𝑚𝑖𝑛𝑢𝑡𝑒𝑠 2 . She randomly selected 45 𝑋̅ = _______
students and found to have a mean of 52
minutes and a standard deviation of 3.5 𝑠 = _______
minutes. She then used the z-distribution
to find out if the group can complete the
exam faster than the population.
5. A manufacturer of lightbulbs produces 𝜎 = ______ , ,
bulbs that last a mean of 800 hours with
𝑛 = ______ ,,
a standard deviation of 100 hours. To
assess the claim of the manufacturer, a 𝜇 = ______ ,
random sample of 12 of these bulbs was 𝜎 2 = _____
tested and found to have a mean of 790 𝑋̅ = _______
hours. He then used the test variate 𝑡 = 𝑠 = _______
𝑥̅ −𝜇
𝑠 .
√𝑛

What I Have Learned


1. What formula is used to determine the standard error of the mean when
the population variance is unknown?
2. What necessary formula should be used to find the mean of the sampling
distribution of the sample mean when the population variance is known?
3. What is the test variate used when the population variance is unknown and
when the population variance is known?

What I Can Do
Make use of your acquired skills in this lesson.
Instruction: Read and analyze the following situations. Determine whether the
following statements have a known (or can be computed) or unknown population
variance. Write your answer on the space provided. Also, identify the formula to be
used for the standard error of the mean by writing 𝜎𝑥̅ , when the population variance
is known and 𝑠𝑥̅ when the population variance is unknown.

Situation Answer Standard Error


Formula
1. A population has a mean of 85 and a
standard deviation of 6. A random sample
of 22 measurements is drawn from this
population.
2. The weights of female SHS students are
normally distributed with a mean of 45kg.
Twenty samples consisting of 10 students
are drawn from the population and found
to have a mean of 46 kg and a standard
deviation of 1.24kg.

231
3. A random of 𝑛 = 70 measurements are
obtained from a population with a mean of
156 and a standard deviation of 23.
4. A manufacturer of nylon synthetic fibers
for fishing line, claims that the mean
tensile strength of their product is 30 lb. of
a 20 meter-line. To test the claim of the
manufacturer, a sample of 25, the 20-
meter line was tested and found to have a
mean weight of 32 lbs and a standard
deviation of .78 lb.
5. Given population mean of 200, a sample of
25 has a standard deviation of 3.
Assessment
Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer.

A B C D 1. This distribution is used to determine the probability of an


O O O O event by transforming the mean of the sample to an
approximately standard normal variable when the population
variance is given.
A. t-Distribution
B. Z-Distribution
C. Chi-Square Distribution
D. Pearson Correlation Coefficient
A B C D 2. Which of the following notations is used for the mean of the
O O O O population?
A. 𝜇 B. 𝜇𝑥̅ C. 𝜎 D. 𝜎𝑥̅
A B C D 3. Which of the following equations is the formula used to
O O O O estimate the numerical characteristics of the population where
the population variance is unknown, and the sample size is
less than 30?
𝑥−𝜇 𝑥−𝜇
A. 𝑧 = 𝑠 C. 𝑡 = 𝑠
( ⁄ ) ( ⁄ )
√𝑛 √𝑛
𝑥−𝜇 𝑥−𝜇
B. 𝑧 = 𝜎 ) D. 𝑡 = 𝜎
( ⁄ ) ( ⁄ )
√𝑛 √𝑛
A B C D 4. It is the distribution used to estimate the population
O O O O parameters for small sample size and when the population
variance is unknown?
A. z-Distribution
B. t-Distribution
C. Chi-Square Distribution
D. Pearson Correlation Coefficient Distribution

A B C D 5. Which of the following formulas is used to calculate the


O O O O standard deviation of the sampling distribution of the sample
mean (for infinite population) when the population standard
deviation is given?
𝜎 𝑁−𝑛 𝜎 𝑁−𝑛
A. 𝜎𝑥̅ = • √ C. 𝜎𝑥̅ = •√
√𝑛 𝑁−1 √𝑛+1 𝑁−1
𝜎 𝑠
B. 𝜎𝑥̅ = D. 𝜎𝑥̅ =
√𝑛 √𝑛

232
A B C D 6. A population consists 11, 19, 22, 2, 8, 9, and 15. Suppose
O O O O samples of 2 are drawn from this population. This situation
is an example of the sampling distribution of the sample mean
for a normal population with population variance is
__________.
A. unknown C. large
B. known D. small
A B C D 7. Suppose that the rice mill wants to test its product to find out
O O O O whether it meets the average weight of rice of 50 kilograms with
a standard deviation of 0.18 kg. Thirty sacks of rice are weighed
and found to have an average weight of 50.8 kg and a standard
deviation of 0.15kg. The rice mill owner is in a situation of the
sampling distribution of the sample mean with population
variance is __________.
A. unknown C. large
B. known D. small
A B C D 8. The average family income in the Philippines in 2019 was
O O O O P22,250 with a standard deviation of P1,250. In a certain
municipality, a sample of 50 families had an average income of
P25,000. This situation is about the sampling distribution of
the sample mean where the population ________.
A. Variance is known
B. Variance is unknown
C. Standard deviation is small
D. Standard deviation is large
A B C D 9. This is used to determine the probability of an event by
O O O O transforming the mean of the sample to a variable of this
distribution where the population variance is unknown, and
the sample size is less than 30.
A. Z-Distribution
B. t-Distribution
C. Chi-Square Distribution
D. Pearson Correlation Coefficient
A B C D 10. Given 𝜇 = 15, 𝜎 = 5, and 𝑛 = 50. This is an example of the
O O O O sampling distribution of the sample mean for a normal
population when the population variance is _______.
A. unknown C. large
B. known D. small
A B C D 11. Which of the following formulas is used to calculate the
O O O O population standard deviation?
𝑠
A. 𝜎𝑥̅2 = 𝛴(𝜇 − 𝑋)2 /𝑁 C. 𝑠𝑥2 =
√𝑛
𝜎
B. 𝜎𝑥2 = D. 𝜎 2 = 𝛴(𝑋 − µ)2 / 𝑁
√𝑛
A B C D 12. The Department of Health has the following data of the number
O O O O of infected persons of Covid-19 in five high-risk areas in the
country, 51, 89, 129, 215, and 297. Suppose samples of size 2
high-risk areas are drawn from this population, what example
of the sampling distribution of the sample mean for normal
population portrays the situation?
A. variance is known
B. variance is unknown
C. standard deviation is small
D. standard deviation is large

233
A B C D 13. Given 𝜇 = 88, 𝑋̅ = 90 𝑠 = 2, and 𝑛 = 15. This is an example of
O O O O the sampling distribution of the sample mean for a normal
population when the population variance is _______.
A. unknown C. large
B. known D. small
A B C D 14. A researcher used to develop a problem-solving test to
O O O O randomly selected 100 Grade 11 students. From this sample,
the mean and standard deviation is 90 and 4 respectively. The
mean of the population is 88. The researcher is in a situation
of the sampling distribution of the sample mean wherein the
population variance is __________.
A. unknown C. large
B. known D. small
A B C D 15. Given 𝜇 = 54.2, 𝜎 = 7.8, 𝑋̅ =60.2 and 𝑛 = 76. This is an example
O O O O of the sampling distribution of the sample mean for a normal
population when the population variance is _______.
A. unknown C. large
B. known D. smal
Additional Activities
Instruction: Determine whether the following statements has a known or unknown
population variance and write your answer on the second column. Identify also the
formula to be used to estimate the standard error of the mean by writing the symbol
𝜎𝑥̅ , when the population variance is known and the symbol 𝑠𝑥̅ when the population
variance is unknown.

Situation Answer Standard


Error
Formula
1. Hereza Start is an enrichment program for Grade 11
SHS students. The program’s heads would like to know
the mean intelligence (IQ) of its participants. They know
that IQ has standard deviation of 15 points. They take
a random sample of 20 students and administer IQ
tests. These are the scores:98, 82, 100, 102, 78, 79, 95,
89, 90, 99, 105, 103, 93, 99, 101, 89, 102, 76, 91, 100.
2. Given the following values: 𝑋̅ = 110, 𝑠 = 6, 𝑛 = 16, μ =
102, and σ = 7
3. Joel administered a mathematics achievement test to a
random sample of 40 Grade 10 students. In this
sample, he obtained the mean and variance to 89 and
81 respectively. The population mean is 85.
4. A random sample of 100 school heads were
administered a Management Skills Test. The sample
mean and the variance were 81 and 26.98. In the
standardization test, the mean was 79 and the standard
deviation was 7.
5. A finite population composed of eight items whose
values are 9, 3, 12, 7, 8, 11, 15 and 10. Samples of 4
items are drawn at random without replacement.

234
Statistics and
Probability
Illustrating the Central Limit
Theorem
Statistics and Probability
Illustrating the Central Limit Theorem
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Rolando S. Merle


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

235
Week

7
What I Need to Know
This module intends to let you illustrate the Central Limit Theorem using
varied situations, diagrams, and illustrations. Always remember that the quality of
your education in this time of crisis depends on you now. Have fun, be confident that
you can do it, and understand this module on your own. This module discusses the
context of our current situation right now.
After going through this module, you are expected to:

1. illustrate the Central Limit Theorem;


2. define the sampling distribution of the sample mean using the Central Limit
Theorem; and
3. create a histogram of the sampling distribution of the sample mean.
What I Know
Let us see how far you will go and how much you know about this pre-test. Let’s get
started!

Directions: Read and analyze each item carefully. On a separate paper, write the
letter of the correct answer for each statement that corresponds to your answer. The
perfect score is 15 points. Use the rubric provided after the test.

1. If the mean of the sampling distribution of the mean is 6.5, which of the following
statements best describe the population mean?
A. The population decreases by 6.5.
B. The population is greater than 6.5.
C. The population mean is also equal to 6.5.
D. The population mean and mean of the sampling distribution of the means
cannot be compared.
2. What does the Central Limit Theorem state?
A. the sample size is large
B. all possible samples are selected
C. the standard error of the sampling distribution is small
D. the standard deviation is sufficiently large than the normal
3. The Central Limit Theorem states that the mean of the sampling distribution of
the sample mean is
A. larger than the population mean.
B. exactly equal to the population mean.
C. close to the population means if the sample size is large.
D. equal to the population mean divided by the square root of the sample size.
4. The _________ states: “As the sample size n increases, the shape of the distribution
of the sample means taken without replacement from a population with mean 𝜇
and the standard deviation 𝜎 will approach a normal distribution.”
A. Z distribution
B. Central limit theorem
C. chi distribution
D. Pythagorean theorem
5. If a population has a mean of 12.8, what is the mean of the sampling distribution?
A. less than 12.8
B. larger than 12.8
C. closer to 12.8
D. exactly the same as 12.8

236
6. As the sample size n increases, the standard deviation of the sampling distribution
__________. (2 points)
A. Increases
B. decreases
C. stays the same
D. not enough information is given

7. Which of the following descriptions does NOT illustrate the Central Limit Theorem?
(2 points)
A. The Central Limit Theorem is used to approximate the distribution of the
sample mean over the distribution of the population mean.
B. If the sample size n, where n is sufficiently large is drawn from any
population with mean 𝜇 and a standard deviation 𝜎, then the sampling
distribution of sample means approximates a normal distribution.
C. Whenever the population is not normally distributed, or if we do not know
distribution, the Central Limit Theorem allows us to conclude that the
distribution of sample means will be normal if the sample size is sufficiently
large.
D. Given a random variable X with mean 𝜇 and variance 𝜎 2 , then regardless of
whether the population distribution of X is normally distributed or not, the
shape of the distribution of the sample means taken from the population
approaches a normal distribution.
8. Which of the following statements is NOT true about Central Limit Theorem? (2
points)
A. The population mean and the mean of the sampling distribution of the
mean are equal.
B. The variance of the sampling distribution of the mean and the
population variance is exactly the same.
C. The central limit theorem tells us exactly what the shape of the
distribution of the mean will be when we draw repeated samples from a
given population.
D. If you take repeatedly independent random samples of size n from
any population, then when n is large, the distribution of the sample
mean will approach a normal distribution.
9. Consider the population consisting of values {2, 4, 6}. List all the possible samples
of size 2 which can be drawn with replacement. (2 points)
A. {2,4,6}
B. {(4,2),(4,4),(4,6),(6,2),(6,4),(6,6)
C. {(2,2),(2,4),(2,6) ,(4,2),(4,4),(4,6)}
D. {(2,2),(2,4),(2,6),(4,2),(4,4),(4,6),(6,2),(6,4),(6,6)}
10. If the population has a variance of 4.6, what is the variance of the sampling
distribution of its means if the sampling distribution was derived with sample
size n = 2 and all possible samples are drawn with replacements? (2 points)
A. 21.16
B. 19.22
C. 10.58
D. 2.3

Rubric for Two -Point Items


Number of Points Criteria
2 points With a complete solution and correct answer
1 Point The answer is correct but without a solution
0 Point ( no point) Incorrect answer without a solution

237
Lesson 1 Illustrating the Central Limit
Theorem
Many common statistical procedures require data that approximately follows
a normal distribution. However, normally distributed data isn’t always the case.
There are cases wherein the population is skewed. This is where the Central Limit
Theorem becomes useful.
For you to be able to understand clearly Central Limit Theorem, you need to
remember the prerequisite concepts. Make sure that you still remember your lesson
about parameter and statistic and the sampling distribution of the sample mean
which were previously discussed. A portion of this module will help you recall these
topics.
As a start, this module presents the current concerns of our community. I
believed that you are updated on the news about the impact of COVID-19 in many
of our fellow Filipino people and to our community. Here is a glimpse of how it was
started.
To visualize the content of this module, situations and data set used here are
based on this unfortunate event that devastated the lives of many people. Statistics
is a tool that somehow helps you to appreciate the importance of data analysis.
Some situations and images placed here were based on the personal
experience of the writer of this module to help you understand and illustrate the
Central Limit Theorem.
What’s In
Activity 1
Consider a population of Senior High School consisting of the values 1, 2, 3,
4, 5, and 6.
Compute the following:
1. population mean
2. population variance
3. population standard deviation
4. illustrate the probability histogram of the sampling distribution of the means
Activity 2
Consider all the possible samples of size 2 that can be drawn with replacement
from the population 1, 2, 3, 4, 5, and 6. Create a sampling distribution of the sample
mean.
Compute the following:
1. Mean of the sampling distribution of the sample mean
2. Variance of the sampling distribution of the sample mean
3. Standard deviation of the sampling distribution of the sample mean
4. Illustrate the probability histogram of the sampling distribution of the mean.
What’s New
From Activities 1 and 2, compare the following:
1. population mean and the mean of the sampling distribution of the sample
mean
2. population variance and the variance of the sampling distribution of the
sample mean
3. population standard deviation and the standard deviation of the sampling
distribution of the sample mean
4. probability histogram of the population and the sampling distribution of the
mean with size 2
Show your comparisons

238
What is It
𝛴𝑋
In Activity 1, we get the population mean using the formula 𝜇 =
𝑁
where:
𝜇 – population mean
𝛴𝑋 − sum of all data in terms of X
N - population size
Therefore, we have:

𝛴𝑋
𝜇= Substitute
𝑁
1+2+3+4+5+6
= Add each data divided by the population size
6
21
= Simplify
6
𝜇 = 3.5 (population mean)
To solve for the population variance, subtract each data to the population mean and
square the result.
The table below shows the summary of the difference between each data and the
computed population mean in the second column while the square of each is on the
third column. While the fourth column shows the probabilities of each data.

Data – Population Square of the Data – Population Probability


X
Mean (X- 𝜇) Mean (X- 𝜇)2 P (X)
1
1 1-3.5 = -2.5 (2.5)2 = (2.5)(2.5) = 6.25
6
1
2 2-3.5 = -1.5 (-1.5)2 = (-1.5)(-1.5) = 2.25
6
1
3 3-3.5 = -0.5 (-0.5)2 = (-0.5)(-0.5)= 0.25
6
1
4 4-3.5 = 0.5 (0.5)2 = (0.5)(0.5) = 0.25
6
1
5 5-3.5 = 1.5 (1.5)2 = (1.5)(1.5) = 2.25
6
1
6 6-3.5 = 2.5 (2.5)2 = (2.5)(2.5) = 6.25
6
N=6 𝛴 (x -𝜇)2 = 17.5 𝛴𝑃(𝑋) = 1

2. Solve for the population variance using the formula


𝛴 (𝑋 −𝜇)2
σ2 =
𝑁
17.5
=
6
σ2 ≈ 2.92
∑(𝑋−𝜇)2
3. To solve for the population standard deviation, use the formula, 𝜎 = √
𝑁
17.5
thus, we have 𝜎 = √
6
≈ √2.92
σ ≈ 1.71

239
4. For the probability histogram of the population wherein X = x and n = 1, we have

0 1 2 3 4 5 6
For Activity 2, the table shows and summarizes the information needed to compute
the mean, variance, and standard deviation of the sampling distribution of the
sample mean.

Samp Samp
Obser Samp le ̅
(𝑿 ̅
(𝑿 Obser Sampl le ̅
(𝑿 ̅
(𝑿
vation le Mean − 𝝁𝑿̅ ) − 𝝁𝑿̅ )𝟐 vation e Mean − 𝝁𝑿̅ ) − 𝝁𝑿̅ )𝟐
( 𝑋̅) ( 𝑋̅)
1 (1,1) 1 -2.5 6.25 19 (4,1)
-1 2.5 1
2 (1,2) 1.5 -2 4 20 (4,2)
-0.5 3
0.25
3 (1,3) 2 -1.5 2.25 21 (4,3)
0 3.5 0
4 (1,4) 2.5 -1 1 22 (4,4)
0.5 4
0.25
5 (1,5) 3 -0.5 0.25 23 (4,5)
1 4.5 1
6 (1,6) 3.5 0 0 24 (4,6)
1.5 5
2.25
7 (2,1) 1.5 -2 4 25 (5,1)
-0.5 3
0.25
8 (2,2) 2 -1.5 2.25 26 (5,2)
0 3.5 0
9 (2,3) 2.5 -1 1 27 (5,3)
0.5 4
0.25
10 (2,4) 3 -0.5 0.25 28 (5,4)
1 4.5 1
11 (2,5) 3.5 0 0 29 (5,5)
1.5 5
2.25
12 (2,6) 4 0.5 0.25 30 (5,6)
2 5.5 4
13 (3,1) 2 -1.5 2.25 31 (6,1)
0 3.5 0
14 (3,2) 2.5 -1 1 32 (6,2)
0.5 4
0.25
15 (3,3) 3 -0.5 0.25 33 (6,3)
1 4.5 1
16 (3,4) 3.5 0 0 34 (6,4)
1.5 5
2.25
17 (3,5) 4 0.5 0.25 35 (6,5)
2 5.5 4
18 (3,6) 4.5 1 1 36 (6,6)
2.5 6
6.25
∑(𝑿̅ − 𝝁𝑿̅ )𝟐 =
∑ 𝑋̅ = 126
52.5
Let us now solve for the mean, variance, and standard deviation of the sampling
distribution with sample size 2 (with replacement).

1. Mean of the sampling distribution of the sample mean


𝛴𝑋̅
𝜇𝑋̅ = where n is the number of all possible samples of size 2
𝑛

126
=
36
= 3.5
2. Variance of the sampling distribution of the sample mean
̅ −𝝁𝑿
𝛴( 𝑿 𝟐
̅)
𝜎 2 𝑋̅ = where n is the number of all possible samples of size 2
𝑛
52.5
=
36
≈ 1.46

240
3. Standard Deviation of the sampling distribution of the sample mean
̅ −𝝁𝑿
𝛴( 𝑿 ̅)
𝟐
𝜎𝑋̅ = √ where n is the number of all possible samples of size 2
𝑛
52.6
=√
36
≈ 1.21
5. To illustrate the probability histogram of the sampling distribution of the
sample, the following table is needed:

Sampling Distribution of the sample mean with replacement

𝑋̅ f ̅)
Probability P(𝑿
1
1 1 36
2
1.5 2 36
3
2 3 36
4
2.5 4 36
5
3 5 36
6
3.5 6 36
5
4 5 36
4
4.5 4 36
3
5 3 36
2
5.5 2 36
1
6 1 36

In What’s New part, we compared the values solved through the data in the
population and the sampling distribution.

241
1. The population mean 𝜇 = 3.5 while the mean of the sampling distribution
𝜇𝑋̅ = 3.5. Notice that the population mean is equal to the mean of the sampling
distribution. That is 𝜇 = 𝜇𝑋̅ .
2. The population variance σ2 ≈ 2.92 while the variance of the sampling distribution
𝜎𝑋2̅ ≈ 1.46. Notice that if we divide the population variance by the sample size of
2, we have 1.46 which equal to the variance of the sampling distribution of the
2.92 𝜎2
sample mean. That is, = 1.46 therefore 𝜎𝑋2̅ =
2 𝑛

3. The population standard deviation σ ≈ 1.71 while the standard deviation of the
sampling distribution 𝜎𝑋̅ ≈ 1.21. Notice that if we divide the population standard
deviation by the square root of the sample size of 2, we have 1.21 which is equal
1.71
to the standard deviation of the sampling distribution. That is, =
√2
𝜎
1.21. Therefore, 𝜎𝑋̅ = .
√𝑛
4. The probability histogram of the population wherein n=1 looks like this

P(x)

X
0 1 2 3 4 5 6

While the probability histogram of the sampling distribution of the sample mean with
size n = 2 looks like this.

The illustration above is more likely to have a curve of a normal distribution while
the probability histogram of the population is not normally distributed.
Based on our activity, it was evident in the Central Limit Theorem that the
sampling distribution will be approximately normally distributed for large sample
sizes regardless of the distribution from which we are sampling. Also, it is important
to recall that the mean of the sampling distribution of the sample mean is equal to
the population mean: 𝜇𝑋̅ = 𝜇 and the standard deviation of the sampling distribution
𝜎
of the sample mean (sampling with replacement) is equal to: 𝜎𝑋̅ = .
√𝑛

242
As discussed previously, if the population is normally distributed, the
sampling distribution of the sample mean is also normally distributed. But what if
the population is not normal?
That is where the Central Limit Theorem addressed this question. The
distribution of the sample mean tends toward the normal distribution as the sample
size increases, regardless of the distribution from which we are sampling. As a simple
guideline, the sample mean can be considered approximately normally distributed if
the sample size is at least 30 (n ≥ 30 ).
If the sample size is sufficiently large, the Central Limit Theorem can be used
to answer the sample mean in the same manner that a normal distribution can be
used to answer questions about individual samples. This also means that even if the
population is not normally distributed, or if we don’t know of its distribution, the
Central Limit Theorem allows us to conclude that the distribution of the sample mean
will be normal if the sample size is sufficiently large. It is generally accepted that a
sample size of at least 30 is large enough to conclude that the Central Limit Theorem
will ensure a normal distribution in the sampling process regardless of the
distribution of the original population. Further, we can continue to use the z
conversion formula in our calculations. This time we will use the formula,
𝑋̅−𝜇
z= 𝜎
√𝑛

Why it is important to know the Central Limit theorem?


Many statistics have distributions that are approximately normal for large
sample sizes, even when we are sampling from a distribution that is not normal. This
means that we can develop statistical inference procedures that are based on the
normal distribution, even if we are sampling from a population that is not normal,
provided we have a large sample size.
Let us illustrate this through this example:
Suppose that the average age of the people living in a Barangay is 34 with a
standard deviation of 4. If 100 residents of a certain Barangay decided to take summer
outing after COVID-19 pandemic and Enhanced Community Quarantine has been lifted
for bonding and relaxation, what is the probability that the average age of these
residents is less than 35?
Solution:
It is not given that the population is normally distributed but since n > 30,
then you can assume that the sampling distribution of the mean ages of 100
barangay residents is normal according to the Central Limit Theorem.
The Central Limit Theorem describes the normality of the distribution of the
sample mean taken from a population that is not normally distributed.
Step 1: Write the given data.
𝜇 = 34, 𝜎 = 4, 𝑋̅ = 35, 𝑛 = 100
Step 2: Convert the raw score to the standard score using the formula.
𝑋̅ − 𝜇
𝑍= 𝜎
√𝑛

243
35−34 10
Z= 4 = = 2.5
4
√100
Step 3: Use the Z table to find P (Z < 2.5).

𝑃( 𝑍 < 2.5) = 0.9938


Therefore, the probability that the random sample of 100 persons has an
average of fewer than 35 years is 0.9938 or 99.38%
Consider the illustrations below for a better understanding of the Central
Limit Theorem. Still refer to our previous example:

Suppose that the average age of the people living in a Barangay is 34 with a
standard deviation of 4. One hundred (100) residents of a certain Barangay decided
to take summer outing after COVID-19 pandemic and Enhanced Community
Quarantine has been lifted for bonding and relaxation.
If we make a relative histogram of samples with various sample sizes, it would
look like the histograms below.
POPULATION (individuals) Distribution of ages of
individuals living in a Barangay. This is an
example of uniform distribution whenever the
sample size n =1. Each individual has an equal
probability of selecting as a sample. The
histogram will look like on the left illustration not
so interesting!

Sampling Distribution of the SAMPLE MEAN


(n=40). Different samples of sizes 40 were
selected then calculate the mean of each sample.
The histogram for those mean will look like this
illustration which becomes a little bit interesting!
The curve starts to show.

Sampling Distribution of the SAMPLE MEAN


(n=80). Different samples of sizes 80 were
selected, then calculate the mean of each sample.
The histogram for those mean will look like this
illustration that is almost
As we continuously close
increase to samples
the a normaland
curve.

calculate the sample mean, the histogram of the illustration comes to be


normally distributed. And that is where the Central Limit Theorem is used to make
better inferences.

What’s More
A. YOU COMPLETE ME. Choose from the pool of words/mathematical formulas
inside the box to complete the following paragraph on the important properties of
the Sampling Distribution of the Sample Mean. Then write the corresponding
words below each term to complete the sentence.
Recall on the Properties of the Sampling Distribution of Sample Mean

244
If all possible sample size n are drawn from a population of size N with mean 𝜇
and (1)_________𝜎 2 , then the sampling distribution of the sample mean has the
following properties: First, the (2)__________of the sampling distribution of the
(3)__________ is (4)__________to the (5)_____________ 𝜇. That is, 𝜇𝑋̅ = 𝜇. Second, the
𝜎2
variance of the sampling distribution of the sample mean 𝜎𝑋̅ 2 is given by: 𝜎𝑋2̅ = for
𝑛
𝜎2 𝑁−𝑛
(6)______________ and 𝜎𝑋2̅ = . for (7) ___________________. The (8)
𝑛 𝑁−1
σ
_________________of the sampling distribution of the sample mean is given by 𝜎𝑋̅ =
√𝑛
𝑁−𝑛 𝑁−𝑛
·√ for finite population (sampling without replacement) where √ is the finite
𝑁−1 𝑁−1
population correction factor and (9) _____________________ for infinite population
(sampling with replacement).

Mean Infinite Population Population Mean


US WITH LORD

Equal Standard Deviation Variance


THE AND LET
𝜎2
Sample Mean Finite Population 𝜎𝑋̅ =
√𝑛
SERVE JOY GLADNESS
DECODE:
Answer:
(1) (2) (3) (4) (5)

(6) (7) (8) (9)


B. Read and answer the following questions independently. Show your solution if
necessary.
1. If the population of ABM students of Malaya Integrated High School has a mean
of 12.45, what is the mean of the sampling distribution of its mean?
2. If the mean of the sampling distribution is 24.29, what is the mean of the
population?
3. If a population of HUMSS students of Makabayan National High School has a
variance of 6.4, what is the variance of the sampling distribution of the sample
mean if the sample size is 4 and all the possible samples are drawn with
replacements?
4. If the population of STEM students of Siyensiya Stand Alone Senior High School
has a standard deviation of 9.4, what is the standard deviation of the sampling
distribution of its means? The sampling distribution was derived with sample
size n=3, and all the possible samples were drawn with replacements.
5. If the population standard deviation is 5.6, what is the population variance?

C. Samples of size n were randomly selected from populations with means (with
replacements) and variances given below. In each case, find the mean and
standard deviation of the sampling distribution of the means.
1. n = 4 𝜇 = 4.7 σ2 = 3
2. n = 7 𝜇=4 σ2 = 2
3. n = 16 𝜇=7 σ2 = 6
4. n = 22 𝜇 = 22 σ2 = 4.5
5. n = 44 𝜇 = 40 σ2 = 9.6

245
What I Have Learned
Let us see what you have learned in this module by completing the following
statements.
The Central Limit Theorem implies the following important ideas in statistics:
1. When the sample size tends to infinity (a very large sample) the
distribution of the sample mean X will be ______________ distributed.
2. If the sample size tends to infinity, the sample mean 𝜇𝑋̅ _____________ the
population mean 𝜇.

3. When the original variable is normally distributed, the distribution of the


sample means will be __________________ distributed, for any sample size n.
4. When the distribution of the original variable might not be normal, a
sample size of ______ or more is needed to use a normal distribution to
approximate the distribution of the sample means. The larger the sample,
the better the approximation will be.
What I Can Do
A. Consider a population consisting of values (1,3,5).
1. List all the possible samples of size 2 that can be drawn from the population with
replacement.

Observation Sample ̅
𝑿 ̅ − 𝝁𝑿̅ )
(𝑿 ̅ − 𝝁𝑿̅ )𝟐
(𝑿

2. Compute for the mean of the sampling distribution of the sample means.
3. Compute for the variance of the sampling distribution of the sample means.

𝑋̅ f Probability
̅)
P(𝑿

4. Construct the probability histogram of means with replacements when n = 2.

246
Assessment
Answer the following questions by writing the letter of the correct answer from the
choices below.
1. Which of the following does NOT illustrate the Central Limit Theorem?
A. The Central Limit Theorem is used to approximate the distribution of the
sample means over the population means.
B. If the sample size n, where n is sufficiently large is drawn from any population
with mean 𝜇 and a standard deviation 𝜎, then the sampling distribution of
sample means approximates the normal distribution.
C. Whenever the population is not normally distributed, or if we do not know of
its distribution, the Central Limit Theorem allows us to conclude that the
distribution of sample means will be normal if the sample size is sufficiently
large.
D. Given a random variable X with mean 𝜇 and variance 𝜎 2 , then regardless of
whether the population distribution of X is normally distributed or not, the
shape of the distribution of the sample means taken from the population
approaches a normal distribution.
2. The Central Limit Theorem states that:
A. the sample size is large.
B. all possible samples are selected.
C. the standard error of the sampling distribution is small.
D. the standard deviation is sufficiently large than the normal.
3. The Central Limit Theorem states that the mean of the sampling distribution of
the sample mean is
A. larger than the population mean.
B. exactly equal to the population mean.
C. equal to the population mean divided by the square root of the sample size.
D. close to the population means if the sample size is large.
4. Which of the following descriptions about Central Limit Theorem is NOT essential?
A. The larger the sample, the better approximation will be.
B. The smaller the sample, the bigger the approximation will be.
C. When the original variable is normally distributed, the distribution of the
sample means will be normally distributed for any sample size n.
D. When the distribution of the original variable might not be normal, a sample
size of 30 or more is needed to use a normal distribution to approximate the
distribution of the sample means.
5. Suppose the average age of family members is 34 with a standard deviation of 4.
If 100 members of the community decided to have a summer outing bonding and
relaxation, find the probability that the average of these members is less than 35?
A. 91.32% C. 95%
B. 94.45% D. 99.38%
6. If a population has a mean of 12.8, what is the mean of the sampling distribution?
A. less than 12.8 C. closer to 12.8
B. larger than 12.8 D. exactly the same as 12.8
7. If the mean of the sampling distribution of the means is 6.5, which of the following
statements best describe the population mean?
A. The population decreases by 6.5.
B. The population is greater than 6.5.
C. The population mean is also equal to 6.5.
D. The population mean and mean of the sampling distribution of the means
cannot be compared.

247
8. Which of the following statements is NOT true about Central Limit Theorem?
A. The population mean and the mean of the sampling distribution of the means
are equal.
B. If you take repeatedly independent random samples of size n from any
population, then when n is large, the distribution of the sample means will
approach a normal distribution.
C. The central limit theorem tells us exactly what the shape of the distribution of
the means will be when we draw repeated samples from a given population.
D. The mean of the sampling distributions of the means, the standard deviation
of the sampling distribution of the means, and variance is the same as the
population means, variance of the population, and standard deviation.
9. Consider the population consisting of values {2, 4, 6}. List all the possible samples
of size 2 which can be drawn with replacement. (2 Points)
A. {2,4,6}
B. {(4,2),(4,4),(4,6),(6,2),(6,4),(6,6)
C. {(2,2),(2,4),(2,6) ,(4,2),(4,4),(4,6)}
D. {(2,2),(2,4),(2,6),(4,2),(4,4),(4,6),(6,2),(6,4),(6,6)}
10. If the population has a variance of 4.6, what is the variance of the sampling
distribution of its means if the sampling distribution was derived with sample
size n = 2 and all possible samples are drawn with replacements?
A. 21.16 C. 10.58
B. 19.22 D. 2.3
11. If a population has a mean of 16.7, what is the mean of the sampling distribution?
A. less than 16.7 C. closer to 16.7
B. larger than 16.7 D. exactly the same as 16.7
12. If the mean of the sampling distribution of the means is 29.2, which of the
following statements best describes the population mean?
A. The population decreases by 29.2
B. The population is greater than 29.2
C. The population mean is also equal to 29.2
D. The population mean and mean of the sampling distribution of the means
cannot be compared.
13. As the sample size n increases, the mean of the sampling distribution
_____________
A. increases C. stays the same
B. decreases D. not enough information is given
14. Consider the population consisting of values (1, 3, 5). List all the possible samples
of size 2 which can be drawn without replacement.
A. {1,3,5}
B. {(3,1), (5,1), (5,3)}
C. {(1,2), (1,3), (1,5), (3,1), (3,3), (3,5)}
D. {(1,1), (1,3), (1,5), (3,1), (3,3), 3,5), (5,1), (5,3), (5,5)}
15. If the variance of a certain probability distribution is 16, find its standard
deviation.
A. 1 C. 3
B. 2 D. 4

248
Additional Activities
Complete the following table and solve for the unknowns.
A population of a certain ethnic group is consisting of three numbers (3, 5, 7). If
samples of size 2 will be taken with replacements, complete the following tables, and
answer the questions that follow.

Sample
Observation (
Sample Mean 𝑋̅ − 𝜇
s (𝑋̅ − 𝜇) 2
( 𝑋̅ )
1
2
3
4
5
6
7
8
9
Total

𝑋̅ f Probability
̅)
P(𝑿

1. What is the value of the following?


a. population mean
b. population variance
c. population standard deviation
d. mean of the sampling distribution of the sample mean
e. variance of the sampling distribution of the sample mean
f. standard deviation of the sampling distribution of the sample mean
2. Illustrate the probability histogram of the sampling distribution of the
means.

249
Statistics and
Probability
Solving Problems Involving
Sampling Distribution of the
Sample Mean
Statistics and Probability
Solving Problems Involving Sampling Distribution of the Sample Mean
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writer: Maria Cristina O. Rosas


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

250
Week

7
What I Need to Know
This module was designed and written with you in mind on how you can solve
problems involving sampling distribution of the sample mean. This will also help you
analyze real – life situated problems statistically in terms of relevant questions for
you to better understand them. Your adept at analysis will help you appreciate the
richness, and beauty of Statistics which will motivate you to apply to similar events
and create statistical measures of your own. This module was written and intended
for senior high students like you who are taking Statistics and Probability subject.

Your patience in solving offered problems here in the module will help you
improve your computational skills as it tackles relevant culture – based situated
problems. Your ability to interpret, reason – out and make judgement or even
decision out of statistical measures will also be practiced here. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students. The lessons are arranged
to follow the standard sequence of the course but the order in which you read them
can be changed to correspond with the textbook you are now using.

After going through this module, you are expected to:


1. identify the steps in solving problems on sampling distribution of the
sample mean; and
2. solve problems involving sampling distribution of the sample mean.

What I Know
Let us see how far you will go and how much you know about the topic. Let’s get
started.
Directions: Choose the letter of the best answer. Write the chosen letter on a separate
sheet of paper.
For numbers 1-4, refer to the problem below.
A mayor of a certain town in Quezon province procures a number of face masks
from different private distributors. He found out that the mean pore size of all
face masks is 40 𝜇𝑚 and standard deviation of 5 𝜇𝑚. Assume that the
distribution is normally distributed, and 64 samples face masks were taken.
1. What is the mean of the sampling distribution?
A. 64
B. 40
C. 8
D. 5
2. Which of the following is the standard deviation of mean sampling
distribution?
A. 12.8
B. 1.6
C. 0.625
D. 0.078
3. The sampling distribution of 64 sample face masks falls under what kind of
distribution?
A. exponential
B. normal
C. uniform
D. straight

251
4. Which of the following shows the computed standard error of the mean?
A. 95
B. 8
C. 0.625
D. 0.078
5. The population of Senior High School learners in Mapayapa Integrated
School has a mean of 54.5 and a standard deviation of 8.6. If a sample of
size 12 is taken at random from this population, which of the following
statements is correct?
A. The mean of the sampling distribution of the sample mean is equal
to 54.5.
B. The mean of the sampling distribution of the sample mean is less
than 54.5
C. The standard deviation of the sampling distribution of the sampling
mean is 8.6.
D. The standard deviation of the sampling distribution of the sampling
mean is 54.5.
6. Taken from the same population, which sample size will give a smaller
standard error of the mean?
A. 35
B. 18
C. 12
D. 9
7. Standard error is always non-negative.
A. true
B. false
C. maybe
D. not sure
8. Which is NOT true about the standard error?
A. The standard error is also the sampling error.
B. The standard error is the approximate standard deviation of a
statistical sample population.
C. The more data points involved in the calculations of the mean, the
smaller the standard error tends to be.
D. The standard error can include the variation between the calculated
mean of the population and one which is considered known, or
accepted as accurate.

For numbers 9 – 12, refer to the problem below.

A certain condominium in Manila has 35 floors. Each floor has 2 elevators.


In every elevator there is a sign inside stating a limit of 16 persons and a weight
limit of 2500 pounds. The average weight of the condominium residents is 150
pounds with a standard deviation of 27 pounds. Assume that the weight
distribution is approximately normal and 16 persons from the condominium are
to be taken as samples.

9. What is the expected value of the weight distribution of the sample mean?
A. 27
B. 35
C. 150
D. 2500

252
10. What is the standard deviation of the weight distribution of the sample
mean?
A. 16.7
B. 14.5
C. 6.75
D. 0.06
11. Which of the following average weight of 16 sample residents will reach
the limit of an elevator’s carrying capacity of 2500 pounds?
A. 156.25
B. 16.67
C. 200.05
D. 180.12
12. What is the probability that a random sample of 16 people will exceed the
weight limit capacity of an elevator?
A. 21%
B. 17%
C. 13%
D. 14%
13. If a sample came from a given population, then the mean of the sampling
distribution of the sample must be equal to _____
A. mean of population
B. sample of population
C. variance of population
D. standard deviation of population
14. Which of the following cases is the sample size considered large?
A. 𝑛 ≥ 30
B. 𝑛 ≥ 50
C. 𝑛 ≤ 30
D. 𝑛 ≤ 50
15. A population has a mean of 50 and a standard deviation of 6. What are
the mean and the standard deviation of the sampling distribution of the
mean for n = 16?
A. 𝑋̅ = 50, s= 25
B. 𝑋̅ = 50, s =1.5
C. 𝑋̅ = 25, s = 6
D. 𝑋̅ = 25, s = 1.
Lesson 1 Solving Problems Involving
Sampling Distribution of the Sample
Mean
The concept of the standard error of the mean is very significant, since it
measures the degree of accuracy of the sample mean as an estimate of population
mean. There is a good estimate if the standard error is small or close to zero and a
poor estimate if it is large. The value of the standard error depends mainly on the
sample size n. Specifically, as n increases, 𝜎𝑥̅ decreases. The most remarkable
theorem in statistical inference is the Central Limit Theorem which assures that no
matter what the shape of the population distribution is, the sampling distribution of
the mean is closely normally distributed whenever n is large. This fact holds true for
sample sizes greater than or equal to 30.

253
What’s In
In the previous topics, you have learned how to use the normal
distribution to calculate the probability of a random variable X at a given value. In
this lesson, you will use the sampling distribution of the mean to get the probability
of the given sample mean taken from the population.
Sampling distribution of sample mean is a frequency distribution of the
mean computed from all possible random samples of a specific size taken from a
population. The probability distribution of the sample mean is also called the
sampling distribution of the sample mean.

1. What do we mean by sampling distribution of the sample mean?


__________________________________________________________________________________
____________________________________________________
2. When do we use the following formulas?
𝑥−𝜇 𝑥̅ − 𝜇
a. z = , b. z = 𝜎
𝜎
√𝑛
__________________________________________________________________________________
____________________________________________________
3. How will you solve problems involving sampling distribution of the sample
mean with “less than”, “greater than” and “between” cases? Is the idea applicable
to a single observation (x)?
__________________________________________________________________________________
____________________________________________________
What’s New
Probable Time
The mean time it takes a group of senior high students to complete a
certain examination is 50.6 minutes. The standard deviation is 6 minutes.
Assume that the variable is normally distributed.

1. What is the probability that a randomly selected senior high school


student will complete the examination in less than 48 minutes?
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
2. If 49 randomly selected senior high school students take the
examination, what is the probability that the mean time it takes the group to
complete the test will be less than 48 minutes?
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
3. If 49 randomly selected senior high school students take the
examination, what is the probability that the mean time it takes the group to
complete the test will be more than 51 minutes?
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
4. If 49 randomly selected senior high students take the examination, what
is the probability that the mean time it takes the group to complete the test
is between 47.8 and 53 minutes?

254
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
5. If you are one of the selected senior high school students to take the test,
how long would it take you to complete the examination?
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
6. What other factors can you give that contribute to the length of
examination?
___________________________________________________________________________
___________________________________________________________________________
___________________________________________________________________________
Rubric for numbers 5 and 6 questions for 3-point score.

Equivalent Points Description


Coherent content with correct use of grammar. It includes
3 points
very timely and interesting ideas.
2 points Inconsistent content with correct use of grammar.
1 point Limited ideas. Grammatical error.
0 point No answer at all.
What is It
A sampling distribution of the sample mean is a frequency distribution of the
sample mean computed from all possible random samples of a specific size n taken
from a population. The probability distribution of the sample mean is also called the
sampling distribution of the sample mean. The standard distribution of the sampling
distribution of the sample mean is also known as the standard error of the mean.
Here are the steps to solve problems involving sampling distribution of the
sample mean. (Refer to the problem in What’s New)
1. What is the probability that a randomly selected senior high school
student will complete the examination in less than 48 minutes?
To answer the problem, follow the steps:
Step 1. Identify the given information:
𝜇 = 50.6
𝜎=6
X = 48
Step 2. Identify what is asked for:
P(X ˂ 48)
Step 3. Identify the formula to be used:
The problem is dealing with an individual data obtained from the
𝑋−𝜇
population so the formula to be used is z = to convert 48 to
𝜎
standard score.
Step 4. Compute for the Probability:
𝑋−𝜇
z=
𝜎
48−50.6
z =
6
z = - 0.43
Find P(X< 48) by getting the area under the normal curve.
P(X˂ 48) = P(z ˂ -0.43)
= 0.3336

255
Therefore, the probability that a randomly selected college student will
complete the examination in less than 48 minutes is 0.3336 or 33.36%
2. If 49 randomly selected senior high school students take the examination,
what is the probability that the mean time it takes the group to complete the
test will be less than 48 minutes?
Step 1: Identify the given information:
𝜇= 50.6
𝜎=6
𝑋̅= 48
𝑛= 49
Step 2: Identify what is asked.
P(𝑋̅ ˂ 48)
Step 3: Identify the formula to be used.
The problem is dealing with data about the sample mean or n
𝑋̅ −𝜇
observations, so the formula to be used to standardize 48 is z = 𝜎
√𝑛
Step 4: Compute for the Probability:
𝑋̅−𝜇
z= 𝜎
√𝑛
48−50.6
= 6 = - 3.03
√49
Find P(𝑋̅ ˂ 48) by getting the area under the normal curve.
P(𝑋̅ ˂ 48) = P(z ˂ -3.03)
= 0.0012

The probability that 49 randomly selected senior high school students will
complete the test in less than 48 minutes is 0.0012 or 0.12%
3. If 49 randomly selected senior high school students take the examination,
what is the probability that the mean time it takes the group to complete the
test will be more than 51 minutes?
Step 1: Identify the given information:
𝜇= 50.6
𝜎=6
𝑋̅= 51
𝑛= 49

256
Step 2: Identify what is asked.
P(𝑋̅ > 51)

Step 3: Identify the formula to be used.


The problem is dealing with data about the sample mean or n
𝑋̅ −𝜇
observations, so the formula to be used to standardize 51 is z = 𝜎
√𝑛
Step 4: Compute for the Probability:
𝑋̅−𝜇
z= 𝜎
√𝑛
51−50.6
= 6
√49
= 0.47
Find P(𝑋̅ > 51) by getting the area under the normal curve.

P(𝑋̅ > 51) = P(z > 0.47)


= 1 −P(𝑧 < 0.47)
= 1 – 0.6808
= 0.3192

The probability that 49 randomly selected senior high students will complete
the test in more than 51 minutes is 0.3192 or 31.92%
4. If 49 randomly selected senior high school students take the examination,
what is the probability that the mean time it takes the group to complete the
test is between 47.8 and 53 minutes?
Step 1: Identify the given information:
𝜇= 50.6
𝜎=6
𝑋̅= 47.8 and 53, 𝑛 = 49
Step 2: Identify what is asked.
P(47.8 < 𝑋̅ < 53)
Step 3: Identify the formula to be used.
The problem is dealing with data about the sample mean or n
observations, so the formula to be used to standardize 47.8 and 53
𝑋̅ −𝜇
is z = 𝜎
√𝑛
Step 4: Compute for the Probability:

(4.a) P( 𝑋̅ <47.8) (4.b) P(𝑋̅ < 53)


𝑋̅−𝜇 𝑋̅ −𝜇
𝑧1 = 𝜎 𝑧2 = 𝜎
√𝑛 √𝑛
47.8−50.6 53−50.6
= 6 = 6
√49 √49
= - 3.27 = 2.8
Find P(𝑋̅ > 47.8) by getting the Find P(𝑥̅ < 53) by getting the

257
area under the normal curve. area under the normal curve
P(𝑥̅ > 47.8) = P(z > - 3.27) = 0.0005 P(𝑥̅ < 53) = P(z < 2.8) = 0.9974
To find the probability that 49 randomly selected senior high school
students will complete the test between 47.8 and 53 minutes, subtract the
smaller area from the bigger area under the normal curve.

That is 0.9974– 0.0005 =0.9969 or 99.69%

What’s More
Below are problems you need to ponder. Read each statement carefully and answer
the questions. Use separate sheet/s of paper for your answers.

Independent Activity 1:

My Favorite Breed!
A veterinarian studies a large population of one breed of dogs which is shih tzu.
The shih tzu’s mean length is 50 inches with standard deviation 26 inches. If
a sample of 169 shih tzu are randomly selected, what is the chance that the
sample mean is:

a. less than 49 inches?


________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
b. at least 47 inches?
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
c. between 46 and 48 inches?
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
d. Describe your favorite pet. State at least 3 characteristics your pet
possesses.
_______________________________________________________________________
_______________________________________________________________________
_______________________________________________________________________

258
Rubric for question (d) for 3-point score.
Equivalent Points Description
Coherent content with correct use of grammar. It
3 points
includes very timely and interesting ideas.
2 points Inconsistent content with correct use of grammar.
1 point Limited ideas. Grammatical error.
0 point No answer at all.
Independent Assessment 1:
Solve each of the following problems. All answers should be in two-decimal places.

IQ Level
The IQs of Grade 11 students in MAKATAO NATIONAL HIGH SCHOOL were
measured and found to be normally distributed with a mean of 98 and a standard
deviation of 8.

a. If a student from the school is chosen at random, what is the probability


that his score is higher than 110?
________________________________________________________________________
________________________________________________________________________
_______________________________________________________________________
b. What is the probability that a random sample of 4 students will have an
average of above 110?
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
c. The table below is an excerpt from the report of Patricia Anne R. San
Buenaventura, Chief Statistical Specialist, Statistical Standard Division,
Standard Office, Philippine Statistics Authority.
Based on the table below and the corresponding standard deviation, make
your own problem involving sampling distribution of the sample mean,
taking the given Statistics as 𝜇. Show the complete solution and the answer.

259
*standard deviation of passing rate in licensure exam is 4.6
*standard deviation of faculty qualification with MS/MA degree/s is 5.1
*standard deviation of faculty qualification with Ph.D. degree/s is 4.8.
Rubric for rating the output:
Equivalent Points Description
The problem is constructed perfectly with complete
5 points
solution and correct answer.
The problem is constructed perfectly with complete
4 points
solution but wrong answer.
The problem is constructed perfectly with incomplete
3 points
solution and wrong answer.
The problem is not perfectly done, incomplete solution
2 points
and wrong answer.
1 point The problem is incorrect.

Independent Activity 2

Is It Worth It?
The mean annual salary of all the frontlines (nurses, medical technologists,
radiologic technologists, phlebotomists) in the Philippines is Php 42,500.
Assume that this is normally distributed with standard deviation Php 5,600.
A random sample of 25 health workers is drawn from this population, find
the probability that the mean salary of the sample is:

a. less than Php 40,500?


_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
b. greater than Php 41,000?
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
c. between Php 40,400 and Php 45,000?
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
d. As a senior high school student, Maria and her classmates made a move on
how they can show their love and concerns to the front liners. They create
a website called “ENCOURAGEMENT WALL” where netizens can post their
words for the front liners.
Find words that best describe our front liners in the grid below running in
one of TEN possible directions, horizontally, vertically, diagonally or
backward. Write the answers on the blank below the grid.

260
D E T T I M M O C P F
E N G L S H E R O R O
T I G H A D G E N O R
E B R A V E E W F A F
R R F T L K T A I C L
M A V E U L T N D T E
I M J D J M A S E I X
N A O S I O W N N V I
E N L U N F E D T E B
D S S S E L F L E S L
R E S P O N S I B L E
Independent Assessment 2
In this part, you need to solve the problem then shade the region under the
normal curve that corresponds to the required area.

Harvest More!
Jose who lives in Zambales harvested mangoes in her plantation for
export. The average weight of the harvested fruits is 2 kilos with a
standard deviation of 0.4 kilo. Assume that the variable is normally
distributed.

a. What is the probability that a randomly selected mango will weigh less
than 1.5 kilos?
_____________________________________________________________________
_____________________________________________________________________

b. A potential importer intends to take a sample of 4 mangoes and will not


place an order if the sample mean is less than 1.5 kilos. What is the
probability that the importer will not place an order?
_____________________________________________________________________
_____________________________________________________________________

261
Independent Assessment 3
Given the situation or problem, determine what is being asked.
a. Suppose a random sample of size n = 25 taken from a very large
population with mean 𝜇 = 90 and a standard deviation 𝜎 = 6.4.
Describe the sampling distribution of the mean for n = 25, in terms
of its expected value and the standard deviation.
_____________________________________________________________________
_____________________________________________________________________

b. A random sample of size n = 16 was drawn from a population. If it is


known that the population has mean 𝜇 = 32 and a standard
deviation 𝜎 = 5, determine the standard error of the sampling distribution
of the mean.
_____________________________________________________________________
_____________________________________________________________________

c. c. Find the standard error of the sampling distribution of the mean when a
random sample of size 20 is chosen from a population of size 167 having a
mean of 75 and a standard deviation of 3.9.
_____________________________________________________________________
_____________________________________________________________________
What I Have Learned
The formula used when dealing with an individual or single data obtained from
the population is __________________ while the formula to be used when dealing
with data about the sample mean or n observations is ________________.
The mean of the sampling distribution of the sample mean is equal to the
___________________. 𝜇𝑥̅ = 𝜇
𝜎
The ____________________ of the sampling distribution of 𝑥̅ is equal to 𝜎𝑥̅ = .
√𝑛
Given a normal or sampling distribution, the probability of a random variable (X)
or sample mean (𝑋̅) can be calculated depending on the following cases:
Case 1. For “less than” problem,
The probability is equal to the area that corresponds to z value.
Case 2. For “greater than” problem,
The probability is equal to 1 minus the area that corresponds to z value.
Case 3. Of “between” problem,
The probability is equal to the difference of the larger and smaller area.
Remember that we do not subtract the z value to get the probability. We
only subtract the areas under the normal curve
What I Can Do
Now let’s apply the knowledge you have gained in this module by answering the
problem below. Use separate sheet of paper for your answer.

Online Testing
Because of the immediate suspension of classes nationwide due to the
COVID-19 pandemic, many schools opted to give the final examination online.
The scores on a certain exam are normally distributed with a mean of 65 and
a standard deviation of 9. Find the probability of the score
a. less than 54
_______________________________________________________________________________
_______________________________________________________________________________
_______________________________________________________________________________

262
b. at least 80
_______________________________________________________________________________
_______________________________________________________________________________
_______________________________________________________________________________
c. The average age of the ABS CBN employees is known to be 32 years and the
standard deviation is 5 years. Find the probability that a random sample of 36
employees will have an average age of less than 30 years.
________________________________________________________________________________
________________________________________________________________________________
________________________________________________________________________________
d. As a response to the personal, social and psychological impact of COVID-19,
Ateneo De Manila University President Fr. Jett Villarin, S.J. declared in a
memorandum, the suspension of online classes during the Enhanced
Community Quarantine measures of the government. With this memorandum,
some schools reacted positively and some the other way around.
If you are the president of a certain university, will you do the same thing? Cite
three reasons or less for your answer.
________________________________________________________________________________
________________________________________________________________________________
________________________________________________________________________________
Rubric for question (c) for 3-point score.
Equivalent Points Description
Coherent content with correct use of grammar. It
3 points
includes very timely and interesting ideas.
2 points Inconsistent content with correct use of grammar.
1 point Limited ideas. Grammatical error.
0 point No answer at all.

Assessment
Multiple Choice: Choose the best answer in each question. Write the letter of your
choice on a separate pad paper.

For numbers 1 - 4, refer to the problem below.

Suppose that a random sample of size 64 is to be selected from a


population with a mean 40 and standard deviation 5.

1. What is the population mean?


A. 5
B. 25
C. 30
D. 40
2. What is the standard deviation of the sampling distribution?
A. 0.625
B. 2.10
C. 21.5
D. 22.5
3. What is the approximate probability that 𝑥̅ will be within 0.5 of the
population mean 𝜇?
A. 31.5%
B. 32.80%
C. 57.62%
D. 75.57%

263
4. What is the approximate probability that 𝑥̅ will differ from 𝜇 by more than 0.7?
A. 26.27%
B. 34.23%
C. 39.3%
D. 40.7%

For numbers 5 – 7, refer to the problem below.

The GPAs of all students enrolled at a large university have an


approximately normal distribution with a mean of 3.05 and a standard deviation
of 0.29. Find the probability that the mean GPA of a random sample of 20 students
selected from this university is:

5. less than 298?


A. 12.58%
B. 14.01%
C. 15.62%
D. 85.23%
6. greater than 3.01?
A. 19.32%
B. 22.41%
C. 55.56%
D. 73.24%
7. between 2.90 and 3.07?
A. 62.17%
B. 61.13%
C. 42.34%
D. 1.04%
8. The standard error of the mean is smaller when N = 20 than when N = 10.
A. true
B. false
C. sometimes
D. maybe
9. A normal distribution is standardized by the formula
A. nπ
B. nπ(1-π)
C. (x - µ)/ σ
D. (x - µ)/ n
10. The mean of the standard normal distribution is
A. 0 B. 1 C. 100 D. 1000

For numbers 11 - 13, refer to the problem below.


Assume that woman’s heights are normally distributed with a mean given
by 𝜇 = 62.3 inches and a standard deviation σ = 1.8 inches.

11. If a woman is randomly selected, find the probability that her height is between
61.7 inches and 62.7 inches?
A. 20.75%
B. 72%
C. 21.64%
D. 86.65%

264
12. If a sample of 16 women is selected, what is the probability that the mean of
the sample will be larger than 63.2 inches?
A. 19.32%
B. 15.56%
C. 4.75%
D. 2.28%
13. If a sample of 16 women is selected, what is the probability that the mean of
the sample will be less than 63.2 inches?
A. 97.72%
B. 41.56%
C. 28.74%
D. 16.67%
14. Which is NOT true about the standard error?
A. The standard error is also the sampling error.
B. The standard error is the approximate standard deviation of a statistical
sample population.
C. The more data points involved in the calculations of the mean, the
smaller the standard error tends to be.
D. The standard error can include the variation between the calculated
mean of the population and one which is considered known, or accepted
as accurate.
15. The number of total carbohydrates in an order of camote fries in a native
restaurant is approximately normally distributed with a mean of 240 and a
standard deviation of 20. What is the probability that a randomly selected
order has at least 265 total carbohydrates?
A. 54.13%
B. 34.67%
C. 21.56%
D. 10.56%

Additional Activities
1. The mean income of a certain government employee is Php 53,200 with a
standard deviation Php 6750. Assume that their income is normally distributed.

Find the following:


a. 𝜇 = ________
b. 𝜎 = ________
c. 𝑃(𝑥 ≥ 50,000) = _____________
d. Show the region under normal curve by shading the part that corresponds to
the required area of P(x ≥ 50,000)

e. If a random sample of 49 employees is selected, what is the probability that


the sample mean had an income of at least Php 50,000?
1. The length of time in hours it takes an “over 40” group of men to play one
basketball game is normally distributed with a mean of 2 hours and a standard

265
deviation of 0.5 hours. A sample of size n = 30 is drawn randomly from the
population.
Answer the following questions:
a. 𝜇 = __________, 𝜎 = ____________ and n = ___________
d. Find the probability that the sample mean is between1.8 hours and 2.1 hours
hours.
e. Shade the area under the normal curve that corresponds to the required
value.

266
Statistics and
Probability
Illustrating the t-Distribution
Statistics and Probability
Illustrating the t-Distribution
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Gilberto M. Delfina


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

267
Week

8
What I Need to Know
This module was designed and written with you in mind on how you can
illustrate the t-distribution. This will also help you analyze real-life situated problems
statistically in terms of relevant questions for you to better understand them. Your
adept at analysis will help you appreciate the richness, and beauty of Statistics which
will motivate you to apply to similar events and create statistical measures of your
own.

Your patience in solving offered problems here in the module will help you
improve your computational skills as it tackles relevant culture-based situated
problems. Your ability to interpret, reason–out, and make a judgment or even
decision out of statistical measures will also be practiced here. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students. The lessons are arranged
to follow the standard sequence of the course. But the order in which you read them
can be changed to correspond with the textbook you are now using.
After going through this module, you are expected to:

1. illustrate the t-distribution and enumerate its properties; and


2. differentiate t-distribution from the z- distribution.
What I Know
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. In the absence of the population variance and/or if the sample size is small,
which sampling distribution is being used?
A. chi distribution
B. z-distribution
C. t-distribution
D. p distribution
2. Who developed the t-distribution in 1908?
A. William S. Gosset
B. Wilhem G. Student
C. Ronald A. Fisher
D. a student from Gosset Academy
3. When do we say that the sample size is sufficiently large?
A. When it is greater than or equal to 10.
B. When it is greater than or equal to 20.
C. When it is greater than or equal to 30.
D. When it is greater than or equal to 40.
4. In Student’s t-distribution, if the sample size is 25, what is the degree of
freedom?
A. 5
B. 24
C. 25
D. 26
5. How does t distribution differ from a normal distribution?
A. The t-distribution has a thicker tail.
B. The t-distribution has a higher peak.
C. The t-distribution is centered at 0.
D. The t-distribution is symmetric in the middle.

268
6. When do you use t-distribution instead of normal distribution?
A. When the sample size is less than 30.
B. If the sample standard deviation is unknown
C. If the population standard deviation is known
D. When the sample size is greater than or equal to 30
𝑥−𝜇
7. In the estimation of a parameter using the t statistic 𝑠 , why did we replace the
⁄ 𝑛

population standard deviation, 𝜎 by the sample standard deviation, s?
A. because the given sample size is small
B. because 𝜎 and s differ by a very small amount
C. because 𝜎 is unknown and s is a good estimator of 𝜎
D. because 𝜎 is equal to s when we are estimating parameters
8. The t-distribution curve has thicker tails than the normal curve. What does it
imply?
A. The normal distribution has a greater mean than the t-distribution.
B. The t-distribution has lesser variability than the normal distribution.
C. The t-distribution has a greater chance for extreme values than the
normal distribution.
D. Estimation of the parameter using the z-distribution is more accurate
than using the t-distribution.

For numbers 9 to 11,


The CEO of Flying Fire Corporation claims that an average flying fire light bulb
lasts 300 days. A researcher randomly selects 15 bulbs for testing. The sampled
bulbs last an average of 290 days, with a standard deviation of 50 days.
9. Identify the population mean.
A. 300 days
B. 290 days
C. 50 days
D. 15 bulbs
10. What is the number of degrees of freedom?
A. 299
B. 289
C. 49
D. 14
𝑣
11. Compute for the variance of the t distribution using the formula, where 𝑣 is
𝑣−2
the number of degrees of freedom.
A. 1.17
B. 1.10
C. 1.04
D. 1.01
12. Which property of t-distribution is also a property of normal distribution?
A. In t-distribution, the variance is always greater than 1.
B. In t-distribution, the standard deviation is always greater than 1.
C. The tails of the t-distribution curve are asymptotic to the horizontal axis.
D. The shape of the t-distribution curve depends on the degrees of freedom.
13. Which of the following statements is NOT true about the t-distribution?
A. The variance of the t-distribution is equal to 1.
B. The t-distribution has thicker tails than the normal distribution.
C. As the degrees of freedom increase, the t distribution tends to normal
distribution.
D. The exact shape of the t-distribution depends on the number of degrees of
freedom.

269
14. Which of the following probability distribution curves has the lowest peak?
A. the standard normal distribution
B. a t-distribution with 20 degrees of freedom
C. a t-distribution with 15 degrees of freedom
D. a t-distribution with 10 degrees of freedom
15. Which of the following probability distribution curves has the thickest tails?
A. the standard normal distribution
B. a t-distribution with 21 degrees of freedom
C. a t-distribution with 19 degrees of freedom
D. a t-distribution with 20 degrees of freedom
How do you find this pre-test? Did you encounter both familiar and unfamiliar
terms? Kindly compare your answer in the Answer Key on the last part of this module
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. But if you missed a point, please proceed with the module as it
will enrich your knowledge in t-distribution.
Lesson 1 Illustrating the t-Distribution
This module is about the t-distribution (Student’s t-distribution) and its
properties. Information about the t-distribution will help you in solving for the
interval estimates of a population when the population variance is unknown. This
module will illustrate the t-distribution and discuss its uses. After completing this
module, you are expected to illustrate the t-distribution, enumerate its properties,
and state its similarities and/ or differences with the z- distribution. It is assumed
that you already learned the topic about the normal distribution before proceeding
in this module.
What’s In
Let us review your lesson on the properties of z-distribution. Write “YES” if the
statement is true about the z-distribution and “NO” if it’s not. Then check your
answer by going back to the previous module on z-distribution.

__________1. The z-distribution is a normal distribution with a mean of 0 and


standard deviation of 1.
__________2. The normal curve is bell-shaped.
__________3. The tails of the normal curve approach the vertical axis but never
touch it.
__________4. The mean is always greater than either the median or the mode.
__________5. The curve is symmetrical about its center.
__________6. The total area under the normal curve is always less than 1.
__________7. The mean, median and mode coincide at the center.
__________8. The width of the normal curve depends on the standard deviation of
the distribution.
__________9. Almost 99.7% of the distribution falls within three standard deviations
from the mean.
__________10. The left tail of the normal curve is flatter than its right tail.

What’s New
ACTIVITY 1. “To z or not to z, that is the question”

In everything you do, your decision matters. And in every decision you make,
you must be responsible for its consequences or outcomes. In the illustration below,
fill in the missing boxes with “to z” if you think a z-table is required in the distribution
and “not to z” if not.

270
1
Is the sample
size less than
30?

YES NO

_____________ ____________

2 Is there a given population


standard deviation?

YES NO

_____________ ____________
Answer the following questions.
1. How did you find the activity?
2. Was it hard for you to fill in each box?
3. Based on the activity, when do you use the z-distribution? When not to use it?
4. If the sample size is less than 30 and the population standard deviation is
unknown, what do you use instead of the z-distribution?

What is It
According to the Central Limit Theorem, the sampling distribution of a
statistic (like a sample mean, 𝑥̅ ) will follow a normal distribution, as long as the
sample size (𝑛) is sufficiently large. Therefore, when we know the standard deviation
of the population, we can compute a z-score and use the normal distribution to
evaluate probabilities with the sample mean.

271
But sample sizes are sometimes small, and often we do not know the standard
deviation of the population. When either of these problems occurs, the solution is to
use a different distribution.
Student’s t-distribution
The Student’s t-distribution is a probability distribution that is used to
estimate population parameters when the sample size is small (𝑖. 𝑒. 𝑠𝑎𝑚𝑝𝑙𝑒 𝑠𝑖𝑧𝑒 < 30)
and/or when the population variance is unknown. It was developed by William Sealy
Gosset in 1908. He used the pseudonym or pen name “Student” when he published
his paper which describes the distribution. That is why it is called “Student’s t-
distribution”. He worked at a brewery and was interested in the problems of small
samples, for example, the chemical properties of barley. In the problem he analyzed,
the sample size might be as low as three.
Suppose you are about to draw a random sample of n observations from a
normally distributed population, you previously learned that,
𝑥−𝜇
𝑧=𝜎
⁄ 𝑛

where 𝑧 is the z-score, 𝑥 is the sample mean, 𝜇 is the population mean, 𝜎 is the
population standard deviation and 𝑛 is the sample size, have the standard normal
distribution. (Note that if we are standardizing a single observation, the value of n is
𝑥−𝜇
1. Hence, the formula becomes 𝑧 = . You can use this concept to construct a
𝜎
confidence interval for the population mean, 𝜇. But in practice, you encounter a
problem, and that problem is that you don’t know the value of the population
standard deviation, 𝜎. The standard deviation for the entire population 𝜎 is a
parameter and you don’t typically know its value, so you can’t use that in your
formula. If that happens, you could do the next best thing, instead of using the
“population” standard deviation, 𝜎; you are going to use your “sample” standard
𝑥−𝜇 𝑥−𝜇
deviation s, to estimate it. And instead of 𝜎 , you are going to have 𝑠 where s is
⁄ 𝑛 ⁄ 𝑛
√ √
your sample standard deviation.
You must take note of the change in the formula. The quantity 𝜎 is a constant
but you don’t know its value, so you used s which is a statistic and this statistic s
has a sampling distribution and its value would vary from sample to sample. And so,
𝑥−𝜇
the quantity 𝑠 would no longer have the standard normal distribution. This
⁄ 𝑛

quantity is labeled as t because it has a t-distribution. When you are sampling from
a normally distributed population, the quantity
𝑥−𝜇
𝑡= 𝑠
⁄ 𝑛

has the t-distribution with n-1 degrees of freedom. Note that the number of degrees
of freedom is one less than the sample size. So, if the sample size n is 25, the number
of degrees of freedom is 24. Similarly, at t distribution having 16 degrees of freedom,
the sample size is 17.
𝑥−𝜇
What does the t-distribution look like? If you look at the statistic 𝑠 , it looks
⁄ 𝑛

like a z-statistic which has standard normal distribution except that you replaced
the population standard deviation, 𝜎, by the sample standard deviation s. You are

272
estimating a parameter with a statistic, so there is a greater variability. Hence, your
t-distribution is going to look like the normal distribution except with greater
variance.

You have here a plot of standard normal distribution in black and t-


distributions with 3, 5, 20, and 30 degrees of freedom in red, green, violet, and blue
respectively. You can see that both the z-distribution and t-distributions are
symmetric about 0 and bell-shaped. But the t-distributions have heavier tails (more
area in the tails) and lower peaks.
The exact shape of the t-distribution depends on the degrees of freedom. The
figure above tells you that as the degrees of freedom increase, the t-distribution tends
toward the standard normal distribution. At 30 degrees of freedom, the blue curve
might look very close to the normal curve. But if you look very closely, you would see
that the t-distribution still has slightly heavier tails and slightly lower peak. But if
you let those degrees of freedom continue to increase, the t-distribution is going to
get closer and closer to the standard normal distribution.

Properties of t-distribution
The t-distribution has the following properties:
1. The t-distribution is symmetrical about 0. That means if you draw a segment
from the peak of the curve down to the 0 mark on the horizontal axis, the
curve is divided into two equal parts or areas. The t- scores on the horizontal axis
will be divided also with half of the t-scores being positive and half negative.

2. The t-distribution is bell-shaped like the normal distribution but has


heavier tails. That means it is more prone to producing values that fall
far from the mean. The tails are asymptotic to the horizontal axis. (Each
tail approaches the horizontal axis but never touches it.)

273
3. The mean, median, and mode of the t-distribution are all equal to zero.
𝑣
4. The variance is always greater than 1. It is equal to where 𝑣 is the
𝑣−2
number of degrees of freedom. As the number of degrees of freedom
increases and approaches infinity, the variance approaches 1. Using the
formula, if the number of degrees of freedom is 10, the variance is
10 10
= = 1.25
10−2 8
5. As the degrees of freedom increase, the t-distribution curve looks more
and more like the normal distribution. With infinite degrees of freedom, t-
distribution is the same as the normal distribution.

6. The standard deviation of the t-distribution varies with the sample size. It is
always greater than 1. Unlike the normal distribution, which has a standard
deviation of 1.
7. The total area under a t-distribution curve is 1 or 100%. One can say that the
area under the t-distribution curve represents the probability or the
percentage associated with specific sets of t-values.

274
What’s More
ACTIVITY 2. “Oh, Is That for Real?”
Most of us hate fake news, fake information, and even fake friends. We need
to develop our ability to distinguish what is real from what is not. Write “REAL” if
the statement is true about the t-distribution and “FAKE” if it’s not.
_________1. The t-distribution is used to estimate population parameters when the
sample size is small and/or the population variance is unknown.
_________2. The mean, median and mode are all equal to zero.
_________3. The variance is equal to 1.
_________4. The t-distribution curve is bell-shaped.
_________5. The standard deviation is always greater than 1.
_________6. Half of the total area under the t-distribution curve is equal to 1.
_________7. The curve is symmetrical about its zero.
_________8. The shape of the t-distribution curve depends on the sample mean.
_________9. The tails of the t-distribution curve approach the horizontal axis but
never touch it.
_________10. As the degrees of freedom increase, the t-distribution curve
looks more and more like the normal distribution.
What I Have Learned
Fill in the blank with the correct word or phrase to complete each sentence.
1. The Student’s t-distribution is a probability distribution that is used to estimate
population parameters when the sample size is ____________ and/or when
the_______________________ is unknown.
2. The t-distribution was developed by ___________________ in 1908.
3.Like the normal distribution, the t-distribution is ____________ -shaped,
symmetrical about ____________________ and has the total area under its
curve equal to ____________.
4. The t-distribution has tails that are asymptotic to the __________________ axis.
5. The mean, median and mode of t-distribution are equal to ___________.
6. The shape of the t-distribution curve depends on the number of ______________.
7. The t-distribution has _____________ peak and _____________ tails than the
normal curve.
8. As the degrees of freedom increases, the t-distribution tends to the
_____________.
9. The variance and the standard deviation of the t-distribution is always
_____________ than 1.
10. To compute for the variance, use the formula ____________.
What I Can Do
Tell whether each of the following statements describes a t-distribution, a normal
distribution, or both.
_________________1. The variance is equal to 1.
_________________2. The probability distribution curve is bell-shaped.
_________________3. The probability distribution curve symmetrical about
zero.
_________________4. The standard deviation is equal to 1.
_________________5. The shape of the distribution curve depends on the
degrees of freedom
_________________6. It has the total area under its curve equal to 1 or 100%.
_________________7. It has lower peak and heavier tails.

275
_________________8. Sample size is small and population variance is
unknown.
_________________9. It has tails that are asymptotic to the horizontal axis.
_________________10. Its mean, median, and mode are equal to 0.
Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.
1. It is the probability distribution used when the population variance is unknown
and/or if the sample size is small.
A. chi distribution
B. z-distribution
C. t-distribution
D. p-distribution
2. The t-distribution was developed by W.S. Gosset in 1908. Why was it called
“Student’s t-distribution”?
A. because it was first introduced to senior high school students in 1908
B. because he was still a college student when he first introduces the
t-distribution
C. because he used the pen name “Student” when he introduces his paper
about the t distribution
D. because he used the students as samples in his study about the
t-distribution
3. When do we consider that the sample size is small?
A. when it is less than 30
B. when it is between 20 to 40
C. when it is greater than 25 but less than 40
D. when it is only 1% of the total population
4. In Student’s t-distribution, what is the sample size if the degree of freedom is 25?
A. 5
B. 24
C. 25
D. 26
5. What is the difference between the normal distribution and the t-distribution?
A. The t-distribution has a lower peak.
B. The t-distribution is centered at 0.
C. The t-distribution is symmetric in the middle.
D. The t-distribution has tails that asymptotic to the horizontal axis
6. When do we use t-distribution instead of normal distribution?
A. If the sample variance is unknown
B. If the population variance is known
C. When the sample size is considered small.
D. When the sample size is sufficiently large.
7. In the estimation of a parameter using the t-distribution, is it possible to replace
the population standard deviation, 𝜎 by the sample standard deviation, s?
A. No, because the given sample size is small
B. No, because 𝜎 and s differ by a significant amount
C. Yes, because 𝜎 is unknown and s is a good estimator of 𝜎.
D. Yes, because 𝜎 is equal to s when we are estimating parameters
8. What will be the effect on the t distribution curve when we replace 𝜎 by s?
A. The t-distribution curve will become bell-shaped.
B. The t-distribution curve will be symmetrical to zero.
C. The t-distribution curve will be the same as the normal curve.
D. The t-distribution curve will have a lower peak and thicker tails.

276
For numbers 9 to 11,
The principal of Mapayapa Integrated National High School claimed that the
average salary of their teachers is ₱24, 000 per month. A random sample of 15
teachers in the school has a mean of ₱23, 220, and a standard deviation of ₱400.
9. Identify the population mean.
A. ₱24,000
B. ₱23,220
C. ₱400
D. 15
10. Identify the sample mean.
A. ₱24,000
B. ₱23,220
C. ₱400
D. 15
11. In testing the principal’s claim, can we use the t-distribution?
A. Yes, because the population standard deviation is unknown, and the
sample size is small.
B. Yes, because the sample standard deviation is small, and variance is
unknown.
C. No, because the population standard deviation is greater than the sample
standard deviation.
D. No, because the sample size is too small for the given sample standard
deviation.
12. Which property of t-distribution is NOT a property of normal distribution?
A. The t-distribution curve is bell-shaped.
B. The t-distribution curve is symmetrical about zero.
C. In t distribution, the variance is always greater than 1.
D. The total area under the t distribution curve equal to 1.
13. The following statements tell about the t-distribution EXCEPT _________________
A. The standard deviation of the t-distribution is equal to 1.
C. The t-distribution has thicker tails than the normal distribution.
D. The exact shape of a t-distribution depends on the degrees of freedom.
B. As the degrees of freedom increase, the t-distribution tends to normal
distribution.
14. Which of the following probability distribution curves has the lowest peak?
A. the standard normal distribution
B. a t-distribution with 5 degrees of freedom
C. a t-distribution with 7 degrees of freedom
D. a t-distribution with 9 degrees of freedom
15. Which of the following probability distribution curves has the thickest tails?
A. a t-distribution with 25 degrees of freedom
B. a t-distribution with 23 degrees of freedom
C. a t-distribution with 21 degrees of freedom
D. a t-distribution with 19 degrees of freedom

277
Additional Activities
Make a Venn diagram showing the comparison of the t-distribution and normal
distribution. List down the characteristics or properties of the two distributions on
the spaces provided.

t-distribution has Normal distribution


these characteristics has these
Both distributions characteristics
have these
__________________ characteristics ___________________
__________________ ___________________ ___________________
__________________ ___________________ ___________________
__________________ ___________________ __________________
___________________ ___________________ __________________
___________________ _________________ ____________________
___________________ _____________ __________________

278
Statistics and
Probability
Identifying Percentiles
Using the t-Table
Statistics and Probability
Identifying Percentiles Using the t-Table
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Gilberto M. Delfina


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

279
Week

8
What I Need to Know
This module was designed and written with you in mind on how you can
identify percentiles using the t-table. This will also help you analyze real-life situated
problems statistically in terms of relevant questions for you to better understand
them. Your adept at analysis will help you appreciate the richness, and beauty of
Statistics which will motivate you to apply to similar events and create statistical
measures of your own.
Your patience in solving offered problems here in the module will help you
improve your computational skills as it tackles relevant culture-based situated
problems. Your ability to interpret, reason – out, and make a judgment or even
decision out of statistical measures will also be practiced here. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students. The lessons are arranged
to follow the standard sequence of the course. But the order in which you read them
can be changed to correspond with the textbook you are now using.
After going through this module, you are expected to:
1. identify regions under the t-distribution corresponding to different t-
values; and
2. identify percentiles using the t-table.
What I Know
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
1. If the area of the shaded part of the t-distribution below is 0.15, what is the area
of the unshaded region?

A. 0.15 B. 0.75 C. 15% D. 0.85


2. Using the t-table, if the t-value is -1.708 and the shaded region is on the left of
the t-distribution, what is the area of the unshaded region?
A. 0.05 B. 0.10 C. 0.90 D. 0.95
3. Which t-distribution corresponds to t-value of 2.567 and whose shaded area on
the right tail of the t-distribution is 0.01?
A. a t distribution with 7 degrees of freedom
B. a t distribution with 17 degrees of freedom
C. a t distribution with 38 degrees of freedom
D. a t distribution with infinite degrees of freedom
4. Using the t table, if the t-value is 2.093 and the shaded area is on the right of the
t-distribution, what are the degrees of freedom?
A. 0.025 B. 1.9 C. 19 D. 20
5. The shaded region is on the left tail of the t-distribution. If the degree of freedom
is 25 and the area of the shaded region is 0.05, what is the value of t?
A. 2.059 B. 1.708 C.1.316 D. -1.708
6. Using the t-table, find the area to the right of 1.83 under the t- distribution
with 18 degrees of freedom?
A. between 0.01 to 0.02 C. between 0.025 to 0.05
B. between 0.02 to 0.025 D. between 0.05 to 0.10

280
7. It is the value on the t distribution whose left tail probability is 93% and whose
right-tail probability is 7%.
A. left t-value C. 93rd percentile
B. right t-value D. 7th percentile
8. In identifying a percentile using the t-table, what information should be identified
first?
A. degrees of freedom only
B. degrees of freedom and the given percentage
C. degrees of freedom and the area to the left tail of the t-distribution
D. degrees of freedom and the area to the right tail of the t-distribution
9. Which of the following values is the 90th percentile of the t-distribution with 10
degrees of freedom?
A. 1.363 B. 1.372 C. 1.383 D. 1.812
10. What is the 95th percentile of the t-distribution if the sample size is 15?
A. 1.771 B. 1.761 C. 1.753 D. 1.345
11. If the degree of freedom is 24, what is the 95 th percentile of the
t-distribution?
A. 1.711 B. 1.714 C 1.717 D. 2.064
12. If the 99th percentile of the t-distribution is 2.528, what is the degree of freedom?
A. 17 B. 18 C. 19 D. 20
13. To what percentile of the t-distribution does 1.337 belong if the sample size is
17?
A. 90th B. 93rd C. 95th D. 99th
14. What is the 50th percentile of a t distribution with 45 degrees of freedom?
A. 1.684 B. 0.50 C. 0 D. -1.684
15. Using the t-table, find the area to the left of 2.15 under the t-distribution with
21 degrees of freedom?
A. between 0.98 to 0.99 C. between 0.95 to 0.975
B. between 0.975 to 0.98 D. between 0.90 to 0.95
How do you find this pre-test? Did you encounter both familiar and unfamiliar
terms? Kindly compare your answer in the Answer Key on the last part of this module
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. But if you missed a point, please proceed with the module as it
will enrich your knowledge in t-distribution.
Lesson 1 Identifying Percentiles Using the t-
Table
What does it mean if you belong to the 95th percentile” of those who took an
examination? Is it a good or a bad ranking? There are times when you need to
describe the location of a certain value from a given set of values. Sometimes you
were asked to tell whether a specific score belongs to a location that is higher or lower
than the other scores in the given distribution. This module will help you in dealing
with real-life problems.
This module is about identifying percentiles using the t-table. In addition, you
will also learn how to identify regions or areas under the t-distribution curve
corresponding to different t-values.

What’s In
BLUE RIBBON
“It’s more beautiful with a .”
Make your life more beautiful while reviewing your lesson on the properties of
t-distribution. Which of the ribbons below contain the characteristics of a t-
distribution? Color them blue.

281
Variance is always greater
than 1.

The distribution is symmetric


about 0.

The standard deviation is


equal to 1.

It has heavier tails than


normal curve.

As the degrees of freedom increases,


t distribution tends to normal.

It has higher peak than


normal curve.

The total area under the


distribution curve is 1.

What’s New
“Fif-t Shades of Gray”
Given below are five t-distributions in the shades of gray. The given percentage
represents the unshaded portion of the t-distribution. Your task is to find the area
of the remaining portion/s of the t-distribution. Remember that the total area
under the t-distribution curve is equal to 1 or 100%.

1. 3.

2. 4.

5.

Answer the following questions.


1. How did you find the activity?
2. Was it hard for you to compute for the area of the shaded part under the t-
distribution?

282
3. What basic operation did you use in finding the area of the shaded part?
4. How did you find the area of two identical shaded parts on both tails of the t-
distribution in question 5?
5. If one of the two identical shaded parts on both tails of the t-distribution has an
area of 1.5%, what is the area of the unshaded portion of the t-distribution?
What is It
You have learned from the previous module the different properties of t-
distribution, some of which are very essential in this lesson. What is the total area
under the t-distribution curve? Yes, it should be equal to 1 or 100%. The area under
the t-distribution curve also represents the probability associated with specific sets
of t-values. That means given the t-value, you can compute for the area or probability
with the use of a table or software. A t-value or t-statistic tells us how many standard
deviations from the mean is the given score. The set of t-values are usually written
below the horizontal axis of the t-distribution curve.
Another property of t-distribution is that the exact shape of the t-distribution
depends on the degrees of freedom. Remember that the lesser the degree of freedom,
the lower is its peak and the thicker is its tails. As the degree of freedom increases,
the tails become flatter, and the peak becomes higher. That means, given the area or
the probability, the t-value depends on the number of degrees of freedom. For
example, with the given area of 0.05 on the right tail of t-distribution, the t-value is
2.015 with 5 degrees of freedom. But with 20 degrees of freedom, the t-value is equal
to 1.725.

The t-Table

In finding the areas and percentiles for a t-distribution you need to familiarize
yourself with the t-table. You are going to use a table that is different from the z-
table you used in finding the area under the normal curve.
Below is an example of a t-table. It is a right-tailed t-table because the given
areas in this table are areas on the right tail of the t-distribution. Some t-tables are
slightly different in format. Look at the t-table below. In the first column in the left-
most part, you have the degrees of freedom. It ranges from 1 down to ∞. While the
first row in the upper part of the t-table represents the area under the right tail of
the t-distribution. Some of the given areas are from 0.25 down to 0.0005. The rest of
the entries in the body of the table are the values of the variable t (t-values).

283
284
By looking at the table, you can see
that the t-value for an area of 0.10 in the
right tail of the t-distribution with 10
degrees of freedom is 1.372. This is the
intersection of the row containing the 10
degrees of freedom and the column
containing the area of 0.10.

Similarly, the area to the right tail


of a t-distribution with 15 degrees of
freedom corresponding to the t-value of
2.249 is 0.02. Focus on the row
containing 15 degrees of freedom, then
look for the t-value of 2.249. The column
that you need is the column containing
the area of 0.02.

285
Identifying Percentiles Using the t-Table
A percentile is a value on a t-distribution that is less than the probability in
the given percentage. For example, the 90th percentile of the t-distribution is that t-
value whose left tail probability is 90% and whose right-tail probability is 10%. Since
the area under the t-distribution curve also represents the probability, the 90 th
percentile of the t-distribution is the t-value whose area on its left tail is 0.90 and
whose area on its right tail is 0.10.

Illustrative Example 1

Find the 95th percentile of a t-distribution with 6 degrees of freedom.


You can plot in a t-distribution and draw what you are looking for. The 95th
percentile is the value of the variable t that has an area of 95% or 0.95 to the left.
That value is somewhere roughly near the t-value of 2. You don’t need to get exactly
where it is on t-distribution when you are drawing it at this point, you are just looking
for the rough idea where it is.
And since the area of the entire curve is 1, this implies that the area to the
right of the 95th percentile is
0.05. Hence, the 95th
percentile is the value of the
variable t that has an area of
0.05 to the right. That means
finding the 95th percentile is
looking for the t-value with an
area to the right of 0.05 under
a t-distribution with 6 degrees
of freedom.
So, you are going to
focus on the 6 degrees of
freedom row, and in the
column containing the area to
the right of 0.05. (The appropriate row and column were highlighted in red).

286
From the figure above, you can see that the value that you need is 1.943.
Hence the 95th percentile is 1.943. That means the t-value of 1.943 has 95% of the
area to the left of it, or 0.95. Also, you can say that the t-value of 1.943 has an area
of 0.05 to its right. And so, using the t-table, you will find that the 95th percentile is
1.943.

Illustrative Example 2

Find the 5th percentile of a t-distribution with 6 degrees of freedom.


The 5th percentile is the value of the variable t that has an area of 5% or 0.05
to the left. And since the area of the entire curve is 1, you are convinced that the area
to the right of the 5th percentile is 0.95. Hence, the 5th percentile is the value of the
variable t that has an area of 0.95 to the right. Therefore, finding the 5th percentile is
the same as finding for the t-value with an area to the right of 0.95 under a t-
distribution with 6 degrees of freedom.
But if you look at the given areas in the first row of the t table, there is no
entry for an area of 0.95. There is no way you can find an area of 0.95 because your
table is a right-tailed t table. That means it is set to display only the areas under the
right tail of the t distribution.
Also, if you look at your illustration of the 5th percentile below you will realize
that the t-value that you are looking for lies between -1 and -2. Hence its value should

287
be a negative number. But if you observe the body of the table where t-values are
located, you cannot find any negative t-value. The table gives only positive values of
t.

At this point, you need to recall one of the properties of the t-distribution that
it is symmetric about zero. That means the right tail of the distribution is exactly the
mirror image of its left tail. So, you can easily find the values in the left tail by relying
on this “symmetry–about–zero” property. Hence, if you are going to find the value of
t such that the area to the left of it is 0.05, recall that the area to the right of 1.943
is also 0.05 (See Illustrative Example 1).
Therefore, you can say that since the t-distribution is symmetric about 0, the
t-value with an area to the left of 0.05 must be -1.943. So, you will find that the 5th
percentile is –1.943.

Illustrative Example 3
What is the area to the right of 2.4 under a t-distribution with 7 degrees of
freedom?
Remember that in the previous example, you found t-values using the given
areas under the t-distribution curve. But in this example, you will be doing the
opposite because in this problem you are given a t-value and you need to find the
area to the right of the t-distribution with 7 degrees of freedom.
You can illustrate the problem with the figure shown below. The t-value of 2.4
is somewhere between 2 and 3, and you are going to find the area to the right of it.

288
So, looking back at the table, you need to focus on the 7 degrees of freedom
line. You will observe that the t-value of 2.4 cannot be found in this row but you do
find these two values 2.365 and 2.517 that surround 2.4 (The t-value 2.4 is between
2.365 and 2.517).

The table tells you that the area to the right of 2.365 is 0.025 and the area to
the right of 2.517 is 0.02. You figure out earlier that our t-value of 2.4 falls in between
two values 2.365 and 2.517 and it tends to reason then, that the area to the right of
2.4 must be between those two values 0.025 and 0.02.
So, using the table you found that the area to the right of 2.4 under the t-
distribution with 7 degrees of freedom lies somewhere between 0.02 and 0.025.
If you need to get the exact value, you need to use software that easily
calculates the area under the t-distribution curve with the given t-value and number
of degrees of freedom. Using such software, you could find that the area to five
decimal places is 0.02373.

What if you needed to use the t-table to find the area to the left of 2.4?
Since the area under the entire curve is 1, the area to the left of 2.4 is equal
to 1 minus the area to the right of 2.4. So, based on the table the area to the left of
2.4 under the t distribution with 7 degrees of freedom must lie somewhere between
0.98 and 0.975 (1 – 0.02 = 0.98 and 1 – 0.025 = 0.975).
But since you already knew that the area to the right of 2.4 is 0.02373, you
could find the exact area to the left of 2.4 to five decimal places as 1 minus 0.02373
or 0.97627.

289
What’s More
INDEPENDENT PRACTICE: “BLACKPINK in your Area”
Use the t-table to identify the area under the t-distribution given the t-value and the
number of degrees of freedom. Draw a t-distribution for each number then choose
from BLACK or PINK to shade the required area. (The first one is done for you).
Example:
Area to the right of 1.746 under a t-distribution with 16 degrees of freedom

START HERE
1. Area to the right of 1.323 under a t-distribution with 21 degrees of freedom
2. Area to the right of 2.821 under a t-distribution with 9 degrees of freedom
3. Area to the left of -2.179 under a t-distribution with 12 degrees of freedom
4. Area to the left of 0.889 under a t-distribution with 8 degrees of freedom
5. Area to the right of 1.25 under a t-distribution with 25 degrees of freedom?

INDEPENDENT ASSESSMENT: “Percen-TILES”


Use the table and the given
degrees of freedom to identify each
percentile.

What I Have Learned


1. What is a percentile? How does it differ from a percentage?
2. What does the 98th percentile of the t-distribution mean?
3. Give three important information found in a right-tailed t-table.
4. Why does the right-tailed t-table show only positive t-values?
5. Enumerate the steps on how to find the area to the right of 2.50 under the t-
distribution with 23 degrees of freedom.
6. Explain in your own words how to find the 90th percentile of the t- distribution
with 28 degrees of freedom.

290
What I Can Do
Answer the following problems.
1. Find the values of t for which the area on the right tail of the t-distribution is 0.05
and the number of degrees of freedom is equal to:
a. 15 b. 28 c. 100
2. Find the 99th percentile of the t-distribution with 18 degrees of freedom.
3. Find the 90th percentile of the t-distribution if the sample size is 25.

Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.

1. It is a value in the t-distribution that is less than the probability of the given
percentage.
A. degrees of freedom C. t-value
B. percentile D. right-tailed value
2. If the area of the unshaded part of the t-distribution is 0.85, what is the area of
the shaded region?
A. 0.15 B. 0.75 C. 0.85 D. 85%
3. If the given t-value is 2.056 and the shaded region is on the right of the t-
distribution with 26 degrees of freedom, what is the area of the unshaded region?
A. 0.025 B. 0.15 C. 0.95 D. 0.975
4. What is the area on the left tail of the t-distribution with 14 degrees of freedom
corresponding to t-value of 3.787?
A. 0.001 B. 0.01 C. 0.975 D. 0.999
5. Which t distribution has an area of 0.10 to the right of 1.333?
A. A t-distribution with 5 degrees of freedom
B. A t-distribution with 11 degrees of freedom
C. A t-distribution with 17 degrees of freedom
D. A t-distribution with 23 degrees of freedom
6. The shaded region is on the right tail of the t-distribution. If the degree of freedom
is 22 and the area of the shaded region is 0.005, what is the value of t?
A. -2.819 B. -2.831 C. 2.819 D. 2.831
7. The shaded region is on the left tail of the t-distribution. If the degree of freedom
is 16 and the area of the shaded region is 0.01, what is the value of t?
A. 2.583 B. 2.120 C.-2. 120 D. -2. 583
8. What is the area to the right of 1.8 under the t-distribution with 5 degrees of
freedom?
A. between 0.20 and 0.15 C. between 0.15 and 0.10
B. between 0.10 and 0.05 D. between 0.05 and 0.025
9. Which of the following values is the 90th percentile of the t-distribution with 15
degrees of freedom?
A. 1.341 B. 1.345 C. 1.753 D. 1.837
10. What is the 95th percentile of the t-distribution when the sample size is 23?
A. 1.721 B. 1.717 C. 1.714 D. 1.703
11. If the degree of freedom is 29, what is the 95 th percentile of the t-distribution?
A. 2.048 B. 1.701 C 1.699 D. 1.547
12. If the 99 percentile of the t-distribution is 2.650, what is the degree of freedom?
th

A. 16 B. 15 C. 14 D. 13

291
13. To what percentile of the t-distribution does 1.397 belong if the sample size is
9?
A. 90th B. 95th C. 97th D. 99th
14. What is needed in order to identify a percentile using the t table?
A. degrees of freedom only
B. degrees of freedom and the given percentage
C. degrees of freedom and the area to the left tail of the t-distribution
D. degrees of freedom and the area to the right tail of the t-distribution
15. Why does the right-tailed t-table display only positive values of t?
A. Because the area cannot be negative.
B. Because t-values are always positive.
C. Because t-values on the left tail are positive
D. Because t-values on the right tail are positive
Additional Activities
Browse the internet and look for a site, a program or software that easily computes
for t-values and areas under the t-distribution. Answer the following questions using
the t-table then compare your answer using the software.
1. What is the area to the right of 1.5 under the t-distribution with 14 degrees of
freedom?
2. What is the area to the right of 2.75 under the t-distribution with 28 degrees of
freedom?
3. What is the area to the left of 2.8 under the t-distribution with 5 degrees of
freedom?
4. If the degree of freedom is 20, what is the 96 th percentile of the t-distribution?

5. In a t-distribution with 13 degrees of freedom, what is the 38th percentile?

292
Statistics and
Probability
Identifying the Length of a
Confidence Interval
Statistics and Probability
Identifying the Length of a Confidence Interval
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Maryquenn F. Reyes


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

293
Week

8
What I Need to Know
The content of this module will open your mind that statistics plays an
important role in your daily life situations. The explanation and language used are
simple due to the diverse level of the learners. Each example has a step-by-step
process so that the learner can follow its flow and sequence.
In this module, you will study how to identify the length of a confidence
interval, the margin of error, confidence level, and the sample mean. You will also
solve problems involving the length of a confidence interval.

After going through this module, you are expected to:


1. differentiate confidence level from confidence interval;
2. identify the length of the confidence interval given the sample mean and
margin of error; and
3. solve problems involving the length of a confidence interval.

What I Know
Before studying this module, take this test to determine what you already know
about the topic covered.
Choose the letter of the best answer. Write the chosen letter on a separate sheet of
paper.
1. Which of the following values and percents illustrates the confidence
level?
A. 32.8
B. 30-39
C. 90%
D. 90% - 94%
2. Which of the following symbols is used to get the mean?
A. E
B. μ or 𝑋̅
C. n
D. 
3. Which of the following symbol stands for the margin of error?
A. E
B. Z𝛼 /2
C. n
D. 
4. For a 95% confidence level, the confidence coefficient is ± ____?
A. 1.645
B. 1.96
C. 2.33
D.2.58
5. What do you call the value resulting from subtracting the margin of error
from the sample mean?
A. point estimate
B. interval estimate
C. lower limit of the interval
D. upper limit of the interval

294
6. What do you call the value resulting from adding the margin of error from
the sample mean?
A. point estimate
B. interval estimate
C. lower limit of the interval
D. upper limit of the interval

For nos. 7 – 10
A survey on 200 HUMSS students of Bumasa Integrated High School shows that they
read on an average of 15. 7 hours per week. The margin of error is 2.2 hours at a
95% confidence level.

7. What is the sample mean?


A.200
B. 95
C. 15.7
D. 2.2
8. Which of the following values is the margin of error?
A.15.7
B. 2.2
C. 0.95
D. 0.05
9. Which of the following intervals is the confidence interval?
A. (13.5, 17.9)
B. (1.35, 1.79)
C. (135, 179)
D. (14.75, 16.65)
10. What is the upper limit of the interval?
A. 1.79
B. 13.5
C. 16.65
D. 17.9

For nos. 11 – 13:


A random sample of 36 scores from the Statistics examination of TVL learners is
taken and gives a sample mean of 68. At a 90% level of confidence, the interval
estimate of the population mean is between 67.18 and 68.82.

11. Which of the following values is the 𝑥̅ ?


A. 36
B. 67.18
C. 68
D. 68.82
12. Which of the following values is the margin of error?
A. 0.90
B. 0.82
C. 0.68
D. 0.05
13. What is the length of the confidence interval in the given problem?
A. from 67 to 68
B. from 671.8 to 688.2
C. from 67.18 to 68.82
D. from 6782 to 6882

295
14. Which of the following terms describes the probability that the interval
estimate contains parameters and what percentage of intervals from
many different samples contains the unknown population parameter?
A. margin of error
B. confidence level
C. confidence interval
D. level of significance
15. Which of the following terms pertains to a range of values that is used to
estimate a parameter which may or may not contain the true parameter
value?
A. margin of error
B. confidence level
C. confidence interval
D. level of significance

What’s In
For you to begin let us recall first the previous lessons which are very important for
you to succeed.
Rearranged the jumbled letters to get the correct answer.
∑𝑋
1. (NAME ) This formula : 𝑋̅ =
𝑁
,stands for______?
2. (SIGNACIFINEC VEELL) This symbol is for ______?
3. ( THUNDDHERS ) 98.7896 = 98.79 is rounded to the nearest ______?
4. ( LACITRIC SEULAV) This symbol Zc represents ______?
5. ( CONIFDENCE EVELL) It is represented by a percentage that refers to all
possible samples that can be estimated to contain the true population
parameter is called ______?
What’s New
FILL ME IN FILIPINO CITIZEN!

Complete the Preamble by answering the number that corresponds to the blank. You
may choose your answer from the given box.

We, the sovereign ____(1)_____ people, imploring the aid of Almighty God, in
order to build a ___(2)____ and humane society, and establish a ____(3)____ that shall
embody our ideals and aspirations, promote the common good, conserve and develop
our ______(4)_____, and secure to ourselves and our _____(5)____, the blessings of
____(6)____and _____(7)____ under the rule of law and a regime of truth, justice,
freedom, love, equality, and peace, do ordain and promulgate this Constitution.

Words to fill in Preamble Answers


administration Before of Lower Limit and after Upper
Limit
democracy 30%<𝜇<40%
Filipino 5%
Government Upper limit
independence 𝑋̅ − 𝐸
just margin of error
patrimony Middle of Lower Limit and Upper Limit
posterity 𝑋̅ + 𝐸

296
right Mean
Sovereignty 30%>𝜇>40%

Here is the problem to answer the questions:


Supposed the school newspaper conducts an SSG election survey and reports
that the independent candidate will receive 35% of the vote. The school newspaper
stated that the survey had a 5% margin of error and a confidence level of 95%. These
findings resulted in the following confidence interval: a 95% confidence that the
independent candidate will receive between 30% and 40% of the vote.
1. What is the margin of error in the given problem? ____________
2. Based on the given problem, the difference of 35% and 30% is the ____________.
3. If 30% is the lower limit, 40% is the ________.
4. If 35% of the vote is the average vote that an independent candidate will
receive, and a limit of 30% and 40% where does the average (also known as
mean) lies? ______
5. What would be the formula in finding the upper limit? ____________
6. What would be the formula in finding the lower limit? ____________
7. If a problem has 95% confidence level and an average of 45 with a lower limit
of 40 and upper limit of 50 and margin of error of 5, the confidence interval is
(40,50) or 40<𝑋̅<50. Following this premise, what would be the confidence
interval of the given problem above?
Questions to ponder:
1. Are you able to complete the preamble?
_________________________________________________________________________
_________________________________________________________________________
2. What do you think is the symbol for the margin of error?
_________________________________________________________________________
_________________________________________________________________________

3. What do you think is the symbol for mean?


_________________________________________________________________________
_________________________________________________________________________
4. What is the impact of identifying the confidence interval on SSG election?
_________________________________________________________________________
_________________________________________________________________________
5. If you are one of the independent candidates, what will be your reaction to the
result of the survey?
_________________________________________________________________________
_________________________________________________________________________
What is It
What is the difference between the Confidence Level and Confidence interval?

The Confidence level of an interval estimate of a parameter is the probability


that the interval estimate contains a parameter, it describes what percentage of
intervals from many different samples contains the unknown population parameter.

297
The confidence level has its corresponding coefficient which is called
confidence coefficients. These coefficients are used to find the margin of error, for
instance, the table below shows the corresponding coefficient confidence level
.
Confidence 99% 98% 96% 95% 92% 90% 85% 80% 70%
Level 0.99 0.98 0.96 0.95 0.92 0.90 0.85 0.80 0.70
Zc 2.58 2.33 2.05 1.96 1.75 1.645 1.44 1.28 1.04

Confidence interval or interval estimate is a range of values that is used to estimate


a parameter. This estimate may or may not contain the true parameter value.
For instance, we write it in this form
Lower limit < 𝜇< Upper limit
Or
(Lower limit, Upper Limit)
The Lower limit is obtained by using the formula LL= 𝑿 ̅ − 𝑬, while the Upper limit
is obtained by using the formula UL= 𝑿 ̅ + 𝑬, where E is the Margin of Error and 𝑿 ̅
is the sample mean.
As mentioned earlier, the confidence coefficient is used on finding the margin of
error, which is the range of values above and below the sample statistic. For
instance, Margin of error is obtained using the formula:
𝝈
𝑬 = 𝒁𝜶/𝟐 •
√𝒏
where, n = sample size
𝑧𝛼/2 = confidence coefficient
𝜎 = population standard deviation
E = margin of error
But with this lesson, the margin of error will be given as well as the sample
mean.
Example:
A random sample of 46 scores from the examination of ABM learners is
taken and it gives a sample mean of 78 with the interval scores between 77.18 and
78.82 having a 90% level of confidence.
Let’s answer the questions!
 Which of the following is the 𝑥̅ in the given statement?
Since it is given in the statement above, the sample mean is 78.
 What is the upper limit? What is the lower limit?
The upper limit is 78.82 while the lower limit is 77.18
 What is the margin of error in the given statement?
As we can see, the Margin of error is not directly mentioned, but the
lower limit and upper limit is there. As mentioned earlier the formula of
the upper limit and the lower limit includes the Margin of error.
LL= 𝑿̅–E
The Lower limit and mean are given so…
77.18 = 78 – E
E = 78-77.18
E = 0.82
Let’s see if we can get the same value of E if we use the formula for the
upper limit.
78.82 = 𝑿 ̅+E
The Lower limit and mean are given. So, we have
78.82 = 78 + E
E = 78.82-78
E = 0.82

298
Therefore, the margin of error is 0.82
 What is the confidence interval in the given statement? To find the
confidence interval, we have to use Lower limit < 𝝁< Upper limit and
substitute the given data. We have,
77.18 < 𝝁< 78.82 or (77.18, 78.82)
So, the Confidence interval is between 77.18 and 78.82.
 What is the confidence level? How will you conclude?
The confidence level is 90%. So, we are 90% confident that the mean score
lies between 77.18 and 78.82.
Note: Sometimes, you just need to convert the formula to find what is missing.
What’s More
In the following activities, read each situation carefully to solve each problem. Write
your answer on a separate sheet of paper.
Practice Activity 1

Online selling

An online seller of yema cake, which is very popular in Tayabas Quezon surveyed
several customers. Fifty-two percent (52) of the customers were satisfied with the
services that were offered with a 3.98% margin of error. Determine the confidence
interval using this information.

The average percentage of people who are satisfied with the product is
_______________.
The lower limit of confidence interval is _________________.
The upper limit of confidence interval is _________________.
The confidence interval is _______________________.

Independent Assessment 1

Online Schooling
You are planning to buy a cellphone with a good quality in order to attend
the online class. The average price of 50 cellphones is Php13,500 with a margin of
error of Php 273.65 and a confidence level of 99%.

What is the lower limit? _______________________


What is the upper limit? _______________________
What is the confidence interval? ________________
If you have ₱13,100.00, is the amount lies within the confidence interval? _________

Independent Assessment 2

Colored Wheel
A survey on 200 Senior High School learners on how much time they read per
week has a confidence interval of (13.5, 17.9) in hours at 95% confidence level.

What is the margin of error? _________________________


What is the average time they read per week? __________________

299
What I Have Learned
A. Complete the following statements by writing the correct word or words and
formulas.
1. The ______________________of an interval estimate of a parameter is the
probability that the interval estimate contains parameter, and it describes
what percentage of intervals from many different samples contains the
unknown population parameter
2.The confidence level has its corresponding coefficient which is called
________________. These coefficients are used to find the margin of error.
3. The ______________________ is a range of values that is used to estimate a
parameter. This estimate may or may not contain the true parameter
value.
3. The Lower limit is obtained by using the formula ____________while the Upper
limit is obtained by using the formula __________________.

What I Can Do
Make your own story!
Things to do:
Construct your own problem but put a little story on it using the data indicated
below, then answer it.
̅ = 𝟏𝟒𝟖
𝑿
𝑬 = 𝟎. 𝟗𝟏
95% Level of Confidence
Rubrics for the task:

CRITERIA

Accuracy of the solution 50%

Clarity and content of the problem 25%

Originality and creativity 25%

TOTAL 100%

Assessment
Multiple Choice. Choose the letter of the best answer. Write the chosen letter on a
separate sheet of paper.
1. Which of the following values and percents illustrates the confidence
interval?
A. 30.8
B. (35,39)
C. 92%
D. 92% and 94%

300
2. Which of the following symbols is used for the population standard
deviation?
A. E
B. μ or 𝑋̅
C. n
D. 
3. Which of the following symbol stands for the margin of error?
A. E
B. Za/2
C. n
D. 
4. For a 95% confidence level, the confidence coefficient is ± ____?
A. 1.645
B. 1.96
C. 2.33
D.2.58
5. What do you call the value resulting from adding the margin of error from
the sample mean?
A. point estimate
B. interval estimate
C. lower limit of the interval
D. upper limit of the interval
6. What do you call the value resulting from subtracting the margin of error
from the sample mean?
A. point estimate
B. interval estimate
C. lower limit of the interval
D. upper limit of the interval

For nos. 7 – 10:


A survey of 200 Senior High School learners shows that they read on average 18.5
hours per week with a margin of error of 2.2 hours at a 95% confidence level.
7. What is the sample mean?
A.200
B. 18.5
C. 15.7
D. 2.2
8. Which of the following values is the margin of error?
A.15.7
B. 2.2
C. 0.95
D. 0.05
9. Which of the following intervals is the confidence interval?
A. (18.5, 17.9)
B. (16.3, 20.7)
C. (135, 179)
D. (14.75, 16.65)
10. What is the upper limit of the interval?
A. 16.3
B. 17.9
C. 18.7
D. 20.7
E.

301
For nos 11 – 13:
A random sample of 36 scores from the examination of TVL learners is taken and
gives a sample mean of 65. At a 90% level of confidence, the interval estimate of the
population mean is between 64.18 and 65.82.
11. Which of the following values is the 𝑥̅ ?
A. 364.18
B. 67.18
C. 65
D. 64.82
12. Which of the following is the margin of error?
A. 0.65
B. 0.68
C. 0.82
D. 0.90
13. What is the length of the confidence interval?
A. (65, 68)
B. (641.8, 658.2)
C. (64.18, 65.82)
D. (6482, 6582)
14. It is a range of values that is used to estimate a parameter. This
estimate may or may not contain the true parameter value. It is called
A. confidence level
B. margin of error
C. confidence interval
D. level of significance
15. It is the probability that the interval estimate contains a parameter, and
it describes what percentage of intervals from many different samples
contains the unknown population parameter.
A. confidence level
B. margin of error
C. confidence interval
D. level of significance
Additional Activities
Martina reads that the average age of graduate students is 33 years old. She
wanted to estimate the mean age of the graduate students at her university, so
she took a random sample of 30 graduate students. She found that their mean
age was 𝑋̅=31.8 at a 95 % confidence level with 1.54 margin of error.

Questions:

1. What is the lower limit?


2. What is the upper limit?
3. What is the confidence interval?
4. Based on the interval you get, is it plausible that the mean age of
all graduate students at her university is also 33 years?

How well did you perform in this lesson?


Are you now ready for the next lesson?
If your answer is YES, I am HAPPY for you. Good luck and happy learning on
your next journey. 😊

302
Statistics and
Probability
Computing for the Length of the
Confidence Interval
Statistics and Probability
Computing for the Length of the Confidence Interval
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Reynante M. Velasco


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

303
Week

8
What I Need to Know
Are you planning to put up your own business? How are you going to
determine whether you are gaining or losing in your desired business? How
important is the margin of error in the business sector or some real-life conditions?
Are you familiar with the length of the confidence interval and how it is applied in
true to life situations? What is a margin of error? The answers to these questions are
expected after studying this module. This will help you to understand, analyze, and
solve the given problems.
The content of this module will open our mind that statistics plays an
important role in our daily life situations. In this module, you will study how to
compute for the length of the confidence interval. The explanation and language used
are simple due to the diverse level of the learners. Each example has a step-by-step
process so that the learner can follow its flow and sequence.
After going through this module, you are expected to:

1. apply the formula in determining the margin of error; and


2. compute for the length of the confidence interval.
What I Know
Before studying this module, take this test to determine what you already know
about the topic covered.
Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer.

A B C D 1. Which of the following formulas is used to get the margin


O O O O of error?
𝝈
A. 𝐸 = 𝑧𝛼⁄2 () C. 𝐸 = ( )
√𝒏
𝜎 𝑧𝛼⁄2
B. 𝐸 = 𝑧𝛼⁄2 ( ) D. 𝐸 = ( )
√𝑛 √𝒏

A B C D 2. Which of the following symbols is used for critical values?


O O O O A. E C. n
B. 𝑧𝑐 D. 

A B C D 3. Which of the following symbols stand for the margin of


O O O O error?
A. n C. E
B. 𝑧𝛼⁄2 D. 

A B C D 4. Which of the following symbols is used for the population


O O O O standard deviation?
A. E C. n
B. 𝑧𝛼⁄2 D. 

304
A B C D 5. Given: n=60,  =0.7, level of confidence is 96%. What is
O O O O the margin of error in the given data?
A. 0.118 C. 0.158
B. 0.151 D. 0.185

A B C D 6. What is the critical value of having a 95% confidence level?


O O O O A. 1.28 C. 1.96
B. 1.44 D. 2.10

For items 7 – 9:
A sample size of 50 has a mean of 20. The population standard deviation is 18 and
the confidence level to be used is 95%.

A B C D 7. What is the sample size in the given statement?


O O O O A. 18 C. 50
B. 20 D. 98

A B C D 8. What is the margin of error in the given statement?


O O O O A. 3.67 C. 5.93
B. 4.99 D. 6.22

A B C D 9. What is the length of the confidence interval?


O O O O A. 1.44 C. 7.76
B. 5.45 D. 9.98

A B C D 10. Which of the following values is the critical value of a 98%


O O O O level of confidence?
A. 1.44 C. 2.10
B. 1.96 D. 2.33

For items 11 – 13:

A random sample of 150 stores of Atimonan Mall has an average daily income of
Php 25,000. The population standard deviation is Php 12,000 and a 95% confidence
level is to be used.

A B C D 11. What is the 𝑥̅ in the given statement?


O O O O A. 150 C. 25,000
B. 12,000 D. 37,000

A B C D 12. What is the 𝜎 in the given statement?


O O O O A. 150 C. 25,000
B. B. 12,000 D. 37,000

305
A B C D 13. What is the length of the confidence interval for the
O O O O average daily income of all stores in Atimonan Mall.
A. 3840.80 C. 3240.65
B. 3480.98 D. 3080.23
For items 14 – 15:
The average weight of 100 sold chicken is 150 kilos. The standard deviation of the
weights of all the chicken in the store is 90 kilos and an 85% level of confidence is to
be used.

A B C D 14. Which of the following values is the Zc for the given


O O O O confidence level?
A. 1.44 C. 1.75
B. 1.645 D. 1.96

A B C D 15. What is the length of the confidence interval?


O O O O A. 25.92 C. 29.96
B. 26.73 D. 32.90

Lesson 1 Computing for the Length of the


Confidence Interval
We are enjoying many things at present because of the past. Computers,
gadgets, foods, cars, medicines, and other things came up into reality because of
research which plays an important role in our life.
Computing for the length of the confidence interval is an important part of the
research. As much as possible, we should use a high level of confidence in estimating
our parameters to give us correct information that helps us build stronger tomorrow.

What’s In
Let us have a recall!
Use a calculator to simplify the following numbers, then round off your answer:
35
1. ( ) = ___________________ (nearest thousandths)
√150
2. (4.26)(2.432) = ___________________ (nearest hundredths)
3. 350 + (1.45)(30.875) = ___________________ (nearest hundredths)
4. 200 + (3.63)(42.236) =___________________ (nearest whole number)
50
5. 350 + (1.45) ( ) =___________________ (nearest whole number)
√250

306
What’s New

Business Matters

Read and analyze carefully the problem and answer the questions that follow.
Businessperson Jennifer Velasco uses an analog weighing scale in her fruit
store. She earns ₧45.00 per kilo in selling lanzones. One time, she managed to use
a digital scale and found out that she is giving a 1.1 kilo instead of 1 kilo. She also
found out that the digital scale is more accurate than the analog weighing scale.
Questions to ponder:
1. If you are Jennifer, would you rather use an analog than digital scale?
Why? ________________________________________________________________________
2. Out of 40 boxes that she has, 15 boxes weighs using analog scale and the rest is
by using the digital scale, how many boxes of lanzones she weighs correctly?
______________________________________________________________________________
3. How much kilo of lanzones did she lost in all?___________________________________
4. How much money did she earn for the correct weight of lanzones? ________________
5. How much money did she lost for weighing lanzones incorrectly?_________________
6. How much should be the earnings of Jennifer if she weighs all the 40 packs
correctly? _____________________________________________________________________

What is It
Confidence level is the likelihood measure of the confidence interval that is
represented by a percentage that refers to all possible samples that can be estimated
to contain the true population parameter. The most common values of the level of
confidence are shown in the Zc table below. Confidence coefficient or critical values
expressed as zc.
Confidence interval is also called the interval estimate. It is a range of values
that is used to have an approximate boundary or parameter. However, this
approximate may or may not contain the correct or true parameter value.
Level of Confidence and Zc Table
Confidence 99% 98% 96% 95% 92% 90% 85% 80% 70%
Level 0.99 0.98 0.96 0.95 0.92 0.90 0.85 0.80 0.70
Zc 2.58 2.33 2.05 1.96 1.75 1.645 1.44 1.28 1.04

The margin of error is the range of values above and below the given statistical
number or sample in a confidence interval. To compute for the margin of error, use
the formula given below:
𝝈
E = 𝒛𝜶/𝟐( )
√𝒏

307
Where, 𝒛𝜶/𝟐 means the critical values or confidence coefficients,  is the
population standard deviation and n as the sample size.

Consider the given example below.


Example 1: Isabel owns a shoe store. She used 160 pairs of shoes as her samples
for the different designs. The population standard deviation of the price of the shoes
is ₱75. Suppose that Isabel wants a 95% level of confidence to determine the mean
price of all her shoes she is selling. Compute for the margin of error of her estimate.
Solution:
Step 1: Write the given data.
n = 160  = ₱75
95% confidence level where zc = 1.96
Step 2: Apply the formula and substitute the given data.
𝜎
E = zα/2 ( )
√𝑛
75
E = (1.96)( )
√160
E = (1.96)(5.929) (use three decimal places for partial answer)
E = 11.62 (round off the final answer in two decimal places)
COMPUTING THE LENGTH OF CONFIDENCE INTERVAL
Computing for the margin of error and confidence interval were discussed in
the previous module. We will use the formulas presented in the previous module to
compute the length of the confidence interval.
Recall that the confidence interval can be written in the form of
Lower limit < 𝑋̅< Upper limit
or
(Lower limit, Upper Limit)

The lower limit is obtained by using the formula LL= 𝑿 ̅ − 𝑬, while the upper limit
is obtained by using the formula UL= 𝑿 ̅ + 𝑬, where E is the Margin of Error and 𝑿 ̅
is the sample mean.
Meanwhile, the margin of error is obtained using the formula:
𝝈
𝑬 = 𝒁𝜶/𝟐 ( )
√𝒏
where, n = sample size 𝑧𝛼/2 = confidence coefficient

𝜎 = population standard deviation E = margin of error


Therefore, the length of the confidence interval (L) is simply:
L = UL – LL
where, UL is the upper limit and LL is the lower limit of the confidence interval.
However, if the sample mean is not given, we cannot compute for the upper
and lower limit of the confidence interval, the following formula can be used:
𝝈
L = 𝟐𝒛𝜶/𝟐 ( ) or L=2E
√𝒏

where E is the margin of error, 𝑧𝛼/2 is the confidence coefficient, 𝜎 is the population
standard deviation, n is the sample size and 𝑛 0

308
Example 2: The population of Sulu Hornbill (one of the endangered bird species in
the Philippines) has a standard deviation of 40. Compute for the length of the
confidence interval for a 90% confidence level having a sample size of 150 and a
sample mean of 65.

Solution:
Step 1: Write the given data.
n = 150 𝑥̅ = 65  = 40 90% confidence level where zc = 1.645
Step 2: Compute for the margin of error.
𝜎
E = 𝑧𝛼/2 ( )
√𝑛
40
E = (1.645)( )
√150
E= (1.645)(3.266) (use three decimal places for partial answer)
E = 5.37(round off final answer to two decimal places)
Step 3: Compute for the upper and lower limit of the confidence interval.

̅ +𝑬
UL = 𝑿 LL = 𝑿̅−𝑬
= 65 + 5.37 = 65 − 5.37
= 𝟕𝟎. 𝟑𝟕 = 𝟓𝟗. 𝟔𝟑
Step 4: Compute for the length of the confidence interval.

L = UL – LL
= 70.37-59.63
= 10.74
Knowing that the length of the confidence interval is just twice the margin of error,
𝝈
a shorter solution can be used using the formula: L = 𝟐𝒛𝜶/𝟐( ) = 2E. Therefore, L =
√𝒏
2 (5.37) =10.74
Example 3: Jennifer wanted to know the average price of shoes that her customer
purchased. She sampled 160 pairs of shoes that were sold and found out that the
mean average price is ₱800 with a standard deviation of ₱75. Construct a 95%
confidence interval for the mean price of all shoes that were sold. Compute for the
length of the confidence interval.

Solution:
Step 1: Write the given data,

n = 160 𝑥̅ = ₱800  = ₱75 95% confidence level where zc = 1.96


Step 2: Compute for the margin of error,
𝜎
E = 𝑧𝛼/2 ( )
√𝑛

75
E = (1.96)( )
√160

E= (1.96)(5.929) (use three decimal places for partial answer)

E = 11.62 (round off final answer to two decimal places)

309
Step 3: Compute for the upper and lower limit of the confidence interval.

̅+𝑬
UL = 𝑿 LL = 𝑿̅−𝑬
= 800 + 11.62 = 800 − 11.62
= 811.62 = 788.38
Step 4: Write the confidence interval.
788.38<𝜇 < 811.62 or (788.38, 811.62)
This means that we are 95% confident that the true mean lies between 788.38 and
811.62. In the context of this problem, Jennifer can state that she is 95% confident
that the average price of a pair of shoes purchased by her customers lies between
₱788.38 to ₱811.62 or ₱788 to ₱812 when rounded to the nearest peso.

Step 5: Compute for the length of the confidence interval.


L = UL – L
= 811.62 – 788.38
= 23.44

Or since 𝐸 = 11.62, 𝐿 = 2(11.62) = 23.24

What’s More
In the following activities, read each situation carefully to solve each problem. Write
your answer on a separate sheet of paper.

Independent Activity 1:

A random sample of size 70 has a standard deviation of 15. What is the


margin of error for a 92% confidence level?

Write all the given data based on the problem statement and show the
complete solution.
n = __________ formula to be used: ________________
 = __________ solution:
zc = __________

Independent Assessment 1:

A random sample of size 100 has a standard deviation of 25. What is the
margin of error for a 90% confidence level?

Independent Activity 2:
Compute the length of the confidence interval for estimating the population
mean using a sample size of 300 and with a standard deviation 84. Use a 92%
confidence level.

Write all the given data based on the problem statement and show the
complete solution.
n = __________ formula to be used: ________________
 = __________ solution: zc = __________

310
Independent Assessment 2:
Compute the length of the confidence interval with a sample size of 240 and
with a standard deviation 60 having an 80% confidence level.

Independent Activity 3:
The average price of 350 cellphones is ₧13,500 with a sample standard deviation
of ₧750.
a. Find the 99% confidence interval of the true average price of the
cellphones. Interpret your answer in the context of this problem.
b. Find the length of the confidence interval.
Write all the given data based on the problem statement and show the
complete solution.
n = __________ formula to be used: a. ________ b. _________
 = __________ solution:
𝑥̅ =___________
zc = __________

Independent Assessment 3:

The average price of 60 Philippine-made shirts is ₧150 and the population


standard deviation is ₧55. Find the length of the confidence interval of the true
average price of the shirts using a 95% confidence level.
Compare your answers with those found in the Answer Key on the last part of
this module. Did you get all items correct? Great! If you did, you learned the lesson
well. If you were not able to get the correct answer, review your computations.

What I Have Learned


A. Complete the following statements by writing the correct word or words and
formulas.
1. ________________ is the likelihood measure of the confidence interval that is
represented by a percentage that refers to all possible samples that can be
estimated to contain the true population parameter.
2. Confidence coefficient or ______________ is expressed as z c.
3. ___________________ is a range of values that is used to estimate a parameter.
4. To compute for the margin of error, use the formula ______________________
5. 𝑧𝛼⁄ means the ____________________ or _____________________.
2
6.  is a symbol for __________________.
7. n is the symbol used for the __________________________.
8. To compute the length of confidence interval, there are two formulas to
consider. These are: __________________ and _______________________.
9. To compute for margin of error, the formula to be used is _________________.
10. Complete the Level of Confidence and Zc Table

Confidence 99% 98% 96% 95% 92% 90% 85% 80% 70%
Level
Zc 2.58 2.05 1.75 1.44 1.04

311
What I Can Do
HAVE CONFIDENCE AND MAKE A MATH VLOG!

Materials needed:

Cellphone
Short bond paper
Calculator
Pen or pencil

Things to do:

1. Make your problem statement about computing the length of the


confidence interval. Use the two formulas for the comparison of the
answer.
2. Write down all the data and write your solution in a short bond paper.
3. Create a video using your cellphone while writing the data and the
solution.
4. Make sure to capture the explanations in your video as you write your
data and solution.
5. Focus only on writing the solution during the video filming. (showing
your face is not necessary). However, make your vlog unique.

VLOG CRITERIA
Accuracy of the solution 50%
Clarity and content of the problem 25%
Originality and creativity 25%
TOTAL 100%

Assessment
Direction: Read and analyze each item carefully. Shade the entire circle for each
statement that corresponds to your answer.

A B C D 1. What is the length of the confidence interval?


O O O O A. E B. 𝑧𝑐 C. n D. 

A B C D 2. Which of the following symbols is used for the population


O O O O standard deviation?
A. E B. 𝑧𝛼⁄ C. n D. 
2

312
A B C D 3. Which of the following symbols stand for the sample size?
O O O O A. E B. 𝑧𝛼⁄ C. n D. 
2

A B C D 4. Which of the following formulas is used to find L of the


O O O O confidence interval?
𝜎 1
A. L= 2𝑧𝛼⁄ ( ) C. L = 2𝑧𝛼⁄ ( )
2 √𝑛 2 √𝑛
𝜎 𝜎
B. L = 2𝑧𝛼⁄ ( ) D. L= 2( )
2 √𝑥 √𝑛

A B C D 5. Which of the following formulas is used to find the margin


O O O O of error?
𝜎 𝑥
A. E = 𝑧𝛼⁄ ( ) C. E = 𝑧𝛼⁄ ( )
2 √𝑛 2 √𝑛
𝑡 𝜎
B. E = 𝑧𝛼⁄ ( ) D. E = 𝑧𝛼⁄ ( )
2 √𝑛 2 √𝑠

A B C D 6. Given: n=130,  = 25, level of confidence is 92%. What is


O O O O the margin of error in the given data?
A. 3.73 B. 3.84 C. 3.99 D. 4.25

(for items, 7 – 8) Edward owns a meat store and he wants to find out the average
amount of his daily sale. He randomly recorded his sales for 16 days and found the
average to be ₱ 8,500. It is known from last year’s record that the standard deviation
of his sales is ₱150 and a 99% level of confidence is to be used.

A B C D 7. What is the sampling size in the given problem?


O O O O A. 150 B. 100 C. 50 D. 16

A B C D 8. What is the margin of error for his estimate?


O O O O A. 96.75 B. 58.7 C. 48.7 D. 38.7

A B C D 9. Given: n= 200,  = 35, and a 96% confidence level. Which


O O O O of the following values is the length of the confidence
interval?
A. 10.15 B. 14.54 C. 15.23 D. 18.25

A B C D 10. Given a sample size of 125 and a population standard


O O O O deviation of 88 having an 80% level of confidence. Which
of the following values is the length of the confidence
interval?
A. 16.15 B. 18.15 C. 20.15 D. 24.15

313
(For items, 11 – 12) Given a sample size of 200 with a mean of 80 and the population
standard deviation is found to be 35 having a 96% confidence level.

A B C D 11. Which of the following values is the critical value or


O O O O confidence coefficient to be used?
A. 1.96 B. 2.05 C. 2.33 D. 2.58

A B C D 12. What is the length of the confidence interval?


O O O O A. 10.15 B. 14.54 C. 15.23 D. 18.25

(For items, 13 – 14) The average amount by purchasing products of 255 customers
in JYEL store is ₧150. The population standard deviation is ₧65 with an 85% level
of confidence.

A B C D 13. Which of the following values is the critical value to be


O O O O used in the given problem?
A. 1.44 B. 1.64 C. 1.75 D. 1.96

A B C D 14. The customers of JYEL store purchases products in the


O O O O average amount _________.
A. between ₧189.87 to ₧194.67
B. between ₧160.23 to ₧178.45
C. between ₧154.87 to ₧162.49
D. between ₧144.14 to ₧155.86

A B C D 15. The average weight of sold tilapia in 205 stores is 100


O O O O kilos. The population standard deviation is 80 kilos. Find
the length of the confidence interval for a 95% level of
confidence.
A. 30.45 B. 29.67 C. 27.38 D. 21.90
Additional Activities
Study and analyze each situation. Apply the formula to solve the stated problems.

1. In a sample of 100 cars of a population having a standard deviation on the speed


of 25km/h, find the margin of error if the level of confidence to be used is 90%.
2. Jennifer has sampled 300 persons for her survey project. If she uses a 99%
confidence level with a population standard deviation of 400, help her to
compute for the length of the confidence interval that she needs.
3. Mr. Velasco wanted to know the average weight (in kilos) of students at LAO
University. He took a random sample of 300 students and discovered that their
average weight is 55kg. The population standard deviation is 0.8kg. Construct
a 95% confidence interval for the mean weight of students in the University.

How well did you perform in this lesson?


Are you now ready for the next lesson?
If your answer is yes, I am HAPPY for you! Good luck and happy learning on your
next journey!

314
Statistics and
Probability
Computing Appropriate Sample
Size Using the Length of the
Interval
Statistics and Probability
Computing Appropriate Sample Size Using the Length of the Interval
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Writers: Dhojie N. Yanto, Jerson Jolo and Reynante M. Velasco


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Ronnjemmele A. Rivera
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

315
Week

8
What I Need to Know
This module was designed and written collaboratively to help you in
computing for an appropriate sample size using the length of the interval.
Understanding how to determine the appropriate sample size will be a great help in
conducting a research study. A sufficient number of samples will give you enough
and unbiased pieces of evidence to claim about the parameter.
These will serve as your guide in computing for a sufficient number of
respondents needed in your investigation about a certain topic of your interest and
coming up with a reliable result. The scope of this module permits it to be used in
many different learning situations. The language used recognizes the diverse
vocabulary level of students. The lesson is arranged to follow the standard sequence
of the course.
In this module, you will compute for an appropriate sample size using the
length of the interval.
The module consists of only one lesson entitled Computing Appropriate
Sample Size Using the Length of the Interval.
After going through this module, you are expected to:
1. derive the formula for computing the appropriate sample
size; and
2. compute for an appropriate sample size using the length of
the interval.
What I Know
Let us see how far you will go and how much you know about this pretest.
Please answer all questions with all your best and utmost honesty so that we can
gauge your learning in the target competency of this lesson.
If you obtain 100% or a perfect score, you may skip the module and
immediately move to the next module. While in the event you missed a point, please
proceed with the module as it will enrich your knowledge in computing for an
appropriate sample size using the length of the interval. Let’s get started!
Direction: Read and analyze each item carefully. Choose the letter of the best
answer. Write the chosen letter on a separate sheet of paper.
1. Which of the following coefficients are the 90% confidence coefficients?
A. ±1.645
B. ±1.96
C. ±2.40
D. ±2.58
2. What do you call a subset of a population?
A. estimation
B. interval
C. proportion
D. sample
3. Which of the following values illustrates confidence level?
A. 90 - 100
B. 65.0
C. 90%
D. 35.75
4. When alpha is 0.05, confidence level is equal to _________.
A. 10%
B. 95%
C. 90%
D. 99%

316
5. Confidence interval is also known as _____________________.
A. interval estimate
B. confidence coefficient
C. interval coefficient
D. confidence interval
For numbers 6-11, refer to the following values:
90% confidence level, 𝜎 = 3.5 and 𝐸 = 0.5.
6. What is the given confidence level?
A. 90%
B. 0.10
C. 0.01
D. 0.05
7. What is 𝛼?
A. 95%
B. 0.10
C. 0.01
D. 0.05
8. What are the confidence coefficients?
A. ±1.645
B. ±1.96
C. ±2.20
D. ±2.58
9. Which of the following values is the given population standard deviation?
A. 0.01
B. 3.5
C. 0.5
D. 90%
10. What is the margin of error?
A. 0.01
B. 3.5
C. 0.5
D. 90%
11. Compute for the appropriate sample size.
A. 131
B. 132
C. 133
D. 134
12. What is the sample size needed if the margin of error is 30 and the standard
deviation of 45 with a 98% confidence level?
C. 10
D. 11
E. 12
F. 13
13. Calculate the sample size of the given data: 𝐸 = 5,  = 20, with a 95%
confidence level.
A. 59
B. 60
C. 61
D. 62

317
14. Which of the following variables is NOT needed in computing the minimum
sample size for the estimation of the population mean?
A. confidence level
B. sample mean
C. population standard deviation
D. margin of error
15. Which of the following formulas is used is used to get the sample size of the
population?
𝑍𝛼⁄ 𝐸
A. 𝑛 = ( 2
)2
𝜎
𝑍𝛼⁄ 𝐸
B. 𝑛 = ( 2
)2
𝜎
𝐸𝜎 2
C. 𝑛 = ( )
𝑍𝛼⁄
2

𝑍𝛼⁄ 𝜎
D. 𝑛 = ( 2
)2
𝐸

Lesson 1 Computing Appropriate Sample Size


Using the Length of Interval
In the previous lessons, you learned how to identify and compute for the
length of the confidence interval. This time, using the length interval, you will learn
how to compute for an appropriate sample size.
Your knowledge in determining the sample size is essential particularly in
conducting a research study. A researcher does not just declare a number to
represent the number of respondents he/she wants to include in the investigation.
An appropriate sample size and sampling techniques must be implemented to come
up with the best estimate of the parameter. Therefore, it is one of the requirements
so that the findings of the study can be generalized to every member of the
population.

What’s In
LET’S REVIEW
Direction: Use your calculator to evaluate the following, then round off your answer
as indicated.
40
1. 65 − (1.645) ( ) = _________ (rounded to the nearest hundredths)
√150
15
2. (2.58) ( ) = _________________(rounded to the nearest tenths)
√70
3. (1.96)2 = _________________ (rounded to the nearest thousandths)
4. (1.96) 2 (2.5)2
=_________________(rounded to the nearest whole number)
(1.96)2 (25)2
5. = __________________ (round up the nearest whole number)
52
B. Complete the table below given the sample data and confidence level. Determine
the confidence coefficient, margin of error, and the confidence interval.
𝑛 = 280, 𝑥̅ = 70, 𝜎 = 20 𝑎𝑛𝑑 95% 𝑐𝑜𝑛𝑓𝑖𝑑𝑒𝑛𝑐𝑒 𝑙𝑒𝑣𝑒𝑙
Lower Confidence Upper Confidence
Confidence Margin of Error Limit Limit
𝝈
coefficient 𝑬 = 𝒛𝜶 ∙ 𝝈 𝝈
𝟐 √𝒏 ̅ − 𝒛𝜶 ∙
𝑳𝑳 = 𝒙 ̅ + 𝒛𝜶 ∙
𝑼𝑳 = 𝒙
𝟐 √𝒏 𝟐 √𝒏

318
How was the review? Did you answer all the items? I believe you got it all correctly.
You are now ready to learn new concepts. Good luck!

What’s New
Priming Activity:
Analyze the steps provided in the example below to solve for the variable x in terms
of y. Fill in the blanks with the correct mathematical expressions in the next part to
derive the formula for computing the appropriate sample size.

3
Example: Given the equation 𝑦 = where 𝑥 ≥ 0
√𝑥
√𝑥
Step 1: multiply both sides of the equation by
𝑦
where √𝑥 ≥ 0 and 𝑦 > 0
√𝑥 √𝑥 3
(𝑦) = ( )
𝑦 𝑦 √𝑥
3
√𝑥 =
𝑦

Step 2: Square both sides of the equation


3
(√𝑥)2 = ( )2
𝑦

3
𝑥 = ( )2
𝑦
Step by step sample size formula derivation:
Step 1: The formula for the Margin of error is_________________
√𝑛
Step 2: Multiply both sides of the equation by , where 𝑛 ≥ 0 and 𝐸 ≠ 0
𝐸
Step 3: Square both sides of the equation______________

How was the activity? Where you able to get the correct expression in each
step? If yes, what helps you solve each required expression? If no, what hinders you?
What difficulty did you encounter during the activity? What mathematical concepts
have you applied to do the activity? The formula you just derived will be used in
computing for appropriate sample size using the length of the interval.
What is It
To make an accurate estimate of the parameter, it is vital to determine the
appropriate size of the sample. The factors that affect the sample size are the
maximum error of estimate, the population standard deviation, and the confidence
level.
You must also consider the table below in computing for an appropriate
sample size.

Level of Confidence of 𝑧𝛼 Table


2

99% 98% 96% 95% 92% 90% 85% 80% 70%


Confidence
Level
0.99 0.98 0.96 0.95 0.92 0.90 0.85 0.80 0.70

𝑧𝛼 2.58 2.33 2.05 1.96 1.75 1.645 1.44 1.28 1.04


2

319
EXAMPLE 1.

A HUMSS student is conducting a research on the awareness of SHS


learners on the names of the Philippine Presidents. Given a margin of error of
4 with a standard deviation of 9 having a 99% confidence level. What should
be the minimum sample size that the student-researchers need to estimate
the population mean?
SOLUTION:
Let us identify the given in the problem:
 Confidence level is 99% then 𝑧𝛼 = 2.58
2
 𝐸=4
 𝜎=9
𝑍𝛼⁄ 𝜎
Use the formula: 𝑛=( 2
)2
𝐸
(2.58)(9)
Substitute the given: 𝑛 = [( )]2
4
Evaluate: 𝑛 ≈ 33.70
Round off to the nearest whole number: 𝑛 = 𝟑𝟒

Therefore, to be 99% confident that the estimate differs from the true mean by 4,
the minimum sample size needed is at least 34.

Note: When determining sample size, we always round off the resulting value to the
next whole number.
EXAMPLE 2.

An Environmentalist is making a survey on the number of households


that applies the principles of Waste Management in a certain community.
Given the standard deviation of 2.6, how large a sample must be selected if
we want to be 95% confident of finding whether the true mean differs from
the sample mean by 1.5 hour?

SOLUTION:
To answer the question, let us identify the given:
 For a 95% confidence level, 𝑧𝛼 = 1.96
2
 𝐸 = 1.5
 𝜎 = 2.6
𝑍𝛼⁄ 𝜎
Use the formula: 𝑛=( 2
)2
𝐸
(1.96)(2.6) 2
Substitute the given: 𝑛 = [ ]
1.5
Evaluate: 𝑛 ≈ 11.54
Round off to the nearest whole number: 𝑛 = 𝟏𝟐
Therefore, to be 95% confident that the estimate differs from the true mean by
1.5, we need a sample size of at least 12.
EXAMPLE 3.

Compute for the appropriate sample size if the margin of error is


6, with a sample standard deviation of 25 and a 90% level of confidence.

320
SOLUTION: Identify the given.
 For a 90% confidence level, 𝑧𝛼 = 1.645
2
 𝐸=6
 𝜎 = 25
𝑍𝛼⁄ 𝜎
Use the formula: 𝑛=( 2
)2
𝐸
(1.645)(25) 2
Substitute the given: 𝑛 = [ ]
6
Evaluate: 𝑛 ≈ 46.98
Round off to the nearest whole number: 𝑛 = 𝟒𝟕

Therefore, to be 90% confident that the estimate differs from the true mean by 6, we
need a sample size of at least 47.
EXAMPLE 4.

Compute for the appropriate sample size if the margin of error is 6,


with a sample standard deviation of 25 and a 92% level of confidence.

SOLUTION: Identify the given data,


 For a 92% confidence level, 𝑧𝛼 = 1.75
2
 𝐸=6
 𝜎 = 25
𝑍𝛼⁄ 𝜎
Use the formula: 𝑛=( 2
)2
𝐸
(1.75)(25) 2
Substitute the given: 𝑛 = [ ]
6
Evaluate: 𝑛 ≈ 53.17
Round off to the nearest whole number: 𝑛 = 𝟓𝟒
Therefore, to be 92% confident that the estimate differs from the true mean by 6, we
need a sample size of at least 54.

What’s More
Independent Activity 26.1

Complete Me

Complete the table below by supplying the needed information and solve for the
appropriate sample size n.
1. Confidence level 99%,
Population standard deviation = 30,
margin of error = 2.5

2. Confidence level 90%,


Population standard deviation σ = 27,
margin of error = 7.5

3. Confidence level 95%,

321
Population standard deviation σ = 9,
margin of error = 4.5

4. Confidence level 92%,


Population standard deviation σ = 45,
margin of error = 3

5. Confidence level 98%,


Population standard deviation σ = 30,
margin of error = 2.5

No. Confidence 𝑍𝛼⁄ E 𝜎 n


2
Level
1

Independent Assessment 26.1

Solve Me
Determine the minimum sample size for each of the following:
a. 95% confidence level, E = 4.8 , σ = 30

b. 96% confidence level, E = 5 , σ = 45

c. 99% confidence level, E = 8.5 , σ = 55


Independent Activity 26.2
Solve Me
𝑍𝛼⁄ 𝜎
Using the formula for computing the appropriate sample size, 𝑛 = ( 2 )2 derive the
𝐸
formula for the margin of error (E) and population standard deviation(𝜎).

Independent Assessment 26.2


Solve Me
1. A random sample of size 70 was used with a population standard deviation
of 15. Assuming that there is a 92% confidence level, what is the margin of
error?
𝑛 = __________
 = __________
𝑍𝛼⁄ =__________
2
formula to be used: ________________
solution:

322
2. Compute for the appropriate sample size needed if the margin of error is 15,
with a population standard deviation of 43 and a 98% level of confidence.
𝐸 = __________
 = __________
𝑍𝛼⁄ =__________
2

formula to be used: ________________


solution:

What I Have Learned


Let us summarize what you have learned by completing the following table. Write the
correct answer in a separate sheet of paper.

Write the formulas here.


Margin of Error Minimum Sample Size

𝐸 = ________________ 𝑛 = ________________

where:
𝑛= ________________________
𝐸= ________________________
𝜎= ________________________
𝑍𝛼⁄ = ________________________
2

What I Can Do
MAKE YOUR SURVEY NOW!

Materials needed:
Bond paper, calculator, pen and pencil

Things to do:
6. Make a problem statement anchored in any of the following issues:
a. Public Health like COVID – 19 pandemic
b. Climate change
c. Overpopulation
d. Gender Inequality
e. Bullying
f. Poverty
7. Identify a random variable related to your selected issue. For example, the
number of facemasks used for Public Health issues, volume of waste disposal
for climate change, the number of children in a family for overpopulation, etc.
8. Survey all of your classmates.

323
9. Compute for the population mean, standard deviation, and the margin of
σ
error. Use the formula: E = zα ∙ and a confidence level of 90%, 95%, and
2 √n
99%.
10. Compute for the appropriate sample size using the computed values in
𝑍𝛼⁄ 𝜎
number 4. Use the formula: 𝑛 = ( 2 )2 and a confidence level of 90%, 95%,
𝐸
and 99%.
11. Interpret your results in the context of your problem statement.
12. How important is sampling in the survey? Is sampling necessary in your
survey? Justify your answer.
13. What can you suggest solving the problem stated in your output?

SURVEY CRITERIA

Accuracy of the solution 50%

Clarity and content of the problem 25%

Originality and creativity 25%

TOTAL 100%

Assessment
Direction: Read and analyze each item carefully. Choose the letter of the best answer.
Write the chosen letter on a separate sheet of paper.

1. Which of the following coefficients are the 95% confidence coefficients?


𝐴. ±1.65 C. ±2.40
𝐵. ±1.96 D. ±2.58
2. When alpha is 0.01, confidence level is equal to _________.
A. 10% C. 90%
B. 95% D. 99%
3. Which of the following values illustrates confidence coefficients?
A. 99% C. 0.05
B. 1.96 D. 35.75
4. Confidence interval is also knowns as _____________________.
A. Interval estimate C. interval coefficient
B. confidence coefficient D. alpha
For numbers 5-11, refer to the following given values:
99% confidence level, 𝜎 = 7 and 𝐸 = 2.
5. What is the given confidence level?
A. 99% C. 0.01
B. 0.10 D. 0.05
6. What is 𝛼?
A. 95% C. 0.01
B. 0.10 D. 0.05
7. What are the confidence coefficients?
𝐴. ±1.645 C. ±2.20
𝐵. ±1.96 D. ±2.58
8. Which of the following is the given population standard deviation?
A. 2 C. 7
B. 2.58 D. 99%

324
9. What is the margin of error?
A. 2 C. 7
B. 2.58 D. 99%
10. Compute for the appropriate sample size.
A. 80 C. 82
B. 81 D. 83
11. If the confidence level is lowered to 90% what will be the appropriate sample
size?
A. 34 C. 36
B. 35 D. 37
12. What is the sample size needed if the margin of error is 30 and the standard
deviation of 45 with a 98% confidence level?
A. 16 C. 14
B. 15 D. 13
13. Calculate the sample size of the given data: E = 5,  = 20, with a 95%
confidence level.
A. 65 C. 63
B. 64 D. 62
14. Which of the following is NOT needed in computing the minimum sample size
for the estimation of the population mean?
A. sample mean C. margin of error
B. confidence level D. population standard deviation
15. Which of the following formulas is used to get the sample size of the
population?
𝑍𝛼⁄ 𝜎 𝐸𝜎 2
A. 𝑛 = ( 2
)2 C. 𝑛 = ( )
𝐸 𝑍𝛼⁄
2

𝑍𝛼⁄ 𝐸 𝑍𝛼⁄ 𝐸
B. 𝑛 = ( 2
)2 D. 𝑛 = ( 2
)2
𝜎 𝜎

Additional Activities
To deepen your knowledge on computing for the appropriate sample size using the
length of interval you can visit the following websites,
https://youtu.be/XO9NtHvRQTQ and http://youtu.be/uMILjv6Fs94.

325
Statistics and
Probability
Solving Problems Involving
Sample Size Determination
Statistics and Probability
Solving Problems Involving Sample Size Determination
First Edition, 2020

Republic Act 8293, section 176 states that: No copyright shall subsist in any work of the Government of the
Philippines. However, prior approval of the government agency or office wherein the work is created shall be necessary for
exploitation of such work for profit. Such agency or office may, among other things, impose as a condition the payment of royalties.

Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names, trademarks, etc.) included in this
module are owned by their respective copyright holders. Every effort has been exerted to locate and seek permission to use
these materials from their respective copyright owners. The publisher and authors do not represent nor claim ownership over
them.

Published by the Department of Education


Secretary: Leonor Magtolis Briones
Undersecretary: Diosdado M. San Antonio

Development Team of the Module

Author: Elmer C. Potes, Rolly Q. Suaverdez and Reynante M. Velasco


Editors: Celestina M. Alba and Nestor N. Sandoval
Reviewers: Irlo L. Dumo, Rey Mark R. Queaño and Ma. Theresa B. Yanto
Illustrator: Dhojie N. Yanto
Layout Artist: Edna E. Eclavea
Management Team: Wilfredo E. Cabral, Job S. Zape Jr., Elaine T. Balaogan, Fe M. Ong-ongowan, Catherine P. Talavera,
Gerlie M. Ilagan, Buddy Chester M. Repia, Herbert D. Perez, Lorena S. Walangsumbat, Jee-Ann O. Briones
Celestina M. Alba, Asuncion C. Ilao

Department of Education – Region IV-A CALABARZON

Office Address: Gate 2 Karangalan Village, Barangay San Isidro


Cainta, Rizal 1800
Telefax: 02-8682-5773/8684-4914/8647-7487
E-mail Address: region4a@deped.gov.ph

326
Week

8
What I Need to Know
This module was designed and written with you in mind on how you can solve
problems involving sample size determination. This will also help you analyze real-
life situated problems statistically in terms of relevant questions for you to better
understand them. Your adept at analysis will help you appreciate the richness, and
beauty of Statistics which will motivate you to apply to similar events and create
statistical measures of your own.

Your patience in solving offered problems here in the module will help you
improve your computational skills as it tackles relevant culture-based situated
problems. Your ability to interpret, reason–out, and make a judgment or even
decision out of statistical measures will also be practiced here. The scope of this
module permits it to be used in many different learning situations. The language
used recognizes the diverse vocabulary level of students. The lessons are arranged
to follow the standard sequence of the course. But the order in which you read them
can be changed to correspond with the textbook you are now using.

In previous modules, you learned that sample statistics can estimate the
population parameter. You also learned how to compute for an appropriate sample
size using the length of the interval. This time, you will focus on solving problems
involving sample size determination for you to better estimate the population
parameter.

After going through this module, you are expected to:


1. illustrate the factors which affect the sample size determination;
2. compute for the required sample size using the formula; and
3. solve problems involving sample size determination.

What I Know
Let us see how far you will go and how much you know about this pretest.
Please answer all questions with all your best and utmost honesty so that we can
gauge your learning in the target competency of this lesson. Let’s get started!
NOTE: For students with online connections and want to have a paperless
assessment, please go to the link given by your teacher.
Direction: Choose the letter of the correct answer for each item. Shade/blacken the
circle that corresponds to your chosen answer.
A B C D 1. Which of the following factors is not considered in the
O O O O determination of sample size?
A. margin of error
B. level of confidence
C. confidence interval
D. variance or standard Deviation
A B C D 2. The value of confidence coefficient (  ) of 1.645 is used
z
O O O O 2
in sample size determination if a researcher wants to be
_______% confident that the parameter is within the
estimate.
A 99 B. 95 C. 92 D. 90

327
A B C D 3. How does the level of confidence affect the sample size?
O O O O (Consider that other factors are constant)
A. The higher the level of confidence, the smaller the
sample size required.
B. The lower the level of confidence, the larger the sample
size required.
C. The lower the level of confidence, the smaller the
sample size required.
D. The sample size remains constant in every level of
confidence.
A B C D 4. Which of the following is the correct equation for sample
O O O O size determination?

2
(𝑍𝛼⁄ )+(𝜎)
A. n  x  z 2 ( ) C. 𝑛 = [ 2
]
𝐸
n
2
(𝑍𝛼⁄ )(𝜎)
B. n = 𝑋̅+ E D. 𝑛 = [ 2
]
𝐸

A B C D 5. How does the margin of error and sample size affect one
O O O O another? (Assume that the other factors are constant)
A. A smaller sample size reduces the margin of error.
B. A higher margin of error requires a larger sample.
C. A lower margin of error requires a larger sample size.
D. A small sample size results in a lower margin of error.

A B C D 6. How does standard deviation 𝜎 affect the sample size?


O O O O (Assume that the other factors are constant)
A. The sample size is constant for all values of 𝜎.
B. Lower 𝜎 reduces the required sample size.
C. Lower 𝜎 increases the required sample size.
D. Higher 𝜎 decreases the required sample size.

A B C D 7. A researcher wants his estimate with a minimum margin


O O O O of error. Which of the following is suggested?
A. Reduce the sample size.
B. Use a sample size of 30.
C. Increase the sample size.
D. Use a sample size of less than 25.

A B C D 8. How large is the sample size required so that a researcher


O O O O is 98% confident with the estimate of a sample, of
standard deviation 5, that the error will not be more than
2?
A. 6 B. 34 C. 42 D.
84

A B C D 9. If a 95% level of confidence is applied by the researcher in


O O O O the condition given in item # 8, what sample size shall be
obtained?
A. 8 B. 17 C. 24 D.
34

328
For questions 10 to 15, refer to the situation that follows.

The diameter of the coconuts harvested in a local farm in Quezon Province is


normally distributed with a population standard deviation of 8 mm. At a 95% level
of confidence, a researcher wants to have an estimate that will not be more than
3 mm from the TRUE population average diameter of the coconuts.

A B C D 10. What correct confidence coefficient should be applied to


O O O O determine the required sample size?
A. 2.576 B. 2.33 C. 1.96 D. 1.645

A B C D 11. What is the given population standard deviation from the


O O O O situation?
A. 8 B. 16 C. 24 D. 64

A B C D 12. What is the desired margin of error for the estimate of the
O O O O population parameter?
A. 2 B. 3 C. 4 D. 16

A B C D 13. What minimum sample size shall be used by a researcher


O O O O to meet the expected results? (Round up to the nearest
whole number)
A. 16 B. 28 C. 62 D. 107
A B C D 14. If the researcher wishes to reduce the error of the
O O O O estimate, what should be done?
I Consider a larger sample.
II Reduce the level of confidence
III Increase the level of confidence.
A. I only C. I and II only
B. II only D. I and III only

A B C D 15. What sample size is needed for the situation to have a


O O O O margin of error not more than 1? (Round to the nearest
whole number)
A. 87 B. 102 C. 246 D. 425

How do you find the pre-test? Did you encounter both familiar and unfamiliar
terms, symbols, and notations? Kindly compare your answer in the Answer Key on
the last part of this module.
If you obtain 100% or a perfect score, skip the module and immediately move
to the next module. While in the event you missed a point, please proceed with the
module as it will enrich your knowledge in the normal distribution, especially in
solving problems involving sample size determination
Lesson 1 Solving Problems Involving Sample
Size Determination
You can be a great problem solver. You may not be aware of it but from time
to time, you are dealing with different problems. And every time you are thinking of
any possible solution, you are actually solving it.
Solving problems involving sample size determination is just like any other
problem you are facing every day. In dealing with it, you need to identify all possible
information given on the problem. This is important because the solution depends

329
on the quantities you plug in to your formula. Of course, correct computation is
equally important.
This lesson will help you in analyzing and solving problems that involve the
sample size determination.

What’s In
Let’s have this first for a review.
Use the given information to compute the sample size.
1. 𝑍𝛼⁄ = 2.576, 𝜎 = 12 𝐸=3 n=?
2
2. 𝑍𝛼⁄ = 1.96, 𝜎 = 25 𝐸=2 n=?
2
3. level of confidence = 90% 4. level of confidence = 99%
standard deviation = 23 standard deviation = 15
margin of error = 5 margin of error = 4
sample size = ? sample size = ?
5. Compute for the sample size if the margin of error is 6, with a sample standard
deviation of 20 and a 95% level of confidence.
What’s New
In each of the situations below, identify the values of the following:
A. confidence level C. standard deviation, (𝜎)
B. critical value, (𝑍𝛼⁄ ) D. margin of error, (𝐸)
2
Situation 1
A researcher would like to estimate the mean weight loss of people one year
after using a specific weight loss program. How many people on the weight loss
program must be surveyed in order to be 95% confident that the sample mean weight
loss is within 0.75 pounds of the true population mean? Assume the population
standard deviation is known to be 10.4 pounds.
Situation 2
An investigator wants to estimate the mean birth weight of infants born full-
term to mothers who are 19 years of age and under. The mean birth weight of infants
born full-term to mothers 20 years of age and older is 3,510 grams with a standard
deviation of 385 grams. How many women 19 years of age and under must be
enrolled in the study to ensure that a 90% confidence on interval estimate of the
mean birth weight of their infants has a margin of error not exceeding 100 grams?
Situation 3
A medical practitioner wants to estimate the mean systolic blood pressure in pre-
school children with inborn heart disease. He plans on using a 99% confidence
interval and a margin of error of 5 units. The standard deviation of systolic blood
pressures in pre-school children with inborn heart disease is 20. How many children
should be enrolled in the study?

What is It
In the previous module, you equipped yourself with a formula in determining
the sample size. It is expected that you have learned very well the steps in using the
formula because you will still use it in this module. Also, in the previous lesson, the
values needed in the computation were already given. But since you are dealing with
worded problems in this module, you are going to identify first the given values from

330
the problem before you can use the formula. Hence, it is important to correctly
identify those values.
The Sample Size Determination in Estimating Population Mean
Let us imagine that you own a fast-food company. You are doing a survey to
get an idea of how long customers stay in a restaurant. In the past two years, it was
observed that the standard deviation of the time they stay in a restaurant is 12
minutes. How many people do you need to observe so that the desired margin of error
will not be more than three (3) minutes with a confidence level of 95%?
Let us identify first the given in the problem
Confidence Level = 95%
Critical Value (𝑍𝛼⁄ ) = 1.96
2
Margin of Error (𝐸) =3
Standard Deviation (𝜎) = 12

Then using the formula, you will have,


2
(𝑍𝛼⁄ )(𝜎) (1.96)(12) 2 23.52 2
𝑛= [ 2
] = [ ] = [ ] = [7.84]2 = 61.47
𝐸 3 3

The formula always gives you the minimum number of sample sizes needed to
ensure the specified precision. Hence, if you round down 61.47 to 61 (like what we
usually do in rounding off numbers), it will be less than the minimum. Therefore,
you should always round up whatever the decimal part is. Thus, the required
sample size is 62.
What if you wish to minimize the error to at most 2 (E ≤ 2)? Will it affect the
sample size? Let’s find out. Your given values will be
Confidence Level = 95%
Critical Value (𝑍𝛼⁄ ) = 1.96
2
Margin of Error (𝐸) =2
Standard Deviation (𝜎) = 12

Then using the formula, you will have,


2
(𝑍𝛼⁄ )(𝜎) (1.96)(12) 2 23.52 2
𝑛= [ 2
] = [ ] = [ ] = [11.76] 2 = 138.30
𝐸 2 2

Thus, the sample size required is 139.


Suppose the margin of error is further reduced to at most 1 (E ≤ 1), the
sample size will be
2
(𝑍𝛼⁄ )(𝜎) (1.96)(12) 2 23.52 2
𝑛= [ 2
] = [ ] = [ ] = [23.52] 2 = 553.19 or 554
𝐸 1 1

What does it imply? As the margin of error gets smaller, the sample size gets
larger. Or you can say that one way to minimize the error is by increasing the sample
size. The same variation will be applied if the margin of error increases.
Moreover, if you reduce the level of confidence to 90% it will surely affect the
sample size. Upon changing the level of confidence to 90%, the critical value will
change also.

331
Critical Value (𝑍𝛼⁄ ) = 1.645
2
Margin of Error (𝐸) =3
Standard Deviation (𝜎) = 12
Then using the formula to compute for the sample size, you have
2
(𝑍𝛼⁄ )(𝜎) (1.645)(12) 2 19.74 2
𝑛= [ 2
] = [ ] = [ ] = [6.58]2 = 43.33 or 44.
𝐸 3 3

If we further reduce the level of confidence to 80%, the sample size will
be computed as follows:
Confidence Level = 80%
Critical Value (𝑍𝛼⁄ ) = 1.28
2
Margin of Error (𝐸) =3
Standard Deviation (𝜎) = 12
2
(𝑍𝛼⁄ )(𝜎) (1.28)(12) 2 15.36 2
𝑛= [ 2
] = [ ] = [ ] = [5.128] 2 = 26.21 or 27.
𝐸 3 3

The changes in the level of confidence in the computation above tell you that
as the sample size gets smaller as your level of confidence decreases.
Factors Affecting Sample size Determination
There are three factors that influence the sample size determination; (1) the
level of confidence, (2) the population standard deviation, and (3) the margin of error.
Among these factors, only two are under the researcher’s control.
These are:
• Margin of Error. Since no sample will be perfect, you need to decide how much
error to allow. The confidence interval determines this. The less error you’re willing
to accept, the bigger the sample size needs to be.

• Confidence Level. This determines how certain you want to be that the parameter
falls within your confidence interval. Most common confidence intervals are 90%,
95% or 99%. The more confident you want to be, the bigger the sample size needs to
be.
Table 1:
The table below shows the confidence coefficient (critical values, 𝑍𝛼⁄ )
2
for a different level of confidence.

Level of
Confiden 99.73 99 98 96 95.45 95 90% 80 68.27 50%
ce % % % % % % % %
z 2
3.00
2.5 2.3 2.0 2.00 1.9 1.64 1.2 1.00 0.674
8 3 5 6 5 8 5
Illustrative Example 1
An investigator wants to estimate the caffeine consumption of senior high
school students. How many students would be required to ensure that a 95%
confidence interval estimate for the mean caffeine intake (measured in mg) is within

332
15mg of the true mean? Assume that the standard deviation in caffeine intake is
68mg
Let us identify first the given in the problem

Confidence Level = 95%


Critical Value (𝑍𝛼⁄ ) = 1.96
2
Margin of Error (𝐸) = 15
Standard Deviation (𝜎) = 68

Then using the formula, you will have,

2
(𝑍𝛼⁄ )(𝜎) (1.96)(68) 2 133.28 2
𝑛= [ 2
] = [ ] = [ ] = [8.89]2 = 78.03
𝐸 15 15

Hence, at least 79 students are required to participate to ensure a 95%


level of confidence.
Illustrative Example 2
A nurse at Lucena United Medical Center is interested in estimating the birth
weight of infants. How large a sample must be if she desires to be 98% confident that
the true mean is within 3 ounces of the sample mean? The standard deviation of
birth weights is known to be 6 ounces.
Given in the problem are the following:
Confidence Level = 98%
Critical Value (𝑍𝛼⁄ ) = 2.33
2
Margin of Error (𝐸) =3
Standard Deviation (𝜎) =6

Then using the formula, you will have,


2
(𝑍𝛼⁄ )(𝜎) (2.33)(6) 2 13.98 2
𝑛= [ 2
] = [ ] = [ ] = [4.66]2 = 21.72
𝐸 3 3

Therefore, the nurse needs at least 22 infants to be enrolled in the study to be


98% confident that the true mean weight of newborn infants is within 3 ounces of
the sample mean.

What’s More
Apply the knowledge you obtained from the discussion by solving the given problems
below.
1. A 99% confidence interval has to be created to estimate the mean volume of
contents in the bottles of Honey-Z. Assume that the population standard
deviation is 20 ml. Calculate the sample size needed if the margin of error is no
more than:
a. 6 ml
b. 5 ml
c. 4 ml
d. 3 ml

333
2. How large sample size is required to determine the mean IQ of seventh graders of
Luis Palma National High School within +3 points, population standard deviation
of 15, and with the following confidence level?
a. 98%
b. 95%.
c. 90%.
d. 80%.
What I Have Learned
To summarize what you have learned from this module try completing the following
statements. Write the correct word or phrase in a separate sheet of paper.

1. _________________________ is a process of finding the number of elements to be


drawn from a population to create sub-group called sample, subject for a study.
2. The sample size is determined by the equation ___________________.
3. In the formula/equation for the sample size, n is the required sample size,  is
z 2
the _____________________,  represents the __________________,
and ___________ is the margin of error.
4. The factors which influence the sample size determination are
____________________, ____________________ and ____________________.
5. As the sample size gets larger, the margin of error gets __________________.
6. The level of confidence determines how certain you want to be that the parameter
fall within your confidence interval. The more confident you want to be, the
______________ the sample size needs to be.
What I Can Do
Read the following problems carefully. Apply your skills in solving the following cases.
1. A researcher found that the IQ scores of the ALS students in the Division of Quezon
Province are normally distributed with a mean of 110 and a standard deviation of
10.
a. How many ALS students are needed to test so that the estimate will not be more
than 5 from the population mean with a 99% level of confidence?

b. Due to limited contact with ALS students, the researcher tested a small number
of ALS students. Explain the effect of this small sample on the estimate of the
parameter?
2. A confidence interval estimate has a maximum error of 2.50. If the distribution of
the data where the estimate is obtained is normal with a standard deviation of 9,
a. How many samples are considered in 98% confidence interval estimate for the
mean?
b. What must be done to the sample size to reduce the margin of error of the
confident interval estimate? Support your answer.
Assessment
Direction: Choose the letter of the correct answer for each item. Shade/blacken the
circle that corresponds to your chosen answer.

A B C D 1. The following are the factors to consider in the


O O O O determination on sample size EXCEPT for the __________.
A. Margin of error C. confidence interval
B. Level of confidence D. variance or standard deviation

334
A B C D 2. The value of confidence coefficient ( z ) of 1.96 is used in
O O O O 2
sample size determination if a researcher wants to be
_______% confidence level that the parameter is within the
estimate.
A 99 B. 95 C. 92 D.
90

A B C D 3. How does the level of confidence affect the sample size?


O O O O (Consider that other factors are constant)
A. The higher the level of confidence, the smaller the
sample size required.
B. The lower the level of confidence, the larger the sample
size required.
C. The lower the level of confidence, the smaller the
sample size required.
D. The sample size remains constant in every level of
confidence.

A B C D 4. Which of the following is the correct equation for sample


O O O O size determination?

2
(𝑍𝛼⁄ )(𝜎)
A. n  x  z 2 ( ) C. 𝑛 = [ 2
]
n 𝐸

B. n = X + E D. n = df + 1

A B C D 5. How does the margin of error and sample size affect one
O O O O another? (Assume that the other factors are constant)
A. A smaller sample size reduces the margin of error.
B. A higher margin of error requires a larger sample.
C. A lower margin of error requires a larger sample size.
D. A small sample size results in a lower margin of error.

A B C D 6. How does the standard deviation (𝜎) affect the sample size?
O O O O (Assume that the other factors are constant)
A. The sample size is constant for all values of 𝜎.
B. Lower 𝜎 reduces the required sample size.
C. Lower 𝜎increases the required sample size.
D. Higher 𝜎 decreases the required sample size.

A B C D 7. A researcher wants his/her estimate with a minimum


O O O O margin of error. Which of the following is suggested?
A. reduce the sample size.
B. use a sample size of 30.
C. increase the sample size.
D. use a sample size of less than 25.

A B C D 8. How large is the sample size required so that a researcher


O O O O is 99% confident with the estimate of a sample, of
standard deviation 5, that the error will not be more than
2?
A. 6 B. 13 C. 42 D. 84

335
A B C D 9. If a 90% level of confidence is applied by the researcher
O O O O in the condition given in item # 8, what sample size shall
be obtained?
A. 8 B. 17 C. 27 D.
34

For questions 10 to 15, refer to the situation that follows.

The diameter of the coconuts harvested in a local farm is normally distributed with
a population variance of 64 mm. At a 99% level of confidence, a researcher wants
to have an estimate that will not be more than 2 mm from the TRUE population
average diameter of the coconuts.
A B C D 10. What correct confidence coefficient should be applied to
O O O O determine the required sample size?
A. 2.58 B. 2.33 C. 1.96 D. 1.645

A B C D 11. What is the given population standard deviation from the


O O O O situation?
A. 8 B. 16 C. 24 D. 64

A B C D 12. What is the desired margin of error for the estimate of the
O O O O population parameter?
A. 2 B. 3 C. 4 D. 16

A B C D 13. What sample size shall be used by a researcher to meet the


O O O O expected results? (Round up to the nearest whole number)
A. 16 B. 32 C. 62 D. 107

A B C D 14. If the researcher wishes to reduce the error of the estimate,


O O O O what should be done?
I Consider a larger sample.
II Reduce the level of confidence
III Increase the level of confidence.

A. I only C. I and II only


B. II only D. I and III only

A B C D 15. What sample size is needed for the situation to have a


O O O O margin of error not more than 1? (Round to the nearest
whole number)
A. 87 B. 102 C. 347 D. 427

336
Additional Activities
How well did you perform in this lesson? Are you ready for the next lesson?
If you are, then answer the following situation.
As a researcher, you were hired by MNG Clinic to estimate the average BMI
of the adult population of Lucena City. A previous survey indicates a standard
deviation of 15.50. How many adults should be visited as samples in order to be 95%
confident that the confidence interval estimate will not differ from the true population
mean by more than 3? What type of random technique will you use to get your
sample? State your procedure in conducting the survey.
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
___________________

337
338
ADDITIONAL ACTIVITIES
Value of
Random
Voter Voter Voter Voter Variables
Event
#1 #2 #3 #4 (Number
of Yes
votes)
1 Y Y Y Y 4
2 Y Y Y N 3
3 Y Y N Y 3
4 Y Y N N 2
5 Y N Y Y 3
6 Y N Y N 2
7 Y N N Y 2
What I Know Assessment
8 Y N N N 1
9 N Y Y Y 3 1. C
10 N Y Y N 2 1. A
2. C
11 N Y N Y 2 3. C 2. B
12 N Y N N 1 4. A
13 N N Y Y 2 5. D 3. B
14 N N Y N 1 6. B
15 N N N Y 1 7. C 4. A
16 N N N N 0 8. B
9. A 5. D
10. B
1 case of four head, 3 cases of Three Heads, 6 cases of 11. C 6. A
Two Heads, 4 cases of One Head, 1 case of Zero 12. B
13. C 7. C
14. C
15. C 8. A
 P (X = 4) = 1/16
 P (X = 3 ) = 3/16 9. B
 P (X = 2 ) = 6/16 or 3/8
 P (X = 1 ) = 4/ 16 or ¼ 10. A
 P ( X = 0 ) = 1/16 What I have learned
11. A
A. Complete the
following statements by 12. D
writing the correct word.
1. Discrete 13. C
2.Continuous
3.Random Variable 14. C
4.Continuous
15. C
5.Discrete
B. Complete the table below
Random Variable
1. Number of ringing of
phone
2. Time of the students to
finish the test.
3.Number of complaints
4. Height of the tallest
building
5. Number of mobile Types of What I can do
Random Variable
1. C
1. Discrete
2. C
2. Continuous
3. C
3. Discrete
4. D
4. Continuous
5. D
5. Discrete
Module 1
Answer Key
339
What I Can Do What I Know
1. The time it takes for a child to complete the Rubik’s 1. A
2. D
cube is a 3. A
continuous random variable, since a child could 4. C
5. D
have finished the 6. D
puzzle in 1.2 minutes or 2.5 minutes and so on. 7. D
8. D
9. A
2. The voltage of a certain electric outlet is a 10. C
11. B
continuous random variable 12. D
13. D
14. A
15. A
What’s More What's In
Independent Activity 1 1. S = {H, T}
2. S = {1, 2, 3, 4, 5, 6}
1. Discrete variables are countable while 3. S = {1, 2, 3, 4, 5, 6}
continuous variables are measurable. 4. S = {2, 4, 6, 8, 10, 12, 14, 16}
2. Discrete variable can take on only integer 5. S = {H1, H2, H3, H4, H5, H6,
values whereas T1, T2, T3, T4, T5, T6}
continuous variable can take on any value.
3. Answer may vary
What's New
Independent Assessment 1
Roll a pair of dice = 2, 3, 4, 5, 6, 7, 8, 9, 10,
1. Continuous 11, 12
2. Discrete
3. Continuous Flip a pair of coins repeatedly = 1, 2, 3, 4,
4. Continuous 5…
5. Discrete
Height of individual = answer may vary
Independent Activity 2 Reflection questions 1 - 4. answer may vary
1. Discrete random variable
2. continuous random variable
3. Discrete random variable
Assessment Additional
Independent Assessment 2 Activities
1. B
2. B 1. C
1. Continuous random variable
3. A 2. C
2. Continuous random variable
4. B 3. C
3. Discrete random variable
5. D 4. D
4. Discrete random variable
6. B 5. D
5. Discrete random variable
7. A
6. Discrete random variable
8. C
7. Discrete random variable
9. C
8. Discrete random variable
10. A
9. Continuous random variable
11. B
10. Discrete random variable
12. C
13. C
14. D
15. A
Module 2
340
What's More What's New
Independent Activity 1 Value of the
Random Variable
1.) Step 1: List all simple Possible
events in sample space. X (number of non
Outcomes
- defective
Step 2: Find probability laptops
for each simple event.
DDD 0
Step 3: List NNN 3
possible values for random DDN 1
variable X and identify NND 2
the value for DNN 2
each simple event. NDD 1
Step 4: Find all simple DND 1
events for which X = k, for NDN 2
each possible value k.
2.) Similarity - they can take
on assigned numerical Reflection:
values
1. Answer may vary: you may answer whether easy, average or difficult
Difference- a) random 2. Answer may vary: if your answer is yes, you probably completed the
variables may take on table by following the first example. If not completed, you probably
assigned intervals instead didn’t understand the example
of numbers 3. Answer may vary: if your answer is yes, you probably count the number
b) each of N that occurred in each possible outcomes, otherwise your answer
value of a must be no.
random variable 4. Answer may vary: A Random Variable is a numerical quantity that is
corresponds to a derived from the outcomes of random experiments.
specific 5. Answer may vary:
probability, Possible answer:
which is not the case in Step 1: List all simple events in sample space.
algebraic variables.
Step 2: Find probability for each simple event.
Independent Assessment Step 3: List possible values for random variable X and identify the
1 value
1. 17
for each simple event.
2. 169 Step 4: Find all simple events for which X = k, for each possible value k.
3. 12
4. 104 Additional Activity What’s In Assessment
5. 160 1. S = {BB, BY, YB, YY} 6. Discrete 16. C
7. Discrete 17. D
Independent Activity 2 2. Possible values of a random 8. Continuou 18. A
Value of variable: 0, 1, 2 s 19. D
Random 9. Discrete 20. D
Possible
Variable Y 10. Discrete 21. D
Outcomes
(number of 22. D
green balls) 23. D
What I Can Do
WW 0 24. D
25. A
WG 1 The possible values of the random
26. B
GW 1 variable are 1, 2, 3, and 4 27. B
GG 2 28. S
29. D
Module 3
341
What I Know What's More What I Have Learned
1. C Activity 1 1. Probability
2. B Distribution
3. B A. 1. Not
2. 1
4. C 2. Probability
5. A 3. 0 ≤ P(X) ≤ 1
Distribution
6. B 4. ∑ 𝑃(𝑋) =1 and
3.Not
7. D 0 ≤ P(X) ≤ 1
4.Not
8. A 5. Sample
9. C 5.Probability Distribution
Random variable
10. C
B. 1. Probability Distribution Probability values or
11. D
12. A 2. Not P(X)
13. C Activity 2
14. D X P(x)
15. D 0 1/8
1 3/8
2 3/8
3 1/8 Assessment
Activity 3
What I Can Do X P(x) 6. B
2 1/36 7. C
X P(X) 3 1/18 8. C
20 1 4 1/12 9. B
10 5 1/9 10. A
30 1 6 5/36 11. C
10 7 1/6 12. B
40 1 8 5/36 13. D
5 9 1/9 14. A
41 1 10 1/12
15. D
10 11 1/18
16. C
43 1 12 1/36
Activity 4
17. A
10 18. B
48 1 X P(x)
10 1/5 19. A
10 20. C
1 12 1/10
70
14 1/5
5 15 3/10
90 1
20 1/10
10 25 1/10
Module 4
What I Know What's More
1. B Independent Assessment 2
2. A
3. A Value of Random Variable Y
4. D Possible Outcomes
(number of tails)
5. C
6. B HHHH 0
7. B HHHT 1
8. D HHTH 1
9. A HHTT 2
10. A
11. B
HTHH 1
12. A HTHT 2
13. D HTTH 2
14. D HTTT 3
15. B THHH 1
THHT 2
THTH 2
THTT 3
TTHH 2
TTHT 3
TTTH 3
TTTT 4
342
Additional Activities
1.
Value of the Random Variable X
Possible Outcomes
(No. of Tails)
HHHH 0
HHHT 1
HHTH 1
HTHH 1
THHH 1
HHTT 2
HTHT 2
HTTH 2
THHT 2
THTH 2
TTHH 2
HTTT 3
THTT 3
TTHT 3
TTTH 3
TTTT 4
Probability Distribution
Value of Random Variable X 0 1 2 3 4
P(X) 1/16 4/16 6/16 4/16 1/16
2.
a. Sample Space
{(1,2),(2,1),
(1,3),(3,1),
(2.3),(3,2),
(1,1),(2,2),
(3,3)}
b. 1+1=2
1+2=3
2+1=3
2+2=4
1+3=4
3+1=4
2+3=5
3+2=5
3+3=6
The values of the random variable are 2,3,5 and 6
c.
Sum X 2 3 4 5 6
P(X) 1 2 3 2 1
9 9 9 9 9
343
What I Know
1. B
2. A
3. B
4. D
5. B
What I Can Do 6. C
7. A
a. 8. B
X P(X) 9. D
2 1 10. A
36 11. D
3 2 12. A
13. C
36
14. C
4 3 15. B
36
5 4
36
6 5
36
7 6
36
8 5 What's More
36 Additional Activities Activity I
9 4 1 6. 0.75
36 1. a. 7. 0.80
2
5 8. 0.85
10 3 b.
8 Assessment 9. 0.45
36
3
c. 1. D Activity II
11 2 16
2. D 7
36 3 1.
d. 10
16 3. C 1
12 1 4. B 2.
3 2
e. 1
36 4 5. D 3.
2
2. a. 0.80 6. C
5
b. 7. A Activity III
12
b. 0.65 8. C
7 1. 0.57
c. 9. C
12 c. 0.75 2. 0.80
1
10. B 3. 0.73
d. d. 0.65 11. B
2
1 12. A
e. 13. D
2
14. B
15. A
Module 5
344
Additional Activities
1. Since 𝜇𝐴 = 6 ; 𝜇𝐵 = 10 and 𝜇𝐶 = 4
Therefore 𝜇𝐶 < 𝜇𝐴 < 𝜇𝐵
2. 𝝈𝒂 < 𝝈𝑩 𝒂𝒏𝒅 𝝈𝒂 < 𝝈𝑪
Because 𝝈𝑩 = 𝝈𝑪
Figure C
Figure B
Figure A
What I Know What's More
1. B
2. C
3. A
4. C
5. B
6. A Assessment
7. D
8. A 1. D
9. B 2. C
10. C 3. C
11. B 4. D
12. C 5. C
13. D 6. A
14. B 7. C
15. C 8. D
9. A
10. C
11. D
12. C
13. B
14. C
15. B
Module 6
345
Additional Activities
Results of the What's More Assessment
experiment may vary,
but the mean of the Practice Activity 1 1. C
distribution after rolling 2. C
𝜇𝑥 = 18.2 3. D
a die 50 times is
somewhat equal to the 4. D
mean of the given ∑[𝑥 ∙ 𝑃(𝑥)] = 337.1 5. B
probability distribution 6. C
of rolling a die where the 𝜎 2 = 5.86 7. A
outcome X is the random 8. C
variable. Thus, the 𝜎 = 2.42 9. D
results of the experiment 10. C
may get a mean which is Independent Activity 1 11. D
very close to 3.5 with a 12. D
𝜇𝑥 = 140.75 13. D
variance that is
somewhat equals to 3. 14. C
𝜎 2 = 730.1875
15. D
𝜎 = 26.52 Module 8
What I Know Practice Activity 2
1. B 1. Statement 1
2. A
3. D 2. Statement 1
Additional Activities
4. C
3. Statement 1
5. B 1. 𝑷(𝑿 > 𝟑) = 𝟎. 𝟓𝟐
6. D 4. Statement 2 𝒃. 𝑷(𝑿 ≤ 𝟒) = 𝟎. 𝟗𝟑
7. B
8. B 5. Statement 1 c. μx = 2.76
9. B
10. A Practice Activity 2 σ2 x = .8024
11. B
12. D Answers may vary σx = 0.90
13. D
14. C
2. 𝐚. 𝛍𝐱 = 𝟏. 𝟑𝟒
15. C
What's More 𝐛. 𝛔𝟐 𝐱 = 𝟎. 𝟗𝟐𝟒𝟒
c. σx = 0.96
Independent Assessment 1
𝒂. 𝝁𝒙 = 𝟐𝟎. 𝟑
𝒃. 𝝈𝟐 𝒙 = 𝟏. 𝟖𝟏 Assessment
What I Know
𝒄. 𝝈𝒙 = 𝟏. 𝟑𝟓 1. A
1. D 2. C
2. D Independent Assessment 2 3. D
3. C 4. D
4. D 𝒂. 𝝁𝒙 = 𝟏𝟏. 𝟑 5. D
5. D 6. B
𝒃. 𝝈𝟐 𝒙 = 𝟏. 𝟖𝟏
6. A 7. B
7. D 𝒄. 𝝈𝒙 = 𝟏. 𝟑𝟓 8. A
8. A 9. C
9. A Independent Assessment 3 10. C
10. C 11. C
11. B 𝒂. 𝝁𝒙 = 2 12. A
12. A 13. D
13. D 𝒃. 𝝈𝟐 = 𝟏
𝒙 14. B
14. C 15. A
15. C 𝒄. 𝝈𝒙 = 𝟏
Module 7
346
What's More
Independent Activity 1
𝑥 9,950 4,950 2,450 -50
𝑃(𝑥) 1/1000 1/1000 1/1000 997/1000
1 1 1
𝑃(𝑊) = + + = 3/1000
1000 1000 1000
𝐄(x) = −32.50
Interpretation: If someone were to buy tickets
repeatedly and although he has a chance to win, he
would lose ₱32.50 on average per ticket purchased,
Independent Assessment 1
𝑥 149,850 -150
𝑃(𝑥) 1/2000 1999/2000
1 1999
E(x) = (149850) ( ) + (−150) ( )
2000 2000
= −75
Interpretation: If someone were to buy tickets
repeatedly and although he has a chance to win, he
would lose ₱75.00 on average per ticket purchased.
Independent Activity 2
𝑥 95 -5
𝑃(𝑥) 1/30 29/30
1 29
E(x) = (95) ( ) + (−5) ( ) = −1.67
30 30
Interpretation: If someone were to bet on numbers at
the roulette repeatedly and although he has a chance
to win, he would lose ₱1.67 per bet.
If each number on the roulette has a bet, the net gain Assessment
of the organizer is equal to 30 times ₱5.00 less than
the prize (₱100.00). So, the net gain per game is
₱50.00, and if there will be 100 games, the total net 1. A
gain will be ₱5,000.00.
2. D
What I Know
Independent Assessment 2 3. D
𝑥 20 10 0 -10
1. A 4. C
𝑃(𝑥) 1/8 1/4 1/4 1/8 5. C
2. A
3. C 6. B
𝟏 𝟏 𝟏 7. C
4. C 𝐄(𝐱) = (𝟐𝟎) ( ) + (𝟏𝟎) ( ) + (𝟎) ( )
𝟖 𝟒 𝟒
𝟏 𝟓 8. C
5. C + (−𝟏𝟎) ( ) =
𝟖 𝟒 9. B
6. A = 𝟏. 𝟐𝟓
7. B Independent Activity 3 10. C
𝑥 500 -249,500 11. B
8. C
9. A 𝑃(𝑥) 0.9996 0.0004 12. D
10. B 13. C
11. D 14. C
E(x) = (500)(0.9996) + (−249500)(0.0004) 15. A
12. D = 400
13. C
Interpretation: With a probability of 0.04%, the
14. D insurance company may lose a large amount of Additional Activities
15. C money, but typically gains ₱400.00 per policy sold,
on the average. 1. 𝐸(𝑥) = −23.05
Independent Assessment 3
E(x) = (1000)(0.9997) + (−499000)(0.0003) 2. 𝐸(𝑥) = 4035
= 850
3. 𝐸(𝑥) = 19000
Module 9
347
What I Know What's More
What's More
1. D
2. B Activity 1 Activity 3
3. B 1. True 2. False 3. True 4. True
4. C 5. False 1. 68%
5. A
6. C 2. Php 200.00 to Php
7. A Activity 2 440.00
8. A A
3. Php 140.00 to Php
9. A 68% 95% 99.7%
10. B 500.00
11. A
1 107-163 79-191 51-219
12. A 2 81.5 – 92.5 76 - 98 70.5 – 103.5
Assessment
13. C 3 198 – 228 183 - 243 70.5 – 103.5
14. A 1. C
15. D
4 547 – 587 527 - 607 507 – 627
2. B
5 756 – 814 727 - 843 698 - 872 3. C
4. A
B. 5. A
6. C
7. D
8. A
9. B
10. B
1. 11. D
12. D
13. D
14. C
15. B
2.
3.
4.
5.
Module 10
348
Additional Activities
1. A. 81.50%
B. 97.35%
2. A. 69 to 87
b. 60 to 96
349
What I Know What's More Assessment
1. B Activity 1 1. D
2. B 1. 0.0006 2. B
2. 0.8944
3. A 3. 0.7123 3. C
4. 0. 9987
4. C 4. A
5. 0.9750
5. C 6. 0.0918 5. A
7. 0.0062
6. C 8.. 06141 6.D
9. 0.4483
7. A 10. 0.0250 7. A
8. A 8. B
Activity 2
9. B 9. B
10. C 10. C
11. D 1. 11. A
12. B 12.C
13. A 13. A
14. B 2. 14. A
15. C 15. D
3
4.
5
Module 11
350
What’s New Additional Activities
1. 0.0455
2. 0.8577
3. 0.8770
4. 0.7190
1. 6.
5. 0.8413
B.
1. 0.6587
2. 0.9941
3. 0.0985 2. 7.
4. 0.5960
5. 0.1359
3. 8.
4. 9.
5. 10.
351
Assessment What's More What I Know
1. A Independent Practice 1. B
1. 0.0735
2. D 2. D
2. 0.2986
3. C 3. A
3. 0.0375
4. A 4. C
What’s In 4. 0.1587
5. A 5. D
Answer 5. 0.6247
1. 47.83% 2. 0.3845 3. 106 6. A 6. D
4. 25 5. 1 6. 0.6480
7. B 7. A
8. C Independent Assessment 8. C
9. C 1. 0.0013 9.B
10. C 2. 0.9998 10. A
What’s New
Answer for the shaded area 11. A 3. 97.59% 11. A
1. 0.8944 2. 0.8851 3.
12. C 12. B
0.89602 4. 0.7734 5. 0.8413
6. 0.8186 13. B 13. C
Interpretation may vary.
14. A 14. D
15. D 15. B
Module 13
Additional Activity
a. 33.38 minutes
b. 21.77 minutes
What I can do
SUBJECTS Grade z- scores
Filipino 85 -0.73
English 83 -1.53
Math 90 1.29
Social Science 88 0.48
Science 88 0.48
𝜇 = 86.8 𝜎 = 2.48
Assessment
1. A 4. A 7. B 10. B 13. A
2. D 5. A 8. A 11. B 14. A
3. C 6. A 9. D 12. C 15. D
What’s New What's More
Independent Practice Independent Assessment
1. 1.25 1. -2.5 1. 3.7 hrs
2. 89.9 2. 137 2. -3
3. 0.4 3. 3 3. 33.375 minutes
4. 18.61 4. 2.5 4. ₱25, 682.50
5. 167.2
Module 12
352
Post Assessment What’s More
1. B or D Independent Practice
1. Systematic
2. C
2. Stratified
3. C
3. Simple
4. D
4. Cluster
5. B
5. Simple
6. D
7. C
Independent Assessment
8. A
 C
9. D
 B
10. B
11. C  C
12. D
13. B  D
14. A
15. C  A
What’s New What's In Pre Assessment
I. 1. B  All Filipinos  B
2. A
3. C  109,369,019  C
4. D  Tested individuals  C
II. 1. Simple  131, 786  D
2. Systematic
 No, not all Filipinos have  D
3. Systematic
equal chances of being
4. Stratified  B
selected
5. Cluster
 Beneficiaries of SAP in  D
Brgy. Pagkakaisa
 C
 400 families
 C
 Randomly selected
10. A
beneficiaries
11. A
 50 families 12. B
13. D
 Yes, there is 14. C
randomization wherein 15. A
each beneficiary has an
Module 14
353
What I Have Learned
Post Assessment 6. Parameter
What’s More
7. Greek or uppercase
10. D ∑X
8. µ= Independent Practice 1B
𝑁
11. B µ=3
∑(X−µ)2
12. D 9. 𝜎2 =
𝑁 σ2 = 2
13. C ∑(X−µ)2 σ ≈ 1.41
14. A 10. 𝜎 = √
𝑁 x ≈ 3.67
15. A 11. Statistic 𝑠2 ≈ 2.33
16. C 12. Roman or lowercase s ≈ 1.53
17. D ∑x
13. 𝑥̅ =
18. D 𝑛 Independent Assessment
∑(x−𝑥̅ )2
14. 𝑠2 = µ = 36
𝑛−1
∑(x−𝑥̅ )2 σ2 = 57
10. 𝑠 = √ σ ≈ 7.55
𝑛−1
𝑥̅ = 36
𝑠2 ≈ 68.5
s ≈ 8.28
What’s New What's In Pre Assessment
µ = 84 𝑥̅ = 86  B
σ2 = 38.2 𝑠2 ≈ 39.33  D
σ ≈ 6.18 s ≈ 6.27  A
 C
1. Statistic is a measure that  C
is used to describe the  C
sample.  B
 B
2. Parameter is a measure
 B
that is used to describe the
population.
3. Parameter is for the
population while statistic is
for the sample.
Module 15
354
What’s More
Independent Practice 1A
Independent Assessment 1. 6 4. 210
Sample Sample Means
2. 15 5. 3003
4, 5, 6 5
3. 56
4, 5, 7 5.33
4, 5, 8 5.67
Independent Practice 1B
4, 6, 7 5.67 Sample Sample Means
4, 6, 8 6 1, 2 1.5
4, 7, 8 6.33 1, 3 2
5, 6, 7 6 1, 4 2.5
5, 6, 8 6.33 2, 3 2.5
5, 7, 8 6.67 2, 4 3
6, 7, 8 7 3, 4 3.5
2 Sample
Frequency Probability Sample Probability
10 Means Frequency
Means
5 1 1/10 1.5 1 1/6
5.33 1 1/10 2 1 1/6
1 5.67 2 2/10
10
2.5 2 2/6
6 2 2/10 3 1 1/6
6.33 2 2/10 3.5 1 1/6
6.67 1 1/10
7 1 1/10
0 5 5.33 5.67 6 6.33 6.67 7
What’s In
1. 6
Assessment 2. 15
3. 10 What I know
4. 3
4. D 5. 21
5. B 6. 7.33
22.D
6. B 7. 3.75 23.B
7. A 8. 11.2 24.B
9. 11 25.A
8. D 10. 18.43
9. A 26.C
10.C C. Constructing FD 27.D
Means Frequency 28.A
11.A
13 1 29.B
12.A 13.5 2
13.C 14 3
30.A
14.D 14.5 4 31.D
15.B 15 5 32.C
16.B 15.5 4 33.A
16 3 34.B
17.C 16.5 2
18.C 35.A
17 1
36.C
Module 16
355
What I Can Do
Samples Sample Mean
19 20 15 17 17.75
19 20 15 27 20.25
19 20 15 22 19
19 20 17 27 20.75
19 20 17 22 19.5
19 20 27 22 22
19 15 17 27 19.5
19 15 17 22 18.25
19 15 27 22 20.75
19 17 27 22 21.25
20 15 17 27 19.75
20 15 17 22 18.5
20 15 27 22 21
20 17 27 22 21.5
15 17 27 22 20.25
Sample Mean Frequency Probability
1
17.75 1
15
1
18.25 1
15
1
18.5 1
15
1
19 1
15
2
19.5 2
15
1
19.75 1
15
2
20.25 2
15
2
20.75 2
15
2 1
15 21 1
15
1
21.25 1
15
1 1
21.5 1
15 15
1
22 1
15
0 17.75 18.25 18.5 19 19.5 19.75 20.25 20.75 21
21.25 21.5 22
356
What’s In
A. 𝑥̅ = 83.41, 𝑠 2 = 80.92
Scores (𝒙) f 𝒇𝒙 𝒙𝟐 𝒇𝒙𝟐
61 1 61 3721 3721
68 1 68 4624 4624
71 1 71 5041 5041
72 1 72 5184 5184
75 1 75 5625 5625
81 1 81 6561 6561
82 1 82 6724 6724
83 2 166 6889 13778
84 2 168 7056 14112
86 2 172 7396 14792
88 2 176 7744 15488
89 1 89 7921 7921
90 1 90 8100 8100
91 3 273 8281 24843
95 1 95 9025 9025
96 1 96 9216 9216
22 1835 154755
What I Know
Post Assessment (Pretest)
1. B 1. B
2. B
2. B
3. C
3. C 4. B
4. C 5. A
5. A 6. D
6. B 7. B
7. C 8. B
8. D
Module 17
357
What’s In (cont….)
B. 𝜇 = 3.89, 𝜎 2 = 4.38
𝑿 𝑷(𝑿) X• P(X) (X-μ) (X-μ)^2 (X-μ)^2 •P(X)
1 0.13 0.134 -2.89 8.36 1.121
2 0.23 0.462 -1.89 3.58 0.827
3 0.09 0.27 -0.89 0.80 0.072
4 0.17 0.66 0.11 0.01 0.002
5 0.11 0.56 1.11 1.23 0.137
6 0.07 0.42 2.11 4.44 0.311
7 0.2 1.386 3.11 9.66 1.913
3.89 4.38
358
What’s More (Independent Activity)
19. 𝜇 = 9.4
2a
2b
𝑿
𝑿−µ (𝑿 − µ)𝟐
5 -4.4 19.36
6 -3.4 11.56
9 -0.4 0.16
12 2.6 6.76
15 5.6 31.36
∑(𝑋 − 𝜇)2
69.2
=
20. 𝜎 2 = 13.84
21. 10
22.
Samples Mean
5, 6 5.5
5, 9 7
5, 12 8.5
5, 15 10
6, 9 7.5
6, 12 9
6, 1510.5
9, 1210.5
9, 15 12
12, 1513.5
359
What’s More (Independent Activity cont..)
(5) Sampling Distribution of Sample Means
Sample
Mean Frequency Probability
̅
𝑿 f 𝑷(𝑿̅)
5.5 1 1/10
7 1 1/10
7.5 1 1/10
8.5 1 1/10
9 1 1/10
10 1 1/10
10.5 2 1/5
12 1 1/10
13.5 1 1/10
10
(6a) (7a) (7b) (7c)
̅
𝑿 𝒇 𝑷(𝑿
̅) 𝑿
̅ • 𝑷(𝑿
̅) 𝑿
̅−𝝁 (𝑿
̅ − 𝝁)𝟐 𝑷(𝑿
̅ ) • (𝑿
̅ − 𝝁)𝟐
5.5 1 1/10 0.55 -3.9 15.21 1.52
7 1 1/10 0.70 -2.4 5.76 0.58
7.5 1 1/10 0.75 -1.9 3.61 0.36
8.5 1 1/10 0.85 -0.9 0.81 0.08
9 1 1/10 0.90 -0.4 0.16 0.02
10 1 1/10 1.00 0.6 0.36 0.04
10.5 2 1/5 2.10 1.1 1.21 0.24
12 1 1/10 1.20 2.6 6.76 0.68
13.5 1 1/10 1.35 4.1 16.81 1.68
10 9.40 (6b) (7d) 5.19
6. 𝜇𝑥̅ = 9.40
7. 𝜎𝑥̅2 =5.19
360
What’s More (Independent Assessment)
First Step: Compute the mean of the population 𝜇 = 7.67
Second Step: Compute the variance of the population
a. Subtract each measurement by the computed population mean
b. Square the results obtained in (a) then add. Divide the sum by the
frequency of measurements to get the value of the population variance.
𝜎𝑥̅2 = 13.89
Third Step: Determine the number of possible samples of size n = 2,
using formula of combination. 𝑁𝐶𝑛
15 -number of possible samples of size 2
Fourth Step: List all possible samples and their corresponding means.
Samples Mean
2, 5 3.50
2, 6 4.00
2, 9 5.50
2, 11 6.50
2, 13 7.50
5, 6 5.50
5, 9 7.00
5, 11 8.00
5, 13 9.00
6, 9 7.50
6, 11 8.50
6, 13 9.50
9, 11 10.00
9, 13 11.00
11, 13 12.00
361
What’s More (Independent What’s More
Assessment 1 cont…) (Independent Assessment
Seventh Step: Compute the variance of cont…)
the sampling distribution of the sample Fifth Step: Construct the
means. Follow these steps: sampling distribution of the
sample means.
a. Subtract the population mean from
each sample mean. Sixth Step: Compute the
mean of the sampling
b. Square the difference. distribution of the sample
c. Multiply the results by the means. Follow these steps:
corresponding probability. a. Multiply the sample mean
by the corresponding
d. Add the results.
probability.
e. Verify the computed variance using the b. Add the results 𝜇𝑥̅ =
𝜎2 𝑁−𝑛 7.67
formula 𝜎𝑥2 = 𝑛
• 𝑁−1
𝑷 (𝑿
̅) • ̅
𝑿 𝒇 𝑷(𝑿
̅)
𝑿
̅ • 𝑷(𝑿
̅) ̅
𝑿 -μ (𝑿 ̅ − 𝝁)^2
̅ -μ)^2 (𝑿
3.5 1 0.07
-
4 1 0.07
0.23 4.17 17.361 1.157
5.5 2 0.13
-
0.27 3.67 13.444 0.896 6.5 1 0.07
- 7 1 0.07
0.73 2.17 4.694 0.626 7.5 2 0.13
- 8 1 0.07
0.43 1.17 1.361 0.091 8.5 1 0.07
- 9 1 0.07
0.47 0.67 0.444 0.030 9.5 1 0.07
- 10 1 0.07
1.00 0.17 0.028 0.004 11 1 0.07
0.53 0.33 0.111 0.007
12 1 0.07
0.57 0.83 0.694 0.046
0.60 1.33 1.778 0.119
0.63 1.83 3.361 0.224
0.67 2.33 5.444 0.363
0.73 3.33 11.111 0.741
0.80 4.33 18.778 1.252
7.67 (6b) 5.56
Variance :𝜎𝑥̅2 = 5.56
362
What I Can Do
1. 𝜇 = 7.0, 𝜎 2 = 14.29 𝜎 = 3.78
2. 21
Samples Mean
1, 2, 7, 8, 9 5.40
1, 2, 7, 8, 10 5.60
1, 2, 7, 9, 10 5.80
1, 2, 7, 8, 12 6.00
1, 2, 8, 9, 10 6.00
1, 2, 7, 9, 12 6.20
1, 2, 7, 10, 12 6.40
1, 2, 8, 9, 12 6.40
1, 2, 8, 10, 12 6.60
1, 2, 9, 10, 12 6.80
1, 7, 8, 9, 10 7.00
2, 7, 8, 9, 10 7.20
1, 7, 8, 9, 12 7.40
1, 7, 8, 10, 12 7.60
2, 7, 8, 9, 12 7.60
1, 7, 9, 10, 12 7.80
2, 7, 8, 10, 12 7.80
1, 8, 9, 10, 12 8.00
2, 7, 9, 10, 12 8.00
2, 8, 9, 10, 12 8.20
7, 8, 9, 10, 12 9.20
363
What I Can Do (cont…)
3.
𝑿
̅ • 𝑷(𝑿
̅) 𝑿
̅ -μ (𝑿
̅ -μ)^2 𝑷(𝑿 ̅ -μ)^2
̅ )•( 𝑿
̅
𝑿 𝒇 𝑷(𝑿
̅)
0.26 -1.60 2.56 0.122
5.4 1 0.05
0.27 -1.40 1.96 0.093
5.6 1 0.05
0.28 -1.20 1.44 0.069
5.8 1 0.05
0.57 -1.00 1.00 0.095
6 2 0.10
0.30 -0.80 0.64 0.030
6.2 1 0.05
0.61 -0.60 0.36 0.034
6.4 2 0.10
0.31 -0.40 0.16 0.008
6.6 1 0.05
0.32 -0.20 0.04 0.002
6.8 1 0.05
0.33 0.00 0.00 0.000
7 1 0.05
0.34 0.20 0.04 0.002
7.2 1 0.05
0.35 0.40 0.16 0.008
7.4 1 0.05
0.72 0.60 0.36 0.034
7.6 2 0.10
0.74 0.80 0.64 0.061
7.8 2 0.10
0.76 1.00 1.00 0.095
8 2 0.10
0.39 1.20 1.44 0.069
8.2 1 0.05
0.44 2.20 4.84 0.230
9.2 1 0.05
7.00 0.95
21
4. 𝜇𝑥̅ = 7.0 = 𝜇
𝜎2 𝑁−𝑛 14.29 2
5. 𝜎𝑥̅2 = 𝑛
• 𝑁−1 = 5
• 6 = 0.95
364
What’s In (Con...)
Table 2: Sampling Distribution of the Sample Mean.
Sample Mean Frequency Probability
(𝑿
̅) (𝒇) 𝑷(𝑿
̅)
2.5 1 0.10
4.5 1 0.10
5 2 0.20
5.5 1 0.10
6.5 1 0.10
7 1 0.10
7.5 1 0.10
9 1 0.10
9.5 1 0.10
Total 10 1.00
What’s In
Table 1: List of samples What I Know (Pretest)
of size 2 and the
corresponding sample 1. B 11. B
mean. 2. D 12. A
Samples 3. D 13. A
of Size Sample 4. A 14. A
2 Mean
2, 3 2.5 5. C 15. B
2, 7 4.5 6. B
2, 8 5
7. A
2, 11 6.5
3, 7 5 8. A
3, 8 5.5 9. B
3, 11 7
7.5 10. A
7, 8
7, 11 9
8, 11 9.5
Module 18
365
What’s in (Cont...)
Table 4: Variance of the Sampling Distribution of the Sample Mean
Sample Probability Sample Mean (Sample. Probability•(Sample
Mean – Population. Mean- . Mean-Pop.
Mean Population. Mean)^2
Mean)^2
(𝑿
̅) 𝑷(𝑿
̅) (𝑿
̅ − 𝝁) (𝑿
̅ − 𝝁)𝟐 𝑷(𝑿 ̅ ) • (𝑿
̅ − 𝝁)𝟐
2.5 0.10 -3.70 13.69 1.369
4.5 0.10 -1.70 2.89 0.289
5 0.20 -1.20 1.44 0.288
5.5 0.10 -0.70 0.49 0.049
6.5 0.10 0.30 0.09 0.009
7 0.10 0.80 0.64 0.064
7.5 0.10 1.30 1.69 0.169
9 0.10 2.80 7.84 0.784
9.5 0.10 3.30 10.89 1.089
4.11
Variance of the sampling distribution of the sample mean 𝝈𝒙̅𝟐 = 𝟒. 𝟏𝟏
What’s in (Cont...)
Table 3: Mean of the Sampling Distribution of the
Sample Mean
Sample Probability Sample
Mean Mean •
(𝑿
̅) 𝑷(𝑿
̅) Probability
(𝑿 ̅)
̅ ) • 𝑷(𝑿
2.5 0.10 0.25
4.5 0.10 0.45
5 0.20 1
5.5 0.10 0.55
6.5 0.10 0.65
7 0.10 0.7
7.5 0.10 0.75
9 0.10 0.9
9.5 0.10 0.95
Total 1.00 6.2
Mean of the sampling distribution of the sample mean
𝝁𝒙̅ = 𝟔. 𝟐
366
What’s More (Independent Activity)
Given Answer Standard
Error
Formula
a. 𝑁 = 8, 𝑛 = 3 Known 𝜎𝑥̅
b. 𝜇 = 12, 𝑠 = 3, 𝑛 = 125 Unknown 𝑠𝑥̅
c. 𝑁 = 11, 𝑛 = 5 Known 𝜎𝑥̅
d. 𝑁 = 100 Known 𝜎𝑥̅
𝜇 = 12, 𝜎 = 2
𝜎 2 = 4, 𝑛 = 10
𝑋̅ = 15, , 𝑠 2 = 𝑠 = 1
e. 𝜇 = 137.16 Unknown 𝑠𝑥̅
𝑛 = 15
𝑋̅ = 139.1
𝑠 = 2.23
367
What’s More (Independent Assessment)
Given Answer Standard
Error
Formula
1. 𝑁 = 9, 𝑛 = 5 Known-This is a population data. Although 𝜎𝑥̅
the variance is not given from the situation
,you can still determine the population
variance of the data using the formula, 𝜎 2 =
𝛴(𝑥− µ)2
𝑁
. So, this situation is an example of the
sampling distribution when the variance is
known.
2. 𝜇 = 98, 𝑠 = 2, Unknown- The population variance is not 𝑠𝑥̅
𝑛 = 45 given. The given data are the population
mean, sample standard deviation and sample
size.
3. 𝑁 = 10, 𝑛 = 6 Known-This is a population data. To 𝜎𝑥̅
determine the population variance, you may
𝛴(𝑥− µ)2
use the formula, 𝜎 2 = . So, this
𝑁
situation is an example of the sampling
distribution when the variance is known.
4. 𝜇 = 50.5, Known – Since the SHS teacher used the z- 𝜎𝑥̅
𝜎 2 = 17.64, 𝑛 = 45, distribution, then the population variance
𝑋̅ = 52, 𝑠 = 3.5 should be given in the situation as showed on
the first statement. The population variance
has a value of the square of 4.2. Also, the
sample size is more than 30.
5. 𝜇 = 800, Unknown- The situation is an example of the 𝑠𝑥̅
𝜎 = 100, 𝑛 = 12, sampling distribution of the sample mean for
𝑋̅ = 790 normal population where the variance is not
given.
368
Additional Activity
Answer Standard Error
Formula
1. known 𝜎𝑥̅
2. known 𝜎𝑥̅
3. unknown 𝑠𝑥̅
4. known 𝜎𝑥̅
5. known 𝜎𝑥̅
Post Assessment
4. B 11. D What I Can Do
5. A 12. A
Answer Standard Error
6. C 13. A Formula
7. B 14. A
4. known 𝜎𝑥̅
8. B 15. B
9. B 5. unknown 𝑠𝑥̅
10. B
6. known 𝜎𝑥̅
11. A
7. unknown 𝑠𝑥̅
12. B
10. B 8. unknown 𝑠𝑥̅
369
What’s More continued… What’s More continued…
C. A. 1. 12.45
A. 𝜇𝑋̅ = 4.7 𝜎𝑋̅ = 0.866 A. 24.29
6.4
B. 𝜇𝑋̅ =4 𝜎𝑋̅ = 0.535 B. 𝜎𝑋2̅ =
𝑛
C. 𝜇𝑋̅ =7 𝜎𝑋̅ = 0.612 6.4
= 22 =
D. 𝜇𝑋̅ 𝜎𝑋̅ = 0.452 4
E. 𝜇𝑋̅ = 40 𝜎𝑋̅ =0.467 =1.6
𝜎
C. 𝜎𝑋̅ = 𝑛

9.4
=
√3
=5.43
D. Population Variance = σ2
= (5.6)2
=31.36
Post-Test What’s More Pre-Test
1. A 11. D A. You complete Me 1. C
1. Variance
2. A 12. C 2. Mean 2. A
3. Sample Mean
3. B 13. C 3. C
4. Equal
4. B 14. B 5. Population Mean 4. B
6. Infinite Population
5. D 15. D 7. Finite Population 5. D
8. Standard Deviation
6. D 𝜎2 6. B
9. 𝜎𝑋̅ =
√𝑛
7. C 7. A
8. D 8. B
DECODE:
9. D LET US SERVE THE 9. D
LORD WITH JOY AND
10.D GLADNESS 10.D
Module 19
370
What I can do
Given the X values: 1, 3, 5
For the Population:
∑𝑋 ∑(𝑋 − 𝜇)2 (1 − 3)2 + (3 − 3)2 + (5 − 3)2 8 8
𝜇= =3 𝜎2 = = = ≈ 2.67 𝜎 = ඨ ≈ 1.63
𝑁 𝑁 3 3 3
For the Sampling Distribution of the sample Mean
Observation Sample ̅
𝑿 (𝑿
̅ − 𝝁𝑿̅ ) (𝑿
̅ − 𝝁𝑿̅ )𝟐
1 (1,1) 1 -2 4
2 (1,3) 2 -1 1
3 (1,5) 3 0 0
4 (3,3) 3 0 0
5 (3,1) 2 -1 1
6 (3,5) 4 1 1
7 (5,1) 3 0 0
8 (5,3) 4 1 1
9 (5,5) 5 2 4
N=9 ∑𝑋̅ = 27 ∑(𝑿̅ − 𝝁𝑿̅ )𝟐 =12
𝟐
∑ 𝑋̅ 27 ∑(𝑿̅ − 𝝁 𝑿
̅ ) 12 12
𝜇𝑋̅ = = =3 𝜎𝑋2̅ = = ≈ 1.33 𝜎𝑋̅ = ඨ ≈ 1.15
𝑁 9 𝑁 9 9
𝜎 √8⁄3
We can see that 𝜇𝑋̅ = 𝜇 and 𝜎𝑋̅ = = = √8⁄6 ≈ 1.15
√𝑛 √2
𝑋̅ f Probability
P(𝑿
̅)
1 1 1⁄9
2 2 2⁄9
3 3 3⁄9
4 2 2⁄9
5 1 1⁄9
371
Additional Activities
Given the X values: 3,5,7
For the Population:
∑𝑋 ∑(𝑋 − 𝜇)2 (3 − 5)2 + (5 − 5)2 + (7 − 5)2 8 8
𝜇= =5 𝜎2 = = = ≈ 2.67 𝜎 = ඨ ≈ 1.63
𝑁 𝑁 3 3 3
For the Sampling Distribution of the sample Mean
Observation Sample ̅
𝑿 (𝑿
̅ − 𝝁𝑿̅ ) (𝑿
̅ − 𝝁𝑿̅ )𝟐
1 (3,3) 3 -2 4
2 (3,5) 4 -1 1
3 (3,7) 5 0 0
4 (5,3) 4 -1 1
5 (5,5) 5 0 0
6 (5,7) 6 1 1
7 (7,3) 5 0 0
8 (7,5) 6 1 1
9 (7,7) 7 2 4
N=9 ∑𝑋̅ = 45 ∑(𝑿̅ −
𝟐
𝝁𝑿̅ )=12
∑ 𝑋̅ 45 ∑(𝑿̅ − 𝝁𝑿̅ )𝟐 12 12
𝜇𝑋̅ = = =5 𝜎𝑋2̅ = = ≈ 1.33 𝜎𝑋̅ = ඨ ≈ 1.15
𝑁 9 𝑁 9 9
𝜎 √8⁄3
We can see that 𝜇𝑋̅ = 𝜇 and 𝜎𝑋̅ = = = √8⁄6 ≈ 1.15
√𝑛 √2
𝑋̅ f Probability
P(𝑿
̅)
1 1 1⁄9
2 2 2⁄9
3 3 3⁄9
4 2 2⁄9
5 1 1⁄9
372
What's More
Independent Activity 1
a. less than 49?
Data: n = 169, 𝜎𝑋̅ = standard
error because of x is not single,
Independent Assessment 3 given by
26
a. The mean or expected value of the = 2, 𝜇 = 50
√169
sampling distribution of the mean is 49−50
P(𝑋̅ < 49) = P(z < 2 )
𝜇𝑥̅ = 𝜇 = 80.
𝜎 6.4 = P(z < - 0.5)
The standard error is 𝜎𝑥̅ = = = 1.28
√𝑛 √25
= 0.3085
P(𝑋̅ < 49 ) = 30.85%
5
b. 𝜎𝑥̅ = = 1.25
√16
b. at least 47?
47−50
3.9
P(𝑋̅ ≥ 47) = P(z ≥ 2 )
c. 𝜎𝑥̅ = = 0.87
√20 = P(z ≥ - 1.5)
= 0.0668
= 1 – 0.0668
= 0.9332
P(𝑋̅ ≥ 47) = 93.32%
d. c. between 46 and 48
P(46 < 𝑋̅ <48)
P(𝑋̅ <46) = , P(z < 46),
P (𝑋̅ < 48) = P (z < 48)
46−50
= P(z < )
2
48−50
= P (z< 2
)
= P (z < -2)
= P (z < - 1)
= 0.0228
= 0.1587
P(46 < 𝑋̅ <48) = 0.1587 – 0.0228
= 0.1359
P(46 < 𝑋̅ <48) = 13.59%
Module 20
373
Independent Assessment 1
a. probability that his score is higher than 110?
𝑥− 𝜇 110−98 12
z= = = = 1.5
𝜎 8 8
From the table, the area under the curve for z = 1.5 is 0.9332,
P(x > 110) = 1 – 0.9332 = 0.0678 = 6.78%
b. What is the chance that a random sample of 4 students
will have an average above 110?
𝑥̅ −𝜇 ̅
𝑥 110−98
z= 𝜎 = 8 = 3, from the table, z = 3 is 0.9987
√𝑛 √4
P(𝑥̅ >110) = 1 – 0.9987 = 0.0013 = .13%
Independent Activity 2
a. What is the probability that the mean salary of the
sample is
less than Php 40,500?
𝑥̅ −𝜇 ̅
𝑥 40500−42500
z= 𝜎 = 5600 = -1.79, from the table, z = -1.79 is 0.0367 = 3.67%
√𝑛 √25
b. greater than Php 41,000?
̅
𝑥̅ −𝜇𝑥 41000−42500
z= 𝜎 = 5600 = -1.34, from the table, z = -1.34 is 0.0901, then 1-
√𝑛 √25
0.0901 = 0.9099 = 90.99%
c. between Php 40,400 and Php 45,000
40400−42500 45000−42500
z= 1120
= -1.88 = 0.0301, z = 1120
= 2.23 = 0.9871
0.9871 – 0.0301 = 0.9570 = 95.70%
Independent Assessment 2
1.5−2
a. z = = - 1.25, from the table, z = -1.25 is 0.1056
0.4
P(z< -1.25) = 10.56%
1.5−2
b. z = 0.4 = -2.5, from the table, z = -2.5 is 0.0062
√4
= .62%
374
Additional Activity
1. a. Php 53,200 b) Php 6,750 c) 68.08%
d)
Assessment
𝑥̅ −𝜇 50,000−53,200 1. D
e) Z = 𝜎 = 6,750 = 3.32 2. A
√𝑛 √49
1 – 0.0005 = 99.95% 3. D
2. a) 2, 0.5, 30 4. A
b) 85% 5. B
c) 6. D
7. B
8. A
9. C
10. A
11. C
12. B
13. A
14. A
What I Can Do 15. D
a) 𝜇 = 65, 𝜎 = 9, X = 54
𝑥− 𝜇
P(x < 54) = P(z < )
𝜎
54−65
= P(z )
9
P(x < 54) = P(z < - 1.22)
= 0.1112
What I Know
P(x < 54) = 11.12%
1. B
b) P(x ≥ 80) or P(x > 80) apply 2. C
the same approach in both 3. B
cases 4. C
80−65 5. A
P(z ≥ ) = 1.67 = 0.9525
9 6. A
7. A
1 – 0.9525 = 0. 0475
8. A
P(x ≥ 80) = 4.75% 9. C
10. C
30−32 11. A
c) P(x< 30) = P(z < 5 )
√36 12. D
= -2.4 13. A
P(x < 30) = P(z < - 2.4) 14. A
= 0.0082 = .82% 15. B
375
What I have Learned What I can do
1. small, population variance/ 1. normal
standard deviation distribution
2. William S. Gosset 2. both
3. bell, zero, 1 3. both
4. horizontal axis 4. normal
5. zero distribution
6. degrees of freedom 5. t-distribution
7. lower, thicker 6. both
8. normal distribution 7. t-distribution
9. greater 8. t-distribution
10. v/v-2 9. both
10. both
Assessment What's More What I Know
1. C 1. REAL
1. C
2. C 2. REAL
2. A
3. A 3. FAKE
3. C
4. D 4. REAL
4. B
5. A 5. REAL
5. A
6. C 6. FAKE
6. A
7. C 7. REAL
7. C
8. D 8. FAKE
8. C
9. A 9. REAL
9. A
10. B 10. REAL
10. D
11. A
11. A
12. C
12. C
13. A
13. A
14. B
14. D
15. D
15. C
Module 21
376
What I Know
1. D
What’ More 2. D
Assessment ACTIVITY 1
3. B
1. B
4. C
2. A
5. D
3. D 1.
6. C
4. D
7. C
5. C
8. D
6. C
2. 9. B
7. D
10. C
8. B
11. A
9. A
12. D
10. B
3. 13. A
11. C
14. C
12. D
15. B
13. A
14. D 4.
What I can do
15. D
1. a 1.753
b. 1.701
Additional Activities
5. c. 1.660
1. 0.0779 or 7.79%
ACTIVITY 2 2. 2.552
2. 0.0052 or 0.52%
1. 1.321 3. 1.318
3. 0.9810 or 98.10% 2. 0.700
3. -2.150
4. 1.844
4. -1.108
5. -0.312 5. -0.861
6. 1.740
7. 2.528
8. 2.145
9. -0.684
Module 22
377
What's More
What I Know
Assessment Practice Activity 1
The average percentage of people
1. C
1. B who are satisfied with the
product is 52%
2. B
2. D The lower limit of confidence
interval is 48.02% or 0.4802.
The upper limit of confidence
3. A
3. A
interval is 55.98% or 0.5598.
The confidence interval is
4. B
4. B
(48.02%,55.98%) or
(0.4802,0.5598). 5. C
5. D
6. D
6. C Independent Assessment 1
What is the lower limit? 7. C
7. B 13226.35
8. B
8. B What is the upper limit?
13773.65 9. A
9. B
What is the length of confidence 10. D
10. D interval?
(13 226.35,13773.65)
If you have P13,100.00, is that
amount lies within the
confidence interval?
NO.
Independent Assessment 2
What is the margin of error? 2.2
What is the average time they
read per week? 15.7 hours
Additional Activities
What is the lower limit? 30.26
What is the upper limit? 33.34
What is the confidence interval?
(30.26,33.34)
Based on the interval you get,
is it plausible that the mean
age of all graduate students at
her university is
also 33 years? Yes
Module 23
378
What's More
What I Know Assessment
Independent Activity 1
23.B n=70 9. B
=15
24.D Zc = 1.75 10.B
𝝈
Formula: E =𝒛𝜶⁄𝟐 ( )
25.C √𝒏 11.C
𝟏𝟓
Solution: E = (1.75) ( )
√𝟕𝟎
26.A E = 3.14
12.D
27.A Independent Assessment 1 13.D
28.B n=100 14.C
=25
29.D Zc = 1.64 (90%) 15.C
𝝈
Formula: E = za/2( )
30.A √𝒏 16.B
𝟐𝟓
Solution: E = (1.645) ( )
√𝟏𝟎𝟎
31.A E = 4.11
17.D
32.C 18.D
Independent Activity 2
33.B 19.C
n=300
34.A =84 20.B
Zc = 1.75
35.A 𝝈 21.A
Formula: L = 2𝒛𝜶/𝟐 ( )
√𝒏
36.D 𝟖𝟒 22.A
Solution: L = 2(1.75) ( )
√𝟑𝟎𝟎
37.D L = 16.97 23.A
Independent Assessment 2
n=240
=60
Zc = 1.28
𝝈
Formula: L = 2𝒛𝜶/𝟐 ( )
√𝒏
Additional Activities 𝟔𝟎
Solution: L = 2(1.28) ( )
√𝟐𝟒𝟎
L = 9.91
Independent Activity 3
1. 4.11 n=350
=750
2. 119.17 𝑥̅ =13,500
Zc = 2.58
3. 54.91 < μ < Formulas: a. 𝑥̅ − 𝐸 < 𝜇< 𝑥̅ + 𝐸
𝜎
b. L = 2𝑧𝛼/2( )
√𝑛
55.09 or 750
Solution: a. E=2.58( )
√350
(54.91,55.09) =103.43
The confidence interval is
𝑥̅ − 𝐸 < 𝜇< 𝑥̅ + 𝐸
13,500-103.43<𝜇 < 13500 + 103.43
13,396.57<𝜇 <13603.43
This means that we are 99% confident that the
average price of all cellphones lies between
₱13,396.57 and ₱13603.43
b. L = 2E=2(103.43) =206.86
Independent Assessment 3
n=60
=55
𝑥̅ =150
Zc = 1.96 (95%)
𝜎
Formula: L =2za/2( )
√𝑛
𝟓𝟓
Solution: L = 2(1.96) ( )
√𝟔𝟎
L =27.84
Module 24
379
Independent Assessment 26.2
𝜎
a. 𝑛 = 70, 𝜎 = 15, 𝑍𝛼⁄2 = 1.75, 𝐸=𝑧 ∙ 𝛼 𝐸 = 3.14
2 √𝑛
𝑍𝛼⁄ 𝜎
2
b. E= 15, 𝜎 = 43, 𝑍𝛼⁄2 = 2.33, 𝑛 = ( 𝐸
)2, 𝑛 = 45
What’s More
Independent Activity 26.1
No. Confidence 𝑍𝛼⁄ E 𝜎 n
2
Level
1 99% or 0.99 2.58 2.5 30 959
2 90% or 0.90 1.645 7.5 27 36
3 95% or 0.95 1.96 4.5 9 16
4 92% or 0.92 1.75 3 45 690
5 98% or 0.98 2.33 2.5 30 782
What’s More
What’s New
Independent Assessment 26.1 𝜎
Step 1: 𝐸 = 𝑧𝛼 ∙
2 √𝑛
a. 151 b. 341 c. 279
𝑍𝛼⁄ 𝜎
Independent Activity 26.2 2
Step 2: √𝑛 =
𝐸
𝝈 √𝒏•𝑬 𝑍𝛼⁄ 𝜎
𝑬 = 𝒛𝜶 ∙ 𝝈= 2
𝟐 √𝒏 𝒁𝜶⁄ Step 3: 𝑛 = ( )2
𝟐 𝐸
What’s In Assessment What I Know
A.1. 59.63
2. 4.6 1. B 1. A
2. D 2. D
3. 3.842 3. B 3. C
4. 24 4. A 4. B
5.97 5. A 5. A
B. 6. C 6. A
7. D 7. B
Lower Upper
Margin Confidence Confidence 8. C 8. A
Confidence of Error
𝝈 Limit Limit 9. A 9. B
coefficient 𝑬 = 𝒛𝜶 ∙ 𝝈 𝝈
𝟐 √𝒏 𝑳=𝒙
̅ − 𝒛𝜶 ∙ 𝑳=𝒙
̅ + 𝒛𝜶 ∙
𝟐 √𝒏 𝟐 √𝒏
10. C 10. C
11. A 11. C
1.96 2.34 67.66 72.34 12. D 12. D
13. D 13. D
14. A 14. B
15. A 15. D
Module 25
380
What I have Learned What’s More
1. Sample size determination 1.
2
(𝑍𝛼⁄ )(𝜎) a. 74
2
2. 𝑛= [ ] b. 107
𝐸
c. 167
3. critical value,
d. 296
population standard deviation,
E
4. Level of Confidence,
2.
standard deviation,
a. 136
margin of error
b. 97
5. smaller
c. 68
6. larger/bigger
d. 41
Assessment What’s New What I
Know
1. C Situation 1
a. 95% 1. C
2. B
b. 1.96
2. D
3. C c. 10.4 lbs
d. 0.75 lb s 3. C
4. C
4. D
5. C Situation 2
a. 90% 5. C
6. B b. 1.645
c. 385 grams 6. B
7. C
d. 100 grams 7. C
8. C
Situation 3 8. B
9. B
a. 99% 9. C
10. A b. 2.58
c. 20 10. C
11. A
d. 5 11. A
12. A
12. B
13. D
What’s In 13. B
14. C
1. 107, 2. 601, 3. 58, 4. 94 14. C
15. D
5. 43 15. C
Module 26
381
What I can do
1.a. 27
b. A sample size that is too small reduces the power to estimate the
IQ scores of IQ students in the Division of Quezon. It also
increases the margin of error of the estimate.
2a. 71
b. increases the number of samples. As the sample size gets larger,
the margin of error gets smaller.
Additional Activities
103, The students may choose from different types of random
sampling discussed in the previous modules.
References
Module 1
Pierce, Rod. (3 Feb 2020). "Random Variables". Math Is Fun. Retrieved 24 May 2020
from http://www.mathsisfun.com/data/random-variables.html
Malate, Jose S. Statistics and Probability for Senior High School. Vicarisg Publications
and Trading, Inc, 2017.
Lim, Yvette F., Nocon, R., Nocon, E., Ruivivar, L. Math for Engaged Learning Statistics
and Probability. Sibs Publishing House, Inc. 2016.
Module 2
Belencia, Rene R., Baccay, Elisa S., and Mateo, Efren B. Statistics and Probability.
Rex Book Store, Inc. (RBSI). 2016
Melosantos, Luis Allan B., Janice E. Antonio, Susan S. Robles and Ryan M.
Bruce. Math Connections in the Digital Age. Sibs Publishing House, Inc. 2016
Malate, Jose S. “Statistics and Probability”. Vicarish Publications and Trading,
Inc. (2017). ISBN 978-421-8030-17-6, 17-26.
http://www.stat.yale.edu/Courses/1997-98/101/ranvar.htm,
https://www.mathsisfun.com/data/random-variables.html
Module 3
Belencia, Rene R., Baccay, Elisa S., and Mateo, Efren B. Statistics and
Probability. Manila: Rex Book Store, Inc. (RBSI). 2016.

Melosantos, Luis Allan B., Antonio, Janice E., Robles, Susan S. and
Bruce, Ryan M. Math Connections in the Digital Age. Quezon City: Sibs
Publishing House, Inc. 2016.
http://www.stat.yale.edu/Courses/1997-98/101/ranvar.htm,
https://www.mathsisfun.com/data/random-variables.html
Module 4
Belecina, Rene R., Baccay, Elisa S. and Mateo, Efren B. “Statistics and Probability”.
Manila: Rex Book Store, Inc. (RBSI).2016. pages 9-18
Malate, Jose S. “Statistics and Probability for Senior High School.” Vicarish
Publications and Trading, Inc, 2017. pages 5-9
https://en.wikipedia.org/wiki/Timeline_of_the_COVID-
19_pandemic_in_the_Philippines
Module 5
Belecina,Rene R., Baccay,Elisa S. and Mateo, Efren B. “Statistics And Probability”.
Manila: Rex Book Store, Inc. (RBSI), 2016.
Malate, Jose S. “Statistics and Probability for Senior High School.” Vicarish
Publications and Trading, Inc, 2017.

382
Module 6
Textbooks:
Gordon, Ian. Probability and statistics: Discrete Probability Distributions – A guide
for Teachers (Years 11–12). Education Services Australia, 2013.
Moore, David S. and McCabe, George P. Introduction to the Practice of Statistics -
Second Edition. W.H. Freeman and Company, 1993.
Malate, Jose S. Statistics and Probability. Vicarish Publication and Trading. Inc.,
2018.
Alferez, Merle S. and Duro, Ma. Cecilia A. MSA Statistics and Probability. MSA
Publishing House, 2012
Websites:
https://www.mathsisfun.com/data/random-variables-mean-variance.html
https://www.cabarrus.k12.nc.us/cms/lib/NC01910456/Centricity/Domain/4633
/Random%20Variables%20Worksheet%202.pdf
https://www.coconino.edu/resources/files/pdfs/academics/sabbatical-
reports/kate-kozak/chapter_5.pdf
https://www.youtube.com/watch?v=NH6oiaV4iWA Comparing standard deviations
without calculation.mp4
Module 7
Gordon, Ian. Probability and statistics: Discrete Probability Distributions – A guide
for Teachers (Years 11–12). Education Services Australia, 2013.
Moore, David S. and McCabe, George P. Introduction to the Practice of
Statistics - Second Edition. W.H. Freeman and Company, 1993.
Malate, Jose S. Statistics and Probability. Vicarish Publication and Trading. Inc.,
2018.
Alferez, Merle S. and Duro, Ma. Cecilia A. MSA Statistics and Probability. MSA
Publishing House, 2012.
Websites:
https://ocw.mit.edu/courses/mathematics/18-05-introduction-to-probability-and-
statistics-spring-2014/readings/MIT18_05S14_Reading5a.pdf
https://www.mathsisfun.com/data/random-variables-mean-variance.html
https://www.cabarrus.k12.nc.us/cms/lib/NC01910456/Centricity/Domain/4633
/Random%20Variables%20Worksheet%202.pdf
https://www.coconino.edu/resources/files/pdfs/academics/sabbatical-
reports/kate-kozak/chapter_5.pdf
Module 8
Belecina, Rene R, Baccay, Elisa S., Mateo, Efren B. Statistics and Probability.
First Edition. Manila, Philippines: Rex Book Store. 2016.

Lim, Yvette F., Nocon, Rizaldi C., Nocon, Ederlina G., Ruivivar, Leonor A.

383
Math for Engaged Learning Statistics and Probability. Quezon City: Sibs
Publishing House, Inc. 2016.

Malate, Jose S. Statistics, and Probability for Senior High School. Sta. Ana,
Manila, Philippines: Vicarish Publications and Trading, Inc. 2017.
Module 9
Lim, Yvette F., Nocon, Rizaldi C., Nocon, Ederlina G., Ruivivar, Leonor A. Math for
Engaged Learning Statistics and Probability. Quezon City: Sibs Publishing
House, Inc. 2016.
Malate, Jose S. Statistics and Probability for Senior High School. Sta. Ana Manila:
Vicarish Publications and Trading, Inc. 2017.
Module 10
Books
Mahmot, Millard R. and Mamhot, Alice A. “Statistics and Probability”. Unlimited
Books Library Service & Publishing Inc., 2016.
Malate, Jose S. “Statistics and Probability”. Vicarish Publications and Trading, Inc.,
2017.

Ocampo, Jose M. and Marquez, Wimer G. Senior High Conceptual Math & Beyond,
Statistics and Probability. Brilliant Creations Publishing Inc., 2016.
Websites
http://davidmlane.com/hyperstat/z_table.html
https://www.khanacademy.org/
Module 11
A. Textbooks
Mahmot, Millard R. and Mamhot, Alice A. “Statistics and Probability”. Unlimited
Books Library Service & Publishing Inc., (2016). ISBN 978-971-9654-55-1,
44-50.
Malate, Jose S. “Statistics and Probability”. Vicarish Publications and Trading, Inc.,
2017. ISBN 978-421-8030-17-6, 45-54.
Ocampo, Jose M. and Marquez, Wimer G. Senior High Conceptual Math & Beyond,
Statistics and Probability. Brilliant Creations Publishing Inc., 2016. ISBN 978-
621-8006-27-0, 39-42.
B. Websites:
http://davidmlane.com/hyperstat/z_table.html
https://www.khanacademy.org/
Module 12
Books
Mahmot, Millard R. Ph.D. and Mamhot, Alice A. M.S.. “Statistics and
Probability”. Unlimited Books Library Service & Publishing Inc. (2016),
ISBN 978-971-9654-55-1, 41 -43.

Malate, Jose S. “Statistics and Probability”. Vicarish Publications and Trading,


Inc. (2017). ISBN 978-421-8030-17-6, 17-26.

384
Ocampo, Jose M. PhD and Marquez, Wimer G. MA.. Senior High Conceptual
Math & Beyond, Statistics and Probability. Brilliant Creations
Publishing Inc. (2016), ISBN 978-621-8006-27-0, 31-42.
Websites
http://davidmlane.com/hyperstat/z_table.html
https://www.khanacademy.org/
Module 13
Books
Mahmot, Millard R. and Mamhot, Alice A. Statistics and Probability. Unlimited Books
Library Service & Publishing Inc., 2016.
Malate, Jose S. Statistics and Probability. Vicarish Publications and Trading, Inc.,
(2017).
Ocampo, Jose M. and Marquez, Wimer G. Senior High Conceptual Math & Beyond,
Statistics and Probability. Brilliant Creations Publishing Inc., (2016).
Websites:
http://davidmlane.com/hyperstat/z_table.html
https://www.khanacademy.org/
Module 14
Books
Belecina, Rene S., Baccay, Elisa S., and Mateo, Efren B. Statistics and Probability.
Manila: REX Book Store, 2016.

Mercado, Jesus P. and Orines, Fernando B. Next Century Mathematics Statistics &
Probability. Quezon City: Phoenix Publishing House, 2016.

Blay, Basilia E. Elementary Statistics. Mandaluyong City: Anvil Publishing, Inc.,


2007.
Websites
"COVID-19 Pandemic in the Philippines." Wikipedia. May 24, 2020.
https://en.m.wikipedia.org/wiki/COVID-19_pandemic_in_the_Philippines.

"Philippines Population (LIVE)." Worldometer.


https://www.worldometers.info/world-population/philippines-population/
Module 15
Belecina, Rene S., Baccay, Elisa S., and Mateo, Efren B. Statistics and Probability.
Manila: REX Book Store, 2016.

Mercado, Jesus P. and Orines, Fernando B. Next Century Mathematics Statistics &
Probability. Quezon City: Phoenix Publishing House, 2016.

Houghton Mifflin Harcourt. “Populations, Samples, Parameters, and Statistics”,


2020. Accessed May 22, 2020. https://www.cliffsnotes.com/study-
guides/statistics/sampling/populations-samples-parameters-and-statistics

Statistics How to. “What is a Parameter in Statistics?”, September 12, 2013. Accessed
May 22, 2020. https://www.statisticshowto.com/what-is-a-parameter-
statisticshowto/
Module 16

385
Belecina, Rene S., Baccay, Elisa S., and Mateo, Efren B. Statistics and Probability.
Manila: REX Book Store, 2016.
Mercado, Jesus P. and Orines, Fernando B. Next Century Mathematics Statistics &
Probability. Quezon City: Phoenix Publishing House, 2016.
“The Sampling Distribution of the Sample Mean”. Accessed May 4, 2020.
https://saylordotorg.github.io/text_introductory-statistics/s10-02-the-
sampling-distribution-of-t.html.
The Pennsylvania State University. “Sampling Distribution of the Sample Mean”,
2020. Accessed May 4, 2020. https://online.stat.psu.edu/stat500/lesson
/4/4.1.
University of New England. “Constructing a Sampling Distribution, 2000. Accessed
May 4, 2020. http://www.abc.net.au/triplej/hack/notes/.
Module 17
Belencia, Rene R., Baccay, Elisa S., and Mateo, Efren B. Statistics and Probability.
Manila: Rex Book Store, Inc. (RBSI), 2016.
Triola, Mario F. Elementary Statistics – 11th Edition. Pearson Addison-Wesley, Inc.,
2010.
Hurlburt, Russel T. Comprehending Behavioral Statistics – 2nd Edition. Pacific Grove,
California: Brooks/Cole Publishing Company, 1998.
Module 18
Belecina, R.R., Baccay, E.S., and Mateo, E.B. Statistics and probability. Manila: REX
Book Store: Manila, 2016.
Triola, Mario F. Elementary Statistics, Eleventh Edition. Pearson Addison-Wesley,
Inc., 2010.
Hurlburt, Russel T. Comprehending Behavioral Statistics,.Second Edition. Las Vegas,
Nevada: Brooks/Cole Publishing Company, 1998.
Module 19
Bluman, Allan G. Elementary Statistics: A Step by Step Approach, 6 th ed. McGraw-
Hill Companies, Inc., 2003.
Remoto-Ocampo, Shirlee and Tresvalles, Regina M. Probability, Statisitics, and
Application.
Teaching the essential topics of Statistics and Probability. Makati City: Private
Education Assistance Committee, 2019.
Reyes, Juan Apolinario C. Statistics and Probability. Senior High School core subject
K to12 Basic Education Curriculum. Malabon City: JIMZYVILLE Publications,
2016.
Website:
https://www.pharmaceutical-technology.com/features/coronavirus-affected-
countries-philippines-measures-impact-tourism-economy/
Module 20
Books:
Belecina, R., Baccay, E., Mateo, E. Statistics and Probability. Manila: REX Book
Store, 2016.
Blay, B. Elementary Statistics. Pasig City: ANVIL Publishing, Inc., 2007.

386
Websites:
The Sampling Distribution of the Sample Mean.
https://saylordotorg.github.io/text_introductory-statistics/s10-02-the-
sampling-distribution-of-t.html.
Libretexts. "7.2: The Central Limit Theorem for Sample Means (Averages)." Statistics
LibreTexts. November 10, 2019.
https://stats.libretexts.org/Bookshelves/Introductory_Statistics/Book:_Introd
uctory_Statistics_(OpenStax)/07:_The_Central_Limit_Theorem/7.02:_The_Centr
al_Limit_Theorem_for_Sample_Means_(Averages).

"The Role of Probability." Central Limit Theorem.


http://sphweb.bumc.bu.edu/otlt/MPH-
Modules/BS/BS704_Probability/BS704_Probability12.html.
"Sample Means and the Central Limit Theorem (practice)." Khan Academy.
https://www.khanacademy.org/math/ap-statistics/sampling-distribution-
ap/sampling-distribution-mean/e/sample-means-central-limit-theorem.

Module 21
Book
Malate, Jose S. Statistics and Probability. Vicarish Publications and Trading, Inc.,
2017.
Websites:
https://www.statisticshowto.com/probability-and-statistics/t-distribution/
https://stattrek.com/probability-distributions/t-distribution.aspx
https://stepupanalytics.com/t-test-distribution-and-its-application/
https://www.youtube.com/watch?v=Uv6nGIgZMVw

Module 22
Book
Malate, Jose S. Statistics and Probability. Vicarish Publications and Trading, Inc.,
2017.
Websites:
https://www.statisticshowto.com/probability-and-statistics/t-distribution/
https://stattrek.com/probability-distributions/t-distribution.aspx
https://stattrek.com/online-calculator/t-distribution.aspx
https://m.youtube.com/watch?v=qH7QZoMbB80
https://stepupanalytics.com/t-test-distribution-and-its-application/
Module 23
Baccay, Elisa S., Belecina, Rene S., Mateo, Efren B. Statistics and Probability. Manila:
Rex Book Store Inc., 2016.
Malate, Jose S. Statistics and Probability for Senior High School. Vicarish Publications
and Trading, Inc., 2017.
Websites
https://www.statisticshowto.com/probability-and-statistics/hypothesis-
testing/margin-of-error/
https://mathbitsnotebook.com/Algebra2/Statistics/STMarginPractice.html

387
https://www.khanacademy.org/math/ap-statistics/estimating-confidence-ap/one-
sample-t-interval-mean/e/sample-size-margin-error-one-sample-t-interval-mean
Module 24
Textbooks

Malate, Jose S. Statistics and Probability for Senior High School. Vicarish Publications
and Trading, Inc, 2017.
Websites:
https://www.mathsisfun.com/data/confidence-interval-calculator.html
https://www.insightsassociation.org/article/importance-confidenceintervals
https://online.stat.psu.edu/stat414/node/198/
https://www.nku.edu/~statistics/212_Confidence_Iinterval_using_z.htm

Module 25
Books:
Mahmot, Millard R. and Mamhot, Alice A. Statistics and Probability.
Unlimited Books Library Service & Publishing Inc., 2016.
Malate, Jose S. Statistics and Probability. Vicarish Publications and Trading,
Inc., 2017.
Belecina, Rene R. et.al. Statistics and Probability, 1st Edition. Sampaloc Manila:
Manila: Rex Book Store, Inc., 2016.
Lim, Yvette, et.al. Math for Engaged Learning Statistics and Probability. Quezon
City: SIBS Publishing House, 2016.
Orines, Fernando B. Next Century Mathematics Statistics and Probability. Quezon
City: Phoenix Publishing House, 2016.
Internet Source:
https://www.mathsisfun.com/data/confidence-interval-calculator.html
https://www.insightsassociation.org/article/importance-confidence-intervals
https://youtu.be/XO9NtHvRQTQ
http://youtu.be/uMILjv6Fs94
Module 26
Textbooks
Malate, Jose S. Statistics and Probability. Vicarish Publication and Trading Inc.,
2017.
Febre, Francisco A. Jr. Introduction to Statistic. Phoenix Publishing House Inc.,
2000.
Belecena, Rene R. Statistics and Probability. Manila: Rex Book Store, Inc., 2016.

Websites

https://study.com/learn/sample-size-determination-questions-and-answers html
http://sphweb.bumc.bu.edu/otlt/MPHModules/BS/BS704_Power/BS704_Power3.
html

388
For inquiries or feedback, please write or call:

Department of Education - Bureau of Learning Resources (DepEd-BLR)

Ground Floor, Bonifacio Bldg., DepEd Complex


Meralco Avenue, Pasig City, Philippines 1600

Telefax: (632) 8634-1072; 8634-1054; 8631-4985

Email Address: blr.lrqad@deped.gov.ph * blr.lrpd@deped.gov.ph

389

You might also like