GEC 4 Module 4 Final Term

P age |i
COLLEGE OF ARTS AND SCIENCES (CAS) CENTRAL PHILIPPINES STATE UNIVERSITY (CPSU)
P a g e | ii
This learner’s module is not for sale. No part of this material may be
reproduced in any form or by any means, including photocopying,
scanning, or other electronic means without the prior written permission to
the author or the College of Arts and Sciences, Central Philippines State
University.
LEARNER’S MODULE IN
GEC 4: MATHEMATICS IN THE MODERN WORLD
Second Semester | School Year 2020-2021
P a g e | iii
PHILOSOPHY
Decent affordable education for sustainable productivity
and global competitiveness under an atmosphere of
academic freedom.
CPSU QUALITY POLICY

CPSU commits to deliver quality higher and advanced education through instruction,
research, extension, production and administrative support services.
We shall endeavor to:
• Continually improve its Quality Management System (QMS) at par with

international standards;
• Provide timely, efficient and effective delivery of products and services;
• Satisfy the needs and expectations of the costumers and relevant interested
parties; and
• Uphold applicable statutory, regulatory, organizational and international
standard requirements.
P a g e | iv
STUDY SCHEDULE GEC 4

Week Hours Topic/Learning Outcomes Assessment
Tasks
UNIT 4: Data Management (27 hours)
Lesson 1: Basic Concepts in Statistics
MATHEMATICS IN THE MODERN WORLD

Week 9 ➢ Define basic statistical Activity 32: Who
(April 5 - 9, 2021) concepts such as Am I?
3 hours population and sample,
parameter and statistic, Activity 33.
variable, measurement, Complete me!
data, etc.
➢ Differentiate descriptive
and inferential statistics
➢ Classify variables as
qualitative or quantitative
➢ Define and distinguish the
four levels of measurement
Lesson 2: Data Organization and Presentation
Week 10 ➢ Organize and present data Activity 34. Fill Me!
(April 12 - 16, using tables and graphs
2021) ➢ Apply rules in data Activity 35.
3 hours presentation Organize Me!
Lesson 3: Measures of Central Tendency
Week 11 ➢ Compute and interpret the Activity 36. 3M’s
(April 19 – 23, different measures of Activity 37.
2021) central tendency Problem Solving:
3 hours Measures of
Central Tendency
Lesson 4: Measures of Dispersion
Week 12 ➢ Compute and interpret the Activity 38:
(April 26 – 30, different measures of Examine Me!
2021) dispersion
3 hours Activity 39:
Problem Solving:
Measures of
Dispersion
Lesson 5: Measures of Relative Position
Week 13 ➢ Compute and interpret the Activity 40. What is
(May 3 – 7,2021) different measures of my Rank?
3 hours position
Activity 41:
Problem Solving:
Measures of
Position
P age |v
TABLE OF CONTENTS GEC 4
MATHEMATICS IN THE MODERN WORLD Page

PRELIMINARIES
• Cover Page i
• Disclaimer ii
• CPSU VMGO and Quality Policy iii
• Study Schedule iv
• Table of Contents vi
UNIT 4: DATA MANAGEMENT
Lesson 1 Basic Concepts in Statistics 205
Lesson 2 Data Organization and Presentation 218
Lesson 3 Measures of Central Tendency 238
Lesson 4 Measures of Dispersion 249
Lesson 5 Measures of Relative Position 263
APPENDICES
Appendix A Rubrics 278
P a g e | cci
UNIT
DATA MANAGEMENT
4
An effective decision making is based on
data. Data is everywhere. It is just lurking around the
corner waiting to be discovered and analyzed. From
there, data will tell its story.
The branch of science that deals with the

collection, organization, presentation, analysis, and
interpretation of data is called Statistics.
In the Philippines, the Philippine Statistics

Authority (PSA) serves as the central statistical
authority on primary data collection by conducting
censuses on different sectors of the Philippine
economy such as population, housing, agriculture,
fisheries, and business. It collects, compiles,
analyzes, and publishes statistical information on
economic, social, demographic, political affairs and
general affairs of the people of the Philippines.
In this unit, you will learn the different ways of

organizing and presenting data. Moreover, you will
describe data using the different summary measures
and apply different statistical tools to process
numerical data.
P a g e | 202
UNIT 4: Data Management PRE-TEST

Direction: Read the questions carefully. Shade the letter of the correct answer. If
answer is not provided, shade letter E. Use the answer sheet provided and strictly
no erasures.
1. The following are nominal variables except for:

A. City of residence
B. Number of household members
C. Area code
D. Marital status
For items 2-3, A researcher wanted to investigate the relationship of weekly

allowance to the academic performance of students studying in a certain university
with a total of 2300 enrolled students. To achieve the researcher’s aim, the
researcher interviewed a random sample of 400 students. Results indicated that the
average allowance of the 400 students is PhP 1500.
2. What is the sample?

A. The group of 2300 enrolled students
B. The group of 400 enrolled students
C. The group of 1500 enrolled students
D. The group of 2700 enrolled students
3. Which of the following is a statistic?

A. The average allowance of the 2300 enrolled students is Php 1500
B. The average allowance of the 400 enrolled students is Php 1500
C. The proportion of students whose allowance is Php 1500
D. None of the above
4. Which of the following statements belong to the field of descriptive statistics?

A. A car manufacturer wishes to estimate the average lifetime of batteries
by testing a sample of 50 batteries.
B. A shipping company wishes to estimate the number of passengers
traveling via their ships next year using their data on the number of
passengers in the past three years.
C. A marketing research group wishes to determine the number of families
not eating three times a day in the sample used for their survey.
D. A politician wants to predict his chance of winning in the senatorial
election by conducting a survey on 1200 registered voters.
5. What type of chart is useful in presenting historical data and in showing the
movement of a series over time?
A. Line chart
B. Bar chart
C. Pie chart
D. Pictograph
GEC 4: MATH IN THE MODERN WORLD

P a g e | 203
For items 6 - 8, refer to the table below:
The distribution of the number of mistakes made by 285 students

in a100-item Math Exam
Number of Mistakes Number of Students
7 – 12 94
13 – 18 37
19 – 24 25
25 – 30 43
31 – 36 86
6. What is the class width or class size?

A. 5 C. 7
B. 6 D. 8
7. What is the relative frequency of the class interval 13 – 18?

A. 0.1298 C. 37%
B. 12.98% D. 48%
8. What is the lower class boundary of the class interval 7 – 12?

A. 6.5 C. 7.5
B. 7 D. 12.5
9. It is a measure of the central tendency obtained by getting the middle value in

distribution when the values are arranged in ascending or descending order.
A. Mean C. Weighted Mean
B. Median D. Mode
10. It is most commonly occurring value in a distribution.

A. Mean C. Average
B. Median D. Mode
11. The reseller’s income in Heng De face shield for seven days were, Php
1,220, Php 1,345, Php, 1000, Php 2,500, Php 1,590, Php 1,350 and Php
1540. What is the reseller’s average income?
A. Php 1,506.42 C. Php 1,350
B. B. Php 1,506.43 D. Php 10,545
12. Consider the sample What is the median of the set of data: 13, 19, 18, 20, 16,
9, 10, 7 and 8.
A. 13 C. 5.05
B. 13.33 D. 25.50
13. It is a statistic that measures the dispersion of a dataset relative to its mean
and is calculated as the square root of the variance.
A. range C. variance
B. standard deviation D. none of these
P a g e | 204

14. The highest score in the distribution is 109 while the lowest score is 54. What
is the range?
A. 54 C. 56
B. 55 D. 57
15. Which scenario will give you a coefficient of variation of 40%?

A. Mean of 4 and standard deviation of 16
B. Mean of 4 and standard deviation of 20
C. Mean of 4 and standard deviation of 4
D. Mean of 16 and standard deviation of 4.
16. It is used when one wishes to compare the scatter of one distribution with
another distribution.
A. Measures of Location C. Measures of Position
B. Coefficient of Variation D. Measures of Relative Dispersion
17. It is a value that divide a set of observations into 100 equals parts.
A. Quartiles C. Deciles
B. Percentiles D. Fractiles
18. It is another way of telling whether or not the distribution of the data is
symmetric relative to the median.
A. Boxplot C. Quartiles
B. Z-scores D. Fractiles
For items 19 and 20, consider the life span (in years) of a certain species below.
1.6 1.9 2.2 2.5 2.6 2.6 2.9 3.0 3.0 3.1
3.1 3.1 3.1 3.2 3.2 3.2 3.3 3.3 3.3 3.4
3.4 3.4 3.5 3.5 3.6 3.7 3.7 3.7 3.8 3.8
3.9 3.9 4.1 4.1 4.2 4.3 4.4 4.5 4.7 4.7
19. What is Q1 for the distribution of life spans?

A. 3.1 C. 2.9
B. 3.0 D. 3.2
20. What is the 85th percentile of the 40 life spans?

A. 4.1 C. 4.2
B. 4.5 D. 4.115

P a g e | 205
Basic Concepts in Statistics LESSON 1

(1.5
Biatin, Gretchen G. HOURS)
LEARNING OUTCOMES
• Define basic statistical concepts such as population and

sample, parameter and statistic, variable and
measurement, data, etc.
• Differentiate descriptive and inferential statistics

• Classify variables as qualitative or quantitative
• Define and distinguish the four levels of measurement
INTRODUCTION
Every day we make decisions that maybe personal,

business related, or of some other kind. Usually, these
decisions are made under condition of uncertainty. Many times,
the situation or problems we face in the real world have no
precise or definite solution, with the help of statistical methods
it will enable us make scientific and intelligent decision in such
situation.
The study of statistics covers different things to different
people. Men from PAGASA report daily weather statistics, such
as high and low temperature and amount of rainfall; market
reports give the exact prices of prime commodities and the
analysis on the actual price increase. Mathematics describe the
appropriate statistics for analysing the results of a particular
investigations.
GEC 4: MATH IN THE MODERN WORLD CAS-CPSU

P a g e | 206
UNIT
LESSON BASIC CONCEPTS IN STATISTICS
4
1
Let’s Look Back
Activity 32: Who Am I? Fill in the following bio-data form. Write your answer on your
answer booklet.
Name: Course:
Age: Sex: Year Level:
Date of Birth: Place of Birth:
Height (in cm): Weight (in kg):
Eye Color: Skin Color:
My favorite color is:
My favorite food is:
My favorite number is:
My favorite movie is:
Let’s Analyze and Prepare
Based on the previous activity, answer the following questions. Write your answer on
your answer booklet.
1. How did you find the activity?

2. In a scale of 1 to 10, with 1 as the lowest and 10 as the highest, how well do
you know yourself?
3. Now, ask three (3) members of your household to fill in the same form. In what
characteristics are you similar? different?
Characteristics like course, sex, age, weight, height are called variables. They are
called variables since these characteristics vary from one person to another. Moreover,
all of the information you have filled in the form is considered data. Statistics, as a
branch of science, deals with collection, organization, presentation, analysis, and
interpretation of data. You will know more of the basic concepts in statistics in the next
section.

P a g e | 207
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
Let’s Discover
Let’s start with the two definition of statistics.
A. Definition of Statistics
o Plural sense – set of numerical figures
e.g. number of students enrolled in every program in CPSU, vital statistics
in a beauty contest
o Singular sense – the branch of science that deals with the collection,
presentation, organization, analysis, and interpretation of data (COPAI)
B. Key Terms
o Population – (denoted by N) the collection of all elements under
consideration
o Sample – (denoted by n) a subset of the population
Example:
A manufacturer of kerosene heaters wants to determine if customers are satisfied
with the performance of their heaters. Toward this goal, 5,000 out of 200,000
customers are contacted and each is asked, "Are you satisfied with the
performance of the kerosene heater you purchased?" Identify the population and
the sample for this situation.
✓ Population: The collection of 200,000 kerosene heater customers.
✓ Sample: The collection of 5,000 kerosene heater customers who were
interviewed.
o Variable – a characteristic or attribute of the elements in a collection that can

assume different values for the different elements.
Example:
✓ Age, Weight, Height
o Experimental unit is the individual or object on which a variable is measured.

Example:
✓ Age (in years) of a freshman student
✓ Weight (in kilograms) of mangoes harvested
✓ Height (in inches) of a sugarcane plant in 3 months

P a g e | 208
UNIT
1
Let’s Discover
o Observation – realized value of a variable
o Data – collection of observations
Example:
Variable Possible observation
a. S = sex of a student Male, Female
b. C = course of a student AB Eng, AB Soc Sci, BS Stat
c. L = amount spent on load per month 𝐿 ≥ 0 pesos
d. N =number of enrolled students per campus 𝑛 = 0, 1, 2, 3, …
e. H = height of a volleyball player h > 0 cm
o Parameter – a summary measure describing specific characteristic of the

population usually denoted by Greek letters: μ (mu), σ (sigma), ρ (rho), λ
(lambda), τ (tau), θ (theta), α (alpha) and β (beta)
e.g. population mean, population variance
o Statistic – a summary measure describing specific characteristic of the sample

e.g. sample mean, sample variance
Example: In order to estimate the true proportion of students at a certain college who
smoke cigarettes, the administration polled a sample of 200 students and determined
that the proportion of students from the sample who smoke cigarettes is 0.12. Identify
the a) population, b) sample, c) parameter, and d) statistic.
a) Population: The set of students at a certain college.

b) Sample: The set of 200 students who were interviewed.
c) Parameter: The population proportion of students in a certain college who smoke
cigarettes.
d) Statistic: (0.12) the proportion of students in the sample who smoke cigarettes.

P a g e | 209
UNIT
1
Let’s Discover
C. Fields of Statistics
1. Statistical Methods of Applied Statistics - procedures and techniques used

in the collection, presentation, analysis and interpretation of data.
2. Statistical Theory of Mathematical Statistics - deals with the development

and exposition of theories that serve as bases of statistical methods.
a. Descriptive Statistics – includes all the techniques used in organizing,
summarizing, and presenting the data on hand.
b. Inferential Statistics – includes all the techniques used in analyzing the
sample data that will lead to generalizations about a population from which
the sample came from
Example:
Descriptive Inferential
1. A bowler wants to find his bowling 1. A bowler wants to estimate his chance
average for the past 12 games. of winning a game based on his current
season averages and the averages of his
opponent.
2. A housewife wants to determine 2. A housewife would like to predict based
the average weekly amount she on last year's grocery bills, the average
spent on groceries in the past three weekly amount she will spend for this year.
months.
3. A politician wants to know the 3. A politician would like to estimate, based
exact number of votes he received in on an opinion poll, his chance of winning in
the last election. the upcoming election.
D. Measurement
Measurement - the process of determining the value or label of a variable based on

what has been observed
Example:
✓ Age (in years)
✓ Weight (in kilograms)
✓ Height (in inches)
✓ Sex (Male or Female)

P a g e | 210
UNIT
1
Let’s Discover
Types of Variables
1. Qualitative variable
o A variable that yields categorical response
o describes the quality or character of something
Example: Eye color, First Name, Favorite Movie
2. Quantitative Variable
o a variable that takes on numerical values representing an amount or
quantity
o describes the amount or number of something
Example: Weight, Height, Number of cars
a. Discrete
• a variable which can assume finite, or, at most, countably infinite
number of values; usually measured by counting or enumeration.
• can assume only certain values, and there are usually gaps between
values.
Example: Number of students enrolled in GEC 4, Number of cars
parked at the front of Admin building
b. Continuous
• a variable which can assume infinitely many values corresponding to a
line interval without gaps, interruptions, or jumps.
• can assume any value within a specified range.
• measurable (measured using a continuous scale such as kilos,
centimeters, grams)
Example: Height in inches, Income in pesos, Weight in kilograms
Levels of Measurement
1. Nominal Level (Classificatory Scale)
The nominal level is the weakest level of measurement where numbers or symbols
are used simply for categorizing subjects into different groups. The categories must
be distinct, non-overlapping and exhaustive
Examples:
Sex M-Male F-Female
Marital status 1-Single 2-Married

3-Widowed 4-Separated

P a g e | 211
UNIT
1
Let’s Discover
2. Ordinal Level (Ranking Scale)

The ordinal level of measurement contains the properties of the nominal level, and in
addition, the numbers assigned to categories of any variable may be ranked or
ordered in some low-to-high-manner. The categories which must be distinct, non-
overlapping and exhaustive can be ordered
Examples:
Teaching Ratings 1-poor 2- fair
3-good 4-excellent
Year level 1 – 1st yr 2 – 2nd yr

3 – 3rd yr 4 – 4th yr
3. Interval Level
The interval level is that which has the properties of the nominal and ordinal levels,
and in addition, the distances between any two numbers on the scale are of known
sizes. An interval scale must have a common and constant unit of measurement.
Furthermore, the unit of measurement is arbitrary and there is no “true zero” point.
Examples:
▪ IQ
▪ Test Result
4. Ratio Level
The ratio level of measurement contains all the properties of the interval level, and in
addition, it has a “true zero” point. This is the strongest level of measurement.
Examples:
▪ Daily Allowance
▪ Weight (in kg)
▪ Age (in years)
▪ Number of correct answers in an exam

P a g e | 212
UNIT
1
Let’s Do It!
I. In the following situations, let’s determine the population, sample, parameter
and statistic.
1. In order to estimate the true proportion of students at a certain college who

smoke cigarettes, the administration polled a sample of 200 students and
determined that the proportion of students from the sample who smoke
cigarettes is 0.12. Identify the a) variable, b) population, c) sample, d)
parameter, and e) statistic.
Solution:
a. Variable: Whether or not a student smoke
b. Population: The set of students at a certain college.
c. Sample: The set of 200 students who were interviewed.
d. Parameter: The population proportion of students in a certain college who
smoke cigarettes.
e. Statistic: (0.12) the proportion of students in the sample who smoke
cigarettes.
2. A politician who is running for the office of mayor of a city with 25,000 registered
voters commissions on a survey. In the survey, 48% of the 200 registered voters
interviewed say they plan to vote for her.
Solution:
a. Variable: Whether or not a voter will vote the running political for Mayor
b. Population: The group of 25,000 registered voters
c. Sample: The group of 200 registered voters who were interviewed
d. Parameter: The percentage of registered voters in the population who plan
to vote for her.
e. Statistic: (48%) The percentage of registered voters in the sample who plan
to vote for her.
II. Let’s determine whether the following statements belong to the field of
descriptive statistics or inferential statistics.
1. A badminton player wants to know his average score for the past 10 games.
Solution: Descriptive statistics (Data is only gathered and summarized)

P a g e | 213
UNIT
1
Let’s Do It!
2. Janine wants to determine the variability of her six exam scores in algebra.
3. Based on last year’s electricity bills, Mrs. Venegas would like to forecast the
average monthly electricity bill she will pay for the next year based on her average
monthly bill in the past year.
Solution: Inferential statistics (Data is analyzed to make a forecast)
4. Elfren “Bata” Reyes wants to estimate his chance of winning in the next world
championship game in Billiards based on his average scores last championship
and the averages of the competing players.
Solution: Inferential statistics (Data is analyzed to come up with an estimate)
5. Ms. Macasaet wants to determine the proportion spent on transportation during

the past four months using the daily records of expenditure that she keeps.
III. Let’s classify the following by a) type of variable and b) level of

measurement. If the variable is quantitative, determine whether it is
discrete or continuous.
Level of
Type of Variable
Measurement
Example:
Degree Program (AB Social Science, AB Qualitative Nominal
English Language, BS Statistics)
Example:
Quantitative,
Weight of women (in kg) before they took Ratio
Continuous
the diet
1. Height (in inches)
2. Sizes of shirts (i.e. XS, S, M, L, XL)
3. Zipcode (i.e. 6111 – Kabankalan)
4. Civil Status
5. Number of siblings
Solution: 1) Quantitative, Continuous, Ratio 2) Qualitative, Ordinal

3) Qualitative, Nominal 4) Qualitative, Nominal
5) Quantitative, Discrete, Ratio

P a g e | 214
UNIT
1
Let’s Try Some More
For you to fully absorb the lesson, let’s have some more examples.
I. In the following situation, let’s determine the variable, population, sample,
parameter and statistic.
Mr. Donaldo Chan, a candidate for Vice Mayor in Orion, Bataan wants to find out
if there is a need to intensify his campaign efforts against his opponents. He
requested the services of a group of students to interview 1,000 of the 3,000
registered voters of Orion, Bataan. The survey results showed that 75% of the
1,000 voters in the sample will vote for him as vice-mayor. Identify the following.
a. Variable: Whether or not a voter will vote for Mr. Chan as Vice Mayor
b. Population: The collection of 3,000 registered voters of Orion, Batangas
c. Sample: The collection of 1,000 registered voters of Orion, Batangas
d. Parameter: The percentage of registered voters in the population who voted
for Mr. Chan as Vice-Mayor
e. Statistic: The percentage of registered voters in the sample who voted for
Mr. Chan as Vice-Mayor
II. Let’s determine whether the following statements belong to the field of
descriptive statistics or inferential statistics.
a. A car manufacturer wishes to estimate the average lifetime of batteries by

testing a sample of 50 batteries. Answer:
b. A shipping company wishes to estimate the number of passengers traveling
via their ships next year using their data on the number of passengers in the
past three years. Answer:
c. A marketing research group wishes to determine the number of families not
eating three times a day in the sample used for their survey.
Answer:
d. A politician wants to determine the total number of votes his rival obtained in
the past election based on his copies of the tally sheet of electoral returns
Answer:
e. A politician wants to determine the total number of votes his rival obtained in
the sample used in the exit poll. Answer:
Solution: a and b uses inferential statistics while c, d, and e uses descriptive statistics.

P a g e | 215
UNIT
1
III. Let’s classify the following by a) type of variable and b) level of

measurement. If the variable is quantitative, determine whether it is
discrete or continuous.
Type of Variable Level of

Measurement
Example:
Weight of women (in kg) before they Quantitative,
Ratio
took the diet Continuous
1. Score in a ten-item quiz
2. Teacher’s performance rating
(Excellent, Very Good, Good,
Satisfactory, Needs Improvement)
3. Total kilos of papaya harvests
4. Brand of cellphone
5. Body Temperature (in Celsius)
Solution: 1) Quantitative, Discrete, Ratio 2) Qualitative, Ordinal

3) Quantitative, Continuous, Ratio 4) Qualitative, Nominal
5) Quantitative, Continuous, Interval
Let’s Have Your Turn
It’s time for you to do this on your own now. Write your answers on the provided answer
booklet
Activity 33. Complete me!
The average weekly allowance of students last year at a public high school was Php
600.00 per week, based on an enrolment of 1,080 students. The third-year students
who did not have this information interviewed 50 students and found their average
weekly allowance last year to be Php 550.00. Identify the following.
Variable:
Population:
Sample:
Parameter:
Statistic:

P a g e | 216
UNIT
1

Descriptive or Inferential?
1. Marie wishes to determine the average amount she spends in online shopping
per month. She checks her purchase history in Shopee in the last six months.
Answer:
2. Ian wants to predict electricity bill in the next month using the data on the
monthly electricity bills in the last six months.
3. Sheena wants to determine the average exam scores of her students in their
summative test.
Answer:
4. Cris wants to determine the total amount spent for cat food in the past five
months.
Answer:
5. Tine wants to predict the number of enrollees for the next school year based on
enrollment data in the last five years.
Answer:
Complete the Table
Classify the following variables by a) type and b) level of measurement. If the variable
is quantitative, determine whether it is discrete or continuous.
Level of
Type of Variable
Measurement
1. Number of text messages sent in
a day
2. Favorite food
3. Academic Performance
(Excellent, Very Satisfactory,
Satisfactory, Needs
Improvement)
4. Time (in hours) spent in
answering modules in a week
5. Shoe size (26, 27, 28, …)
6. Average daily temperature (in
Celsius)
7. Number of oranges in a bag
8. Amount spent (in pesos) on
mobile load per week
9. Favorite pet (cat, dog, bird, etc.)
10. Network carrier (Globe, Smart,
TM,..)

P a g e | 217
UNIT
3
Bibliography
Almeda, J.V., Capistrano, T.G., and Sarte, G. (2010). Elementary Statistics. The UP
Press, Diliman, Quezon City
Aufmann, Richard et.al. (2018). Mathematics in the Modern World 14 th edition.
Amid, Diego. (2005). Fundamentals of Statistics. Lorimar Publishing CO., INC
Everitt, B. S.; Skrondal, A. (2010), The Cambridge Dictionary of Statistics, Cambridge
University Press.

P a g e | 218
Data Organization and Presentation

LESSON 1
Biotin, Gretchen G. & Cawaling, Ma. Janesa (3 HOURS)
LEARNING OUTCOMES
o Organize and present data using tables and graphs.

o Apply rules in data presentation

INTRODUCTION
One of the most important aspects in any statistical

investigation is a manner by which the researcher presents the data.
Knowing our lives should be a great task to do in every day
situation. Even in our daily routine from home to work vice versa we
do the organization to keep the things in place.
Arranging the staff at home or at work make the work
easier and more manageable
Data are a set of facts, and provide a partial picture of reality.
Whether data are being collected with a certain purpose or collected
data are being utilized, questions regarding what information the data
are conveying, how the data can be used, and what must be done to
include more useful information must constantly be kept in mind.
Since most data are available to researchers in a raw format, they must
be summarized, organized, and analyzed to usefully
GEC 4: MATHEMATICS IN THE MODERN WORLD CAS-CPSU

P a g e | 219
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Look Back
ACTIVITY 34. FILL ME! Analyze the following problems and answer the questions
that follow. Write your answers in your answer booklet.
1. Mr. Darcy wanted to know the performance of his students in mathematics. He

gave them short quizzes after every discussion. The data below shows the scores
of twenty students in their 10-item quiz.
5 2 7 10 4
8 4 5 7 6
3 5 8 5 5
4 1 6 3 6
a. How many of the students got scores above 8?

Answer:
b. What percentage of students got scores lower than 5?
Answer:
c. If Mr. Darcy sets the passing score at 7, how many students passed the quiz?
Answer:
2. Ms. Frank wanted to lose weight so she conducts a survey on what physical activity
is effective in reducing weight. She asks 30 of her friends for recommendations.
The data below shows the preferred physical activity of the 30 women.
W T A W G T W W C W T W A T T
W G W W C A W A W W W T W W T
W = walking T = weight training C = cycling G = gardening A = aerobics
a. How many women prefer walking?

Answer:
b. How many women prefer weight training?
Answer:
c. What percentage of women prefer gardening?
Answer:
d. Based on data gathered, what physical activity can you recommend to Ms.
Frank?
Answer:

P a g e | 220
UNIT
4
2
Based on the previous activity. Answer the following questions. Write your answers
on your answer booklet.
1. How did you find the activity?
2. Was it easy to answer the questions? Why or why not?
3. What did you find hard in the previous activity?
In order for you to answer the questions in the previous activity, you must have
organized your data. Just like in conducting a research, researchers must organize the
gathered data in some meaningful way. The easiest way of organizing data is to
construct a frequency distribution table. You will know more about frequency
distribution tables as we go along the next sections.
Let’s Discover
You will learn how to organize data in this section.
A. Raw data and array

✓ Raw data – data in their original form
✓ Array – is an ordered arrangement of data according to magnitude, may it be
from lowest to highest or highest to lowest
B. Frequency distribution
✓ is a way of summarizing data by showing the number of observations that
belong in the different categories or classes
✓ also referred to as grouped data.
A frequency distribution table is a grouping of data into categories showing the

number of observations in each of the non-overlapping classes.

P a g e | 221
UNIT
4
2
Let’s Discover
Two General Forms of Frequency Distribution Table
1. Single-value grouping is a frequency distribution where the classes are the

distinct values of the variable. This is applicable for data with only a few
unique values.
Example:
No. of Children Number of Married Women
0 7
1 8
2 11
3 14
4 8
5 2
2. Grouping by class intervals – is a frequency distribution where the classes

are the intervals.
Example:
Exam Scores No. of Students
50-59 13
60-69 21
70-79 36
80-89 35
90-99 5
Total 110
C. Definition of Terms
• Class interval - range of values that belong in the class or category.

• Class Frequency - the number of observations that belong in a class interval
• Class Limits – the end numbers used to define the class interval.
o Lower class limit (LCL) - is the lower end number
o Upper class limit (UCL) – is the upper end number
• Open class interval is a class interval with no lower-class limit or no upper-
class limit.

P a g e | 222
UNIT
4
2
Let’s Discover
• Class boundaries are the true class limits.

o Lower class boundary (LCB) is halfway between the lower-class limit
of the class and the upper-class limit of the preceding class
o Upper class boundary (UCB) is halfway between the upper-class limit
of the class and the lower-class limit of the next class.
• Class size is the size of the class interval.
- difference between the upper-class boundaries of the class and the
preceding class
- difference between the lower-class boundaries of the next class and
the class
- We can also use the class limits in place of the class boundaries.
• Class mark is the midpoint of a class interval.
- It is the average of the lower-class limit and the upper-class limit or the
average of the lower-class boundary and upper-class boundary of a
class interval.
D. Steps in Constructing a Frequency Distribution
1. Make an array for the given data.
2. Determine the number of classes (K). There are no precise rules concerning
the optimal number of classes but Sturges’ formula can be used as a first
approximation.
Sturges’ formula: K = 1 + 3.322 log n, where n = number of observations
Note: Round off K to the nearest integer.
3. Determine the approximate class size (C). Whenever possible, all classes
should be of the same size.
➢ Solve for the range R = max - min

➢ Compute for C = R÷K, then round it up to the next number.
4. Determine the lowest class limit. The first class must include the smallest
value in the data set.

P a g e | 223
UNIT
4
2
Let’s Discover
5. Determine the upper limit of the lowest class using the formula:
UCL = LCL + C – 1(unit of measure)
Note:
5 unit of measure = 1
5.2 unit of measure = 0.1
5.34 unit of measure = 0.01
6. Determine the class boundaries using the following formula:

LCB = LCL - 1/2 (unit of measure)
UCB = UCL + 1/2 (unit of measure)
7. Determine all class limits by adding the class interval C to the limit of the
previous class.
8. Tally the frequencies for each class. Sum the frequencies and check against
the total number of observations.
Example: The data below shows the midterm exam scores of students in Math.
Construct a grouped frequency distribution for the data.
82 82 83 79 72 71 84 59 77 50
83 82 63 75 50 85 76 79 68 69
79 69 74 53 73 71 50 76 57 81
Solution:
50 53 63 69 72 75 77 79 82 83
50 57 68 71 73 76 79 81 82 84
50 59 69 71 74 76 79 82 83 85
2. Determine the number of classes (K).

◆ K = 1 + 3.322 log n, where n = 30
◆ K = 1 + 3.322 log (30) = 5.91 ≈ 6
We will construct a frequency distribution table with 6 classes.

P a g e | 224
UNIT
4
2
Let’s Discover
3. Determine the approximate class interval (C).
◆ R = max – min = 85 – 50 = 35
◆ C = R ÷ K = 35 ÷ 6 = 5.8 ≈ 6
4. Determine the lowest class limit.

◆ LCL = 50
◆ UCL = LCL + C - 1 (unit of measure)
◆ UCL = 50 + 6 – 1(1) = 55

◆ LCB = LCL – ½ (unit of measure) = 50 – ½(1) = 49.5
◆ UCB = UCL + 1/2 (unit of measure) = 55 + ½(1) = 55.5
7. Determine all class limits and class boundaries by adding the class interval C to the
limit of the previous class.
Class Limits LCB UCB
50 – 55 49.5 55.5
56 – 61 55.5 61.5
62 – 67 61.5 67.5
68 – 73 67.5 73.5
74 – 79 73.5 79.5
80 – 85 79.5 85.5
8. Determine the frequencies for each class. Note that the frequency is the number of
observations that fall into each class. For example, there are 4 observations in the
data array that falls int the 50-55 class interval. Thus, the frequency of the first class
is 4. Sum the frequencies and check against the total number of observations.
Class Limits LCB UCB Frequency
50 – 55 49.5 55.5 4
56 – 61 55.5 61.5 2
62 – 67 61.5 67.5 1
68 – 73 67.5 73.5 7
74 – 79 73.5 79.5 8
80 – 85 79.5 85.5 8
Total 30

P a g e | 225
UNIT
4
2
Let’s Discover
9. Add another column for the class marks. The class mark is found by getting the
average of the class limits.
Class Limits LCB UCB Frequency Class Mark
50 – 55 49.5 55.5 4 52.5
56 – 61 55.5 61.5 2 58.5
62 – 67 61.5 67.5 1 64.5
68 – 73 67.5 73.5 7 70.5
74 – 79 73.5 79.5 8 76.5
80 – 85 79.5 85.5 8 82.5
Total 30
Variations of Frequency Distribution
1. Relative Frequency (RF) Distribution and Relative Frequency Percentage (RFP)

RF = class frequency/ no. of observations
RFP = RF × 100%
2. Cumulative Frequency Distribution (CFD) - shows the accumulated frequencies of

successive classes, beginning at either end of the distribution.
• Greater than CFD ( > CFD) - shows the no. of observations greater than
the LCB
• Less than CFD ( < CFD) - shows the no. of observations less than the
UCB
Class LCB UCB Frequency RF RFP Class >CF <CF
Limits Mark
50 – 55 49.5 55.5 4 0.13 13 52.5 30 4
56 – 61 55.5 61.5 2 0.07 7 58.5 26 6
62 – 67 61.5 67.5 1 0.03 3 64.5 24 7
68 – 73 67.5 73.5 7 0.23 23 70.5 23 14
74 – 79 73.5 79.5 8 0.27 27 76.5 16 22
80 – 85 79.5 85.5 8 0.27 27 82.5 8 30
Total 30 1.00 100
•

P a g e | 226
UNIT
4
2
Let’s Discover
Categorical Frequency Distribution

The categorical frequency distribution is used to organize nominal-level or
ordinal-level type of data. Examples of this type of data are gender, religious affiliation,
nationality, etc.
Steps in Constructing a Categorical Frequency Distribution
1. Construct a table as shown below
Class Frequency Percentage
2. Determine the distinct categories of the variable of interest. Write all the
categories under the ‘Class’ column
3. Determine the corresponding frequencies
4. Determine the percentage by using the formula:
𝒇
𝑷𝒆𝒓𝒄𝒆𝒏𝒕𝒂𝒈𝒆 = × 𝟏𝟎𝟎%
𝒏
where f = frequency of the class and n = total number of observations
Example: Twenty students were given a performance evaluation. The data set is as
follows. Construct a frequency distribution for the data.
High High High Low Average
Average Low Average Average Average
Low Average Average High High
Low Low Average High High
Solution:
1. Construct a table as shown below
2. Determine the distinct categories of the variable of interest.

High
Average
Low

P a g e | 227
UNIT
4
2
Let’s Discover
3. Determine the corresponding frequencies
High 7
Average 8
Low 5
4. Determine the percentage by using the formula:

High 7 35 (7/20)*100
Average 8 40 (8/20)*100
Low 5 25 (5/20)*100
Total 20 100
E. Presentation of Data
After organizing data, the next step is to present data so they can be understood
easily. The most useful method of presenting data is by constructing graphs and charts.
There are a number of ways to plot charts, and each one has a specific purpose.
Graphical Methods for Interval/Ratio Data
1. Histogram is a bar graph that displays the classes on the horizontal axis and
frequencies of the class on the vertical axis. The vertical lines of the bars are
erected at the class boundaries and the height of the bars correspond to the class
frequency.
Steps in Constructing a Histogram

a. Draw and label the x-axis and y-axis.
b. Represent the frequency on the y-axis and the class boundaries on the x-axis.
c. Use the frequency to represent the height and draw the vertical bars.

P a g e | 228
UNIT
4
2
Let’s Discover
2. Frequency Polygon is a line chart that is constructed by plotting the frequencies
at the midpoints and connecting the plotted points by means of a straight line. The
polygon is closed by considering an additional class at each end of the lines that
are brought down to the horizontal axis at the midpoints of the additional classes.
Steps in Constructing a Frequency Polygon

a. Find the midpoints of each class.
b. Draw and label the x-axis and y-axis
c. Represent the frequency on the y-axis and the midpoints on the x-axis
d. Connect adjacent points with line segments. Draw a line back to the x-axis
at the beginning and end of the graph
3. Less than Cumulative Frequency Polygon (Less than Ogive) is a line chart that
is constructed by plotting the cumulative frequencies of the classes. The vertical
axis represents the less than cumulative frequency of the distribution while the
horizontal axis represents the upperclass boundaries of the frequency distribution.
Steps in Constructing a Less than Cumulative Frequency Polygon (Less than

Ogive)
a. Draw and label the x-axis and y-axis.
b. Represent the less than cumulative frequency on the y-axis and the upper
class boundaries on the x-axis.
c. Connect adjacent points with line segments.

P a g e | 229
UNIT
4
2
Let’s Discover
Other Types of Graphs
1. Line Graph – It is a presentation of data that shows a continuous change or

trend. It may show ascending or descending trend.
2. Bar Graph – It uses bars to compare the magnitudes of the different categories
of a variable of interest. It may be drawn vertically or horizontally. A vertical bar
chart is best to used when comparing means or percentage between distinct
categories while a horizontal graph may contain more than 5 categories. A bar
is plotted on either the x-axis or the y-axis
3. Pie Chart - is a type of graph that displays data in a circular graph. The
pieces of the graph are proportional to the fraction of the whole in each
category. In other words, each slice of the pie is relative to the size of that
category in the group as a whole. The entire “pie” represents 100 percent of a
whole, while the pie “slices” represent portions of the whole.

P a g e | 230
UNIT
4
2
Let’s Discover
4. Pictograph – is a type of graph in which each a symbol represents a definite

and uniform value
5. Scatter Plot - scatter plot uses dots to represent individual pieces of data. In
statistics, these plots are useful to see if two variables are related to each other.
For example, a scatter chart can suggest a linear relationship (also
called scatter graphs).
Qualities of a Good Graph (ACSA)

1. Accuracy
◼ should not be deceptive, distorted, misleading, or in any way susceptible to
wrong interpretations
◼ not create any optical illusion
2. Clarity
◼ easily read and understood
◼ should focus on the message it is trying to communicate
◼ able to aid the reader in the interpretation of facts
3. Simplicity
◼ should be simple, straight-forward, not loaded with irrelevant, superfluous, or
trivial symbols and ornamentation

4. Appearance
◼ designed and constructed to attract and hold attention by holding a neat,
dignified, and professional appearance

P a g e | 231
UNIT
4
2
Let’s Do It!
Let’s have some exercises on constructing frequency distribution tables and graphs
1. A marketing research consultant conducted a survey of 40 persons who used to visit

fastfood chains in one morning. The age of persons was recorded to the nearest
year as follows. Construct a frequency distribution and a histogram based on data.
16 40 34 50 22 32 19 47 46 32
29 24 46 33 53 21 40 27 48 31
44 28 35 38 44 44 30 50 29 42
36 47 26 19 55 41 47 33 27 28
Solution:
16 19 19 21 22 24 26 27 27 28
28 29 29 30 31 32 32 33 33 34
35 36 38 40 40 41 42 44 44 44
46 46 47 47 47 48 50 50 53 55
2. Determine the number of classes (K).

◆ K = 1 + 3.322 log n, where n = 40
◆ K = 1 + 3.322 log (40) = 6.32 ≈ 6
We will construct a frequency distribution table with 6 classes.
3. Determine the approximate class size (C).

➢ Solve for the range R = max - min = 55 – 16 = 39
➢ Compute for C = R÷K, then round it up to the next number.
➢ C = 39 ÷ 6 = 6.5. Round it up to 7.
4. The lowest class limit (LCL) will be 16 since the smallest value in the data set
is 16.
◆ UCL = LCL + C - 1 (unit of measure)
◆ UCL = 16 + 7 – 1(1) = 22

P a g e | 232
UNIT
4
2

◆ LCB = LCL – ½ (unit of measure) = 16 – ½(1) = 15.5
◆ UCB = UCL + 1/2 (unit of measure) = 22 + ½(1) = 22.5
7. The next class limits will be determined by adding the class interval C = 7.
8. The frequencies are found by determining the number of observations in each
class interval
9. The relative frequency (RF) and relative frequency percentage (RFP is found
using the formula
RF = class frequency/ no. of observations
RFP = RF × 100%
10. The class marks are found by getting the middle value of the class interval.
That is, get the average of the lower and upper class limits.
11. The <cf is the number of observations that is lesser than the upper class
boundary. For instance, in the first class interval, there are 5 observations
less than 22.5. Moreover, in the second class interval, there are 5 + 8 = 13
observations less than 29.5
The table below presents the completed frequency distribution table.
Class Limits LCB UCB Frequency RF RFP Class Marks <CF
16 – 22 15.5 22.5 5 0.125 12.5 19 5
23 – 29 22.5 29.5 8 0.2 20.0 26 13
30 – 36 29.5 36.5 9 0.225 22.5 33 22
37 – 43 36.5 43.5 5 0.125 12.5 40 27
44 – 50 43.5 50.5 11 0.275 27.5 47 38
51 - 57 50.5 57.5 2 0.05 5.0 54 40
Total 40 1.00 100
To graph the histogram, construct the x-axis and y-axis. Represent the frequencies on
the y-axis and class boundaries on the x-axis. The height of the vertical bars
corresponds to the frequency of each class.

P a g e | 233
UNIT
4
2
2. Mr Luna is working as a librarian at a university. At the end of the week, he is

tasked to make a report on the number of books borrowed at a library. Below is
the list of names of students who borrowed books in the said week. Construct a
bar graph on the number of books borrowed.
Monday Tuesday Wednesday Thursday Friday
Concepcion, Sammy Lustre, Nadz Yap, Kim Go, Borgy Padilla, Anne
Reid, Jay-r Bernardo, Chandria Gonzaga, Allen Lapid, Lita Curtis, Annie
Perez, JV Chiu, Sue Kramer, Ken Revilla, Bea Crawford, Bella
Ramirez, Ken Liz, Zia Binay, Nene Corpuz, Thadeo
Tan, Anna Angara, Sony
Uy, Leo
Diaz, Marie
Verzosa, Kyline
Gabriel, Jose
Solution:
Start with identifying the number of books borrowed per day. There were 3 books
borrowed on Monday, 4 on Tuesday, 9 on Wednesday, 5 on Thursday, and 4 on Friday.
The bar graph would look like this.

Let’s have some more exercises on organizing and presenting data.
1. Samples of forty-two (42) college students are considered for study and were
categorized according to year level. Construct a frequency distribution for the data.
Senior Freshman Freshman Sophomore Freshman Junior Junior
Junior Sophomore Sophomore Freshman Sophomore Freshman Junior
Sophomore Freshman Sophomore Senior Senior Senior Sophomore
Freshman Freshman Senior Sophomore Freshman Sophomore Junior
Sophomore Junior Freshman Freshman Sophomore Junior Junior
Freshman Freshman Sophomore Senior Junior Freshman Freshman

Class Frequency Relative Frequency Percentage
P a g e | 234
UNIT
4
2
Solution:
Our variable of interest is the year level of the students. The categories are freshmen,
sophomore, junior, and senior. The frequency distribution tables is as follows

Freshmen 15 36
Sophomore 12 29
Junior 9 21
Senior 6 14
Total 42 100
2. The number of postpaid cellular phone subscribers for each of the last 12
years is listed below. Use a line graph to represent these figures. Interpret the
result.
Year 2004 2005 2006 2007 2008 2009
No. of Subscribers
3.12 4.10 4.23 3.96 3.87 3.50
(in millions)
Year 2010 2011 2012 2013 2014 2015

No. of Subscribers
4.67 4.99 4.86 4.96 5.01 5.18
(in millions)
Solution:
As reflected in the line chart, there is an increasing trend in the number of subscribers
from 2004 to 2015 although a drop in the number of subscribers is seen from 2007 to
2009.

P a g e | 235
UNIT
4
2

Now, it’s your time to answer. Write your solution in your answer booklet.
Activity 35! Organize Me!
1. The food services department of a cinema is studying the amount spent by an

individual on food and drinks during his visit per day. A sample of 30 individuals
who visited the cinema yesterday revealed they spent the following amounts.
145 122 138 180 155 170 186 165 200 100
142 130 155 195 163 178 185 168 120 220
140 128 150 193 160 175 184 165 115 210
a. Construct an array
b. Determine the number of classes K
c. Determine the class interval C.
d. Determine the LCL, UCL, LCB, and UCB of the first class interval
e. Complete the frequency distribution table.

Class Limits LCB UCB Frequency RF RFP Class Mark <CF
Total
Country No. of Cars Imported

Japan 225,000
P a g e | 236
UNIT
4
2
f. Construct a frequency histogram for the data.
2. The Land Transportation Office (LTO) is interested in the number of brand new
cars imported to the Philippines in 2015. The following data are as follow:
Country No. of Cars Imported
Japan 225,000
South Korea 78,300
USA 120,250
United Kingdom 19,200
Italy 16,750
China 40,500
Total 500,000
Construct a bar chart for given data and interpret.

P a g e | 237
UNIT
4
2
Let’s Extend and Link
How to create Frequency Distribution Tables and Charts using MegaStat

MegaStat is an add-in toolbar in Microsoft Excel that can be used in doing statistical
analysis. It offers various commands such as generation of frequency distribution
tables and testing of hypotheses. Explore the different menus on generating frequency
tables for quantitative and qualitative data, and in creating graphs and charts.
You may visit this link: https://www.youtube.com/watch?v=t_crb_Fp44s
Bibliography
Almeda, J.V., Capistrano, T.G., and Sarte, G. (2010). Elementary Statistics. The
UP Press, Diliman, Quezon City
Aufmann, R. N., et al. (2018). Mathematics in the Modern World (Chapter 4).
Philippines: Rex Book Store
Aufmann, R. N., et al. (2013). Mathematical Excursions (Chapter 13). Brooks/Cole:
Cengage Learning
Sirug, W. (2018). Mathematics in the Modern World. Mindshapers Co. Inc.

P a g e | 238
Measures of Central Tendency

LESSON 3
DAYONO, JENESA A. (3 HOURS)
LEARNING OUTCOMES
• Compute and interpret the different measures of central

tendency
INTRODUCTION

Measures of central tendency are sometimes called
measures of central location. They are also classed as summary
statistics. The mean (often called the average) is most likely the
measure of central tendency that you are most familiar with, but
there are others, such as the median and the mode.
The mean, median and mode of data represent a single

value that represents the data set. Its purpose is to locate the
center of a data set. This lesson discusses three different
measures of central tendency: the mean, median, and the mode.
We will illustrate how to calculate each of these measures for
ungrouped and grouped data.

P a g e | 239
UNIT
LESSON Measures of Central Tendency
4
3
Let’s Look Back
You have learned from your previous study of mathematics how to find the
mean, median and mode of the data. To find out if you are ready to this lesson, do
the following activity. Write your answer in your answer booklet.
Activity 36: 3M’s
A student recorded her scores on weekly math quizzes that were marked out
of a possible 10 points. Her scores were as follows:
8 5 8 5 7 6 7 7 5 7
5 5 6 6 9 8 9 7 9 9
6 8 6 6 7
1. What is the mean of her scores on the weekly math quizzes?
2. What is the median of her scores on the weekly math quizzes?
3. What is the mode of her scores on the weekly math quizzes?
Previously, you were tasked to find the mean, median and mode on the weekly math
quizzes of a student.
1. To solve for the mean, simply used this formula:
, where n is the number of observations in the sample.
Thus,

P a g e | 240
UNIT
4
3
To solve for the median, simply arranged the data in ascending or descending order.
In this data, we will arrange it by ascending order.
5 5 5 5 5 6 6 6 6 6 6 7 7 7 7 7 7 8 8 8 8 9 9 9 9
Rule: If n is odd, the median is the middle ranked.

Thus, Median= 7
To solve for the mode, just find the most frequently occurring value.
Thus, Mode=7
Let’s Discover
A. Mean
The arithmetic mean, often called as the mean, is the most frequently used measure
of central tendency. The mean is the only common measure in which all values play
an equal role, meaning, to determine its values you would need to consider all the
values of any given data set. The mean is appropriate to determine the central
tendency of an interval or ratio data.
The symbol , called “x bar” is used to represent the mean of a sample and the
symbol , called “mu” , is used to denote the mean of a population.
Properties of Mean
1. A set of data has only one mean.
2. Mean can be applied for interval and ration data.
3. All values in the data set are included in computing the mean.
4. The mean is very useful in comparing two or more data sets.
5. Mean is affected by the extreme small or large values on a data set.
6. Mean is most appropriate in symmetrical data.

P a g e | 241
UNIT
4
3
where:
=sample mean
= population mean
x= the value of any particular observation or measurement.
= sum of all x’s
n=total number of values in the sample

N=total number of values in the population
Example 1:
The daily salaries of a sample of eight employees at GMS Inc. are Php550, Php420,
Php560, Php500, Php700, Php670, Php860, Php480. Find the mean daily rate of
employees.
Solution:
The mean daily salary of employees is Php 592.50.
Example 2:
Find the population mean of the ages of 9 middle-management employees of a
certain company. The ages are 53,45,59,48,54,46,51,58, and 55.
Solution:
The mean population age of middle-management employees is 52.11.

P a g e | 242
UNIT
4
3
B. Median
The median is the midpoint of the data array. When the data set is ordered,
whether ascending or descending order, it is called a data array. Median is an
appropriate measure of central tendency for data that are ordinal or above, but is
more valuable in an ordinal type of data.
Properties of Median
1. The median is unique, there is only one median for asset of data.
2. The median is found by arranging the set of data from lowest or highest (or
highest to lowest) and getting the value of the middle observation.
3. Median is not affected by the extreme small or large values.
4. Median can be applied for ordinal, interval and ration data.
5. Median is most appropriate in a skewed data.
To determine the value of median for ungrouped, we need to consider two rules:
1. If n is odd, the median is the middle rank.
2. If n is even, then the median is the average of the two middle ranked values.
Median (Rank Value)=
Example 1: Find the median of the ages of 9 middle-management employees of a

Solution:
Step 1: Arrange the data in order.
45, 46, 48, 5, 53, 54, 55, 58, 59
Step 2: Select the middle rank value.
Step 3: Identify the median in the data set.

45, 46, 48, 5, 53, 54, 55, 58, 59
5th
Hence, the median age is 53 years.

P a g e | 243
UNIT
4
3
Example 2: The daily salaries of a sample of eight employees at GMS Inc. are
Php550, Php420, Php560, Php500, Php700, Php670, Php860, Php480. Find the
median daily rate of employee.
Solution:
Step 1: Arrange the data in order.
Php420, Php480, Php500, Php550, Php560, Php670, Php700, Php860
Step 2: Select the middle rank value.
Step 3: Identify the median in the data set.

Php420, Php480, Php500, Php550, Php560, Php670, Php700, Php860
4.5th
Since the middle point falls between Php550 and Php560, we can determine the
median of the data set by getting the average of the two values.
Therefore, the median daily rate is Php 555.
C. Mode
The mode is the value in a data set that appears most frequently. Like the median
and unlike the mean, extreme values in a data set do not affect the mode. A data
may not contain any mode if none of the values are “most typical”. A data set that has
only one value that occurs the greatest frequency is said to be unimodal. If the data
has two values with the same greatest frequency, both values are considered the
mode and the data set is bimodal. If a data set has more than two modes, then the
data set is said to be multimodal. There are some cases when a data set values
have the same number of frequencies. When this occurs, the data set is said to be
no mode.

P a g e | 244
UNIT
4
3
Properties of Mode
1. The mode is found by locating the most frequently according value.
2. The mode is the easiest average to compute.
3. There can be more than one mode or even no mode in any given data set.
4. Mode is not affected by the extreme small or large values.
5. Mode can be applied for nominal, ordinal, interval and ratio data.
Example 1: The following data represent the total unit sales for Smartphones from a
sample of 10 Communication Centers for the month of August: 15, 17, 10, 12, 13, 10,
14, 10, 8, and 9. Find the mode.
Solution:
The ordered array for these data is 8, 9, 10, 10, 10, 12, 13, 14, 15, 17.\
Because 10 appears 3 times, more tha the other values, therefore the mode is 10.
Example 2. Find the mode of the ages of 9 middle-management employees of a

Solution:
The ordered array for these data is 45, 46, 48, 51, 53, 54, 55, 58, 59.
There is no mode since the data set has the same frequency.

P a g e | 245
UNIT
4
3
Let’s Do It!
Apply the knowledge learned and do the following task:
Solving mean, median and mode
The following are the ages of new recoveries of the COVID-19 patient in Western
Visayas on August 2, 2020. The following ages are: 30, 20, 40, 20, 24, 5, 30, 45.
Solve the mean, median and mode. Interpret the results.
Solution:
The mean age of new recoveries of the COVID-19 patient in western Visayas on
August 2, 2020 is 27 years old.
Arrange the data in ascending order,
5 20 20 24 30 30 40 45
The media age of new recoveries of the COVID-19 patient in western Visayas on
August 2, 2020 is 27 years old.
5 20 20 24 30 30 40 45
The most occurring value is 30, therefore modal age of new recoveries of the COVID-
19 patient in western Visayas on August 2, 2020 is 30 years old.

P a g e | 246
UNIT
4
3
The following are total number of COVID-19 Quarantine Facilities of the 8 provinces
in the Western Visayas. The total numbers are the identified isolation facilities and
other facilities for OFW’s. These are: 17, 23, 72, 6, 144, 41, 6 and 7. (Data Source:
DOH6 Data April 30, 2020). Solve the mean, median and mode. Make an
observation on the results.
See your answers here:
Mean 39.50 The result shows that there were only average of 40
Quarantine Facilities of the 8 provinces in the Western
Visayas.
Median 20 The result is not affected by the extreme values, it shows that
the average (median) number of Quarantine Facilities of the 8
provinces in the Western Visayas is 20.
Mode 6 The modal number of Quarantine Facilities of the 8 provinces
in the Western Visayas is 6.
Activity 37. Problem Solving: Measures of Central Tendency
The data below shows the monthly (February-July, 2020) electricity bill of a particular
household in Rivera Ville, Kabankalan City, Negros Occidental. Solve for the mean,
median mode and interpret the results.
Month February March April May June July
Electricity Bill (in pesos) 2,751 2,636 4,130 3,410 2,910 2,731

P a g e | 247
UNIT
4
3
Let’s Sum It Up
The arithmetic mean, often called as the mean, is the most frequently used
measure of central tendency, median is the midpoint of the data array, and the
mode is the value in a data set that appears most frequently.
The following are properties of mean: (1) A set of data has only one mean’ (2)
Mean can be applied for interval and ration data; (3) All values in the data set are
included in computing the mean; (4) The mean is very useful in comparing two or
more data sets’ (5) Mean is affected by the extreme small or large values on a data
set; (6) Mean is most appropriate in symmetrical data.
The following are properties of median: (1)The median is unique, there is only
one median for asset of data; (2) The median is found by arranging the set of data
from lowest or highest (or highest to lowest) and getting the value of the middle
observation; (3) Median is not affected by the extreme small or large values; (4)
Median can be applied for ordinal, interval and ration data; (5) Median is most
appropriate in a skewed data.
The following are properties of mode: (1) The mode is found by locating the
most frequently according value; (2) The mode is the easiest average to compute; (3)
There can be more than one mode or even no mode in any given data set; (4) Mode
is not affected by the extreme small or large values; (5) Mode can be applied for
nominal, ordinal, interval and ratio data.
Try the following link and do check your manual computation of the previous
exercises using MS Excel.
Calculating The Standard Deviation, Mean, Median, Mode, Range, & Variance Using
Excel https://www.youtube.com/watch?v=k17_euuiTKw

P a g e | 248
UNIT
4
3
Bibliography
Book
Sirug, W.S. (2018). Mathematics in the Modern World. A CHED General Education
Curriculum Compliant. Mindshapers Co., Inc.
Internet
"Mean, Median, Mode Practice Problems - Rio Salado College."
https://www.riosalado.edu/web/oer/wrkdev100-
20011_inter_0000_v1/m5/pdf/m5_l1_mean_median_mode_practice_probs.pdf.
Accessed 12 Aug. 2020.
"Measures of Central Tendency: Mean, Median, Mode & Range."
https://www.lcps.org/cms/lib4/VA01000195/Centricity/Domain/10878/9.4%20Notes%
20and%20Practice%20MMMR%20and%20MCT.pdf. Accessed 12 Aug. 2020.
"STATISTICS Unit 1 Measures of Central Tendency TEXT."
http://www.cimt.org.uk/cmmss/S1/Text.pdf. Accessed 12 Aug. 2020.

P a g e | 249
Measures of Dispersion
LESSON 4
LEARNING OUTCOMES
• Compute and interpret the different measures of

dispersion
INTRODUCTION

Another important characteristic of a data set is how it is
distributed, or how far each element is from some measure of
central tendency (average). There are several ways to measure
the variability of the data. Although the most common and most
important is the standard deviation, which provides an average
distance for each element from the mean, several others are also
important, and are hence discussed here.
The measures of central tendency we studied in the last
section (mean, median and mode) of a set of data can be
deceiving. Using only mean, median and mode to make a decision
may lead to mistakes. Standard deviation is a number that
describes how bunched up or spread out a group of data is.
Knowing the standard deviation of a group of data may help you
make more informed decisions.
Hence, this module will introduce you to the concepts of range,

variance and standard deviation.

P a g e | 250
UNIT
LESSON Measures of Dispersion
4
4
Let’s Look Back
Activity 38: Examine Me!
Take a look on the figure below. What differences have you seen on the two curves?
Write three observations on the space provided
COVID-19 and Flattening of the Curve
My observations:
1.
2.
3.

P a g e | 251
UNIT
4
4
If you examine the curve, the major difference that you can see between them
is the peakedness of the curve. The curve on the left is more peaked than the one on
the right. But what does this tells us?
Note that the government has set protocols to minimize the spread of the
COVID- 19 virus. Based on the figure earlier, if we do not take steps to minimize the
spread of the virus, we will have more cases in just a short period of time. On the
other hand, if we follow health protocols, we will reach the peak slowly over a period
of time.
We will relate this with the measures of dispersion which will be discussed on
the next section.
Let’s Discover
Measures of Dispersion
Measures of dispersion indicate the extent to which individual items in a
series are scattered about an average. It is used to determine the extent of the
scatter so that steps may be taken to control the existing variation. It is also used as
measure of reliability of the average value.
A. Range
Probably the simplest and easiest way to determine measure of dispersion is
the range. The range of a set of measurement is the difference between the largest
and the smallest values. There are two advantages of range: (i) it is easy to compute
and (ii) it is easy to understand. On the other hand, it also has two disadvantages, it
can be distorted by a single extreme value (or outlier) and only two values are used
in the calculation.
Example 1:
The daily rates of a sample of eight employees at GMS Inc. are Php550, Php420,
Php560, Php500, Php700, Php670, Php860, Php480. Find the range.
Solution:
Range (R)= maximum value- minimum value
Php860- Php420= Php440

P a g e | 252
UNIT
4
4
Example 2. The IQ score of the Morales’ family are 108,112,127,116, and 113. Find
the range.
Solution:
Range (R)= maximum value- minimum value
B. Variance and Standard Deviation

One of the most widely used measures of dispersion is the standard
deviation. The more spread apart the data, the higher the deviation. Standard
deviation is calculated as the square root of variance. In finance, standard deviation
is applied to the annual rate of return of an investment to measure the investment’s
volatility. Standard deviation is also known as historical volatility and is used by
investors as a gauge for the amount of expected volatility.
A measure of the dispersion of a set of data points around their mean value.
Variance is a mathematical expectation of the average squared deviations from the
mean.
Sample Variance and Sample Standard Deviation for Ungrouped Data
where: sample variance

sample standard deviation
the value of any particular observation or measurement
sample mean
sample population
Example 1. The daily rates of a sample of eight employees at GMS Inc. are Php550,
Php420, Php560, Php500, Php700, Php670, Php860, Php480. Find the variance
and standard deviation.
Solution:
Step 1: Compute the mean of the data set.

P a g e | 253
UNIT
4
4
Step 2: Subtract the mean from each of the value in the data set.
550 -42.5
420 -172.5
560 -32.5
500 -92.5
700 107.5
670 77.5
860 267.5
480 -112.5
Step 3: Square the , then get the sum
550 -42.5 1,806.25

420 -172.5 29,756.25
560 -32.5 1,056.25
500 -92.5 8,556.25
700 107.5 11,556.25
670 77.5 6,006.25
860 267.5 71,556.25
480 -112.5 12,656.25
Step 4: Solve for variance and the standard deviation.
Hence, the variance is Php 20, 421.43 and the standard deviation id Php 142.90.

P a g e | 254
UNIT
4
4
Alternative solution can also be obtain using this formula:
Example 2: Consider the following two sets of data of engine life. Find the standard
deviation for each engine manufacturer.
A. 500, 500, 500, 1500, 1500, 1500 B. 925, 925, 925, 975, 975, 975
Solution: Company A
500 -500
500 -500
500 -500
1500 500
1500 500
1500 500
Step 3: Square the , then get the sum.
500 -500 250,000

500 -500 250,000
500 -500 250,000
1500 500 250,000
1500 500 250,000
1500 500 250,000

P a g e | 255
UNIT
4
4
Solution: Company B
925, 925, 925, 975, 975, 975
925 -25
925 -25
925 -25
975 25
975 25
975 25
Step 3: Square the , then get the sum.
925 -25 625

925 -25 625
925 -25 625
975 25 625
975 25 625
975 25 625

P a g e | 256
UNIT
4
4
Even though company A’s engines last 50 hours longer on average of

Company B, but looking at the respective standard deviation, Company A
has a higher deviation (the data spread widely) compared to Company
B thus, Company B’s engines may be the better choice. Of course there
are many other factors, such as cost that need to go into the decision. The standard
deviation show a difference that the mean just can’t show.
We can also use the concept of relative dispersion in these data to compare
the scatter of one distribution with another.
Relative Dispersion
Measures of Relative dispersion are unitless and are used when one wishes to
compare the scatter of one distribution with another distribution.
Coefficient Variation
The coefficient of variation, CV, also known as relative dispersion, is the ratio of the
standard deviation to the mean and is usually expressed in percentage. It is
computed as:
For Company A: For Company B:
By looking at the coefficient variation of two companies, Company A has a

large variability compared to Company B. Thus company B data are spread not too
far from the mean.

P a g e | 257
UNIT
4
4
Population Variance and Population Standard Deviation
where: population variance

population standard deviation
the value of any particular observation or measurement
population mean
population
Example 1. The monthly income of the five research directors of Recoletos schools
are: Php55,000, Php59,500, Php62,500, Php57,000, and Php61,000. Find the
variance and standard deviation.
Solution:
55,000 -4,000 16,000,000

59,500 500 250,000
62,500 3,500 12,250,000
57,000 -2,000 4,000,000
61,000 2,000 4,000,000
Hence, the population variance is Php730,000 and the population standard

deviation is Php2,701.85.

P a g e | 258
UNIT
4
4
Let’s Do It
Apply the knowledge learned and do the following activity.

A. The following are the ages of new recoveries of the COVID-19 patient in Western
Calculate the range, variance and standard deviation.
Solution:
Range = 45-5=40
For variance and standard deviation:

Compute the mean of the data set.
Step 2 & 3. Subtract the mean from each of the value in the data set and square the
, then get the sum.
30 3.25 10.56
20 -6.75 45.56
40 13.25 175.56
20 -6.75 45.56
24 -2.75 7.56
5 -21.75 473.06
30 3.25 10.56
45 18.25 333.06
Hence, the variance is 157.36 and the standard deviation is 12.54

P a g e | 259
UNIT
4
4
B. At the Steakhouse Grill, steaks are cut into 12‐ounce portions when they are
ordered by a customer. For the past month, the manager of Steakhouse Grill has
been trying to determine which cook can mostly accurately cut portion of the meat.
The manager collected the mean of sample meat portions for a few weeks for both of
his cooks.
The results are summarized in the table below.
Week 1 Week 2 Week 3 Week 4
Opie 12.5 12.3 11.8 11.2
Maynard 12.0 12.2 11.8 12.4
1. Compute the mean portion size for each cook.

b) Compute the standard deviations for each cook.
c) The manager needs to choose a cook to cut portions of meat throughout the week.
Who should he choose? Why?
Solution (Opie):
Step 2 & 3: Subtract the mean from each of the value in the data set and Square the
, then get the sum.
12.5 0.55 0.3025

12.3 0.35 0.1225
11.8 -0.15 0.0225
11.2 -0.75 0.5625
Thus, the mean cut of Opie is 11.95 oz and his standard deviation is 0.58.

P a g e | 260
UNIT
4
4
Solution (Maynard):
Step 2 & 3: Subtract the mean from each of the value in the data set and Square the
, then get the sum.
12 -0.1 0.01
12.2 0.1 0.01
11.8 -0.3 0.09
12.4 0.3 0.09
Thus, the mean cut of Maynard is 12.10 oz and his standard deviation is 0.26.
Despite the fact that Opie’s mean of 11.95 is closer to 12 ounce than
Maynard’s mean which is 12.10, their standard deviation would speak rightfully for
whom among the cook will be chosen by the manager to cut portions of meat
throughout the week.
The manager needs to choose Maynard because standard deviation of
Marnard’s cut is smaller compare to Opie. The smaller the value of standard
deviation, the closer it is to the mean. Thus, the cut of Maynard is closer to the
required cut of the Steakhouse Grill which is 12 ounce. It is true then that using only
the mean, one’s decisions may lead to mistakes.

P a g e | 261
UNIT
4
4
A. The following are total number of COVID-19 Quarantine Facilities of the 8

provinces in the Western Visayas. The total numbers are the identified isolation
facilities and other facilities for OFW’s. These are: 17, 23, 72, 6, 144, 41, 6 and 7.
(Data Source:DOH6 Data April 30, 2020). Calculate the range, variance and standard
deviation.
B. If a cereal box is supposed to contain 20 oz. of cereal then consumers expect 20

oz. What will consumers feel if the cereal box is underfilling? How will happen to the
investors of the company if the cereal box is overfilling?
Answers:
A. Range: 138 Variance: 2,294 Standard Deviation: 47.90
B. The amount varies too much from box to box- sometimes under filling, sometimes
overfilling. The manufacturer will soon be in trouble with consumer groups (if under
filling) or the investors in the company will lose potential profit (if overfilling).
Activity 39: Problem Solving: Measures of Dispersion
A. The data below shows the monthly (February-July, 2020) electricity bill of a
particular household in Rivera Ville, Kabankalan City, Negros Occidental. What is the
variance and standard deviation of the data? Show your complete solution in your
answer booklet.
B. Working hours for a week were recorded by the two teachers who were working at
home due to COVID-19 pandemic. The required working hours for a permanent
employee is 8 hours. Consider the following data and compute for the variation.
Which among the two teachers were working productively?
Teacher A: 8.5 8 7.25 7.25 9
Teacher B: 7.75 8.25 7.75 8.25 8

P a g e | 262
UNIT
LESSON Measures of Dispersion 4
4
Let’s Sum It Up
Measures of dispersion indicate the extent to which individual items in a series are
scattered about an average.
The simplest and easiest way to determine measures of dispersion is the range. The
range of a set of measurement is the difference between the largest and the smallest
values.
One of the most widely used measures of dispersion is the standard deviation.
Standard deviation is calculated as the square root of variance. The more spread
apart the data, the higher the deviation.
When one wishes to compare the scatter of one distribution with another distribution,
measures of relative dispersion (unitless) are used.
The coefficient of variation, CV, also known as relative dispersion, is the ratio of the
standard deviation to the mean and is usually expressed in percentage.
Try the link and do check your manual computation of the previous exercises using
MS Excel.
Calculating The Standard Deviation, Mean, Median, Mode, Range, & Variance Using
Excel https://www.youtube.com/watch?v=k17_euuiTKw
Bibliography
Book
Sirug, W.S. 2018. Mathematics in the Modern World. A CHED General Education
Zamora, C.O. et, al. 2003. Elementary Statistics. Text/Workbook.
"Measures of Central Tendency: Mean, Median, Mode & Range."
https://www.lcps.org/cms/lib4/VA01000195/Centricity/Domain/10878/9.4%20Notes%
20and%20Practice%20MMMR%20and%20MCT.pdf. Accessed 12 Aug. 2020.
https://web.gccaz.edu/~JOHWD63181/MAT142/chapter_3/problems/section%203.4.p
df

P a g e | 263
Measures of Relative Position

LESSON 5
LEARNING OUTCOMES
• Compute and interpret different measures of position
INTRODUCTION

When you take an exam, what is often as important as your
actual score is the way your score compares to other students’
scores. If you made a 70 but the average score is 85 you did
relatively poor. If you made a 70 but the average score was only
55, then you did relatively well. The significance of one observed
value in a data set often depends on how that value compares to
the other values in that data set. This is why measures of relative
position or rank of data points in particular data sets are useful.
Hence, this module will introduce you to the following
lessons:
• Measures of Relative Position

- Quartiles, Percentiles, z-scores, Boxplots (Box-and-
Whisker Plot)

P a g e | 264
UNIT
LESSON Measures of Relative Position
4
5
Let’s Look Back
Activity 40. What is my Rank?

Examine the infographic below.
Based from the activity 40, answer the questions below. Write your answer in your
answer booklet.
1. What country ranks first in terms of number of cases for COVID-19?

2. In terms of recoveries, who tops among the ten countries?
3. In terms of death, which country falls at the middle rank?
4. Among ten countries, which country has the least number of deaths?
5. Enumerate those countries which death records falls below 100?

P a g e | 265
UNIT
4
5
Let’s Discover
Measures of Relative Position

Measures of location/position are values below which a specified fraction or
percentage of the observation in a given set must fall.
A. Quartiles
Quartiles are values that divide the array into 4 equal parts. Thus,
𝑄1 , read as first quartile, is the value below which 25% of the values fall
𝑄2, read as second quartile, is the value below which 50% of the values fall
𝑄3 , read as third quartile, is the value below which 75% of the values fall
𝑘(𝑁 + 1)
𝑄𝑘 =
4
where: 𝑄𝑘 =Quartile
𝑁 = population
𝑘 = quartile location
Example 1: Find the first, second and third quartiles of the ages of 9 middle-
management employees of a certain company. The ages are
53,45,59,48,54,46,51,58, and 55.
Solution:
Step 1: Arrange the data in order from lowest to highest.
45, 46, 48, 51, 53, 54, 55, 58, 59
Step 2: Select the first, second and third quartiles using the formula.
1(𝑁 + 1) 1(9 + 1) 10
𝑄1 = = = = 2.5
4 4 4
2(𝑁 + 1) 2(9 + 1) 20
𝑄2 = = = =5
4 4 4
3(𝑁 + 1) 3(9 + 1) 30
𝑄3 = = = = 7.5
4 4 4

P a g e | 266
UNIT
4
5
Step 3: Identify the first, second, and third quartile values in the data set.
45, 46, 48, 51, 53, 54, 55, 58, 59
2.5th 5th 7.5th
Since the 2.5th falls between 46 and 48; and 7.5 th falls between 55 and 58 we
can determine the first and third quartiles of the data set by getting the average of the
two values.
46+48 94 55+58 113
𝑄1 = 2
= 2
= 47 𝑄3 = 2
= 2
= 56.5
Therefore, the first quartile is 47, which is interpreted as 25% of the ages are
below 47; the second quartile is 53, which is interpreted as 50% of the ages are
below 53; and the third quartile is 56.5, which is interpreted as 75% of the ages are
below 56.5
B. Percentiles
Percentiles are values that divide a set of observations in an array into 100 equal
parts. Thus,
𝑃1 , read as first percentile, is the value below which is 1% of the values fall
𝑃2 , read as second percentile, is the value below which is 2% of the values fall
⋮
𝑃99 , read as ninety-nine percentile, is the value below which is 99% of the values fall
𝑘(𝑁 + 1)
𝑃𝑘 =
100
where: 𝑃𝑘 =Percentile
𝑁 = population
𝑘 = percentile location
Example 1: Find the 64th and 80th percentiles of the ages of 9 middle-management
employees of a certain company. The ages are 53,45,59,48,54,46,51,58, and 55.
45, 46, 48, 51, 53, 54, 55, 58, 59

P a g e | 267
UNIT
4
5
Step 2: Select the 64th and 80th percentiles using the formula.
𝑘(𝑁 + 1) 64(9 + 1) 640
𝑃64 = = = = 6.4
100 100 100
In this case, to find the 6.4 th position, subtract 6th location from 7th location (55-54=1),
then multiply the difference to the decimal part of the 6.4 th location (1*.4=.0.4). Add
the product to the the number in the 6 th position (54+.4=54.4)
𝑘(𝑁 + 1) 80(9 + 1) 800
𝑃80 = = = =8
100 100 100
Step 3: Identify the 64th and 80th percentiles value in the data set.
45, 46, 48, 51, 53, 54, 55, 58, 59
8th
Therefore, 64th percentile is 54.4, which is interpreted as 64% of the ages are
below 54.4; and the 80th percentile is 58, which is interpreted as 80% of the ages are
below 58.
C. z-scores
z-score is used to know the position of one observation relative to others in a
set of data. Let say, we want to know a score of a student of 42 compared to the
scores of the other students in the class based from a quiz on a total of 50 points.
The mean and the standard deviation of the scores can be used to compute a z-
score, which will measure a relative standing of a measurement in a data set.
A z-score measures the distance between an observation and the mean,
measured in units of standard deviation. The following formulas show how to
compute the z-score for a data value x in a population and in a sample.
𝑥−𝜇 𝑥−𝑥
𝑧= 𝜎
(for population) 𝑧= 𝑠
(for sample)
Example 1: Ruel got a grade of 75% in English and a grade of 90% in History. The
mean grade in English is 65% and the standard deviation is 10% whereas in History,
the mean grade is 80% and the standard deviation is 20%. In which subject did Ruel
perform better?
Solution:
90−80 75−65
𝑧ℎ𝑖𝑠𝑡𝑜𝑟𝑦 = = 0.5 𝑧𝑒𝑛𝑔𝑙𝑖𝑠ℎ = = 1.0
20 10

P a g e | 268
UNIT
4
5
Ruel’s grade in History has a z-score of 0.5 and is closer than the mean
(average) while his English grade has a z-score of 1.0 and is farther way from the
mean, therefore, Ruel performed better in English.
Example 2. The monthly expenditures of a large group of households are normally
distributed with a mean of Php 48, 700 and a standard deviation of Php 10,400. What
is the z-value of monthly expenditures of Php59,400 and Php38,300?
Solution:
59,400−48,700
𝐹𝑜𝑟 𝑥 = 59,400 𝑧= 10,400
= 1.00
38,300−48,700
𝐹𝑜𝑟 𝑥 = 38,300 𝑧= = −1.00
10,400
The z of 1.00 indicates that a monthly expenditure of Php 59, 400 for
households is one standard deviation above the mean, and a z of -1.00 shows that a
Php 38,300 monthly expenditure is one standard deviation below the mean. Note that
both household monthly expenditures (Php 59,400 and Php 38,300) are the same
distance (Php 10,400) from the mean.
D. Box-and-Whisker Plot
John Wilder Turkey (1915-2000) introduced the boxplot in the 1970’s. A
boxplot (or box-and-whisker plot) is graph of a data set obtained by drawing a
horizontal line from the minimum data value to first quartile (𝑄1 ), drawing a horizontal
line to the third quartile (𝑄3 ) to the maximum data value, and drawing a box whose
vertical line passes through 𝑄1 and 𝑄3 with a vertical line inside the box passing
through the median or second quartile (𝑄2 ).
The boxplot is another way of telling or not the distribution of data is symmetric
relative to the mean.
The boxplot will give the following information:
1. If the median is near the center of the box, the distribution is approximately
symmetric.
2. If the median falls to the right of the center of the box, the distribution is
negatively skewed.
3. If the median falls to the left of the center of the box, the distribution is
positively skewed.
4. If the lines are about the same length, the distribution is approximately
symmetric.
5. If the left line is larger than the right line, the distribution is negatively skewed.
6. If the right line is larger than the left line, the distribution is positively skewed.

P a g e | 269
UNIT
4
5
Example 1: Construct a boxplot of the ages of 9 middle-management employees of a

certain company. The ages are 53,45,59,48,54,46,51,58, and 55. What can you say
about the distribution of the data set?
Solution:
Step 1: Determine the 𝑄1 , Median, and 𝑄3 of the given data set. Recall that 𝑄1 =47,
Median(𝑄2 ) =53, and 𝑄3 =56.5.
Step 2: Locate the lowest value,𝑄1 , the median, 𝑄3 , and the highest value on the
scale.
Step 3: Draw a box around 𝑄1 and 𝑄2 , draw a vertical line through the median, and
connect the upper and lower values, as shown below.

P a g e | 270
UNIT
4
5
Note: The following are the plot values used in the data
Minimum value 45
𝑄1 47
Median or the 𝑄2 53
𝑄3 56.5
Maximum Value 59
The data set of the distribution is negatively skewed, since the median falls to
the right to the center of the box.
Let’s Do It
Apply the knowledge learned and do the following activity.

A. The following are the ages of new recoveries of the COVID-19 patient in Western
1. Solve for the 𝑄1 , and 𝑃25 . Compare the results.
2. Construct a boxplot. What can say about the distribution of the data set?
Solution #1:
5, 20, 20, 24, 30, 30 ,40, 45
Step 2: Select the first quartile and 25 th percentile.

1(𝑁 + 1) 1(8 + 1) 9
𝑄1 = = = = 2.25
4 4 4
25(𝑁 + 1) 25(8 + 1) 225

𝑃25 = = = = 2.25
100 100 100
Step 3: Identify the first quartile and 25 th percentile values in the data set.

P a g e | 271
UNIT
4
5
If you noticed, 𝑄1 , and 𝑃25 have the same rank value in the data set. Thus the first
quartile and the 25th percentile of the data set is 20 which means that 25% of the
ages of the recoveries of the COVID-19 patient in Western Visayas on August 2,
2020 falls below 20 years old.
Solution #2:
Recall how to find values for the following:
Minimum value 5
𝑄1 20
Median or the 𝑄2 27
𝑄3 37.5
Maximum Value 45
The median falls to the left of the center of the box, so therefore distribution is
positively skewed.

P a g e | 272
UNIT
4
5
B. Two men are on a weight-reducing diet. The first belongs to an age/body type
group for which the mean weight is 145 pounds with a standard deviation of 15
pounds. The second belongs to an age/body type of group for which the mean is 165
pounds with a standard deviation of 20 pounds. If their respective weights are 178
and 204 pounds, who is more definitely overweight for his age/body type group?
Solution:
𝑥−𝑥
𝑧=
𝑠
178−145 204−165
𝑧1 = = 2.2 𝑧2 = = 1.95
15 20
The overweight for age/body group type is the one whose weight is 178 pounds.

A. The following are total number of COVID-19 Quarantine Facilities of the 8
provinces in the Western Visayas. The total number are the identified isolation
facilities and other facilities for OFW’s. These are: 17, 23, 72, 6, 144, 41, 6 and 7.
(Data Source:DOH6 Data April 30, 2020).
1. Solve for the 𝑄1 , 𝑄2 , and 𝑄3 .
2. Construct a boxplot. What can say about the distribution of the data set?
Solve in your own and check your answers below:
𝑄1 = 6.25
𝑄2 = 20
𝑄3 = 64.25

P a g e | 273
UNIT
4
5
B. The mean grade of the students who took Computer programming is 77.1 and
standard deviation equal to 8.03. Carl got a grade of 64 and he was worried because
the university has the policy that a student will fail if he obtains a grade which falls
beyond two (2) standard deviations below the mean. Will Carl fail the course?
Answers:
64−77.1
𝑧= = −1.63. This z-score is not beyond 2 standard deviations below the
8.03
mean. Therefore, Carl will not fail the course.
Activity 41: Problem Solving

A. The data below shows the monthly (February-July, 2020) electricity bill of a
particular household in Rivera Ville, Kabankalan City, Negros Occidental. Show your
complete solution in your answer booklet.

1. What is the first, second and third quartile of the given data?
2. Construct a box-and-whisker plot. What can you say about the distribution of
the data set.
B. Different typing skills are required for secretaries depending on whether one is
working in a law office, an accounting firm, or for a mathematical research group at a
major university. In order to evaluate candidates for these positions, an agency
administers 3 distinct standardized typing samples. A time penalty has been
incorporated into the scoring of each sample based on the number of typing errors.
The mean and standard deviation for each test, together with the scores achieved by
Mary, an applicant, are given in the following table.
Sample Mary’s Score Mean Std.Dev.
Law 141 sec 180 sec 30 sec
Accounting 7 min 10 min 2 min
Math Research 33 min 26 min 5 min
Who among the secretarial candidates perform best? Why?

P a g e | 274
UNIT
4
5
Let’s Sum It Up
Measures of Position are values below which a specified fraction or percentage of

the observations in a given set must fall.
Quartiles are values that divide the array into 4 equal parts while percentiles are
values that divide a set of observations in an array into 100 equal parts.
z-score on the other hand is used to know the position of one observation relative to
others in a set of data. A z-score measures the distance between an observation and
the mean, measured in units of standard deviation.
A boxplot (or box-and-whisker plot) is graph of a data set obtained by drawing a
horizontal line from the minimum data value to first quartile (𝑄1 ), drawing a horizontal
line to the third quartile (𝑄3 ) to the maximum data value, and drawing a box whose
vertical line passes through 𝑄1 and 𝑄3 with a vertical line inside the box passing
through the median or second quartile (𝑄2 ).
Try the following links and do check your manual computation of the previous
exercises using MS Excel.
How To Find Quartiles and Construct a Box Plot in Excel

https://www.youtube.com/watch?v=9K7u5KPTNc8
Bibliography
Mamhot, M.R. et al. 2013. Statistics for General Education (A Keypad Approach).
Purelybooks Trading and Publishing Corp.
Sirug, W.S. 2018. Mathematics in the Modern World. A CHED General Education
Zamora, C.O. et, al. 2003. Elementary Statistics. Text/Workbook.

P a g e | 275
UNIT 4: Data Management POST-TEST
Direction: Read the questions carefully. Shade the letter of the correct answer. If
answer is not provided, shade letter E. Use the answer sheet provided on your
answer booklet and strictly no erasures.
1. Which of the following statements belong to the field of descriptive statistics?

A. A car manufacturer wishes to estimate the average lifetime of batteries
by testing a sample of 50 batteries.
B. A shipping company wishes to estimate the number of passengers
traveling via their ships next year using their data on the number of
passengers in the past three years.
C. A marketing research group wishes to determine the number of families
not eating three times a day in the sample used for their survey.
D. A politician wants to predict his chance of winning in the senatorial
election by conducting a survey on 1200 registered voters.
2. What type of chart is useful in presenting historical data and in showing the
movement of a series over time?
A. Line chart
B. Bar chart
C. Pie chart
D. Pictograph
3. The following are nominal variables except for:

A. City of residence
B. Number of household members
C. Area code
D. Marital status
For items 4-5, A researcher wanted to investigate the relationship of weekly

allowance to the academic performance of students studying in a certain university
with a total of 2300 enrolled students. To achieve the researcher’s aim, the
researcher interviewed a random sample of 400 students. Results indicated that the
average allowance of the 400 students is PhP 1500.
4. What is the sample?

A. The group of 2300 enrolled students
B. The group of 400 enrolled students
C. The group of 1500 enrolled students
D. The group of 2700 enrolled students
5. Which of the following is a statistic?

A. The average allowance of the 2300 enrolled students is Php 1500
B. The average allowance of the 400 enrolled students is Php 1500
C. The proportion of students whose allowance is Php 1500
D. None of the above

P a g e | 276
6. It is a measure of the central tendency obtained by getting the middle value in

distribution when the values are arranged in ascending or descending order.
A. Mean C. Weighted Mean
B. Median D. Mode
7. It is most commonly occurring value in a distribution.

A. Mean C. Average
B. Median D. Mode
For items 8 - 10, refer to the table below:
The distribution of the number of mistakes made by 285 students

in a100-item Math Exam
Number of Mistakes Number of Students
7 – 12 94
13 – 18 37
19 – 24 25
25 – 30 43
31 – 36 86
8. What is the class width or class size?

A. 5 C. 7
B. 6 D. 8
9. What is the relative frequency of the class interval 13 – 18?

A. 0.1298 C. 37%
B. 12.98% D. 48%
10. What is the lower class boundary of the class interval 7 – 12?
A. 6.5 C. 7.5
B. 7 D. 12.5
11. The reseller’s income in Heng De face shield for seven days were, Php
1,220, Php 1,345, Php, 1000, Php 2,500, Php 1,590, Php 1,350 and Php
1540. What is the reseller’s average income?
A. Php 1,506.42 C. Php 1,350
B. B. Php 1,506.43 D. Php 10,545
12. It is a statistic that measures the dispersion of a dataset relative to its mean
and is calculated as the square root of the variance.
A. range C. variance
B. standard deviation D. none of these
13. Consider the sample What is the median of the set of data: 13, 19, 18, 20, 16,
9, 10, 7 and 8.
A. 13 C. 5.05
B. 13.33 D. 25.50
P a g e | 277

14. The highest score in the distribution is 109 while the lowest score is 54. What
is the range?
A. 54 C. 56
B. 55 D. 57
15. Which scenario will give you a coefficient of variation of 40%?

A. Mean of 4 and standard deviation of 16
B. Mean of 4 and standard deviation of 20
C. Mean of 4 and standard deviation of 4
D. Mean of 16 and standard deviation of 4.
16. It is another way of telling whether or not the distribution of the data is
symmetric relative to the median.
A. Boxplot C. Quartiles
B. Z-scores D. Fractiles
For items 17 and 18, consider the life span (in years) of a certain species below.
1.6 1.9 2.2 2.5 2.6 2.6 2.9 3.0 3.0 3.1
3.1 3.1 3.1 3.2 3.2 3.2 3.3 3.3 3.3 3.4
3.4 3.4 3.5 3.5 3.6 3.7 3.7 3.7 3.8 3.8
3.9 3.9 4.1 4.1 4.2 4.3 4.4 4.5 4.7 4.7
17. What is Q1 for the distribution of life spans?

A. 3.1 C. 2.9
B. 3.0 D. 3.2
18. What is the 85th percentile of the 40 life spans?

A. 4.1 C. 4.2
B. 4.5 D. 4.115
19. It is used when one wishes to compare the scatter of one distribution with
another distribution.
A. Measures of Location C. Measures of Position
B. Coefficient of Variation D. Measures of Relative Dispersion
20. It is a value that divide a set of observations into 100 equals parts.
A. Quartiles C. Deciles
B. Percentiles D. Fractiles

P a g e | 278
Appendix A
Rubrics
A. Essay Writing
1 2 3 4 5
CONTENT /IDEAS Writing is Writing Writes Writing is Writing is

limited in does not related, purposef confident
communicat clearly quality ul and and clearly
ing communicat paragraphs, focused. focused. It
knowledge. e with little or Piece holds the
Length is knowledge. no details. contains reader’s
not The reader some attention.
adequate is left with details. Releva
for questions. nt
developmen details
t. enrich
writing.
ORGANIZATION Writing is Writing is Uses Writing Writing
brief and confused and correct includes a includes a
underdevelo loosely writing strong strong,
ped with organized. format. beginning, beginning,
very weak Transitions Incorporate middle, and middle,
transitions are weak sa end, with and end
and closure. and closure coherent some with clear
is closure. transition transitions
ineffective. s and and a
good focused
closure. closure.
VOCABULARY Language Shows Uses a Purposeful Effectiv
/ WORD is trite, some use variety of use of word e and
CHOICE vague or of varied word choice. engagin
flat. word choice to g use of
choice. make word
writing choice.
interesting.
CONVENTIONS Inconsistent Occasional Maintains Consistent Uses
agreement errors agreement agreement consistent
between between between between agreement
parts of parts of parts of parts of between
speech. speech. speech. speech. parts of
Many errors Some errors Few errors in Uses correct speech.
in in mechanics. punctuation, No errors
mechanics. mechanics. Applies capitalization in
Limited Some basic grade , etc. mechanics
evidence of evidence of level Consistent .
spelling spelling spelling. use of Creative
strategies. strategies. spelling and
strategies. effective
use of
spelling
strategies.
P a g e | 279
B. Construction of Bar Graph
Criteria 1 2 3
Labels and Title Title is appropriate Title is somewhat Title is missing or
to the topic of the appropriate to the inappropriate. Little
graph. All areas of topic of the graph. or no labeling. X
the graph are Most areas of the and Y axis are not
labeled and well graph are labeled indicated correctly.
defined including and well defined
the X and Y axis. including the X and
Y axis.
Accuracy Graph shows a Graph shows a Graph cannot be
clear, defined connection to the X read but some data
connection to the X and Y axis. Most is present. The
and Y axis. Each bars are the length of the bars is
bar is the appropriate length. inappropriate.
appropriate length.
Visual Appearance Color is used in a Color is used. The Little or no color
meaningful way. graph, title and used. The graph,
The graph, title and labels are arranged title and/or labels
labels are arranged appropriately. Text are not arranged
appropriately. Text is readable. appropriately. Text
is easy to read. is difficult to read.
Retrieved from: https://www.uen.org/rubric/previewRubric.html?id=192
C. Frequency Table
3 2 1 0
Class Limits All class limits One value is Two to three Four or more
are correct incorrect values are values are
incorrect incorrect
Class All class One value is Two to three Four or more
Boundaries boundaries are incorrect values are values are
correct incorrect incorrect
Frequencies All frequencies One value is Two to three Four or more
incorrect incorrect
Relative All relative One value is Two to three Four or more
Frequencies frequencies are incorrect values are values are
Class Marks All class marks One value is Two to three Four or more
incorrect incorrect
Cumulative All cumulative One value is Two to three Four or more
Frequencies frequencies are incorrect values are values are
P a g e | 280
D. Frequency Histogram
5 4 3 2 1
Axes Both X and Both X and Both X and Both X and Both X and Y
Y Axes are Y axes are Y are Y axes are axes are not
scaled scaled scaled, but not scaled scaled or values
effectively based on not based evenly. not present
based on the data correctly
data set set on data
collected. X collected. X set. X and
and Y are and Y are Y are not
scaled scaled scaled
evenly evenly evenly
throughout throughout throughout
both axes both axes both axes
Titles/Labels The graph The graph The graph The graph Most titles and
is complete is complete is is labels are poor
and labeled and labeled complete complete or not
correctly correctly and but not present/missing.
throughout throughout labeled completely
the graph. the graph. throughout labeled.
This This the graph.
includes a includes a This
title created title created includes a
appropriate somewhat title
to the data appropriate created for
collected. to the data the data
collected. collected.
Histogram All bars are All bars are All bars Bars are Bars are noy
connected connected are not connected and
throughout throughout connected connected none of the data
its entirety. most of the throughout and not all correlated to the
Bars are axes. Bars some of data frequency table.
correctly are the axes. correlates
correlated correctly Bars are to the
to the data correlated somewhat frequency
collected to the data correlated table.
and collected to the data
displayed and collected
in the displayed and
frequency in the displayed
table. frequency in the
table. frequency
table
Retrieved from:
https://www.rcampus.com/rubricshowc.cfm?sp=yes&code=VX5BA74&
P a g e | 281
E. Summary Measures
3 2 1 0
Measures of The mean, The mean, The mean, The mean,
Central median, and median, and median, and median, and
Tendency mode have all mode have mode have mode have not
been computed been calculated been calculated been calculated
with no errors with 75% but are not
accuracy correct
Measures of The range, The range, The range, The range,
Variation variance, variance, variance, variance,
standard standard standard standard
deviation, and deviation, and deviation, and deviation, and
coefficient of coefficient of coefficient of coefficient of
variation have variation have variation have variation have
all been been calculated been calculated not been
computed with with 75% but are not calculated
no errors accuracy correct
Measures of The percentiles The percentiles The percentiles The percentiles
Position and quartiles and quartiles and quartiles and quartiles
have all been have been have been have not been
computed with calculated with calculated but calculated
no errors 75% accuracy are not correct
Retrieved from: https://www.rcampus.com/rubricshowc.cfm?sp=yes&code=N8C347&
F. Box-and-Whisker Plot
5 4 3 2
Five Number Summary All values One value Two to Four or more
➢ Minimum value are correct is incorrect three values values are
➢ Lower quartile are incorrect
➢ Median incorrect
➢ Upper quartile
➢ Maximum value
Box-and-whisker plot All One Two to Four or more
Plots potted correctly, box components component three components
is placed around upper and are present is missing components are missing
lower quartiles, line for are missing
median, whiskers are
drawn.
Analyze data Thoroughly Complete, Partially Incomplete
Describe the relationship complete, clear, complete; explanation;
between the two variables clear, and accurate gives not
accurate explanation response; understandable
explanation explanation or insufficient.
may be
unclear or
lack detail
Retrieved from: https://www.rcampus.com/rubricshowc.cfm?sp=yes&code=H8A6XB&

GEC 4 Module 4 Final Term

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

GEC 4 Module 4 Final Term

Uploaded by

Copyright:

Available Formats

P age |i

CPSU QUALITY POLICY

• Continually improve its Quality Management System (QMS) at par with

STUDY SCHEDULE GEC 4

MATHEMATICS IN THE MODERN WORLD

TABLE OF CONTENTS GEC 4

MATHEMATICS IN THE MODERN WORLD Page

The branch of science that deals with the

In the Philippines, the Philippine Statistics

In this unit, you will learn the different ways of

UNIT 4: Data Management PRE-TEST

1. The following are nominal variables except for:

For items 2-3, A researcher wanted to investigate the relationship of weekly

2. What is the sample?

3. Which of the following is a statistic?

4. Which of the following statements belong to the field of descriptive statistics?

GEC 4: MATH IN THE MODERN WORLD

UNIT 4: Data Management PRE-TEST

For items 6 - 8, refer to the table below:

The distribution of the number of mistakes made by 285 students

6. What is the class width or class size?

7. What is the relative frequency of the class interval 13 – 18?

8. What is the lower class boundary of the class interval 7 – 12?

9. It is a measure of the central tendency obtained by getting the middle value in

10. It is most commonly occurring value in a distribution.

UNIT 4: Data Management PRE-TEST

15. Which scenario will give you a coefficient of variation of 40%?

19. What is Q1 for the distribution of life spans?

20. What is the 85th percentile of the 40 life spans?

GEC 4: MATH IN THE MODERN WORLD

Basic Concepts in Statistics LESSON 1

• Define basic statistical concepts such as population and

UNIT 4: DATA MANAGEMENT

Every day we make decisions that maybe personal,

GEC 4: MATH IN THE MODERN WORLD CAS-CPSU

Let’s Look Back

Let’s Analyze and Prepare

1. How did you find the activity?

GEC 4: MATH IN THE MODERN WORLD CAS-CPSU

Let’s start with the two definition of statistics.

o Variable – a characteristic or attribute of the elements in a collection that can

o Experimental unit is the individual or object on which a variable is measured.

GEC 4: MATH IN THE MODERN WORLD CAS-CPSU

o Observation – realized value of a variable

o Data – collection of observations

o Parameter – a summary measure describing specific characteristic of the

o Statistic – a summary measure describing specific characteristic of the sample

a) Population: The set of students at a certain college.

GEC 4: MATH IN THE MODERN WORLD CAS-CPSU

1. Statistical Methods of Applied Statistics - procedures and techniques used

2. Statistical Theory of Mathematical Statistics - deals with the development

Measurement - the process of determining the value or label of a variable based on

GEC 4: MATH IN THE MODERN WORLD CAS-CPSU

Marital status 1-Single 2-Married

GEC 4: MATH IN THE MODERN WORLD CAS-CPSU

2. Ordinal Level (Ranking Scale)

Year level 1 – 1st yr 2 – 2nd yr

GEC 4: MATH IN THE MODERN WORLD CAS-CPSU

1. In order to estimate the true proportion of students at a certain college who

GEC 4: MATH IN THE MODERN WORLD CAS-CPSU

5. Ms. Macasaet wants to determine the proportion spent on transportation during

III. Let’s classify the following by a) type of variable and b) level of