Professional Documents
Culture Documents
COLLEGE OF ARTS AND SCIENCES (CAS) CENTRAL PHILIPPINES STATE UNIVERSITY (CPSU)
P a g e | ii
This learner’s module is not for sale. No part of this material may be
reproduced in any form or by any means, including photocopying,
scanning, or other electronic means without the prior written permission to
the author or the College of Arts and Sciences, Central Philippines State
University.
LEARNER’S MODULE IN
GEC 4: MATHEMATICS IN THE MODERN WORLD
Second Semester | School Year 2020-2021
COLLEGE OF ARTS AND SCIENCES (CAS) CENTRAL PHILIPPINES STATE UNIVERSITY (CPSU)
P a g e | iii
PHILOSOPHY
Decent affordable education for sustainable productivity
and global competitiveness under an atmosphere of
academic freedom.
COLLEGE OF ARTS AND SCIENCES (CAS) CENTRAL PHILIPPINES STATE UNIVERSITY (CPSU)
P a g e | iv
COLLEGE OF ARTS AND SCIENCES (CAS) CENTRAL PHILIPPINES STATE UNIVERSITY (CPSU)
P age |v
• Cover Page i
• Disclaimer ii
• CPSU VMGO and Quality Policy iii
• Study Schedule iv
• Table of Contents vi
UNIT 4: DATA MANAGEMENT
Lesson 1 Basic Concepts in Statistics 205
Lesson 2 Data Organization and Presentation 218
Lesson 3 Measures of Central Tendency 238
Lesson 4 Measures of Dispersion 249
Lesson 5 Measures of Relative Position 263
APPENDICES
Appendix A Rubrics 278
COLLEGE OF ARTS AND SCIENCES (CAS) CENTRAL PHILIPPINES STATE UNIVERSITY (CPSU)
P a g e | cci
UNIT
DATA MANAGEMENT
4
An effective decision making is based on
data. Data is everywhere. It is just lurking around the
corner waiting to be discovered and analyzed. From
there, data will tell its story.
COLLEGE OF ARTS AND SCIENCES (CAS) CENTRAL PHILIPPINES STATE UNIVERSITY (CPSU)
P a g e | 202
5. What type of chart is useful in presenting historical data and in showing the
movement of a series over time?
A. Line chart
B. Bar chart
C. Pie chart
D. Pictograph
11. The reseller’s income in Heng De face shield for seven days were, Php
1,220, Php 1,345, Php, 1000, Php 2,500, Php 1,590, Php 1,350 and Php
1540. What is the reseller’s average income?
A. Php 1,506.42 C. Php 1,350
B. B. Php 1,506.43 D. Php 10,545
12. Consider the sample What is the median of the set of data: 13, 19, 18, 20, 16,
9, 10, 7 and 8.
A. 13 C. 5.05
B. 13.33 D. 25.50
13. It is a statistic that measures the dispersion of a dataset relative to its mean
and is calculated as the square root of the variance.
A. range C. variance
B. standard deviation D. none of these
GEC 4: MATH IN THE MODERN WORLD
P a g e | 204
16. It is used when one wishes to compare the scatter of one distribution with
another distribution.
A. Measures of Location C. Measures of Position
B. Coefficient of Variation D. Measures of Relative Dispersion
17. It is a value that divide a set of observations into 100 equals parts.
A. Quartiles C. Deciles
B. Percentiles D. Fractiles
18. It is another way of telling whether or not the distribution of the data is
symmetric relative to the median.
A. Boxplot C. Quartiles
B. Z-scores D. Fractiles
For items 19 and 20, consider the life span (in years) of a certain species below.
1.6 1.9 2.2 2.5 2.6 2.6 2.9 3.0 3.0 3.1
3.1 3.1 3.1 3.2 3.2 3.2 3.3 3.3 3.3 3.4
3.4 3.4 3.5 3.5 3.6 3.7 3.7 3.7 3.8 3.8
3.9 3.9 4.1 4.1 4.2 4.3 4.4 4.5 4.7 4.7
LEARNING OUTCOMES
INTRODUCTION
UNIT
LESSON BASIC CONCEPTS IN STATISTICS
4
1
Activity 32: Who Am I? Fill in the following bio-data form. Write your answer on your
answer booklet.
Name: Course:
Age: Sex: Year Level:
Date of Birth: Place of Birth:
Height (in cm): Weight (in kg):
Eye Color: Skin Color:
My favorite color is:
My favorite food is:
My favorite number is:
My favorite movie is:
Based on the previous activity, answer the following questions. Write your answer on
your answer booklet.
Characteristics like course, sex, age, weight, height are called variables. They are
called variables since these characteristics vary from one person to another. Moreover,
all of the information you have filled in the form is considered data. Statistics, as a
branch of science, deals with collection, organization, presentation, analysis, and
interpretation of data. You will know more of the basic concepts in statistics in the next
section.
Let’s Discover
A. Definition of Statistics
o Plural sense – set of numerical figures
e.g. number of students enrolled in every program in CPSU, vital statistics
in a beauty contest
o Singular sense – the branch of science that deals with the collection,
presentation, organization, analysis, and interpretation of data (COPAI)
B. Key Terms
o Population – (denoted by N) the collection of all elements under
consideration
o Sample – (denoted by n) a subset of the population
Example:
A manufacturer of kerosene heaters wants to determine if customers are satisfied
with the performance of their heaters. Toward this goal, 5,000 out of 200,000
customers are contacted and each is asked, "Are you satisfied with the
performance of the kerosene heater you purchased?" Identify the population and
the sample for this situation.
✓ Population: The collection of 200,000 kerosene heater customers.
✓ Sample: The collection of 5,000 kerosene heater customers who were
interviewed.
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
Let’s Discover
Example:
Variable Possible observation
a. S = sex of a student Male, Female
b. C = course of a student AB Eng, AB Soc Sci, BS Stat
c. L = amount spent on load per month 𝐿 ≥ 0 pesos
d. N =number of enrolled students per campus 𝑛 = 0, 1, 2, 3, …
e. H = height of a volleyball player h > 0 cm
Example: In order to estimate the true proportion of students at a certain college who
smoke cigarettes, the administration polled a sample of 200 students and determined
that the proportion of students from the sample who smoke cigarettes is 0.12. Identify
the a) population, b) sample, c) parameter, and d) statistic.
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
Let’s Discover
C. Fields of Statistics
Example:
Descriptive Inferential
1. A bowler wants to find his bowling 1. A bowler wants to estimate his chance
average for the past 12 games. of winning a game based on his current
season averages and the averages of his
opponent.
2. A housewife wants to determine 2. A housewife would like to predict based
the average weekly amount she on last year's grocery bills, the average
spent on groceries in the past three weekly amount she will spend for this year.
months.
3. A politician wants to know the 3. A politician would like to estimate, based
exact number of votes he received in on an opinion poll, his chance of winning in
the last election. the upcoming election.
D. Measurement
Example:
✓ Age (in years)
✓ Weight (in kilograms)
✓ Height (in inches)
✓ Sex (Male or Female)
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
Let’s Discover
Types of Variables
1. Qualitative variable
o A variable that yields categorical response
o describes the quality or character of something
Example: Eye color, First Name, Favorite Movie
2. Quantitative Variable
o a variable that takes on numerical values representing an amount or
quantity
o describes the amount or number of something
Example: Weight, Height, Number of cars
a. Discrete
• a variable which can assume finite, or, at most, countably infinite
number of values; usually measured by counting or enumeration.
• can assume only certain values, and there are usually gaps between
values.
Example: Number of students enrolled in GEC 4, Number of cars
parked at the front of Admin building
b. Continuous
• a variable which can assume infinitely many values corresponding to a
line interval without gaps, interruptions, or jumps.
• can assume any value within a specified range.
• measurable (measured using a continuous scale such as kilos,
centimeters, grams)
Example: Height in inches, Income in pesos, Weight in kilograms
Levels of Measurement
1. Nominal Level (Classificatory Scale)
The nominal level is the weakest level of measurement where numbers or symbols
are used simply for categorizing subjects into different groups. The categories must
be distinct, non-overlapping and exhaustive
Examples:
Sex M-Male F-Female
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
Let’s Discover
Examples:
Teaching Ratings 1-poor 2- fair
3-good 4-excellent
3. Interval Level
The interval level is that which has the properties of the nominal and ordinal levels,
and in addition, the distances between any two numbers on the scale are of known
sizes. An interval scale must have a common and constant unit of measurement.
Furthermore, the unit of measurement is arbitrary and there is no “true zero” point.
Examples:
▪ IQ
▪ Test Result
4. Ratio Level
The ratio level of measurement contains all the properties of the interval level, and in
addition, it has a “true zero” point. This is the strongest level of measurement.
Examples:
▪ Daily Allowance
▪ Weight (in kg)
▪ Age (in years)
▪ Number of correct answers in an exam
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
Let’s Do It!
I. In the following situations, let’s determine the population, sample, parameter
and statistic.
Solution:
a. Variable: Whether or not a student smoke
b. Population: The set of students at a certain college.
c. Sample: The set of 200 students who were interviewed.
d. Parameter: The population proportion of students in a certain college who
smoke cigarettes.
e. Statistic: (0.12) the proportion of students in the sample who smoke
cigarettes.
2. A politician who is running for the office of mayor of a city with 25,000 registered
voters commissions on a survey. In the survey, 48% of the 200 registered voters
interviewed say they plan to vote for her.
Solution:
a. Variable: Whether or not a voter will vote the running political for Mayor
b. Population: The group of 25,000 registered voters
c. Sample: The group of 200 registered voters who were interviewed
d. Parameter: The percentage of registered voters in the population who plan
to vote for her.
e. Statistic: (48%) The percentage of registered voters in the sample who plan
to vote for her.
II. Let’s determine whether the following statements belong to the field of
descriptive statistics or inferential statistics.
1. A badminton player wants to know his average score for the past 10 games.
Solution: Descriptive statistics (Data is only gathered and summarized)
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
Let’s Do It!
2. Janine wants to determine the variability of her six exam scores in algebra.
Solution: Descriptive statistics (Data is only gathered and summarized)
3. Based on last year’s electricity bills, Mrs. Venegas would like to forecast the
average monthly electricity bill she will pay for the next year based on her average
monthly bill in the past year.
Solution: Inferential statistics (Data is analyzed to make a forecast)
4. Elfren “Bata” Reyes wants to estimate his chance of winning in the next world
championship game in Billiards based on his average scores last championship
and the averages of the competing players.
Solution: Inferential statistics (Data is analyzed to come up with an estimate)
Level of
Type of Variable
Measurement
Example:
Degree Program (AB Social Science, AB Qualitative Nominal
English Language, BS Statistics)
Example:
Quantitative,
Weight of women (in kg) before they took Ratio
Continuous
the diet
1. Height (in inches)
2. Sizes of shirts (i.e. XS, S, M, L, XL)
3. Zipcode (i.e. 6111 – Kabankalan)
4. Civil Status
5. Number of siblings
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
For you to fully absorb the lesson, let’s have some more examples.
I. In the following situation, let’s determine the variable, population, sample,
parameter and statistic.
Mr. Donaldo Chan, a candidate for Vice Mayor in Orion, Bataan wants to find out
if there is a need to intensify his campaign efforts against his opponents. He
requested the services of a group of students to interview 1,000 of the 3,000
registered voters of Orion, Bataan. The survey results showed that 75% of the
1,000 voters in the sample will vote for him as vice-mayor. Identify the following.
a. Variable: Whether or not a voter will vote for Mr. Chan as Vice Mayor
b. Population: The collection of 3,000 registered voters of Orion, Batangas
c. Sample: The collection of 1,000 registered voters of Orion, Batangas
d. Parameter: The percentage of registered voters in the population who voted
for Mr. Chan as Vice-Mayor
e. Statistic: The percentage of registered voters in the sample who voted for
Mr. Chan as Vice-Mayor
II. Let’s determine whether the following statements belong to the field of
descriptive statistics or inferential statistics.
Solution: a and b uses inferential statistics while c, d, and e uses descriptive statistics.
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
It’s time for you to do this on your own now. Write your answers on the provided answer
booklet
The average weekly allowance of students last year at a public high school was Php
600.00 per week, based on an enrolment of 1,080 students. The third-year students
who did not have this information interviewed 50 students and found their average
weekly allowance last year to be Php 550.00. Identify the following.
Variable:
Population:
Sample:
Parameter:
Statistic:
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
1
1. Marie wishes to determine the average amount she spends in online shopping
per month. She checks her purchase history in Shopee in the last six months.
Answer:
2. Ian wants to predict electricity bill in the next month using the data on the
monthly electricity bills in the last six months.
3. Sheena wants to determine the average exam scores of her students in their
summative test.
Answer:
4. Cris wants to determine the total amount spent for cat food in the past five
months.
Answer:
5. Tine wants to predict the number of enrollees for the next school year based on
enrollment data in the last five years.
Answer:
Classify the following variables by a) type and b) level of measurement. If the variable
is quantitative, determine whether it is discrete or continuous.
Level of
Type of Variable
Measurement
1. Number of text messages sent in
a day
2. Favorite food
3. Academic Performance
(Excellent, Very Satisfactory,
Satisfactory, Needs
Improvement)
4. Time (in hours) spent in
answering modules in a week
5. Shoe size (26, 27, 28, …)
6. Average daily temperature (in
Celsius)
7. Number of oranges in a bag
8. Amount spent (in pesos) on
mobile load per week
9. Favorite pet (cat, dog, bird, etc.)
10. Network carrier (Globe, Smart,
TM,..)
UNIT
LESSON BASIC CONCEPTS IN STATISTICS 4
3
Bibliography
Almeda, J.V., Capistrano, T.G., and Sarte, G. (2010). Elementary Statistics. The UP
Press, Diliman, Quezon City
Aufmann, Richard et.al. (2018). Mathematics in the Modern World 14 th edition.
Amid, Diego. (2005). Fundamentals of Statistics. Lorimar Publishing CO., INC
Everitt, B. S.; Skrondal, A. (2010), The Cambridge Dictionary of Statistics, Cambridge
University Press.
LEARNING OUTCOMES
UNIT
LESSON Data Organization and Presentation
4
2
ACTIVITY 34. FILL ME! Analyze the following problems and answer the questions
that follow. Write your answers in your answer booklet.
2. Ms. Frank wanted to lose weight so she conducts a survey on what physical activity
is effective in reducing weight. She asks 30 of her friends for recommendations.
The data below shows the preferred physical activity of the 30 women.
W T A W G T W W C W T W A T T
W G W W C A W A W W W T W W T
W = walking T = weight training C = cycling G = gardening A = aerobics
UNIT
LESSON Data Organization and Presentation
4
2
Based on the previous activity. Answer the following questions. Write your answers
on your answer booklet.
In order for you to answer the questions in the previous activity, you must have
organized your data. Just like in conducting a research, researchers must organize the
gathered data in some meaningful way. The easiest way of organizing data is to
construct a frequency distribution table. You will know more about frequency
distribution tables as we go along the next sections.
Let’s Discover
B. Frequency distribution
✓ is a way of summarizing data by showing the number of observations that
belong in the different categories or classes
✓ also referred to as grouped data.
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
C. Definition of Terms
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
2. Determine the number of classes (K). There are no precise rules concerning
the optimal number of classes but Sturges’ formula can be used as a first
approximation.
3. Determine the approximate class size (C). Whenever possible, all classes
should be of the same size.
4. Determine the lowest class limit. The first class must include the smallest
value in the data set.
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
5. Determine the upper limit of the lowest class using the formula:
UCL = LCL + C – 1(unit of measure)
Note:
5 unit of measure = 1
5.2 unit of measure = 0.1
5.34 unit of measure = 0.01
7. Determine all class limits by adding the class interval C to the limit of the
previous class.
8. Tally the frequencies for each class. Sum the frequencies and check against
the total number of observations.
Example: The data below shows the midterm exam scores of students in Math.
Construct a grouped frequency distribution for the data.
82 82 83 79 72 71 84 59 77 50
83 82 63 75 50 85 76 79 68 69
79 69 74 53 73 71 50 76 57 81
Solution:
1. Make an array for the given data.
50 53 63 69 72 75 77 79 82 83
50 57 68 71 73 76 79 81 82 84
50 59 69 71 74 76 79 82 83 85
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
3. Determine the approximate class interval (C).
◆ R = max – min = 85 – 50 = 35
◆ C = R ÷ K = 35 ÷ 6 = 5.8 ≈ 6
5. Determine the upper limit of the lowest class using the formula:
◆ UCL = LCL + C - 1 (unit of measure)
◆ UCL = 50 + 6 – 1(1) = 55
7. Determine all class limits and class boundaries by adding the class interval C to the
limit of the previous class.
Class Limits LCB UCB
50 – 55 49.5 55.5
56 – 61 55.5 61.5
62 – 67 61.5 67.5
68 – 73 67.5 73.5
74 – 79 73.5 79.5
80 – 85 79.5 85.5
8. Determine the frequencies for each class. Note that the frequency is the number of
observations that fall into each class. For example, there are 4 observations in the
data array that falls int the 50-55 class interval. Thus, the frequency of the first class
is 4. Sum the frequencies and check against the total number of observations.
Class Limits LCB UCB Frequency
50 – 55 49.5 55.5 4
56 – 61 55.5 61.5 2
62 – 67 61.5 67.5 1
68 – 73 67.5 73.5 7
74 – 79 73.5 79.5 8
80 – 85 79.5 85.5 8
Total 30
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
9. Add another column for the class marks. The class mark is found by getting the
average of the class limits.
Class Limits LCB UCB Frequency Class Mark
50 – 55 49.5 55.5 4 52.5
56 – 61 55.5 61.5 2 58.5
62 – 67 61.5 67.5 1 64.5
68 – 73 67.5 73.5 7 70.5
74 – 79 73.5 79.5 8 76.5
80 – 85 79.5 85.5 8 82.5
Total 30
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
2. Determine the distinct categories of the variable of interest. Write all the
categories under the ‘Class’ column
3. Determine the corresponding frequencies
4. Determine the percentage by using the formula:
𝒇
𝑷𝒆𝒓𝒄𝒆𝒏𝒕𝒂𝒈𝒆 = × 𝟏𝟎𝟎%
𝒏
where f = frequency of the class and n = total number of observations
Example: Twenty students were given a performance evaluation. The data set is as
follows. Construct a frequency distribution for the data.
High High High Low Average
Average Low Average Average Average
Low Average Average High High
Low Low Average High High
Solution:
1. Construct a table as shown below
Class Frequency Percentage
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
3. Determine the corresponding frequencies
Class Frequency Percentage
High 7
Average 8
Low 5
E. Presentation of Data
After organizing data, the next step is to present data so they can be understood
easily. The most useful method of presenting data is by constructing graphs and charts.
There are a number of ways to plot charts, and each one has a specific purpose.
1. Histogram is a bar graph that displays the classes on the horizontal axis and
frequencies of the class on the vertical axis. The vertical lines of the bars are
erected at the class boundaries and the height of the bars correspond to the class
frequency.
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
2. Frequency Polygon is a line chart that is constructed by plotting the frequencies
at the midpoints and connecting the plotted points by means of a straight line. The
polygon is closed by considering an additional class at each end of the lines that
are brought down to the horizontal axis at the midpoints of the additional classes.
3. Less than Cumulative Frequency Polygon (Less than Ogive) is a line chart that
is constructed by plotting the cumulative frequencies of the classes. The vertical
axis represents the less than cumulative frequency of the distribution while the
horizontal axis represents the upperclass boundaries of the frequency distribution.
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
Other Types of Graphs
2. Bar Graph – It uses bars to compare the magnitudes of the different categories
of a variable of interest. It may be drawn vertically or horizontally. A vertical bar
chart is best to used when comparing means or percentage between distinct
categories while a horizontal graph may contain more than 5 categories. A bar
is plotted on either the x-axis or the y-axis
3. Pie Chart - is a type of graph that displays data in a circular graph. The
pieces of the graph are proportional to the fraction of the whole in each
category. In other words, each slice of the pie is relative to the size of that
category in the group as a whole. The entire “pie” represents 100 percent of a
whole, while the pie “slices” represent portions of the whole.
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Discover
5. Scatter Plot - scatter plot uses dots to represent individual pieces of data. In
statistics, these plots are useful to see if two variables are related to each other.
For example, a scatter chart can suggest a linear relationship (also
called scatter graphs).
wrong interpretations
◼ not create any optical illusion
2. Clarity
◼ easily read and understood
3. Simplicity
◼ should be simple, straight-forward, not loaded with irrelevant, superfluous, or
UNIT
LESSON Data Organization and Presentation
4
2
Let’s Do It!
Let’s have some exercises on constructing frequency distribution tables and graphs
Solution:
1. Make an array for the given data.
16 19 19 21 22 24 26 27 27 28
28 29 29 30 31 32 32 33 33 34
35 36 38 40 40 41 42 44 44 44
46 46 47 47 47 48 50 50 53 55
4. The lowest class limit (LCL) will be 16 since the smallest value in the data set
is 16.
5. Determine the upper limit of the lowest class using the formula:
◆ UCL = LCL + C - 1 (unit of measure)
◆ UCL = 16 + 7 – 1(1) = 22
UNIT
LESSON Data Organization and Presentation
4
2
7. The next class limits will be determined by adding the class interval C = 7.
8. The frequencies are found by determining the number of observations in each
class interval
9. The relative frequency (RF) and relative frequency percentage (RFP is found
using the formula
RF = class frequency/ no. of observations
RFP = RF × 100%
10. The class marks are found by getting the middle value of the class interval.
That is, get the average of the lower and upper class limits.
11. The <cf is the number of observations that is lesser than the upper class
boundary. For instance, in the first class interval, there are 5 observations
less than 22.5. Moreover, in the second class interval, there are 5 + 8 = 13
observations less than 29.5
The table below presents the completed frequency distribution table.
Class Limits LCB UCB Frequency RF RFP Class Marks <CF
16 – 22 15.5 22.5 5 0.125 12.5 19 5
23 – 29 22.5 29.5 8 0.2 20.0 26 13
30 – 36 29.5 36.5 9 0.225 22.5 33 22
37 – 43 36.5 43.5 5 0.125 12.5 40 27
44 – 50 43.5 50.5 11 0.275 27.5 47 38
51 - 57 50.5 57.5 2 0.05 5.0 54 40
Total 40 1.00 100
To graph the histogram, construct the x-axis and y-axis. Represent the frequencies on
the y-axis and class boundaries on the x-axis. The height of the vertical bars
corresponds to the frequency of each class.
UNIT
LESSON Data Organization and Presentation
4
2
Solution:
Start with identifying the number of books borrowed per day. There were 3 books
borrowed on Monday, 4 on Tuesday, 9 on Wednesday, 5 on Thursday, and 4 on Friday.
The bar graph would look like this.
1. Samples of forty-two (42) college students are considered for study and were
categorized according to year level. Construct a frequency distribution for the data.
Senior Freshman Freshman Sophomore Freshman Junior Junior
Junior Sophomore Sophomore Freshman Sophomore Freshman Junior
Sophomore Freshman Sophomore Senior Senior Senior Sophomore
Freshman Freshman Senior Sophomore Freshman Sophomore Junior
Sophomore Junior Freshman Freshman Sophomore Junior Junior
Freshman Freshman Sophomore Senior Junior Freshman Freshman
UNIT
LESSON Data Organization and Presentation
4
2
Solution:
Our variable of interest is the year level of the students. The categories are freshmen,
sophomore, junior, and senior. The frequency distribution tables is as follows
2. The number of postpaid cellular phone subscribers for each of the last 12
years is listed below. Use a line graph to represent these figures. Interpret the
result.
Year 2004 2005 2006 2007 2008 2009
No. of Subscribers
3.12 4.10 4.23 3.96 3.87 3.50
(in millions)
Solution:
As reflected in the line chart, there is an increasing trend in the number of subscribers
from 2004 to 2015 although a drop in the number of subscribers is seen from 2007 to
2009.
UNIT
LESSON Data Organization and Presentation
4
2
a. Construct an array
d. Determine the LCL, UCL, LCB, and UCB of the first class interval
Total
UNIT
LESSON Data Organization and Presentation
4
2
2. The Land Transportation Office (LTO) is interested in the number of brand new
cars imported to the Philippines in 2015. The following data are as follow:
Country No. of Cars Imported
Japan 225,000
South Korea 78,300
USA 120,250
United Kingdom 19,200
Italy 16,750
China 40,500
Total 500,000
UNIT
LESSON Data Organization and Presentation
4
2
Bibliography
Almeda, J.V., Capistrano, T.G., and Sarte, G. (2010). Elementary Statistics. The
UP Press, Diliman, Quezon City
Aufmann, R. N., et al. (2018). Mathematics in the Modern World (Chapter 4).
Philippines: Rex Book Store
Aufmann, R. N., et al. (2013). Mathematical Excursions (Chapter 13). Brooks/Cole:
Cengage Learning
Sirug, W. (2018). Mathematics in the Modern World. Mindshapers Co. Inc.
LEARNING OUTCOMES
INTRODUCTION
UNIT
LESSON Measures of Central Tendency
4
3
You have learned from your previous study of mathematics how to find the
mean, median and mode of the data. To find out if you are ready to this lesson, do
the following activity. Write your answer in your answer booklet.
A student recorded her scores on weekly math quizzes that were marked out
of a possible 10 points. Her scores were as follows:
8 5 8 5 7 6 7 7 5 7
5 5 6 6 9 8 9 7 9 9
6 8 6 6 7
Previously, you were tasked to find the mean, median and mode on the weekly math
quizzes of a student.
1. To solve for the mean, simply used this formula:
Thus,
UNIT
LESSON Measures of Central Tendency
4
3
To solve for the median, simply arranged the data in ascending or descending order.
In this data, we will arrange it by ascending order.
5 5 5 5 5 6 6 6 6 6 6 7 7 7 7 7 7 8 8 8 8 9 9 9 9
To solve for the mode, just find the most frequently occurring value.
Thus, Mode=7
Let’s Discover
A. Mean
The arithmetic mean, often called as the mean, is the most frequently used measure
of central tendency. The mean is the only common measure in which all values play
an equal role, meaning, to determine its values you would need to consider all the
values of any given data set. The mean is appropriate to determine the central
tendency of an interval or ratio data.
The symbol , called “x bar” is used to represent the mean of a sample and the
symbol , called “mu” , is used to denote the mean of a population.
Properties of Mean
1. A set of data has only one mean.
2. Mean can be applied for interval and ration data.
3. All values in the data set are included in computing the mean.
4. The mean is very useful in comparing two or more data sets.
5. Mean is affected by the extreme small or large values on a data set.
6. Mean is most appropriate in symmetrical data.
UNIT
LESSON Measures of Central Tendency
4
3
where:
=sample mean
= population mean
x= the value of any particular observation or measurement.
= sum of all x’s
Example 1:
The daily salaries of a sample of eight employees at GMS Inc. are Php550, Php420,
Php560, Php500, Php700, Php670, Php860, Php480. Find the mean daily rate of
employees.
Solution:
Example 2:
Find the population mean of the ages of 9 middle-management employees of a
certain company. The ages are 53,45,59,48,54,46,51,58, and 55.
Solution:
UNIT
LESSON Measures of Central Tendency
4
3
B. Median
The median is the midpoint of the data array. When the data set is ordered,
whether ascending or descending order, it is called a data array. Median is an
appropriate measure of central tendency for data that are ordinal or above, but is
more valuable in an ordinal type of data.
Properties of Median
1. The median is unique, there is only one median for asset of data.
2. The median is found by arranging the set of data from lowest or highest (or
highest to lowest) and getting the value of the middle observation.
3. Median is not affected by the extreme small or large values.
4. Median can be applied for ordinal, interval and ration data.
5. Median is most appropriate in a skewed data.
To determine the value of median for ungrouped, we need to consider two rules:
1. If n is odd, the median is the middle rank.
2. If n is even, then the median is the average of the two middle ranked values.
5th
Hence, the median age is 53 years.
UNIT
LESSON Measures of Central Tendency
4
3
Example 2: The daily salaries of a sample of eight employees at GMS Inc. are
Php550, Php420, Php560, Php500, Php700, Php670, Php860, Php480. Find the
median daily rate of employee.
Solution:
Step 1: Arrange the data in order.
Php420, Php480, Php500, Php550, Php560, Php670, Php700, Php860
4.5th
Since the middle point falls between Php550 and Php560, we can determine the
median of the data set by getting the average of the two values.
C. Mode
The mode is the value in a data set that appears most frequently. Like the median
and unlike the mean, extreme values in a data set do not affect the mode. A data
may not contain any mode if none of the values are “most typical”. A data set that has
only one value that occurs the greatest frequency is said to be unimodal. If the data
has two values with the same greatest frequency, both values are considered the
mode and the data set is bimodal. If a data set has more than two modes, then the
data set is said to be multimodal. There are some cases when a data set values
have the same number of frequencies. When this occurs, the data set is said to be
no mode.
UNIT
LESSON Measures of Central Tendency
4
3
Properties of Mode
1. The mode is found by locating the most frequently according value.
2. The mode is the easiest average to compute.
3. There can be more than one mode or even no mode in any given data set.
4. Mode is not affected by the extreme small or large values.
5. Mode can be applied for nominal, ordinal, interval and ratio data.
Example 1: The following data represent the total unit sales for Smartphones from a
sample of 10 Communication Centers for the month of August: 15, 17, 10, 12, 13, 10,
14, 10, 8, and 9. Find the mode.
Solution:
The ordered array for these data is 8, 9, 10, 10, 10, 12, 13, 14, 15, 17.\
Because 10 appears 3 times, more tha the other values, therefore the mode is 10.
UNIT
LESSON Measures of Central Tendency
4
3
Let’s Do It!
The following are the ages of new recoveries of the COVID-19 patient in Western
Visayas on August 2, 2020. The following ages are: 30, 20, 40, 20, 24, 5, 30, 45.
Solve the mean, median and mode. Interpret the results.
Solution:
The mean age of new recoveries of the COVID-19 patient in western Visayas on
August 2, 2020 is 27 years old.
5 20 20 24 30 30 40 45
The media age of new recoveries of the COVID-19 patient in western Visayas on
August 2, 2020 is 27 years old.
5 20 20 24 30 30 40 45
The most occurring value is 30, therefore modal age of new recoveries of the COVID-
19 patient in western Visayas on August 2, 2020 is 30 years old.
UNIT
LESSON Measures of Central Tendency
4
3
The following are total number of COVID-19 Quarantine Facilities of the 8 provinces
in the Western Visayas. The total numbers are the identified isolation facilities and
other facilities for OFW’s. These are: 17, 23, 72, 6, 144, 41, 6 and 7. (Data Source:
DOH6 Data April 30, 2020). Solve the mean, median and mode. Make an
observation on the results.
Mean 39.50 The result shows that there were only average of 40
Quarantine Facilities of the 8 provinces in the Western
Visayas.
Median 20 The result is not affected by the extreme values, it shows that
the average (median) number of Quarantine Facilities of the 8
provinces in the Western Visayas is 20.
Mode 6 The modal number of Quarantine Facilities of the 8 provinces
in the Western Visayas is 6.
The data below shows the monthly (February-July, 2020) electricity bill of a particular
household in Rivera Ville, Kabankalan City, Negros Occidental. Solve for the mean,
median mode and interpret the results.
Electricity Bill (in pesos) 2,751 2,636 4,130 3,410 2,910 2,731
UNIT
LESSON Measures of Central Tendency
4
3
Let’s Sum It Up
The arithmetic mean, often called as the mean, is the most frequently used
measure of central tendency, median is the midpoint of the data array, and the
mode is the value in a data set that appears most frequently.
The following are properties of mean: (1) A set of data has only one mean’ (2)
Mean can be applied for interval and ration data; (3) All values in the data set are
included in computing the mean; (4) The mean is very useful in comparing two or
more data sets’ (5) Mean is affected by the extreme small or large values on a data
set; (6) Mean is most appropriate in symmetrical data.
The following are properties of median: (1)The median is unique, there is only
one median for asset of data; (2) The median is found by arranging the set of data
from lowest or highest (or highest to lowest) and getting the value of the middle
observation; (3) Median is not affected by the extreme small or large values; (4)
Median can be applied for ordinal, interval and ration data; (5) Median is most
appropriate in a skewed data.
The following are properties of mode: (1) The mode is found by locating the
most frequently according value; (2) The mode is the easiest average to compute; (3)
There can be more than one mode or even no mode in any given data set; (4) Mode
is not affected by the extreme small or large values; (5) Mode can be applied for
nominal, ordinal, interval and ratio data.
Try the following link and do check your manual computation of the previous
exercises using MS Excel.
Calculating The Standard Deviation, Mean, Median, Mode, Range, & Variance Using
Excel https://www.youtube.com/watch?v=k17_euuiTKw
UNIT
LESSON Measures of Central Tendency
4
3
Bibliography
Book
Sirug, W.S. (2018). Mathematics in the Modern World. A CHED General Education
Curriculum Compliant. Mindshapers Co., Inc.
Internet
"Mean, Median, Mode Practice Problems - Rio Salado College."
https://www.riosalado.edu/web/oer/wrkdev100-
20011_inter_0000_v1/m5/pdf/m5_l1_mean_median_mode_practice_probs.pdf.
Accessed 12 Aug. 2020.
"Measures of Central Tendency: Mean, Median, Mode & Range."
https://www.lcps.org/cms/lib4/VA01000195/Centricity/Domain/10878/9.4%20Notes%
20and%20Practice%20MMMR%20and%20MCT.pdf. Accessed 12 Aug. 2020.
"STATISTICS Unit 1 Measures of Central Tendency TEXT."
http://www.cimt.org.uk/cmmss/S1/Text.pdf. Accessed 12 Aug. 2020.
Measures of Dispersion
LESSON 4
DAYONO, JENESA A. (3 HOURS)
LEARNING OUTCOMES
INTRODUCTION
UNIT
LESSON Measures of Dispersion
4
4
Take a look on the figure below. What differences have you seen on the two curves?
Write three observations on the space provided
My observations:
1.
2.
3.
UNIT
LESSON Measures of Dispersion
4
4
If you examine the curve, the major difference that you can see between them
is the peakedness of the curve. The curve on the left is more peaked than the one on
the right. But what does this tells us?
Note that the government has set protocols to minimize the spread of the
COVID- 19 virus. Based on the figure earlier, if we do not take steps to minimize the
spread of the virus, we will have more cases in just a short period of time. On the
other hand, if we follow health protocols, we will reach the peak slowly over a period
of time.
We will relate this with the measures of dispersion which will be discussed on
the next section.
Let’s Discover
Measures of Dispersion
Measures of dispersion indicate the extent to which individual items in a
series are scattered about an average. It is used to determine the extent of the
scatter so that steps may be taken to control the existing variation. It is also used as
measure of reliability of the average value.
A. Range
Probably the simplest and easiest way to determine measure of dispersion is
the range. The range of a set of measurement is the difference between the largest
and the smallest values. There are two advantages of range: (i) it is easy to compute
and (ii) it is easy to understand. On the other hand, it also has two disadvantages, it
can be distorted by a single extreme value (or outlier) and only two values are used
in the calculation.
Example 1:
The daily rates of a sample of eight employees at GMS Inc. are Php550, Php420,
Php560, Php500, Php700, Php670, Php860, Php480. Find the range.
Solution:
Range (R)= maximum value- minimum value
Php860- Php420= Php440
UNIT
LESSON Measures of Dispersion
4
4
Example 2. The IQ score of the Morales’ family are 108,112,127,116, and 113. Find
the range.
Solution:
Range (R)= maximum value- minimum value
Example 1. The daily rates of a sample of eight employees at GMS Inc. are Php550,
Php420, Php560, Php500, Php700, Php670, Php860, Php480. Find the variance
and standard deviation.
Solution:
Step 1: Compute the mean of the data set.
UNIT
LESSON Measures of Dispersion
4
4
Step 2: Subtract the mean from each of the value in the data set.
550 -42.5
420 -172.5
560 -32.5
500 -92.5
700 107.5
670 77.5
860 267.5
480 -112.5
Hence, the variance is Php 20, 421.43 and the standard deviation id Php 142.90.
UNIT
LESSON Measures of Dispersion
4
4
Example 2: Consider the following two sets of data of engine life. Find the standard
deviation for each engine manufacturer.
A. 500, 500, 500, 1500, 1500, 1500 B. 925, 925, 925, 975, 975, 975
Solution: Company A
Step 1: Compute the mean of the data set.
Step 2: Subtract the mean from each of the value in the data set.
500 -500
500 -500
500 -500
1500 500
1500 500
1500 500
UNIT
LESSON Measures of Dispersion
4
4
Solution: Company B
925, 925, 925, 975, 975, 975
Step 2: Subtract the mean from each of the value in the data set.
925 -25
925 -25
925 -25
975 25
975 25
975 25
UNIT
LESSON Measures of Dispersion
4
4
Relative Dispersion
Measures of Relative dispersion are unitless and are used when one wishes to
compare the scatter of one distribution with another distribution.
Coefficient Variation
The coefficient of variation, CV, also known as relative dispersion, is the ratio of the
standard deviation to the mean and is usually expressed in percentage. It is
computed as:
UNIT
LESSON Measures of Dispersion
4
4
Example 1. The monthly income of the five research directors of Recoletos schools
are: Php55,000, Php59,500, Php62,500, Php57,000, and Php61,000. Find the
variance and standard deviation.
Solution:
UNIT
LESSON Measures of Dispersion
4
4
Let’s Do It
Step 2 & 3. Subtract the mean from each of the value in the data set and square the
, then get the sum.
30 3.25 10.56
20 -6.75 45.56
40 13.25 175.56
20 -6.75 45.56
24 -2.75 7.56
5 -21.75 473.06
30 3.25 10.56
45 18.25 333.06
UNIT
LESSON Measures of Dispersion
4
4
B. At the Steakhouse Grill, steaks are cut into 12‐ounce portions when they are
ordered by a customer. For the past month, the manager of Steakhouse Grill has
been trying to determine which cook can mostly accurately cut portion of the meat.
The manager collected the mean of sample meat portions for a few weeks for both of
his cooks.
The results are summarized in the table below.
Week 1 Week 2 Week 3 Week 4
Opie 12.5 12.3 11.8 11.2
Maynard 12.0 12.2 11.8 12.4
Step 2 & 3: Subtract the mean from each of the value in the data set and Square the
, then get the sum.
Thus, the mean cut of Opie is 11.95 oz and his standard deviation is 0.58.
UNIT
LESSON Measures of Dispersion
4
4
Solution (Maynard):
Step 1: Compute the mean of the data set.
Step 2 & 3: Subtract the mean from each of the value in the data set and Square the
, then get the sum.
12 -0.1 0.01
12.2 0.1 0.01
Thus, the mean cut of Maynard is 12.10 oz and his standard deviation is 0.26.
Despite the fact that Opie’s mean of 11.95 is closer to 12 ounce than
Maynard’s mean which is 12.10, their standard deviation would speak rightfully for
whom among the cook will be chosen by the manager to cut portions of meat
throughout the week.
The manager needs to choose Maynard because standard deviation of
Marnard’s cut is smaller compare to Opie. The smaller the value of standard
deviation, the closer it is to the mean. Thus, the cut of Maynard is closer to the
required cut of the Steakhouse Grill which is 12 ounce. It is true then that using only
the mean, one’s decisions may lead to mistakes.
UNIT
LESSON Measures of Dispersion
4
4
A. The data below shows the monthly (February-July, 2020) electricity bill of a
particular household in Rivera Ville, Kabankalan City, Negros Occidental. What is the
variance and standard deviation of the data? Show your complete solution in your
answer booklet.
Electricity Bill (in pesos) 2,751 2,636 4,130 3,410 2,910 2,731
B. Working hours for a week were recorded by the two teachers who were working at
home due to COVID-19 pandemic. The required working hours for a permanent
employee is 8 hours. Consider the following data and compute for the variation.
Which among the two teachers were working productively?
Teacher A: 8.5 8 7.25 7.25 9
Teacher B: 7.75 8.25 7.75 8.25 8
UNIT
LESSON Measures of Dispersion 4
4
Let’s Sum It Up
Measures of dispersion indicate the extent to which individual items in a series are
scattered about an average.
The simplest and easiest way to determine measures of dispersion is the range. The
range of a set of measurement is the difference between the largest and the smallest
values.
One of the most widely used measures of dispersion is the standard deviation.
Standard deviation is calculated as the square root of variance. The more spread
apart the data, the higher the deviation.
When one wishes to compare the scatter of one distribution with another distribution,
measures of relative dispersion (unitless) are used.
The coefficient of variation, CV, also known as relative dispersion, is the ratio of the
standard deviation to the mean and is usually expressed in percentage.
Try the link and do check your manual computation of the previous exercises using
MS Excel.
Calculating The Standard Deviation, Mean, Median, Mode, Range, & Variance Using
Excel https://www.youtube.com/watch?v=k17_euuiTKw
Bibliography
Book
Sirug, W.S. 2018. Mathematics in the Modern World. A CHED General Education
Curriculum Compliant. Mindshapers Co., Inc.
Zamora, C.O. et, al. 2003. Elementary Statistics. Text/Workbook.
"Measures of Central Tendency: Mean, Median, Mode & Range."
https://www.lcps.org/cms/lib4/VA01000195/Centricity/Domain/10878/9.4%20Notes%
20and%20Practice%20MMMR%20and%20MCT.pdf. Accessed 12 Aug. 2020.
https://web.gccaz.edu/~JOHWD63181/MAT142/chapter_3/problems/section%203.4.p
df
LEARNING OUTCOMES
INTRODUCTION
UNIT
LESSON Measures of Relative Position
4
5
Based from the activity 40, answer the questions below. Write your answer in your
answer booklet.
UNIT
LESSON Measures of Relative Position
4
5
Let’s Discover
Example 1: Find the first, second and third quartiles of the ages of 9 middle-
management employees of a certain company. The ages are
53,45,59,48,54,46,51,58, and 55.
Solution:
Step 1: Arrange the data in order from lowest to highest.
45, 46, 48, 51, 53, 54, 55, 58, 59
Step 2: Select the first, second and third quartiles using the formula.
1(𝑁 + 1) 1(9 + 1) 10
𝑄1 = = = = 2.5
4 4 4
2(𝑁 + 1) 2(9 + 1) 20
𝑄2 = = = =5
4 4 4
3(𝑁 + 1) 3(9 + 1) 30
𝑄3 = = = = 7.5
4 4 4
UNIT
LESSON Measures of Relative Position
4
5
Step 3: Identify the first, second, and third quartile values in the data set.
45, 46, 48, 51, 53, 54, 55, 58, 59
2.5th 5th 7.5th
Since the 2.5th falls between 46 and 48; and 7.5 th falls between 55 and 58 we
can determine the first and third quartiles of the data set by getting the average of the
two values.
46+48 94 55+58 113
𝑄1 = 2
= 2
= 47 𝑄3 = 2
= 2
= 56.5
Therefore, the first quartile is 47, which is interpreted as 25% of the ages are
below 47; the second quartile is 53, which is interpreted as 50% of the ages are
below 53; and the third quartile is 56.5, which is interpreted as 75% of the ages are
below 56.5
B. Percentiles
Percentiles are values that divide a set of observations in an array into 100 equal
parts. Thus,
𝑃1 , read as first percentile, is the value below which is 1% of the values fall
𝑃2 , read as second percentile, is the value below which is 2% of the values fall
⋮
𝑃99 , read as ninety-nine percentile, is the value below which is 99% of the values fall
𝑘(𝑁 + 1)
𝑃𝑘 =
100
where: 𝑃𝑘 =Percentile
𝑁 = population
𝑘 = percentile location
Example 1: Find the 64th and 80th percentiles of the ages of 9 middle-management
employees of a certain company. The ages are 53,45,59,48,54,46,51,58, and 55.
Step 1: Arrange the data in order from lowest to highest.
45, 46, 48, 51, 53, 54, 55, 58, 59
UNIT
LESSON Measures of Relative Position
4
5
Step 2: Select the 64th and 80th percentiles using the formula.
𝑘(𝑁 + 1) 64(9 + 1) 640
𝑃64 = = = = 6.4
100 100 100
In this case, to find the 6.4 th position, subtract 6th location from 7th location (55-54=1),
then multiply the difference to the decimal part of the 6.4 th location (1*.4=.0.4). Add
the product to the the number in the 6 th position (54+.4=54.4)
𝑘(𝑁 + 1) 80(9 + 1) 800
𝑃80 = = = =8
100 100 100
Step 3: Identify the 64th and 80th percentiles value in the data set.
45, 46, 48, 51, 53, 54, 55, 58, 59
8th
Therefore, 64th percentile is 54.4, which is interpreted as 64% of the ages are
below 54.4; and the 80th percentile is 58, which is interpreted as 80% of the ages are
below 58.
C. z-scores
z-score is used to know the position of one observation relative to others in a
set of data. Let say, we want to know a score of a student of 42 compared to the
scores of the other students in the class based from a quiz on a total of 50 points.
The mean and the standard deviation of the scores can be used to compute a z-
score, which will measure a relative standing of a measurement in a data set.
A z-score measures the distance between an observation and the mean,
measured in units of standard deviation. The following formulas show how to
compute the z-score for a data value x in a population and in a sample.
𝑥−𝜇 𝑥−𝑥
𝑧= 𝜎
(for population) 𝑧= 𝑠
(for sample)
Example 1: Ruel got a grade of 75% in English and a grade of 90% in History. The
mean grade in English is 65% and the standard deviation is 10% whereas in History,
the mean grade is 80% and the standard deviation is 20%. In which subject did Ruel
perform better?
Solution:
90−80 75−65
𝑧ℎ𝑖𝑠𝑡𝑜𝑟𝑦 = = 0.5 𝑧𝑒𝑛𝑔𝑙𝑖𝑠ℎ = = 1.0
20 10
UNIT
LESSON Measures of Relative Position
4
5
Ruel’s grade in History has a z-score of 0.5 and is closer than the mean
(average) while his English grade has a z-score of 1.0 and is farther way from the
mean, therefore, Ruel performed better in English.
Example 2. The monthly expenditures of a large group of households are normally
distributed with a mean of Php 48, 700 and a standard deviation of Php 10,400. What
is the z-value of monthly expenditures of Php59,400 and Php38,300?
Solution:
59,400−48,700
𝐹𝑜𝑟 𝑥 = 59,400 𝑧= 10,400
= 1.00
38,300−48,700
𝐹𝑜𝑟 𝑥 = 38,300 𝑧= = −1.00
10,400
The z of 1.00 indicates that a monthly expenditure of Php 59, 400 for
households is one standard deviation above the mean, and a z of -1.00 shows that a
Php 38,300 monthly expenditure is one standard deviation below the mean. Note that
both household monthly expenditures (Php 59,400 and Php 38,300) are the same
distance (Php 10,400) from the mean.
D. Box-and-Whisker Plot
John Wilder Turkey (1915-2000) introduced the boxplot in the 1970’s. A
boxplot (or box-and-whisker plot) is graph of a data set obtained by drawing a
horizontal line from the minimum data value to first quartile (𝑄1 ), drawing a horizontal
line to the third quartile (𝑄3 ) to the maximum data value, and drawing a box whose
vertical line passes through 𝑄1 and 𝑄3 with a vertical line inside the box passing
through the median or second quartile (𝑄2 ).
The boxplot is another way of telling or not the distribution of data is symmetric
relative to the mean.
The boxplot will give the following information:
1. If the median is near the center of the box, the distribution is approximately
symmetric.
2. If the median falls to the right of the center of the box, the distribution is
negatively skewed.
3. If the median falls to the left of the center of the box, the distribution is
positively skewed.
4. If the lines are about the same length, the distribution is approximately
symmetric.
5. If the left line is larger than the right line, the distribution is negatively skewed.
6. If the right line is larger than the left line, the distribution is positively skewed.
UNIT
LESSON Measures of Relative Position
4
5
Solution:
Step 1: Determine the 𝑄1 , Median, and 𝑄3 of the given data set. Recall that 𝑄1 =47,
Median(𝑄2 ) =53, and 𝑄3 =56.5.
Step 2: Locate the lowest value,𝑄1 , the median, 𝑄3 , and the highest value on the
scale.
Step 3: Draw a box around 𝑄1 and 𝑄2 , draw a vertical line through the median, and
connect the upper and lower values, as shown below.
UNIT
LESSON Measures of Relative Position
4
5
Note: The following are the plot values used in the data
Minimum value 45
𝑄1 47
Median or the 𝑄2 53
𝑄3 56.5
Maximum Value 59
The data set of the distribution is negatively skewed, since the median falls to
the right to the center of the box.
Let’s Do It
Solution #1:
Step 1: Arrange the data in order from lowest to highest.
5, 20, 20, 24, 30, 30 ,40, 45
UNIT
LESSON Measures of Relative Position
4
5
If you noticed, 𝑄1 , and 𝑃25 have the same rank value in the data set. Thus the first
quartile and the 25th percentile of the data set is 20 which means that 25% of the
ages of the recoveries of the COVID-19 patient in Western Visayas on August 2,
2020 falls below 20 years old.
Solution #2:
Recall how to find values for the following:
Minimum value 5
𝑄1 20
Median or the 𝑄2 27
𝑄3 37.5
Maximum Value 45
The median falls to the left of the center of the box, so therefore distribution is
positively skewed.
UNIT
LESSON Measures of Relative Position
4
5
B. Two men are on a weight-reducing diet. The first belongs to an age/body type
group for which the mean weight is 145 pounds with a standard deviation of 15
pounds. The second belongs to an age/body type of group for which the mean is 165
pounds with a standard deviation of 20 pounds. If their respective weights are 178
and 204 pounds, who is more definitely overweight for his age/body type group?
Solution:
𝑥−𝑥
𝑧=
𝑠
178−145 204−165
𝑧1 = = 2.2 𝑧2 = = 1.95
15 20
The overweight for age/body group type is the one whose weight is 178 pounds.
UNIT
LESSON Measures of Relative Position
4
5
B. The mean grade of the students who took Computer programming is 77.1 and
standard deviation equal to 8.03. Carl got a grade of 64 and he was worried because
the university has the policy that a student will fail if he obtains a grade which falls
beyond two (2) standard deviations below the mean. Will Carl fail the course?
Answers:
64−77.1
𝑧= = −1.63. This z-score is not beyond 2 standard deviations below the
8.03
mean. Therefore, Carl will not fail the course.
1. What is the first, second and third quartile of the given data?
2. Construct a box-and-whisker plot. What can you say about the distribution of
the data set.
B. Different typing skills are required for secretaries depending on whether one is
working in a law office, an accounting firm, or for a mathematical research group at a
major university. In order to evaluate candidates for these positions, an agency
administers 3 distinct standardized typing samples. A time penalty has been
incorporated into the scoring of each sample based on the number of typing errors.
The mean and standard deviation for each test, together with the scores achieved by
Mary, an applicant, are given in the following table.
Sample Mary’s Score Mean Std.Dev.
Law 141 sec 180 sec 30 sec
Accounting 7 min 10 min 2 min
Math Research 33 min 26 min 5 min
Who among the secretarial candidates perform best? Why?
UNIT
LESSON Measures of Relative Position
4
5
Let’s Sum It Up
Try the following links and do check your manual computation of the previous
exercises using MS Excel.
Bibliography
Mamhot, M.R. et al. 2013. Statistics for General Education (A Keypad Approach).
Purelybooks Trading and Publishing Corp.
Sirug, W.S. 2018. Mathematics in the Modern World. A CHED General Education
Curriculum Compliant. Mindshapers Co., Inc.
Zamora, C.O. et, al. 2003. Elementary Statistics. Text/Workbook.
Direction: Read the questions carefully. Shade the letter of the correct answer. If
answer is not provided, shade letter E. Use the answer sheet provided on your
answer booklet and strictly no erasures.
2. What type of chart is useful in presenting historical data and in showing the
movement of a series over time?
A. Line chart
B. Bar chart
C. Pie chart
D. Pictograph
10. What is the lower class boundary of the class interval 7 – 12?
A. 6.5 C. 7.5
B. 7 D. 12.5
11. The reseller’s income in Heng De face shield for seven days were, Php
1,220, Php 1,345, Php, 1000, Php 2,500, Php 1,590, Php 1,350 and Php
1540. What is the reseller’s average income?
A. Php 1,506.42 C. Php 1,350
B. B. Php 1,506.43 D. Php 10,545
12. It is a statistic that measures the dispersion of a dataset relative to its mean
and is calculated as the square root of the variance.
A. range C. variance
B. standard deviation D. none of these
13. Consider the sample What is the median of the set of data: 13, 19, 18, 20, 16,
9, 10, 7 and 8.
A. 13 C. 5.05
B. 13.33 D. 25.50
GEC 4: MATH IN THE MODERN WORLD
P a g e | 277
16. It is another way of telling whether or not the distribution of the data is
symmetric relative to the median.
A. Boxplot C. Quartiles
B. Z-scores D. Fractiles
For items 17 and 18, consider the life span (in years) of a certain species below.
1.6 1.9 2.2 2.5 2.6 2.6 2.9 3.0 3.0 3.1
3.1 3.1 3.1 3.2 3.2 3.2 3.3 3.3 3.3 3.4
3.4 3.4 3.5 3.5 3.6 3.7 3.7 3.7 3.8 3.8
3.9 3.9 4.1 4.1 4.2 4.3 4.4 4.5 4.7 4.7
19. It is used when one wishes to compare the scatter of one distribution with
another distribution.
A. Measures of Location C. Measures of Position
B. Coefficient of Variation D. Measures of Relative Dispersion
20. It is a value that divide a set of observations into 100 equals parts.
A. Quartiles C. Deciles
B. Percentiles D. Fractiles
Appendix A
Rubrics
A. Essay Writing
1 2 3 4 5
Criteria 1 2 3
Labels and Title Title is appropriate Title is somewhat Title is missing or
to the topic of the appropriate to the inappropriate. Little
graph. All areas of topic of the graph. or no labeling. X
the graph are Most areas of the and Y axis are not
labeled and well graph are labeled indicated correctly.
defined including and well defined
the X and Y axis. including the X and
Y axis.
Accuracy Graph shows a Graph shows a Graph cannot be
clear, defined connection to the X read but some data
connection to the X and Y axis. Most is present. The
and Y axis. Each bars are the length of the bars is
bar is the appropriate length. inappropriate.
appropriate length.
Visual Appearance Color is used in a Color is used. The Little or no color
meaningful way. graph, title and used. The graph,
The graph, title and labels are arranged title and/or labels
labels are arranged appropriately. Text are not arranged
appropriately. Text is readable. appropriately. Text
is easy to read. is difficult to read.
C. Frequency Table
3 2 1 0
Class Limits All class limits One value is Two to three Four or more
are correct incorrect values are values are
incorrect incorrect
Class All class One value is Two to three Four or more
Boundaries boundaries are incorrect values are values are
correct incorrect incorrect
Frequencies All frequencies One value is Two to three Four or more
are correct incorrect values are values are
incorrect incorrect
Relative All relative One value is Two to three Four or more
Frequencies frequencies are incorrect values are values are
correct incorrect incorrect
Class Marks All class marks One value is Two to three Four or more
are correct incorrect values are values are
incorrect incorrect
Cumulative All cumulative One value is Two to three Four or more
Frequencies frequencies are incorrect values are values are
correct incorrect incorrect
P a g e | 280
D. Frequency Histogram
5 4 3 2 1
Axes Both X and Both X and Both X and Both X and Both X and Y
Y Axes are Y axes are Y are Y axes are axes are not
scaled scaled scaled, but not scaled scaled or values
effectively based on not based evenly. not present
based on the data correctly
data set set on data
collected. X collected. X set. X and
and Y are and Y are Y are not
scaled scaled scaled
evenly evenly evenly
throughout throughout throughout
both axes both axes both axes
Titles/Labels The graph The graph The graph The graph Most titles and
is complete is complete is is labels are poor
and labeled and labeled complete complete or not
correctly correctly and but not present/missing.
throughout throughout labeled completely
the graph. the graph. throughout labeled.
This This the graph.
includes a includes a This
title created title created includes a
appropriate somewhat title
to the data appropriate created for
collected. to the data the data
collected. collected.
Histogram All bars are All bars are All bars Bars are Bars are noy
connected connected are not connected and
throughout throughout connected connected none of the data
its entirety. most of the throughout and not all correlated to the
Bars are axes. Bars some of data frequency table.
correctly are the axes. correlates
correlated correctly Bars are to the
to the data correlated somewhat frequency
collected to the data correlated table.
and collected to the data
displayed and collected
in the displayed and
frequency in the displayed
table. frequency in the
table. frequency
table
Retrieved from:
https://www.rcampus.com/rubricshowc.cfm?sp=yes&code=VX5BA74&
P a g e | 281
E. Summary Measures
3 2 1 0
Measures of The mean, The mean, The mean, The mean,
Central median, and median, and median, and median, and
Tendency mode have all mode have mode have mode have not
been computed been calculated been calculated been calculated
with no errors with 75% but are not
accuracy correct
Measures of The range, The range, The range, The range,
Variation variance, variance, variance, variance,
standard standard standard standard
deviation, and deviation, and deviation, and deviation, and
coefficient of coefficient of coefficient of coefficient of
variation have variation have variation have variation have
all been been calculated been calculated not been
computed with with 75% but are not calculated
no errors accuracy correct
Measures of The percentiles The percentiles The percentiles The percentiles
Position and quartiles and quartiles and quartiles and quartiles
have all been have been have been have not been
computed with calculated with calculated but calculated
no errors 75% accuracy are not correct
Retrieved from: https://www.rcampus.com/rubricshowc.cfm?sp=yes&code=N8C347&
F. Box-and-Whisker Plot
5 4 3 2
Five Number Summary All values One value Two to Four or more
➢ Minimum value are correct is incorrect three values values are
➢ Lower quartile are incorrect
➢ Median incorrect
➢ Upper quartile
➢ Maximum value
Box-and-whisker plot All One Two to Four or more
Plots potted correctly, box components component three components
is placed around upper and are present is missing components are missing
lower quartiles, line for are missing
median, whiskers are
drawn.
Analyze data Thoroughly Complete, Partially Incomplete
Describe the relationship complete, clear, complete; explanation;
between the two variables clear, and accurate gives not
accurate explanation response; understandable
explanation explanation or insufficient.
may be
unclear or
lack detail
Retrieved from: https://www.rcampus.com/rubricshowc.cfm?sp=yes&code=H8A6XB&