Professional Documents
Culture Documents
LESSON 1
GROUPED DATA
In the previous lesson, we discussed the way of determining the mean of a set of data. As you
notice, the given set of data in the previous lesson are not organized in a manner like we covered in Module
2. In other words, the data are ungrouped. In this lesson, we will discuss the methods of calculating the
mean of a set of grouped data. That is, we will calculate the mean of a set of data presented in frequency
distribution table.
x=
∑ fx
x
Where:
𝒇 represents the frequency column of the distribution.
𝒙 represents the midpoint column of the distribution.
𝒇𝒙 represents the “frequency times midpoint” column of the distribution.
𝒏 is the total frequency.
Comments:
∑ 𝑓𝑥 is a quantity that is obtained by adding all the entries of 𝑓𝑥 column.
Recall the formula of midpoint of each class interval. It was discussed at Module 2 – Lesson 3.
The value of 𝑛 can be obtained by simply adding all the entries of 𝑓column.
EXAMPLE 1
Consider the following set of data.
Classes 𝒇 𝒙 𝒇𝒙
11 – 22 3 16.5 49.5
23 – 34 5 28.5 142.5
This is the “frequency
35 – 46 11 40.5 445.5 times the midpoint”
47 – 58 19 52.5 997.5 column. For example,
3 ∙16.5=49.5
59 – 70 14 64.5 903
71 – 82 6 76.5 459
83 – 94 2 88.5 177
𝑛 = 60
STEP 3. To get the value of ∑ 𝑓𝑥, get the sum of all the entries of the 𝑓𝑥 column.
Classes 𝒇 𝒙 𝒇𝒙
11 – 22 3 16.5 49.5
23 – 34 5 28.5 142.5
35 – 46 11 40.5 445.5
47 – 58 19 52.5 997.5
59 – 70 14 64.5 903
71 – 82 6 76.5 459
83 – 94 2 88.5 177
𝑛 = 60 ∑ 𝑓𝑥 = 3174
STEP 4. Substitute the pertinent values to the formula to get the mean.
∑ 𝑓𝑥 3174
𝑥̅ = = = 52.9
𝑛 60
STEP 2. Construct the 𝑓𝑥 column. To get the value of ∑ 𝑓𝑥, we will simply get the sum of all the entries of 𝑓𝑥
column.
Class Intervals 𝑓 𝑥 𝑓𝑥
11 – 15 1 13 13
16 – 20 2 18 36 This is the
21 – 25 5 23 115 “frequency times
midpoint” column.
26 – 30 11 28 308 For example,
31 – 35 12 33 396 5 ∙23=115
36 – 40 11 38 418
41 – 45 5 43 215
46 – 50 1 48 48
TOTAL 𝑛 = 48 ∑ 𝑓𝑥 = 1549
x=
∑ f x = 1549 =32. 27
n 48
x=x a +( )∑ fd
n
c
Where:
𝒙𝒂 is the assumed mean
𝒇 represents the frequency of every class interval.
𝒅 represents the unit deviation
𝒄 is the class width
𝒏 is the total frequency
Comments:
o The unit deviation method is just the alternative method of computing the mean of a set of
grouped data. When applied correctly, this method always gives the result as the midpoint
method.
o This method minimizes the use of large numbers in the computation of the mean.
o The application of this method always begins with your free choice of class interval.
o The assumed mean 𝑥𝑎 is the midpoint of the class interval of your choice.
To see that this method gives the result as the midpoint method, let use the distribution presented in Example 1.
EXAMPLE 3
Consider the following set of data.
STEP 1. As stated in my comments, we will choose any class interval (as in it depends on you). For the sake
of discussion, I will choose the interval 𝟒𝟕 – 𝟓8.
Class Intervals 𝒇
11 – 22 3
23 – 34 5
35 – 46 11
47 – 58 19
It is the chosen class 59 – 70 14
interval. It entirely 71 – 82 6
depends on your
83 – 94 2
preference.
𝑛 = 60
The midpoint of the chosen class interval is the assumed mean. Since the midpoint of 47 − 58 is
52.5, we have 𝒙𝒂 = 𝟓𝟐. 𝟓.
Furthermore, the see that the class width of each class interval in the distribution is 12. So, 𝒄 = 𝟏𝟐.
STEP 2. Construct a column for the unit deviation. It is represented by 𝑑. The 𝑑 of the chosen interval is zero.
Class Intervals 𝒇 𝒅
With respect to column 𝑑, let 11 us –
start
22 with zero. We designate
3 consecutive negative integers (-1, -2, -3, …) to
the class intervals before our chosen intervals. On the other hand, we designate consecutive positive integers
23 – 34 5
(1, 2, 3, …) to the class intervals after our chosen class intervals.
35 – 46 11
Class 47 – 58 𝒇 19 𝒅 0
Intervals 59 – 70 14
11 – 22 3 -3 Starting from zero, negative
71 – 82 6
consecutiveWe designate zero to
23 – 34 integers are designed
83 – 94 5 2 -2 the chosen class
to the intervals before our chosen
35 – 46 11 𝑛 = 60-1 interval
class interval
47 – 58 19 0
59 – 70 14 1
71 – 82 6 2 Starting from zero, positive
consecutive integers are designed
83 – 94 2 3 to the intervals after our chosen
𝑛 = 60 class interval
Construct the 𝑓𝑑 column. This column contains the product between the frequency and their corresponding 𝑑.
To get the value of ∑ 𝑓𝑑, we will get the sum of all the entries of the 𝑓𝑑 column.
Class 𝒇 𝒅 𝒇𝒅
Intervals
11 – 22 3 -3 -9
23 – 34 5 -2 -10 This is the fd column. For
35 – 46 11 -1 -11 Example 3 ∙ (−3 )=−9
47 – 58 19 0 0
59 – 70 14 1 14
71 – 82 6 2 12
83 – 94 2 3 6
𝑛 = 60 ∑ 𝑓𝑑 = 2
x=x a +(∑ )n
fd
c=52 . 5+ ( )
2
60
( 12 )=52 . 9
Comments:
o The value of 𝑥𝑎 and ∑ 𝑓𝑑 depends on your choice of class interval. Their values may vary.
o Still, regardless of your choice of class interval, you will get the same mean.
To see that we will get the same result regardless of your choice of the class interval, let us use the same
distribution and then apply the unit deviation method. This time we choose different classs intervals.
EXAMPLE 4
Consider
WEIGHTEDthe following
MEAN set of data used in Example 3
The weighted mean of a set of grouped data can be calculated using the formula:
Frequency Distribution of the Examination Scores of Sixty Students in a Statistics Class
Scores ∑ wx
x= Number of Students
11 – 22 ∑w 3
23 – 34 5
Where: 35 – 46 11
𝒙 represents the item value 47 – 58 19
59 –
w represents the weight associated to 𝒙70 14
71 – 82 6
83 – 94 2
EXAMPLE 1
Use unit deviation. This time let’s use different class interval.
Suppose we are interested in computing the weighted mean grade of the student.
x=
∑ wx
∑x
3 ( 2.0 ) +3 ( 3.0 ) +5 (1.25 )+ 1 ( 3.0 ) +2 ( 2.5 ) +3(2.5)
x=
3+3+5+1+2+3
36.75
x=
17
x=2.16
. Republic of the Philippines
SOUTHERN LUZON STATE UNIVERSITY
LABORATORY SCHOOL
Lucban, Quezon
49 84 91 67 38 57 82 29 47 38
54 52 43 67 65 50 18 58 48 39
16 65 35 71 73
Compute the mean of the set of data above. Use the two formulas discussed in Lesson 1 of this module.
a. Using the set of data in #1, construct a frequency distribution with 6 classes. Recall the method of
constructing frequency distribution table.
b. Using the constructed frequency distribution table in #2, calculate the mean of the grouped data. Use
any of the two methods discussed in this lesson.
c. Compare the result that you obtained in #1 and #3. Based on the result, what can you say about the
formula of the mean of a set of grouped data?
DO IT YOURSELF
Consider the following set of grouped data. Calculate the mean using (a) midpoint method and (b) unit
deviation method. Show your solution.
2. Frequency Distribution of the NCEE Scores of First Year College Students of ABC College.
3. Calculate the mean of the distribution used in Example 2 using unit deviation method.
4. The final grades of a student in six subjects where he was enrolled were taken and are shown below.