Professional Documents
Culture Documents
3 Measures of Central Tendency
3 Measures of Central Tendency
1 2
Sigma operator …
Sigma operator … Example: considering the following data and find the required values.
X 5 7 7 6 8
Properties of summation 5
Y 6 7 8
5
7 8
n
kX k Xi
2
c) 5 d) 5
Xi
10
i
where k is any constant. =510=50 d≠e
= 33 = 1089
i 1 i 1 2
i 1
n n i1
( a bX i) na b X where i 5
i1 i1 a and b are any constants.
e) X i 1
i
2
= 52 + 7 2+ 72 + 62 + 82= 223
f)
5
i 1
Xi
5
i1
Yi
= 33x36=1188
n n n
(X i yi ) X y i i 5
XY
i1 i 1 i1
X Y i i ( X 1 Y1 ) ( X 2 Y2 ) ( X 2 Y2 ) ... ( X n Yn ) h)
i1
(X i Yi ) =(5+6)+(7+7)+(7+8)+(6+7)+(8+8)=33+36=69
i1
5
3
i) (X
i1
i Yi ) =(5-6)+(7-7)+(7-8)+(6-7)+(8-8)=33-36=-3 4
3 4
5 6
1
Stat 2161 SB
. 4 2 8
. 5 3 15
X
fX i i
187
7 .5
If Xn occurs fn times
6
7
1
5
6
35 f i 25
8 7 56
Then the mean will be: 9 4 36 Conclusion
f X
X
f
i
i
i
where f =n
i
10
11
2
1
20
11
On the average, it takes 7
and half minutes to type the
f i 25 f i X i 187 test paragraphs.
7 8
7 8
9 10
X
W X 3 4 3 2 4 3 2 1 32 2.7
W 3 3 4 2 12
2.7 is the Grade Point Average {GPA} for Abebe
11 12
11 12
2
Stat 2161 SB
13 14
15 16
17 18
17 18
3
Stat 2161 SB
The median for grouped frequency distribution can be Consider the record high temperature data.
calculated by using Step 1: Cumulative
Class boundaries Frequency (f) frequency
N cf 99.5–104.5 2 2
MD Lm 2 w
104.5–109.5 8 10
fm CF
Lm 109.5–114.5 18 fm 28
Median class
114.5–119.5 13 41
where
119.5–124.5 7 48
• Lm=Lower boundary of the median class. 124.5–129.5 1 49
• N = the total number of observations N=? 129.5–134.5 1 50
• cf= cumulative frequency of the class immediately preceding the Step 2: Find the halfway point 50/2=25 N/2
median class
• fm= frequency of the median class Step 3: Locate the median class by using the cumulative frequency
distribution. This class contains the 25th value (the median).
• w=class width
.
19 20
21 22
3) Eleven cleaners were tested how long they take to clean a lecture hall. The
• Examples
data, in minutes, are shown below. Find the mode.
1) Find the modes for the following data sets.
18.0, 14.0, 34.5, 10.0,11.3,10.0,12.4,10.0 15, 18, 18, 18, 20, 22, 24, 24, 24, 26, 26
Soln
Soln Since 18 and 24 both occur three times, the modes are 18 and 24. This data
It is helpful to arrange the data in order although it is not necessary. set is said to be bimodal.
10.0, 10.0, 10.0, 11.3, 12.4, 14.0, 18.0, 34.5
Since 10.0 occurs three times—a frequency larger than any other number—the 4) Find the modal class for the frequency distribution of our temperature data.
mode is 10.0.
Class boundaries Frequency (f)
2) Six children were tested to see how long they could remain silent. The time, 99.5–104.5 2
in minutes, is recorded below. Find the mode. 104.5–109.5 8 Soln
109.5–114.5 18
2, 3, 5, 7, 8, 10 114.5–119.5 13 The modal class is 109.5–114.5,
Soln 119.5–124.5 7 since it has the largest frequency.
124.5–129.5 1
Since each value occurs only once, there is no mode. 129.5–134.5 1
Note: Do not say that the mode is zero. That would be incorrect, because in The mode is the only measure of central tendency
some data zero can be an actual value. 23
when the data are nominal or categorical. 24
23 24
4
Stat 2161 SB
3.4 Properties and uses of measures of central tendency Properties and uses of …
Researchers and statisticians must know which measure of CT is being used and when to use each The Median
measure of CT. These are summarized below. • The median is used to find the center or middle value of a data set.
The Mean • The median is used when it is necessary to find out whether the data
values fall into the upper half or lower half of the distribution.
• The mean is found by using all the values of the data. • The median is used for an open-ended distribution.
• The mean varies less than the median or mode when samples are • The median is affected less than the mean by extremely high or
taken from the same population and all three measures are extremely low values.
computed for these samples.
• The mean is used in computing other statistics, such as the The Mode
variance.
• The mode is used when the most typical case is desired.
• The mean for the data set is unique and not necessarily one of the
data values. • The mode is the easiest average to compute.
• The mean cannot be computed for the data in a frequency • The mode can be used when the data are nominal, such as religious
distribution that has an open-ended class. preference, gender or political affiliation.
• The mode is not always unique. A data set can have more than one
• The mean is affected by extremely high or low values, called mode, or the mode may not exit for a data set.
outliers, and may not be the appropriate average to use in these
situations.
25 26
25 26
27 28
In a positively skewed or right-skewed distribution, • In symmetric distribution, the data values are evenly
the majority of the data values fall to the left of the distributed on both sides of the mean. In addition, when
the distribution is unimodal, the mean, median and mode
mean and cluster at the lower end of the distribution; are the same and are all at the center of the distribution.
the “tail” is to the right. Also, the mean is to the right Examples, IQ scores and heights of mothers.
of the median, and the mode is to the left of the
median. • When the majority of the data values fall to the right of the
mean and cluster at the upper end of the distribution, with
the tail to the left, the distribution is said to be negatively
Example: skewed or left-skewed. Also the mean is to the left of the
In an examination, if most of the students did poorly, median and the mode is to the right of the median. Eg., if
the majority of students score very high on an examination.
their scores would tend to cluster on the left side of the These scores will tend to cluster to the right of the
distribution. A few high scores would constitute the tail of distribution.
the distribution, which would be on the right side.
• When a distribution is extremely skewed, the median
rather than the mean is a more appropriate measure of CT.
29 30
29 30