Professional Documents
Culture Documents
Handnote - Chapter-4 - Measures-Of-Dispersio
Handnote - Chapter-4 - Measures-Of-Dispersio
Measures of Dispersion
Structure
4.1 Measures of dispersion
4.2 Definition of Dispersion
4.3 Objectives of Measuring Dispersion
4.4 Characteristics of a good measure of dispersion
4.5 Types of measures of dispersion
4.6 Range and coefficient of range
4.7 Quartile deviation and coefficient of quartile deviation
4.8 Average deviation and coefficient of average deviation
4.9 Standard deviation and coefficient of variation
4.10 Mathematical properties of standard deviation
4.11 Correcting incorrect value of Standard Deviation
4.12 Differences between Mean Deviation and Standard Deviation
4.13 Relation between Measures of Dispersion
4.14 Solved problems
Student I Student II
68 85
75 90
65 80
67 25
70 65
Both have got a total of 345 and an average of 69 each. The fact is that the second
student has failed in one paper. When the averages alone are considered, the two
students are equal. But first student has less variation than second student. Less
variation is a desirable characteristic.
It is clear from above that dispersion (also known as scatter, spread or variation)
measures the extent to which the items vary from some central value. Since measures
of dispersion give an average of the differences of various items from an average, they
are also called averages of the second order. An average is more meaningful when it is
examined in the light of dispersion.
On the other hand relative measures of dispersion are free from the units of
measurements of the observations. A measure of relative dispersion/variation is the
ratio of a measure of absolute variation to an average. It is sometimes called a
coefficient of variation because “coefficient” means a pure number.
The various absolute and relative measures of dispersion are listed below:
Range: This is the simplest possible measure of dispersion and is defined as the
difference between the largest and smallest values of the variable.
In symbols, Range = L – S.
Method 1:
L = Upper boundary of the highest class.
S = Lower boundary of the lowest class.
Method 2:
L = Mid-value of the highest class.
S = Mid-value of the lowest class.
Co-efficient of Range: The relative measure corresponding to range, called the
coefficient of range, is obtained by applying the following formula:
7, 9, 6, 8, 11, 10, 4
Here,
Largest value, L=11
Smallest value, S = 4.
Range = L – S = 11- 4 = 7
Demerits:
1. It is very much affected by the extreme items.
2. It is based on only two extreme observations.
3. It cannot be calculated from open-end class intervals.
4. It is not suitable for mathematical treatment.
5. It is a very rarely used measure.
Quartile Deviation (Q.D): Quartile Deviation is half of the difference between the first
and third quartiles. Hence, it is called Semi Inter Quartile Range.
In Symbols,
Among the quartiles Q1, Q2 and Q3, the range Q3 - Q1 is called inter quartile range and
, Semi inter quartile range.
Example: For the data given below, find the quartile deviation and coefficient of quartile
deviation.
Class 351 – 500 501 – 650 651 – 800 801 – 950 951 – 1100
Frequency 48 189 88 47 28
We know,
Therefore,
= 500.5 + 41.27
= 541.77
Again,
Location of Q3 = size of 3N/4 observation
= size of (3 x 400)/4 observation
= size of 300th observation
= 650.5 + 107.39
= 757.89
Thus,
Merits:
1. It is Simple to understand.
2. It is easy to calculate.
3. It is not affected by extreme values.
4. It can be calculated for data with open end classes also.
Demerits:
1. It is not based on all the items. It is based on two positional values Q1 and Q3
and ignores the extreme 50% of the items.
2. It is not amenable to further mathematical treatment.
3. It is affected by sampling fluctuations.
∑| |
(i) For ungrouped data, A.D(Me) =
∑ | |
(ii) For grouped data, A.D(Me) =
If the deviations are taken from mean, the formula for average deviation is written as
follows:
∑| ̅|
(i) For ungrouped data, A.D( ̅ ) =
∑ | ̅|
(ii) For grouped data, A.D( ̅ ) =
Example: Calculate average deviation from mean and its coefficient from the
following data.
Income
0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
(in Tk.)
Number of
6 8 10 12 7 4 3
persons
Solution: Table: Calculation of average deviation from mean
Therefore,
∑ | ̅|
( ̅)
( ̅)
Merits:
1. It is simple to understand and easy to compute.
2. It is rigidly defined.
3. It is based on all items of the series.
4. It is not much affected by the fluctuations of sampling.
5. It is less affected by the extreme items.
6. It is flexible, because it can be calculated from any average.
7. It is better measure of comparison.
Demerits:
1. It is not a very accurate measure of dispersion.
2. It is not suitable for further mathematical calculation.
3. It is rarely used. It is not as popular as standard deviation.
4. Algebraic positive and negative signs are ignored. It is mathematically
unsound and illogical.
Variance =
(i) For ungrouped data: If the variable x assumes n values x1, x2,… xn , then the
standard deviation is given by
∑( ̅)
√
∑ ∑
√ ( )
We know,
∑
̅
∑( ̅)
√
(ii) For grouped data: The standard deviation for grouped data is obtained from
the following formula
∑ ( ̅)
√
Short-Cut method:
∑ ∑
√ ( )
Where,
x = the mid-point of individual class
f = the frequency of individual class
N = the sum of the frequencies or total frequencies.
A = any value in x
c = class interval
̅
Note: Coefficient of variation is used in such problems where we want to compare the
variability of two or more than two groups. That groups for which the Coefficient of
variation is greater is said to be more variable or conversely less consistent, less
uniform, less stable or less homogeneous.
On the other hand, the groups for which the Coefficient of variation is smaller is said to
be less variable or conversely more consistent, more uniform, more stable or more
homogeneous.
Example: Calculate mean and standard deviation from the following data.
We know,
∑
̅
∑ ∑
√ ( )
√ ( )
√
√
Example: In two factories A and B located in the same industrial area, the average
weekly wages (in Tk.) and the standard deviations are as follows:
( )
̅
( )
̅
Since C.V. of factory B is greater than C.V of factory A, factory B has greater
variability in individual wages.
Merits:
1. It is rigidly defined
2. Its value is always definite and based on all the observations and the actual
signs of deviations are used.
3. As it is based on arithmetic mean, it has all the merits of arithmetic mean.
4. It is the most important and widely used measure of dispersion.
5. It is possible for further algebraic treatment.
6. It is less affected by the fluctuations of sampling and hence stable.
7. It is the basis for measuring the coefficient of correlation and sampling.
Demerits:
1. It is not easy to understand.
2. It is difficult to calculate.
3. It gives more weight to extreme values because the values are squared up.
4. As it is an absolute measure of variability, it cannot be used for the purpose of
comparison.
where, d1 = | ̅ ̅ | and d2 = | ̅ ̅ |
2. The standard deviation of the first n natural numbers can be found by the
following formula:
√ ( )
3. The sum of the squared deviations of the individual observations from the
arithmetic mean is always minimum. In other words, the sum of the squared
deviations taken from any value other than the arithmetic mean will be higher.
Again,
∑
√ ( ̅)
∑
( ̅)
∑
( )
√ ( )
√
√
Range = L – S.
where, L = Largest value.
S = Smallest value.
In continuous series,
L = Upper boundary of the highest class.
S = Lower boundary of the lowest class.
Therefore, Range = L – S = 36 – 4 = 32
We know,
Class 4–8 8 – 12 12 – 16 16 – 20 20 – 24 24 – 28 28 – 32 32 – 36 36 – 40
Frequency 6 10 18 30 14 12 10 6 2
Range = L – S = 40 – 4 = 36
(ii) Table for calculation:
Class Frequency, f Cumulative frequency, cf
4–8 6 6
8 – 12 10 16
12 – 16 18 34
16 – 20 30 64
20 – 24 14 78
24 – 28 12 90
28 – 32 10 100
32 – 36 6 106
36 – 40 2 108
Total N = 108
We know,
1st quartile,
3rd quartile,
Therefore,
∑ | |
• For grouped data, A.D(Me) =
∑ | ̅|
• For grouped data, A.D( ̅ ) =
Income
0 – 10 10 – 20 20 – 30 30 – 40 40 – 50 50 – 60 60 – 70
(in Tk.)
Number of
6 8 10 12 7 4 3
persons
0 – 10 6 5 –3 –18 26 156
10 – 20 8 15 –2 –16 16 128
20 – 30 10 25 –1 –10 6 60
30 – 40 12 35 = A 0 0 4 48
40 – 50 7 45 1 7 14 98
50 – 60 4 55 2 8 24 96
60 – 70 3 65 3 9 34 102
∑ = ∑ | ̅|
N = 50
–20 =688
Arithmetic mean
∑
̅
Therefore,
∑ | ̅|
( ̅)
( ̅)
∑
̅
∑| ̅|
̅
Rule # 3: Determination of Standard Deviation (S.D) and Coefficient of Variation (C.V)
∑( ̅) ∑
• For ungrouped data: S.D, √ √ ( ̅)
Short-Cut method, √∑ (
∑
)
where, A = the assumed mean or any value in x
d=x–A
∑ ( ̅)
• For grouped data: S.D, √
Short-Cut method, √∑ (
∑
)
Where,
x = the mid-point of individual class
f = the frequency of individual class
N = the sum of the frequencies or total frequencies.
A = any value in x
c = class interval
Problem-5: The table below gives the marks obtained by 10 students in Business
Statistics. Calculate standard deviation.
Student No.: 1 2 3 4 5 6 7 8 9 10
Marks: 43 48 65 57 31 60 37 48 78 59
∑ ∑
√ ( )
√ ( )
√
√
∑ ∑
√ ( )
√ ( )
√
√
We know,
∑
̅
∑ ∑
√ ( )
√ ( )
√
√
( )
Problem-8: A purchasing agent obtained samples of 60 watt bulbs from two companies.
He had the samples tested in his own laboratory for length of life with the following
results:
Samples from
Length of life (in hours)
Company A Company B
1700 – 1900 10 3
1900 – 2100 16 40
2100 – 2300 20 12
2300 – 2500 8 3
2500 – 2700 6 2
(i) Which company’s bulbs do you think better in terms of average life?
(ii) If the prices of both types are same, which company’s bulbs would you buy
and why?
∑
̅
∑ ∑
√ ( )
√ ( )
√
√
∑
̅
∑ ∑
√ ( )
√ ( )
√
√
(i) Since average length of life is greater in case of company A, hence bulbs of
company A are better.
(ii) Coefficient of variation is less for company B. Hence if prices are same, we
will prefer to buy company B’s bulbs because their burning hours are more
uniform.
Problem-9: In two factories A and B located in the same industrial area, the average
weekly wages (in Tk.) and the standard deviations are as follows:
Factory Average Standard Deviation Number of workers
A 34.5 5 476
B 28.5 4.5 524
and
( )
̅
Since C.V. of factory B is greater than C.V of factory A, factory B has greater
variability in individual wages.
Problem-10: Prices of a particular commodity in five years in two cities are given below:
Price in city A 20 22 19 23 16
Price in city B 10 20 18 12 15
For city A:
∑
̅
∑( ̅)
√
( )
̅
For city B:
∑
̅
∑( ̅)
√
( )
̅
City A had more stable prices than City B, because the coefficient of variation is less in
City A.
( ) ( )
Combined standard, √ √
where, d1 = | ̅ ̅ | => ( ̅ ̅ )
d2 = | ̅ ̅ | => ( ̅ ̅ )
̅ ̅
̅
( ) ( )
√
( ̅ ̅ ) ( ) ( )
( ̅ ̅ ) ( )
Therefore,
( ) ( )
√
( ) ( )
√
Problem-12: Mean and standard deviation of 100 items are found to be 30 and 10. If
two observations are wrongly taken as 31 and 71 instead of 13 and 17 at the time of
calculation, find the correct mean and correct standard deviation.
Corrected mean,
∑
̅
Again,
∑
√ ( ̅)
∑
( ̅)
∑
( )
√ ( )
√
√
Exercise-4
Theoretical
Q.1 What do you mean by measures of dispersion? Write down the characteristics of
a good measure of dispersion.
Q.2 Discuss different types of measures of dispersion.
Q.3 Define range and coefficient of range. Write down the merits and demerits of
range.
Q.4 Define quartile deviation and coefficient of quartile deviation. Write down the
merits and demerits of quartile deviation.
Q.5 Define average deviation and coefficient of average deviation. Write down the
merits and demerits of average deviation.
Q.6 Define standard deviation and coefficient of variation. Write down the merits and
demerits of standard deviation.
Q.7 Write down the mathematical properties of standard deviation.
Q.8 Write down the differences between Mean Deviation and Standard Deviation.
Q.9 Why is the standard derivation the most widely used measure of dispersion?
State the limitations of Range. [NU BBA 2015]
Q.10 Distinguish between Absolute and relative measure of dispersion.
[NU BBA 2014]
Q.11 What is the standard deviation? Why standard deviation is most widely used as a
measure of variations? [NU BBA 2013]
Q.12 Explain how measure of central tendency & measure of variation
complement each other in describing mass data.
Q.13 What are the requisites of a good measure of dispersion? Why is the
standard deviation usually chosen as a measure of variation? Explain how
measure of central tendency and measure of variation complement each other in
describing mass of data.
Mathematical
Problem-1: Compute inter quartile range, semi-inter quartile range/quartile deviation and
its coefficient from the following data:
Marks 0 – 20 20 – 40 40 – 60 60 – 80 80 – 100 100 – 120
No. of
12 20 10 5 7 8
Students
[54.5; 27.25; .5369]
Problem-2: Compute quartile deviation and its coefficient from the following data:
Mid-value 14 16 18 20 22 24 26
Frequency 12 14 50 60 32 20 10
Problem-3: Calculate mean deviation and its coefficient from the following data:
Class 120 – 140 140 – 160 160 – 180 180 – 200 200 – 220
Frequency 5 8 10 6 4
[167.575; 19.90; 0.118]
Problem-4: Calculate average deviation from median and its coefficient from the
following data.
Sales
10 – 20 20 – 30 30 – 40 40 – 50 50 – 60
(in thousand Tk.)
Number of Shops 3 6 11 3 2
[33.18; 8.1096; 0.2444]
Problem-5: You are given the data pertaining to kilowatt hours of electricity consumed
by 100 persons in Dhaka.
Calculate (i) the mean, (ii) the standard deviation and (iii) the range within which middle
50% of the consumers fall.
[25.9; 10.96; 16.4]
Problem-9: Calculate average deviation from mean and its coefficient from the following
data.
Profits
10 – 20 20 – 30 30 – 40 40 – 50 50 – 60
(in thousand Tk.)
Number of companies 3 6 11 3 2
[33; 8.16; 0.247]
Problem-12: The mean and standard deviation of a set of 100 observations were
worked out as 40 and 5 respectively by a computer which by mistake took the value 50
in place of 40 for one of the observation. Find the correct mean and variance.[39.9;
23.99]
Problem-13: The mean and standard deviation of 20 items are found to be 10 and 2
respectively. At the time of checking it was found that one item 8 was incorrect.
Calculate the mean and standard deviation if (i) the wrong item is omitted, (ii) it is
replaced by 12. [10.1;2.26;10.2;2]
Problem-14: The profits (inTk. Lakhs) earned by 100 companies during 2014-2015 are
shown below:
Profits(Tk.lakhs) No. of companies
20-30 4
30-40 8
40-50 18
50-60 30
60-70 15
70-80 10
80-90 8
90-100 7
Problem-16: For a group of 50 male workers, the mean and standard deviation of their
daily wages are 80 tk. And 10 tk. Respectively. For a group of 40 female workers, these
are 70 tk. And 9 tk. Respectively. Find combined mean and standard deviation of daily
wages.[75.56; 10.78]
What is the average life of each refrigerator? Which model has greater
uniformity?[5.12;6.16;54.9%;36.2%]
Problem-18: For two firms A and B belonging to the same industry, the following details
are available.
Firm A Firm B
No of Employees 100 200
Average wage monthly Tk.2400 Tk. 1800
S.D. Tk.60 Tk.80
Problem-20: In a survey, data on daily wages paid to workers of two factories A and B
are as follows:
No. of workers
Daily wages(in Takas)
Factory- A Factory- B
20-30 15 25
30-40 30 40
40-50 44 60
50-60 60 35
60-70 30 20
70-80 14 15
80-90 7 5
Find out:
a) Which factory pays higher average wages? By how much?
b) Wages of which have greater variability.
c) Average monthly wage and standard deviation of all the workers in both the
factories
Problem-21: The price of a share in Dhaka & Chittagong stock exchange during the
last 10 months are recorded below:
Problem-23: From the following data show that M.D > Range
( -5 13 12 15 0 16 )
Problem-24: An organization has two units A and B. an analysis 0f weekly wages paid
to workers gave the following results:-
Unit A Unit B
Number of wages earners 600 500
Standard deviation Tk. 100 Tk. 81
Average weekly wages Tk.1750 Tk.1860
(a) Which units pay larger amount as weekly wages?
(b) In which unit there is greater variability in wage distribution?
[5.71;4.35]
(c) Find the combined average wage & combined standard deviation
of wages of the whole organization. [1800;106.94]
Problem-25: An analysis of production rejects resulted in the following figures:
No. of rejects per
21 – 25 26 – 30 31 – 35 36 – 40 41 – 45 46 – 50 51 – 55
operator
No. of operator 5 15 28 42 15 12 3
Calculate mean and standard deviation
Problem-26: In a survey, data on daily wages paid to workers of a factory are as
follows:
Daily Wages (Rs.) No. of workers
20-30 15
30-40 30
40-50 44
50-60 60
60-70 30
70-80 14
80-90 7
Problem-27: Given below are the monthly household incomes (in TK.) of 10 families:
10648, 17416, 6517, 13555, 14821, 9226, 15936, 11800, 18527, 12222.
Compute the range and quartile deviation.
Problem-28: The following distribution shows the turnover of the branches of a group of
multiple-shops on March 2009
Turnover (in taka lakhs) No of Shops
5 and under 10 8
10 and under 15 18
15 and under 20 42
20 and under 25 62
25 and under 30 30
30 and under 35 10
35 and over 4
Problem-29: For a group of 50 male workers, the mean and standard deviation of their
daily wages are 54.1 tk. And 8 tk. Respectively. For a group of 100 female workers,
these are 50.3 tk. And 7 tk. Respectively. Find combined mean and standard deviation
of daily wages. [51.57;7.56]
Problem-30: The mean and standard derivation of a set of 100 observations were
found out as 50 and 10 respectively which by mistake took the values 60 in place of 50
for one of the observations. Find the correct mean and variance. (NU BBA 2015)
[49.9; 89.99]
Problem-31: Mean and standard deviation of 100 items are found to be 40 and 10. If at
the time of calculation two items are wrongly taken by 30 and 48 instead of 35 and 45.
Find the correct mean and standard deviation. (NUBBA 2014) [40.02;9.94]
Problem-32: The following are the scores of batsman A and B for 10 innings:-
Innings 1 2 3 4 5 6 7 8 9 10
A 40 58 45 60 50 45 70 60 45 75
B 80 40 25 90 10 65 12 70 25 90
Find the average and standard deviation of the weekly salary of all 65 employees in the
company. Which location shows greater variability? (NU BBA 2013) [320;48.69]
Problem-34: The prices of a tea company share in Dhaka & Chittagong Stoke
Exchange during the last 10 months are recorded below:-
Month Dhaka Chittagong Month Dhaka Chittagong
January 105 108 June 127 125
February 120 117 July 109 125
March 115 120 August 110 120
April 118 130 September 104 110
May 130 100 October 112 135
(i) Determine the mean share price for each of the markets.
(ii) Determine the standard deviation of the price of share in each market.
(iii) In which market the share price is more stable?
(NU BBA 2012) [115;119;7.24;8.48]
Problem-35: For two factories A and b belonging to the same industry, the following
details are available:-
Factory A Factory B
No. of employees 100 80
Average wage monthly Tk. 4600 Tk. 4900
S.D Tk. 500 Tk. 400
(i) Which factory A or B pays larger amount as monthly wages?
(ii) Which factory shows greater variability in the distribution of wages?
(iii) What is the mean and standard deviation of all the workers in two factory taken
together? (NU BBA 2011) [460000;392000;4733.33;484.15]
(i) Which company’s lamp do you think are better in terms of average life?
(ii) If prices of both types are the same which company’s lamps would you buy
and why?(NU BBA 2015)[1109.67; 1050; 16.40%; 11.85%]