Professional Documents
Culture Documents
All the three series A, B, and C, have the same size (n=9) and the same
mean, mode and median, i.e., 15. Thus, if we are given that the
mean, mode or median of a series of 9 observations is 15, we cannot
determine if we are talking of the series A, B, or C.
In fact any series of 9 items which total 135 will give mean 15.
Thus, we may have a large number of series with entirely different
structures and compositions but having the same mean. The measures
of central tendencies (i.e. mean, mode or median) indicate the
general magnitude of the data and locate only the center of a
distribution of measures. They do not establish the degree of
variability or the spread out or scatter of the individual items and their
deviation from (or the difference with) the mean.
Definition of dispersion
Example: Find the range and the coefficient of range of the following prices of
shares of ABC company ltd.
Day: Monday Tuesday Wednesday Thursday Friday
Saturday
Prices KShs: 200 210 208 160 200 250
Solution:
Range = Xm – Xo where Xm = 250, Xo = 160.
= 250 – 160 = Ksh90.
Coefficient of range = S = (Xm – Xo )/(Xm + Xo)= 250 – 160 = 0.22
250 + 160
1.The Range…….Example
Weight (Kg) 60 - 62 63 - 65 66 - 68 69 - 71 72 - 74
Number of 5 18 42 27 8
Students
1.The Range……. Solution
In this case, the upper true limit of the highest class 70-79
is Xm = 79.5 and the lower true limit of the lowest class
20-29 is Xo = 19.5
1. Range does not change even the least even if all other, in between, terms
and variables are changed.
2. It is not based on all the observations of the series. It only takes the highest
and the lowest scores in to account.
3. For open-end intervals, range is indeterminate because lower and upper
limits of first and last interval are not given.
4. It is affected very greatly by fluctuations in sampling. Its value is never
stable. In a class where normally the height of students ranges from 150 cm to
180 cm, if a dwarf, whose height is 90 cm is admitted, the range would shoot
up from 90 cm to 180 cm. .
5. Range does not present the series and dispersion truly. Asymmetrical and
symmetrical distribution can have the same range but not the same dispersion.
It is of limited accuracy and should be used with caution. .
Uses of Range: In spite of these limitations, the
range has its applications in a number of fields;
Solution
Arrange data in ascending order: 8, 10, 12, 14, 16, 18, 20.
2.Quartile Deviation and its
Coefficient- Ungrouped Data…..
Here, n = 7, the first and third quartiles are:
2.Quartile Deviation and its
Coefficient- Discrete Series
2.Quartile Deviation and its
Coefficient- Continuous Series
2.Quartile Deviation and its
Coefficient- Continuous Series…….
Activity:
It is because both the dispersions do not take all individual scores into
account. We can overcome some of the serious shortcomings of range
and quartile deviation by using another dispersion called Average
deviation or Mean deviation.
“Average deviation is the arithmetic mean of all the deviations of
different scores from the mean value of the scores without the
regard for sign of the deviation.”
Average deviation is the arithmetic mean of the deviations of a
series computed from some measure of central tendency (mean,
median or mode), all the deviations being considered positive.
In other words the average of the deviations of all the values from
the arithmetic mean is known as mean deviation or average
deviation.
3. The Average Deviation (A.D.)…..
Here we have
Computation of AD from grouped data:
discrete Series
Calculate the Mean Deviation for the following data:
Solution:
To calculate MD of the given distribution, we construct
the following table:
Solution:
Solution:
Computation of AD from grouped data:
Continuous Series
Find the mean deviation for the following frequency
distribution:
Marks 0 – 10 10 – 20 20 – 30 30 – 40 40- 50
No. of 6 28 51 11 4
Students
Solution
Solution
Merits of A.D.
The range just takes in to account only the highest score and
the lowest score.
The quartile deviation takes into account only the middle
50% of scores and in case of average deviation we ignore the
signs.
Therefore in order to overcome all these difficulties we use
another measure of dispersion called Standard Deviation. It is
commonly used in experimental research as it is the most stable
index of variability.
Symbolically it is wrote as σ (Greek small letter sigma).
4. The Standard Deviation (SD)….
Standard deviation is the square root of the average value of the squared
deviations of the scores from their arithmetical mean. The SD is computed
by summing the squared deviation of each measure from the mean,
divided by the number of cases and extracting the square root.
To be clearer, we should note here that in computing the SD we square all
the deviations separately, find their sum, divide the sum by the total
number of scores and then find the square root of the mean of the
squared deviation. That is why it is also called the ‘root mean square
deviation’.
The square of standard deviation is known as Variance (σ2). It is referred
to as the mean square deviation. It is also called as the second moment
dispersion.
The Standard Deviation (SD)……..
Steps in the calculation of SD from
Ungrouped Data
1. First of all, calculate the mean of the data.
2. Find out deviation from the mean for each variable of the series.
3. Square each deviation from the mean computed in step 2
4. Find the sum of the squared deviations computed in step 3
4. Divide the sum of the squared deviations by n, the no. of
observations given in the series. – At this point, You have
computed the VARIANCE OF THE SERIES.
5. Take the positive square root of the quotient you have computed
in step 4 and the result would be the standard deviation of the
series.
Computation of SD from Ungrouped Data
Here also, the first step is to find the mean M, for which we have to take the
mid-points of the c.i’s denoted by X’ and find the product fX.’. Mean is given
by ∑fx’/N.
The second step is to find the deviations of the mid-points of class intervals X’
from the mean i.e. X’- M denoted by d.
The third step is to square the deviations and find the product of the squared
deviations and the corresponding frequency.
Solution
When two sets of scores have been combined into a single lot, it is
possible to calculate the σ of the total distribution from the σ’ s of
the two component distributions.
The formula is:
Combined Standard Deviation (σcomb)
where σ1 , = SD of distribution 1
σ2 = SD of distribution 2
d1 = (M1 – Mcomb)
d2 = (M2 – Mcomb)
N1 = No. of cases in distribution 1.
N2 = No. of cases in distribution 2.
Combined Standard Deviation (σcomb)
Figure 1. Sketches showing general position of mean, median, and mode in a population.
A Skewed Left b Skewed Right
Long tail point’s left Long tail points right
Types of Skewness
There are certain tests to know whether skewness does or does not exist
in a frequency distribution.
They are:
1. In a skewed distribution, values of mean, median and mode would not
coincide. The values of mean and mode are pulled away and the value of
median will be at the centre. In this distribution, mean-Mode = 2/3
(Median - Mode).
2. Quartiles will not be equidistant from median.
3. When the asymmetrical distribution is drawn on the graph paper, it
will not give a bell shaped curve.
4. Sum of the positive deviations from the median is not equal to sum of
negative deviations.
5. Frequencies are not equal at points of equal deviations from the
mode
Nature of Skewness
Bowley’s Skewness =
(Q1+Q3–2Q2) / (Q3-Q1).
Where Q1 is the lower quartile, Q3 is the
upper quartile, and Q2 is the median.
6. Karl Pearson’s Measure of Skewness
90-100 5 5
100 -110 8 13
110 -120 10 23
120 -130 15 38
130 -140 9 47
140 -150 7 54
150 -160 6 60