Professional Documents
Culture Documents
MEASURES OF VARIBILITY
A measure of variability is a summary statistic that represents the amount of
dispersion in a dataset. The measures of variability define how far away the data
points tend to fall from the center. Statisticians use summary measures to describe the
amount of variability or spread in a set of data. The most common measures of
variability are the range, variance, and standard deviation.
RANGE
Range is represented by R
The range is the difference between the highest value (HV) minus the
lowest value (LV) in the data set.
In symbol. R = HV – LV
Example:
The retail prices of rice per kilo of the five stores are listed below.
P 42, P38, P40, P45, P35
Find the range.
Solution:
From the list of prices, the highest value, HV = 45
and the lowest value, LV = 35.
R= HV – LV
R = 45 – 35
R = 10
VARIANCE
σ2 =
N
where σ2 is the population variance, μ is the population mean, Xi is the ith element from
the population, and N is the number of elements in the population.
Observations from a simple random sample can be used to estimate the variance
of a population. For this purpose, sample variance is defined by slightly different formula,
and uses a slightly different notation:
S2 =
N−1
Example:
The numbers of calls per day received by the secretary of an
office were recorded. as follows.
15, 8, 12, 25, 6, 18
Find the variance and standard deviation.
Solution:
The steps involved in solving the are shown in the table.
S2 =
N−1
242
S2 =
6−1
S2 = 48.4
SD = 48.4 = 6.96
To make the mean and standard deviation more meaningful, we are going
to use them for the computation of the coefficient of variation. The coefficient of
variation (CV) is a measure of relative variability. It is the ratio of the standard
deviation to the mean (average). The formula is
standard deviation
CV = 100%
mean
The coefficient of variation (CV) is particularly important to measure the
degree of homogeneity of data. CV is used to analyze series of values and is a
measurement of precision. The smaller the variation between a data set the
greater the precision. Basically CV <10 is very good , 10% - 20% is good
6.96
CV =
14
100%
CV = 49.71%
Since CV = 49.71% greater than 20%, the data is considered
heterogeneous. It means that scores in a data set are widely scattered or
spread.
Example:
The retail prices of rice per kilo of the five stores are listed below.
P 42, P38, P40, P45, P35
The mean, = 40 and SD=3.81.
Find the coefficient of variation and discuss the homogeneity of
the data.
Solution:
3.81
CV =
40
100%
CV = 9.53%
Since CV = 9.53% less than 20%, the data is considered homogeneous. It
means that scores in a data set are closer to the mean.
Computation of variance and standard deviation using e cel.
Step 1. Open the e cel and encode the scores or numerical observations.
SD = 3.81
Variance = 14.5