Professional Documents
Culture Documents
2
STATICTICS
The word statistics refers to quantitative information or to
a method of dealing with quantitative information.
4
5
Objectives of Averaging
To get single value that describes the characteristics of the entire
data.
Useful to extract and summarize the characteristics of the entire data set.
To facilitate comparison.
Since average represents the entire data set, it is possible to make
comparison between two or more data sets. E.g. performance of a
sales person based on average sales over two month or two years.
It becomes the base for computing other measures such as
dispersion, skewness, kurtosis etc.
6
Characteristics of a good average
It should be easy to understand.
It should be simple to compute.
It should be based on all the observations.
It should be rigidly defined.
It should have sampling stability.
It should be capable of further algebraic treatment.
It should not be unduly affected by the presence of
extreme values.
7
Measures of Central Tendency
Mathematical Averages
Arithmetic mean (simple or weighted)
Geometric mean
Harmonic mean
Averages of Position
Median
Quartiles
Deciles
Percentiles
Mode
8
ARITHMETIC MEAN
The arithmetic mean (AM) of a set of observations is their sum,
divided by the number of observations.
It is generally denoted by x or AM. Population mean is denoted by μ.
9
Bilal Textiles (Pvt.) Ltd
BTL was registered as a Pvt ltd co. with authorized capital of RS.100000 million in 1975.operations officially started in sept
1976.its products & services are categorized as Yarn Manufacturing & Weaving Garments etc.It now exports to countries in
europe,america,africa,asia,mid-east,asia.it has won “Best Textile Export award in 2003” for “sustained growth” for 2000-03. Lets
Take the Sales data Of this for Statistcal Analysis.
Mean= 8631.138889
10
Computing the Mean from a
Frequency Distribution
Consider the following distribution:
X f
30 2
29 3
28 5
27 3
26 2
11
Computing the Mean from a
Frequency Distribution
How would you compute the mean?
X f
i i
X i 1
N
f
i 1
i
12
Merits and Demerits of
Arithmetic Mean
Merits
Calculation of AM is simple
Calculation is based on all observations and hence it
can be regarded as representative of the given data
It is capable of being treated mathematically and
hence, is widely used in statistical analysis
It represents center of gravity of the distribution
because it balances the magnitudes of observations
which are greater and less than it
It gives good basis of comparison of two or more
distributions
13
Merits and Demerits of
Arithmetic Mean
Demerits
It can neither be determined by inspection nor by
graphical location
Arithmetic mean cannot be computed for a qualitative
data
It is affected too much by extreme observations and
hence does not adequately represent data consisting of
some extreme observations
AM cannot be computed when class intervals have
open ends
Simple arithmetic mean gives greater importance to
larger values and lesser importance to smaller values
14
Weighted Arithmetic Mean
15
Weighted Arithmetic Mean
Example 1: An examination was held to decide the award of a scholarship
The weights of various subjects are different. The marks obtained by 3 students are
given below:
Subject Weight Students
A B C
Mathematics 4 60 57 62
Physics 3 62 61 67
Chemistry 2 55 53 60
English 1 67 77 49
English 1 67 67 77 77 49 49
244 603 248 594 238 618
17
GEOMETRIC MEAN
Geometric mean (GM) is the nth root of the product of n items of a series.
18
Geometric Mean
In many business and economic problems we deal
with quantities that change over a period of time. In
such cases if we aim to know the average rate of
change, we consider geometric mean rather than
arithmetic mean
Example 01: If the population of the country has been
growing at a rate of 3%, 2.5%, 2.8%, 2% and 1.9%
respectively over the last five years, what has been the
average growth rate for the period.
In this case, we need to calculate the geometric mean
rather than the arithmetic mean
19
Geometric Mean
Example 2: The following table gives the annual rate
of growth of sales of a company in the last five years.
Calculate the average growth rate over these five
years.
Year Growth rate Sales at the end
of the year
20
Solution to the example
The average annual growth rate =
GM =
= (X1 x X2 x X3 x X4 x X5)1/5
=
= 5.9 percent
Simplified solution:
Log (G.M.) =
GM = antilog{ }
21
Geometric Mean
22
Uses, Merits and Demerits of
GM
Uses
GM is highly useful in averaging, ratios, percentages,
and rate of increase between two periods
GM is important for construction of index numbers
Merits
The value of GM is not much affected by extreme
observations and is computed by taking all observations
Useful in studying economic and social data
Demerits
GM cannot be computed if any item in the series is
negative or zero
Difficult to calculate
23
HARMONIC MEAN
Based on the reciprocal of the numbers averaged.
Defined as the reciprocal of the arithmetic mean of the reciprocal of the
individual observation.
It can be written as
24
Applications of Harmonic Mean
Useful for computing average rates
e.g. Average rate of increase of profits or average
speed at which any journey has been performed.
25
Bilal Textiles (Pvt.) Ltd
Sales turnover for year 1989-2006
YEAR SALES 1/X
(In million Rs.) (x)
Harmonic Mean = n
∑ 1/x
= 18/0.0061473
= 2928.1148
27
Merits and Demerits of
Harmonic Mean
Merits
It is based on all observations of the series
It is suitable in case of series having wide dispersion
Demerits
Difficult to calculate
It is not often used for analyzing business problems
28
Relationship between AM, GM
and HM
If all values are equal then AM = GM = HM
If values are different then AM > GM > HM
If the values of an observation takes the values a, ar,
ar2, ar3, …., arn, then (GM)2 = AM x HM
29
Median
The median may be defined as the middle or central value of
the variable when values are arranged in the order of
magnitude.
In other words, median is defined as that value of the variable
that divides the group into two equal parts, one part comprising
all values greater and the other all values lesser than the
median.
To measure the qualitative characteristics of data, other
measures of central tendency, namely median and mode are
used.
Positional averages, as the name indicates, mainly focus on the
position of the value of an observation in the data set. 30
Averages of Position - Median
Median – Median may be defined as the middle
value in the data set when the elements are arranged
in sequential order (either ascending or descending)
Median for ungrouped data:
If number of observations (n) is odd, then
Median = Size or value of { } th observation
If the number of observations are odd, then
31
Bilal Textiles (Pvt.) Ltd
BTL was registered as a Pvt ltd co. with authorized capital of RS.100000 million in 1995.operations officially
started in sept 1999.its products & services are categorized as Yarn Manufacturing & Weaving Garments etc.It
now exports to countries in europe,america,africa,asia,mid-east,asia.it has won “Best Textile Export award in
2003” for “sustained growth” for 2000-03. Lets Take the Sales data Of this for Statistcal Analysis.
SALES TURNOVER FROM 1989-2006
MEDIAN :
total observations(n)=18
median=(4525.8+5170.8)/2
=4848.3
32
Averages of Position - Median
Median for grouped data
33
Averages of Position - Median
Exercise 2 A survey was conducted to determine the
age in years of 120 automobiles. The result of such a
survey is given in the table below. What is the median
age
Age of the autos?
of auto 0–4 4–8 8 – 12 12 – 16 16 – 20
No. of autos 13 29 48 22 8
0–4 13 13
4–8 29 42 Median
Class
8 – 12 48 90
12 – 16 22 112
16 – 20 8 120
120
= 8 + 1.5 = 9.5
35
Partition Values – Quartiles, Deciles,
Percentiles
Quartiles: The values of observations in a data set,
when arranged in an ordered sequence, can be
divided into four equal parts, or quarters, using three
quartiles viz. Q1, Q2 and Q3. The first quartile Q1
divides the distribution in such a way that 25 percent
of the observations have a value less than Q1 and 75
percent of the values are more than Q1.
Q1 Q2 Q3
36
Partition Values – Quartiles, Deciles,
Percentiles
Deciles: The values of observations in a data set
when arranged in an ordered sequence can be divided
into then equal parts, using nine deciles (D1, D2, …..,
D9)
37
Partition Values – Quartiles, Deciles,
Percentiles
Exercise 1: The following is the distribution of weekly
wages of 600 workers in a factory
Weekly No. of Weekly No. of
wages (Rs.) workers wages (Rs.) workers
Find the
525 –29
600 th
percentile
65 and 95th percentile
Find the median
38
Relationship Between Median ,
Quartiles, Deciles, Percentiles
Q1 = P25
Q2 = Median
P50 = D5
Q3 = p75
D2 = P20
Q4 = P100
P90 = D9
Q2 = p50
P50 = Median
D5 = Median
39
Advantages & Disadvantages of
Median
Merits
Extreme values (outliers) do not affect the median as
strongly as they do the mean.
Useful when comparing sets of data.
It is unique - there is only one answer.
Demerits
Not as popular as mean.
It tell us only One value which is one is middle.
It provides not good result in Algebric type of data
40
MODE
41
Averages of Position - Mode
Mode: Mode is that value of an observation which
occurs most frequently in the data set, i.e. the point or
class mark with the highest frequency.
42
Bilal Textiles (Pvt.) Ltd
BTL was registered as a Pvt ltd co. with authorized capital of RS.100000 million in 1995.operations
officially started in sept 1999.its products & services are categorized as Yarn Manufacturing &
Weaving Garments etc.It now exports to countries in europe,america,africa,asia,mid-east,asia.it has
won “Best Textile Export award in 2003” for “sustained growth” for 2000-03. Lets Take the Sales data
Of this for Statistcal Analysis
SALES TURNOVER FROM 1989-2006 mode=0
43
Advantages & Disadvantages of
Mode
Merits
Extreme values (outliers) do not affect the mode as
strongly as they do the mean & Median.
Vary simple measure of centrel tendency.
It can be located graphically , with help of histogram.
Demerits
It is Vague and uncertain centrel tendency of values.
It is not papuler as mean and median
It provides not good result in Algebric type of data
In case of more yhen one answer difficult to trace
actual Centrel tendency of values
44
RELATIONSHIP BETWEEN MEAN,MEDIAN & MODE
Distribution in which values of mean,median,mode
coincide are symmetrical distribution.
Distribution in which values of mean,median,mode are
not equal are asymmetrical or skewed.
The distance between mean & median is approximately
one-third of the distance between the mean and mode.
Acc. To Karl Pearson :
Mean-median=1/3(mean-mode)
Mode=3median-2mean
Median=(2mean+mode)/3
45
Relationship between Mean, Median and
Mode as Graphical Analysis
Mean=median=mode
Mode Median Mean Mean Median Mode
46
Choosing Measures of Central
Tendency
Measures Best Uses
Mean Interval or ratio data
Near normal distribution