You are on page 1of 10

Practical Number: 1

One of the major measure of the quality of service provide by an organization is the speed with
which it responds to customer complaints. An internal service provider, had undergone a major
improvement by recruiting well trained installation crews, supervisors and office staffs. The business
objective of the company was to reduce the time between when the complaint it received and when
it is resolved. During a recent month, the company received 50 complaints concerning internet
installation. The data from the 50 complaints, collected by ISP, represent the number of house
between the receipt and the resolution of the complaint:
27,4,52,30,22,36,26,20,23,33,68,165,32,29,28,29,26,25,1,14,13,13,10,5,19,126,110,110,29,61,35,94,
31,26,5,12,4,54,5,35,137,31,27,152,2,123,81,74,27,11
a) Computer mean, median, mode, first quartile, third quartile
b) Compute the range, variance, standard deviation, coefficient of variation.
c) Compute the Skewness and kurtosis and comment on the result.
d) Construct the box and whisker plot. Are the data skewed? If so, how?

Working Expression:
Arithmetic mean:
The ratio of sum of all the observation to the total number of observation is called arithmetic mean.
It is denoted by X̅ and given by:

Mean (X̅)¿
∑X where n= total number of observation
n

Median:
The middle most value of the data which arrange in ascending or descending order is known as
median. It is denoted by Md and given by (for individual series)
th
n+1
Md¿ value of ( ) item
2
Mode:
Mode is the value whose repetition is maximum. In individual series mode is the value whose
repetition is maximum.

Quartile:
Quartile are the value which divide the distribution in to four equal parts and each part equal to
25%. There are three quartiles, known as lower quartile (Q1), median (Md) and upper quartile (Q3)
For individual series:

Size of Qi¿ i ( n+14 ) ,where i=1 , 2 ,3


Range:
The difference between the largest and smallest values in a given data is called range. It is denoted
by R and computed as follows.
R=L−S Where L=Largest item∧S=Smallest item .

Standard deviation:
It is square root of arithmetic mean of the squares of the deviations of the set of the values
measured from their mean. It denoted by  and is given by,

√1
n
Ʃ ¿¿ 
n √ −
n ( )
Ʃ X 2 ƩX 2

Where, X =numeric value∧n=number of observation 

Variance:

The square of the standard deviation is known as variance and is denoted by  and is given by:
Variance¿ σ 2

Coefficient of variation:
The relative measure based on the standard deviation and defined as the ratio of the standard deviation
to the mean express in percentage as known as coefficient of variation. It is denoted by C.V. and the
formula is
σ
C.V. = ×100 %
X
Skewness:
It a descriptive statistical measure used to measure the shape of the curve or to measure the
direction of variation.
mean−mode
Skewness¿
s.d

Kurtosis:
It is used to measure the flatness and peakedness of the curve drawn from the given distribution.
This is useful to test the normality of the curve. It is generally denoted by K,
1
(Q3−Q1 )
K 2
¿
P 90−P10

CALCULATION:

Using MS-excel
Mean 43.02
Standard Error 5.930446519
Median 28.5
Mode 27
First quartile(Q1) 13.75
Third quartile(Q3) 55.75
Standard Deviation 41.93458949
Sample Variance 1758.509796
Kurtosis 1.308902846
Skewness 1.487902647
Range 164
Minimum 1
Maximum 165
C.V 97.46%
Five number summary: 1, 13.75, 28.5, 55.75, 165,

Box and whishker plot

S=1 L=165

Q1=13.75 Md= 28.5 Q3=55.75

0 10 20 30 40 50 60

RESULT:
a. The arithmetic mean, median, mode, first quartile and third quartile are 43.04, 28.5, 27,
13.75 and 165 respectively
b. The range, standard deviation, variance and C.V. of the given distribution are 164, 41.93,
1758 and 97.46%
c. Skewness=1.48 and kurtosis (k)= 1.30
Conclusion:
Here the data is positively skewed and leptokurtic .And from box and whisker plots, we can see that
the length of right whisker is longer than length of left whisker, so the given distribution is positively
skewed.
Practical No: 2
Prepare a histogram from the following grouped frequency distribution of marks obtained by
students and locate the mode. Also draw the Ogive curve and locate the median.
Marks 50-60 60-70 70-80 80-90 90-100 100-110 110-120
No of students 10 15 25 40 15 10 5

Working expression

Histogram:
Histogram is the graphical presentation of frequency distribution with continuous classes. It is a
series of rectangles, each proportional in width to the range of values within a class and proportional
in height to the number of items falling in the class.
Steps involved in construction of histogram
 Draw X-axis and Y-axis.
 Take class boundaries on X-axis and frequencies on Y-axis
 Construct joint rectangles. The resulting figure is the required histogram.

Cumulative frequency curve (Ogive curve)


Cumulative frequency curve is a graphic presentation of frequency distribution. Considering variable
values in the X-axis and their corresponding cumulative frequency in the vertical Y-axis. This curve is
known as Ogive. Cumulative frequency curves are helpful to locate the partition values like median,
quartile, percentile etc. There are two type of cumulative frequency curve, which are as follows.
 Less than cumulative frequency curve
 More than cumulative frequency curve

Construction of Histogram

Histogram
45
40
35
30
25
20
15
10
5
0 Mode
50-60 60-70 70-80 80-90 90-100 100-110 110-120
Construction of Ogive curve
140

120

100

80
frequency

60 less than cf
more than cf

40

20

Median
0
40 50 60 70 80 90 100 110 120 130
marks

Result:
Hence, Histogram is constructed and mode is located using MS-Excel. Also cumulative frequency curve is
constructed with the help of MS-Excel and median is located.
Practical number: 3
Find the Skewness and kurtosis of the following distribution by the method of moment and
interpret the result.
Class interval 0-10 10-20 20-30 30-40
Frequency 1 4 3 2

Working Expression

Moment:
Moments are the arithmetic mean of different powers of the deviations of the given
observation from the chosen value. If the deviation are taken from the exact arithmetic
mean of the series, then the moments from these deviation is known as central moments. If
the deviations are taken from some arbitrary value, then the moments from these deviation
is known as raw moments. There are two types of moments
 Raw moments
 Central moments
Raw moments:
It is denoted by μr' and defined as
For individual series:
n n
1 1
μr = ∑ (x i− A) = ∑ d r , r =1, 2 , 3 , 4
' r
n i=1 n i=1
Where n=number of observation, A=assumed mean, d=x −A
For discrete series:
n n
1 1
μr' = ∑ f ( x i− A)r = ∑ f d r ,r =1 ,2 , 3 , 4
N i=1 N i=1
where N=Ʃf , A=assumed mean , d=x− A
For continuous series:
n n
1 1
μr' = ∑ (x i− A)r =hr × ∑ fd r
N i=1 N i=1
x− A
Where,d= and h= class interval
h
Central moment:
It is denoted by μr and defined as
For individual series
n
1
μr = ∑ (x−x)r , r=1 , 2 ,3 , 4
n i=1
For frequency distribution
n
1
μr = ∑ f (x−x )r , r=1 , 2 ,3 , 4
n i=1
Relationship between central moments and raw moments
' '
μ1=μ1 −μ 1=0
' '2
μ2=μ 2−μ 1
' ' ' '3
μ3=μ 3−3 μ 2 μ1 +2 μ1
' ' ' ' '2 '4
μ4 =μ4 −4 μ 3 μ1 +6 μ2 μ 1 −3 μ 1

Measures of Skewness and kurtosis based on moments


Skewness based on moment:
Karl Pearson defined the following coefficient of Skewness based on moments. These
coefficients are also known as moment coefficient of Skewness.


2 2
μ3 μ3 μ3
β 1= 3
∧γ 1= 3
= 3/ 2
μ 2 μ μ2 2
Interpretation
 β 1=0∧γ 1=0, the distribution is symmetrical
 β 1> 0∧γ 1 >0 , the distribution is positively skewed.
 μ3 <0∧γ 1< 0, the distribution is negatively skewed.
Kurtosis based on moment:
μ4
β 2= 2 ∧γ 2=β 2−3
μ2
Interpretation
 β 2=3∧γ 1=0 , The distribution is normal or mesokurtic.
 β 2> 3∧γ 1 <0 , The distribution is leptokurtic.
 β 2< 3∧γ 1 <0 , The distribution is platykurtic.

Calculation:
Calculation table
mid value X −25 fd fd
2
fd
3
d= fd
4

C.I f (X) 10
0-10 1 5 -2 -2 4 -8 16
10-20 4 15 -1 -4 4 -4 4
20-30 3 25 0 0 0 0 0
30-40 2 35 1 2 2 2 2
Total 10 -4 10 -10 22

Here, N=10 , ∑ fd =−4 , ∑ f d 2=10 , ∑ f d 3 =−10 , ∑ f d 4 =22 ,h=10


First four central moment are:
n
1 1
μ'1=h1 × ∑ fd 1=¿ ¿ 10× × (−4 )=−4
N i =1 10
n
1 1
μ =h × ∑ fd 2 =¿ ¿ 10 ×
' 2 2
×10=100
2
N i=1 10
n
1 1
μ =h × ∑ fd 3=¿ ¿ 10 ×
' 3 3
3 × (−10 )=−1000
N i=1 10
n
1 1
μ'4 =h 4 × ∑ fd 4=¿ ¿ 10 ×
4
×22=22000
N i=1 10

First four central moments will be


' '
μ1=μ1 −μ 1=0
' '2 2
μ2=μ 2−μ 1 =100−(−4 ) =84
' ' ' '3
μ3=μ 3−3 μ 2 μ1 +2 μ1
3
¿−1000−3 ×100 × (−4 ) +2 (−4 )
¿ 72
' ' ' ' '2 '4
μ4 =μ4 −4 μ 3 μ1 +6 μ2 μ 1 −3 μ 1
2 4
¿ 22000−4 (−1000 ) × (−4 )+ 6 ×100 × (−4 ) −3(−4)
¿ 14832

μ3 72
Moment coefficient of Skewness,γ 1= 3/ 2
=
¿¿
μ2
¿ 0.0935
μ 4 14832
Moment coefficient of kurtosis, β 2= 2 = 2
μ2 (84)
¿ 2.102

Result:
Skewness (γ 1 )=0.0935 and kurtosis ( β 2 )=2.102 using MS-Excel

Conclusion:
 Skewness(γ 1 )=0.0935> 0 , The frequency distribution is positively skewed.
 Kurtosis ( β 2 )=2.102<3, the distribution is platykurtic.
Practical Number: 4
Construct the grouped frequency distribution for the following distribution. Calculate the mean,
median, standard deviation.
20,30,90,40,10,70,40,50,40,10,40,10,40,10,40,10,70,10,60,50,10,40,30,50,30,10,70,20,40,40,20,60,
20,30,70,20,30,10,50

Working Expression:
Arithmetic mean:
The ratio of sum of all the observation to the total number of observation is called arithmetic mean.
It is denoted by X̅ and given by:

Mean (X̅)¿
∑f X where n= total number of observation
n

Median:
The middle most value of the data which arrange in ascending or descending order is known as
median. It is denoted by Md and given by (for continuous series)
th
n
Position of Md¿ value of ( ) item
2
N
−CF ×h
2
Md=L+
f

Standard deviation:
It is square root of arithmetic mean of the squares of the deviations of the set of the values
measured from their mean. It denoted by  and is given by,


√1
n
Ʃ ¿¿ 

n

n ( )
Ʃ X 2 ƩX 2

Where, X =numeric value∧n=number of observation 

Calculation:

You might also like