0 ratings0% found this document useful (0 votes) 515 views57 pagesBasic Statistics
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content,
claim it here.
Available Formats
Download as PDF or read online on Scribd
Pea:
MODULE-4
Basic STATISTICS
91. statistics and its Related Terms
vaisties. An aggregate of facts which are affecetd by a number of
ee and which are expressed numerically to some resonable extent of
| ec) and which are collected in a systematic manner for a specific
purpose is called statistics.
‘The subject in which we study the characteristics of the facts is also
fnown as Statistics .
‘iiusiration. Let we collect the facts and figures of the car accident
ten place in Kolkata during last 15 years. Then this collection is a
satistics of ‘car accident in Kolkata during last 15 years’.
Variable, Variable is a symbol e.g. x,...that can assume any prescribed
value,
Ifa variable assume only one value then it is called constant.
Mlustration. (i) Let W’ = number of members in a family of India. Then
Nean assume any of the values 1, 2,3, ... .. So, here N is a variable.
(i) Let = number of prime ministers in India .Then we see X
‘Ssumes only one value,1.So X is a constant.
(ii) Let_ = weight of a person of city. Then we see H can assume
tte values 45 kg, 46.12 kg, 80.0015 kg etc.So here H is a variable.
Dis a . +
a and Continuous Variable. A variable that can theoretically
‘ume any value between two given values is called a continuous variable.
Yatiable which is not continuous is called a discrete variable.
in tl ‘ . +, + fe ge a .
oe illustration the variable cited in (i) and (ii) are discrete
i the variable in (iii) is continuous.
a Observations, Ti
ay observation, §
cal data os
le al data or statisti
‘tration,
Tea tion.
he values assumed by a variable are known as
‘ometimes, in statistics, these are regarded as
ical observation.
; let Y= marks obtained by the students in Mathematics in a
Theda cay p © 2 Vétlable which can assume the datas 5,90,0,81 ete
r F © presented as
Sle 008 70 65¥
GINEERING MATHEMATICS. 1,
Kk: In fact variable,data ete.can be defined in different way.In this
text we keep this definition thinking of the relevant concerned readers
9.2. Frequency Distribution.
The number of occurence of an observation or data of a
ailled the frequency of that data,
Frequency
variable
Illustration. Let x be the marks obtained by 30 students.Let x assumes
the values
30 35 31 32 34 31
30 34 42 30 57 68
42 val 20 15 10 51
57 51 51 52 51 80
51 57 20 a 35 32
Here we see the data 30 occurs three times.So frequency of 30 is 3.
Similarly the frequency of the data 80,71,68 are 1,2,1 respectively.
Simple Frequency Distribution, The simple frequency distribution of
4 variable is the statistical table where the observations (assumed by the
Variable ) are arranged in order of magnitude and the frequency of each
observation is shown side by side,
Illustration, Let x be a variable which takes the value :
3 4 5 3 6 4
4 2; 2 S 6 1
3 4 5 3 2 1
Then the frequency distribution of x is
x! 1 2 3 4 5 6 Total
fii 2 2 5 4 3 2 18
The table can also be shown in column-wise.
Grouped Frequency Distribution,
When a large number of datas are available we cannot grasp their
characteristic only by placing them individually in a table . In these cases
Wwe group the observations into a number of suitable intervals.In a table
Tvals are shown and’ the frequency of the
(or statistical table) these inter
observations included in each interval are shown side by side. This table
is called Grouped frequency distribution,BASIC STATISTICS a
CE
Illustration. The datas below give the marks secured by 70 candidates
ina certain examination ;
2 3h: 85" 52° 64 74 89 53. 42 7
2 35 43 .'67. 16 35 46 26 1..32' 40
243 3.38. 41- 63. 71 28 32. «45 54
15 18 a 73 86 50 39 55 47 12
44 58 67 85 39 40 50 65.:. 72 69
5763 5 SOOT 137. 24 54 82 49
St 54.68 29° «3444 58 62 59 65
Here we see there is a large number of the observations which are
almost distinct.We group the datas into the intervals 0-10,11-21,22-32,.... We
see the data 7,5 are included in the interval 0-10.So the frequency (called
class-frequency)of the class interval 0-10 is 2. In this way we have the
following grouped frequency distribution :
Marks secured : 0-10 11-21 22-32 33-43 44-54 55-65 66-76 77-87 88-98
Frequency :2 4 8 4 IS 12 10 4 1
Note. Frequency distribution is nothing but quantitative classification.
Terms associated with Grouped Frequency Distribution.
(1) Class interval : The group of datas into a number of suitable
intervals are called class interval. In the previous example 0-10, 11-21
etc. are class intervals.
(2) Class limits ; The two extreme values specifying a class interval
are called class limit, In the previous example the lower class limit (LCL)
and the upper class limit (UCL) of the class 22-32 are respectively 22
and 32.
(3) Class Boundaries : The class boundaries of a class are defined as
Lower Class Boundary (LCB) = LCL of the class 4
where d = LCL of the class -UCL of the previous class.
Upper class Boundary (UCB) = UCL of the class +4
where d = LCL of the next class
example LCB of the class 22-32 is
22-21
2
UCL of the class. In the previous
22-
= 21-5; UCB of the class 22-32 is 32-298 ENGIN!
ING MATI
MATICS - 11,
(4) Cla
= L(LCL+UCL) of the class.
=
Mark or Mid Value : Class Mark of a class
In the previous example, class Mark of the class 66-76 is
3 (65+76)=71,
(5) Width of a class : Width of a class = (UCB-LCB) of the class. In
the previous example, width of the class 22-32 is 32-5—21-5=11.
Cummulative Frequency.
For a simple frequency distribution the total frequency of the
observations lesser or equal to an observation is called "less (<) than
type " cummulative frequency of the observation.
For a grouped frequency distribution the total frequency of observations
lesser or equal to the observation in a class is called the "less (s) than
type" cummulative frequency of the observation,
Illustration.
(i) In the simple frequency distribution
x22 4 9 aa
coo 6 4 1
the “less than (s) type" cummulative frequene
Y Of 9 is 34644 =13;
the "greater than (2) type"
cummulative frequency of 9 is 4+1=5.
(ii) In the grouped frequency distribution
Class > 0-4 5-9 10-14 15-19
Frequency : 3 6 4 I
the "less than (<) type" cummulative frequency of the class 19~14 or
against the upper boundary 14.5 of this class is 34.644 = 13; the "greater
than type(2)" cummulative frequency of the class 1014 or against the
ECB 9.5 of this class is 44125,
Measure of Central tendency
9.3. Mean,
A typical value which may or may not be among the datas assumed
by a variable is considered 8 a representative of all the datas. For example
among the datas 2,5,6.1,8,9,7 the value 7 can be treated as thatentative. Generally this representative-value tends to lie centrally within
: . This value is measured by different way. Following is
srithmetic Mean. The Arithmetic mean (A.M) or briefly the mean of
‘n is defined as
4 |" {f the datas have the frequency shown in the following table
F Aaviable (x): oy X ho aay
“Frequency (fi):, fi to fen
; an Pall F id
then theit A.M is ¥ yeu tH frtutintn) =F Vi
; iat
where N =f, + fateotn =
>
if; where
Mlustration, If x: 2 4 1 3 Total
foo: 3 2 4 1 10
- bethe frequency distribution of a variable x then its A.M,
1
jp Ax dtax 21 x443x1)
m1 1
Rs (6484443)=— x21 = 21
io! 84443) 10%
Norem 1. If the two variables x and y are related by the equation
= tw
Pe then F= AS where ¢ and d are any number.
Prof. We consider the frequency distribution of x :
a x 2 Xz ow
ff ff
Since the values of x are changed to those of y so the frequency
tion of » would be
V3 we se Jy Total
fypininh, oN|
300 ENGINEERING MATHEMATICS. 114
where y; =
Now the A.M of y, ¥
is
“Nd fi
1
(fim - fie) = Na
1 on 1 c
CT er ee? ee
Note. The above theorem is very much helpful to determine the A.M of
the variable assuming large data. This is shown in the following Illustration
Illustration. We are given the following Grouped frequency distribution :
Weight in gms(x): 110-119 120-129 130-139 140-149 150-159
Frequency: 5 7 1 20.) .16
Weight in gms(x): 160-169 170-179 180-189
Frequency: 10 7 3
To find the A.M we construct the following table :
Calculation of Mean
Class-interval] Midpoint [Frequency | x, -154.5 see ivi
@) | w {
110-119 114.5 5 40
120-129 124.5 7 -30
130-139 134.5 12 -20
140-149 144.5 20 -10
150-159 154.5 16 0
160-169 164.5 10 10
170-179 174.5 7 20
180-189 184.5 3 30
Total : 80 Ee 2301
aI
3 or, ¥=148.
ot, -0.65 =
Thus the required A.M, x =148.
Theorem. 2. (On Mean of Composite Group)
Let F¥zerF be the A.M, of r groups containig mm,
observations respectively. Then their combined mean or composite mean
is given by
| myXy tgp Heese X,
ny tn to,
| x=
Proof. Omitted.
Illustration. Suppose the mean wage of 60 labourers in morning shift
is Rs 80 and the mean wage of 40 labourers working in evening shift is
Rs 70. Then the mean wage of all labourers (of both shift)
_ m¥,+m¥, _ 60x804+40x70 _ 76
mtn 60+40
: Note. There are other type of means viz., Geometric mean and Harmonic
mean. But study of these is beyond the scope of the book.
54, Median,
Median of a set of observations is the middle most value when the
observations are arranged in increasing or decreasing order of magnitude.
Thus to find the median of a set of observation it is necessrary to
al the observations in order of magnitude.
‘leulation of Median.
sp ileulation of median may be confusing for
q Ponvition; for grouped frequency distributio
, dure of Calculation of Median in the following three cases :
even / odd number of
n. So we classify the302
INEERING MATHEMATIC:
1. (For simple distribution i.e. without having any frequency).
Arrange the given 1 number of observations in ascending / descendiny
order of magnitude.
(i) If nis odd then
Median =”*! th observation of the arranged set.
2
Gi) If» is even then
Median =3 G th observation (+1) th observation}.
Mlustration. (i) If we are required to find the median of the set {5, 8,
7, 20, 13, 3, 11} then we first arrange the datas in increasing order i.e. 3,
5, 7, 8, 11, 13, 20. Here the number of observations, = 7 (odd).
So its median = a th = 4th observation =g.
(ii) If we are required to find the median of the set {5, 4, 7, 3, 21,
12}, we first arrange the datas in ascending order of magnitude {3, 4, 5,
7, 12, 21}. Here the number of observation, n =6 (even).
So its median = es th observation (g + ') th observation}
=5 {3rd observation +4 th observation} =
(5+7)=6.
Case 2. (For Simple Frequency Distribution).
Arrange the observations in ascending order of magnitude, Construct
+1
"less (<) than type" cumulative frequency, Calculate “+! where Wis
total frequency. Then Median = the observation corresponding to the
cummulative frequency Y+! or next heigher (it Yt! is not a
cummulative frequency ).
Illustration,
Consider the following frequency distribution
Marks : 30 40 50 60 70 80
No of students : 8 15: 23 16 8 5
The "less (S) than type" Cummulative Frequency.pasic STATISTICS an
—_. SS
Marks
(s)
a)
Frequeney
Cummutative Frequeney
(fi) (less(S) than type)
Q) (3)
30 8 8
40 IS 23
50 23 46
60 16 62
10 8 70
80. 5 a5)
Total
Here
We see there is no cummulative frequency.'38" in column 3. Next higher
figure than 38 in column 3 is 46,
», Required Median is the observation corresponding to the
cummulative frequency 46 = 50.
Note: N+!
2
may be fraction. The procedure in same is that ca
Case 3. (For Grouped Frequency Distribution)
Here also construct the "less (S)_ than type" cummulative frequency
against class boundaries, Calculate z where N is total frequency. Find
the Median-Class, i.e. the class corresponding to the cummulative
N N
frequency FZ or next higher (if Sis not a cummulative frequency).
Nig
Then Median = lntt 2 xi
m
where /,, = lower boundary of Median-Class
N =Total frequency
a F = Cummulative frequency of the class preceeding to the Median-
ass,304 ENGINEERING MATHEMATICS - lA
EE
Jin = frequency of median class.
i= width of the median class.
Mlustration,
Consider the following grouped frequency distribution :
interval Frequency
130-134 5;
135-139 : 15
140-144 28
145-149 24
150-154 x 17
155-159 10
160-164. 1
The "less than type" Cummulative frequency against Class boundaries.
lass-interval] Frequency | Upper Class-Boundary | Cummulative
Frequency (< type)
(4)
5
20"
48
72
89
99
130-134
135-139
140-144
145-149
150-154
155-159
160-164
* It means there are 20 observations which are less or equal to 139.5.
N a
Now, == 50. There is no cummulative frequency 50 in the 4th
column of the above table. In the 4th column the next higher figure is 72.
‘This corresponds to the median class 145-149,BASIC STATISTICS 305
———————
Therefore, /,, = lower boundary of the median class = 144-5.
F = the cummulative frequency of the class
preceeding the median class = 48 «
Sn = frequency of the median class = 24.
i= width of the median class =5.
2 og, 50-48
4
So the Median = J, +
Note. There is another formula to find the median called "interpolation
formula”.
9.5. Mode,
The observation having maximum frequency is called mode.
Calculation of Mode.
Case 1. For a simple frequency distribution the mode is calculated by
simply method of inspection.
Ilustration. For the frequency distribution
10 20 30 40 50 70
2 2 1
x
Frequency: 2 2 3)
Here we see the observation 30 has heighest frequency 3. So the mode
is 30,
Case 2. (For Grouped Frequency Distribution).
For a grouped frequency distribution mode can be calculated for
frequeney distribution having unique class with heighest frequency and
With equal class width. First find the modal-class i.e. the class having
heighest frequency. Then
Mode = Jy + fi-fo x;
2fi-h-h
where J,, = lower class boundary of
Sy = frequency of the modal class
fo = frequency of the class preceeding the modal class
Sy = frequency of the class succeeding the modal class
i= width of each class (note that this is same for each class).
the modal class ©
I EM2A20
KeIllustr: ne
Consider the grouped frequency distribution.
Here we see every class has same width which is 10 and only class
40-49 has heighest frequency. So the modal class is 40-49. As we know
+c hy;
2n-ho-h
where /,, = lower class boundary of the class 40-49 =39.5
J, = frequency of the class "40 — 49"=20.
So = frequency of the class preceeding the class 40-49 =18
Jy = frequency of the class succeeding 40-49 =12
i= width of each class = 10.
20-18
«. Mode = 39-5+—————__ x 10 = 41.5,
40-18-12 10-41
Relation among Mean, Median and Mode : For a distribution having
single Mode the relation is Mean - Mode = 3(Mean - Median)
Mode =/,,
Note. For a symetrical distribution mean, median and mode coincide.
Measure of Dispersion
9.6. Variance and Standard Deviation.
As we have stated in the previous article A.M represents the entire set
of datas.But the degree to which the datas tend to spread about the AM
is to be measured.It is usually measured by variance or standard deviation
which are discussed below :ra
asic STAT 307
—_—_—__——
Variance . 7
The mean of the squares of the differences of the observations (or
datas) assumed by a variable from their arithmetic mean (A.M) is called
variance of the variable.
Standard Deviation.
The positive square root of variance is called Standard Deviation (s.d)
Thus (i) if x, %2,....x,be the datas then their variance,
i 2 1a =)\2
Var(x)=—{(x =3 swuect(Uq EJ }= 3° (x; —¥)° and the
n
Niet
standard deviation, o, = +
(ii) if the datas have the frequency shown in the following table
Variable Q@X) som x xe
Frequency (fi): fi Sr we Sn
then the Variance,
Var(s) = [fils -3) + Ales
1
ay Dil where N = f, + fot...+f, and the standard
ist
deviation,o, = + LS ste -%).
Nia
If the datas have grouped frequency distribution then x,will be the
mid -value of each class-interval and f, would be the corresponding
Srequency.
Illustration : Let x be a variable which assumes the datas:
20,85,120,60,40.Then 3g (20+ 85+120+ 604-40) = 65. To find the
Variance and standard deviation we go through the following table:
6100308 ENGINEERING MATI
ATICS- A,
5
Here ¥°(x; - 65) = 6100
a
3, 2
Then Var(x) Sd (01-65) = *6100= 1220.
=I
and the S.d, o = 41220 = 34.93.
Theorem 2. If x;,x2
Proof. Left as an exercise.
Note, The above theorem can be extended for frequency distribution
2
pelt -| Lin) where N=fit+frt.tf,.
ia
Theorem 3. If the two variables x and y are related by the equation
ial
_x-¢
ye
also.There Var(x)=
then o, = 2+ where d is a positive number.
Proof. Beyond the scope of this text.
Note. The above theorem is very much helpful to determine the s.d of
the variable assuming large data.This is shown in the following illustration. In
practice, we use the result in Th-2 to find the variance / §.d.
Illustration. (i) Let we be
frequency distribution :
x + 240.12 240.13 240.15 240.16 240.17 240.21 240.22
Frequency: 2 2 4 Ie 92 onde y Ay
given the following observation having the
To find the S.d we go through the following table ;BASIC STATISTICS 309
1 1
Here, N=10, > fiy; =-2 and Yo fry? =114
iat
i=l
2 2
Now, vary) = Bvt (Bes) -14-(2) =1136
N 10 10
= 1136 =337
Since y, == 24016 og 5 _ Ss
017 2-01
or, 337 = Se
01
(ii) Consider the following grouped frequency distribution :
Value : 90-99 80-89 70-79 60-69 50-59 40-49 30-39
Frequency : 2 12 22 20 14 4 1
To find the variance and standard deviation of this grouped frequency
distribution we construct the following table :
Here N=75, fii =27, D fi? = 127.
2
1 1 1
Now, Var(y) = wolfe -(EE4%) =75%
«. the standard deviation of y, oy= 1-56 = 1-23.
Now since y == 64
: ; therefore g, = 2+
7 10
or, o, =1-23x10=12-3,ATICS- HA.
310 ENGINEERING MATHE!
9.7. Significance of measure of central tendency and standard
deviation.
The measure of central tendency represents the set of all observations/
datas of a variable and standard deviation shows the measure of
consistency of the datas. In fact to compare the consistency of datas of
two variables we are to find the Coefficient of Variation (C.V) of the two
datas, The variable having greater c.v is lesser consistent. Coefficient of
variation of a variable
_ Standard deviation , 1 yy
Mean
For example : Let x= runs scored by batsman A in 20 innings and
= runs scored by batsman B in 35 innings. Then x will get 20 datas
and y will get 35 datas.
From these data let the AM, x=90ands.do, =12 whereas the AM,
80ands.dc, =2.
Then since x > y, average runs scored by batsman A is 90 which is
greater than that of B. It seems performance of batsman A is better than
that of B. But the C.V of x= Zx100-Z 1100-135 and the C.V of
g, 2 1
erg} = 9 100=2> which is lesser than that of A. We may
conclude batsman B is more consistant i.e, reliable than A.
9.8. Moments. The r-th moment of the values (datas) x,,),-7+-+-2,
about a number A is defined as
ry_l¢ a
4 f(a, -4y' +(- 4" + sees +x, - A" }== D4 - A)
n n
If the datas have the frequency shown in the following table
datas (x) Saat Xyoreee Xy
frequency (f)): Ko A
then their r-th moment about A is
a ly ,
afta -A A +0 AY fy tot (%, - AY SBE AG - A)
where N=fit+htoott, -2iiPATISTICS 31
rouped
quency distribution , x,,x,, will be taken
gs the mid-value of each class interval,
Example. Find the second moment about 10 for the following Grouped
frequency distribution.
Annual Sales : 0-20 20-50 50-100 100-250 250-500 Total
(Rs.'000)
No. of Firms : 20 50 69 30 25 194
The mid value of the class intervals are
0 , 35 > 75 : 1 375
., the second moment of ‘Annual Sales of Firms’ about 10
= lio ~10)? 20 + (35~-10)?50 + (75 -10)?69 + (175 -10)?30
+(375-10)?25}
= 1716817
9.9. Central moments and Raw moments
Any moment about the A.M of the datas is called central moment. r th
order central moment is denoted by m, or 11,
Any moment about 0 is called Raw moment
Example. Find the third central moment and Raw moments of x from
the following distribution
x : 2 4 1 3) Total
f , 3 2 4 1 10
The arithmetic mean x =2+1
“the third central moment of x
1
= yp ((Q-2-? x34 4-2-1)? x24 (1-21) x44 3-2-4) x}
50-912
The third raw moment of x
at pase x24 x443? x1} =18-3312 ENGINEERING MATHEMATICS- 1A
Theorems
(1) Ist central moment = 0
Ist central moment
Proof, 2nd central moment = SOG 3)? f, = variance by definition.
7
9.10. Relations between central moment and any moment
If m, is r-th central moment and m,' is rth moment about any number
then (i) m, =m} —m?
(ii) my = mi, ~ 3mm} + 2m?
Git) mg = my - Ami} + 6mm)? —3mit
Proof
:@) m=
DAG x)
Si{e-4-@-ay
a
ON
anlis Bacio rs| hit?
=H Loi es 9? 26 Ayla, A+ 4)"}
1¢ = 1 1-- n.
yy fii AP ~Ae- ATLA - A) +O ADA
my -2x-Aym/+(k-A4P (I) DH EN
Me:
Now,
Sih
f=}
AS fo AS fa = LS Heya =m!
Nae Naas: WES Oo
2. from (1), m, = my! -2m,'m/ +m)? =m, -m,?
Proof of (ii) and (iii) are kept beyond the scope of the book.4
BASIC STATISTICS 313
WE
GC- de is independent of i
or, m (9) =m, (2) -)
9.11. Skewness and Kurtosis
The skewness and kurtosis of a frequency distribution are
(skewness) = %
o
and y,(kurtosis) = a3 respectively
where m,,m, are ard and 4th central moments and o is the standard
deviation of the distribution.
Note : Since skewness and kurtosis ure ratio of two quantities having
same unit so they have not unit, they are pure number.
Example. Following is the frequency distribution of a variable x :
Xt 112-45 117-45 122-45 127-45 132-45 137-45 142-45 Total
af: 5 15 20 35 10 10 5 100
Find its skewness and kurtosisSolution. Calculation of moments
27-45
x, Sy fe, -127-45|
i iP 5
12-45} 5 -15 =
li7-45] 1S} ~10 ~2
122.45] 20 -5 =]
27-45] 35 0 0 0 0 0 0
132-45] 10 5 1 10} 10 10 10
137-45} 10 10 2 20] 40 80 160
5 15 3 IS] 45 135 405
42 45]
Total [1003 = 7 -20] 20 | 50 | 1240
The raw moments i.e. moments of. y about 0 are
1
== =—~x-29 2-1
m/(y) yet 100 :
= 229 =1t
100 *
sol 1
=—x-59=-1
i009 “>
2
m ED foi = ppt mo=
‘, the central moments of 'y are
2
my) = my'(y) = mor=B-(-1] =H =2416
ims) = m3'()~ 3my'(»)mn'(y) + 2 {n'a}
1332 _ 201
=o >+
Saas =0-
225 125, 359BASIC STATISTICS 315
j——_—_—
\4
|
mg 9) = mg (9) = 4m, Cy)! (9) + 6my(y){m/ 9) = 3{m( )|
2-4-2) (1eot (2) (4)
5 2 5 5 oy. 5)
_ 82 4,66 3 _ 7827 _ 15 5939
5 10 125 625 625
—127-45
Since }
:. the central moments of x are given by
m= or, my(x) =25%2-16 = 54
m(y) = wo or, m,(x) = 1250-804 =100-5
m9) = me or, my(x) = 625x12-5232 = 7827
Now we know the variance o? = m,(x)=54
2.0, = V54 = 7-348 ) ae
m(x) __100-
3 = == 9.253
<. Skewness of x, 7(*) o (7-348)
7827
and Kurtosis of x 79(x) =) _ aa =-0-316
9.12, Significance of Skewness
Skewness shows the extent of symmetry of the frequency diagram
of a variable. Below we draw the frequency diagram of the variables
| x,y,z and w respectively.
|
|
Frequency diagram of x
XX, xh Xq Xs Xe XY
Note that this frequency diagram is symmetric about the frequency
Js. This type of frequency distribution will have skewness 0. The curve
Fis fitted to show the symmetry.
OH316 ENGINEERING MATHEMATICS - 14,
Frequency diagram of y
Note that this diagram is not symmetric. It has positive value of
skewness as this has asymmetry towards +ve direction of X axis. The
curve f is fitted to show this asymmetry.
Frequency diagram of z
Note this diagram also has asymmetry towards right side. Its
asymmetry is more than that of y. Its measure of skewness will be greater
than that of y. It is also positively skewed,
Frequency diagram of w
i,BASIC STATISTICS 317
_e..,aKRwr°KeFrOO——__
Note that this frequency diagram has asymmetry towards left side. Its
skewness will be negative.
9.13. Other Formula For finding Skewness
Skewness can also be measured with the help of following theorems
whose proofs are kept beyond the scope of the book.
Mcan — Mode
Standard deviation
3 (Mean ~ Median)
Standard deviation
Theorem 1. Skewness =
Theorem 2. Skewness =
9.14. Significance of Kurtosis
Kurtosis shows the peakedness of the frequency diagram of a variable.
Below we draw the frequency diagram of the variables x, y, z and w
respectively |
Frequency diagram of x
In this frequency diagram the greatest frequence is f,. The curve
J is fitted to show this peakedness.
Frequency diagram of yae
3s ENGINEERING MATHEMATICS. 1},
eee
We the peakedness of the frequency diagram is less than that of
x. Then the kurtosis of y will be less than the kurtosis of x.
The kurt normal variate is 0. It is seen that the peakedness of
normal distribution is 3.
So, if the peakedness of a distribution is less then 3 then its kurtosis
type of distribution is known as Platykurtic
will be negative. Thi
distribution.
If the peakedness of a distribution is greater than 3 then its kurtosis
will be posotive. This type of distribution is known as Leptokurtie
distribution
If the peakedness of a distribution is 3 then its kurtosis is 0. This type
of distribution is known as Mesokurtie distribution.
9.15. Illustrative Examples.
Example. 1. Find the mean from the following data :
Daily wages (Rs) : 29 30-34 35-39 40-44
No. of workers : 16 28 14 12
Iris given that the total wage for 10 workers earning Rs 45 and more is
Rs 600.
First we are to work out the mean for the rest part without the last
class.
For that we construct the following table :
class mid point frequency
interval
Here N= 70,5 fi, = 4839
a 4
1, ¥=3745p=37-<=x5 = 33-57.
ie, ¥ iy 3505 73557
Rs, 33-57% 70 = Rs. 2349-9.
So the total wage of 70 workers =
Thus the wage of total 80 workers is
Rs (2349-94600) =Rs 2949-9.
« p. 2949-9
., Mean wage is Rs [5 = Rs 36-87.
Example. 2. The A.M calculated from the following frequency distribution
is known to be 72.5. Find the value of x :
= 30-39 40-49 50-59 60-69 70-79 80-89 90-99
Frequency : 2 3. nN 0 x B% 7
We first construct the following table to find A.M using the given datas :
classes
Classes Mid pt | Frequency Vi
(A)
30-39 2 4 8
40-49 3 3 id
50-59 i 2 32
60-69 20 -1 -20
10-79 x 9 ce
80-89 1 i
= ae 200_
¥=145+109=745-Gyy-EERING MATHEMATI
Ba ENGIN CS-n,
ce
By the given condition we have
145-a = ns or, 67+x=100 * ¥=33-
67+
Thus the rnisein frequency x is 33.
Example. 3. Average marks obtained by a class of 70 students was found
to be 65. Later it was found that the marks of one student was wrongly
recorded as 85 in place of 58. Find the corrected mean.
70
Wrongly calculated mean -dn =65
or, dn= 4550
um of wrong observations = 4550-
.. sum of corrected observations = 4550-85458 = 4523 -
4523
70
Ex. 4. Following is a frequency distribution lacking two class frequency.
Find them if the mean is 7.74.
value : RS 5-7 7-9 9-11 I-13 Total
frequency : 320 — 57 25 200
Let the two missing frequencies be f and /, respectively. We
construct ie following table :
Cilass-Mark
a 12
- So the corrected mean =
= 64-61.
300
eae rr
Now, 324+ fi+57+f.+25=200
or, fit f= w
Cee Rey Lah or, 7. Tha F- x(88446f; +108)
or, 3f, +5f2 = 332
oi ye (2).
Solving (1) and (2) we get <4 Sad
frequencies are 49 and 37, A=49, fi=37 so the two missins| aot ane
BASIC STATISTICS ait
paste STAT
Example. 5. Zio variables x and y are related by 3x+4
is 3. Find A. M. of y.
2 . 21-
From the given relation we get ¥= eis 7
21-3 _21-3x3_ 12
‘4 ea a
Example. 6. Tivo variables x andy are related by x=2y+5- The median
of is 25. Find the median of y.
From the given relation we have y=
Here if x increases y also does so.
Medianofx-5 _ 25-5 _
So the median of Y= =~ 10,
Example. 7. The number of observations of two groups are in the ratio
2:1 and their A. M. are 8 and 128 respectively. Find the A. M. of the
combined group.
Let the number of observations of the two groups be 2k,k. 3 =8,
mi tm, _ 2kx8+K x128
my +m 2k+k
¥=128. The combined A. M., ¥=
or, Fate ae:
Example. 8. Find the median of the following frequency distribution :
to 10 15 20 25 30, 35, 40
-, -- - age; 21) 17 8
equency distribution table is given below :
The cumulative fr
EM-2,4.91ENGINEERING MATHEMATICS.
ert,
Here N =119
N+l_ 6g.
Nl
greater than 7 Is 73 and the
So the cumulative frequency just
value of x corresponding to c.f 73 is 25. Hence the median is 25,
Example. 9. Calculate mean, median and hence find the approxima
value of the mode from the following frequency distributions :
Height (inches) : 60-63 64-67 68-71 72-75 76-79 80-83
No of students 8 3 18 6 16 8
To find mean we first construct the following table :
frequency] c.f
(f)
Mid pt
(x)
ge Dh _ -16
Ja Sta = 02
so 712
oF, x, =73544y,
z ia *4¥=135+4(-02712) - 79.4150
.. mean of the given distribution —
- Mos tion = 72.4159.
So the median class is 72-75
by = 15, F =29, f,, =6
and §=755-715=4BASIC STATISTICS 323
———
Np
e. Median = Im+ 7 xP 27154 8 y= 7183-
6
Using the relation Mean-Mode =3 (Mean-Median) we have
Node = 72.4152 - 7183) or, 72.4152-mode = 1.7556
. Mode = 70,6596
Example. 10. Calculate the mode of the following data :
1, 12, 5, 8, 12, 13, 8, 1, 4, 8, 7, 8,5.
Let us arrange the given variaties with corresponding frequencies as
given below
}
1 2
4 1
3 2
7 1
8 4
12 2
13 1
As the variate 8 ocurs 4 times which is maximum, so,
mode =.
Example. 11. Calculate the mode from the following distribution :
class + 10-15 15-20 20-25 25-30 30-35
frequency: 6 9 i 7 7
Here the greatest frequency 11 lies in the class 20-25. Hence modal
Class is 20-25.
= lower class boundary of the modal class
=20.
J, = frequency of the class =11
fo = frequency of the procceding the modal class
9
fy = frequency of the class succeding the modal class = 7.
i= Width of each class =6-Ex.12, The median and mode of the following frequency distribution are
known to be 27 and 26 respectively, Find the values of « and B :
10-20 20-30 30-40 40-50
class interval : 0-10
a 20 12 B
frequency: 3
Since mode = 26 , so it lies within the class 20-30.
fi-
Mode = /,, +—/— 0 — 10
2h-fo-hr
=204 20-2 x19,
28
20- 10(20 -
204 1020-8) 9g 4, 1020-2)
28-4 28-a
a=8.
So the total frequency is N =43+B.
But media = 27, so it lies within the class 20-30.
Then we construct the following frequency table :
x
Now median = /,, + i xi
nt
43+8
7 sil 21+8
* 97=204+—2-———x 10% 4 =Tor, 21+P=28 -. B=7.
20
bggentes|
BASIC STATISTICS 325
Se en oe ie ee Be
Ex. 13. Find the variance and standard deviation of the following frequency
distribution :
Weight (in kg) : 36-40 41-45 46-50 51-55 56-60 61-65 66-70
Noofpersons: 14 26 40 33 50 37 25
We construct the following table :
class Mid pt Wr
interval (x)
36-40 38 126
41-45 43 104
46-50 48 40
51-55 533 0
56-60 58 50
61-65 63 148
66-70 68 225
Total 693
Here N =225,5 fy; = 65, fy; = 693
. Var(y)= LD fyi -(EEAn)
2
1 65
= 395 * () = 2996.
+. SD of y, 0 = ¥2996 = 1.731
53 o
x
Since y, 180 Sy
5
6, =1731x5=8655,
Example. 14. The mean and standard deviation of marks of 70 students
Were found to be 65 and 5.2 respectively. Later it was detected that the
marks of one student was wrongly recorded as 85 instead of 58. Obtain
the correct s.d.326 GINEERING MATHEMATICS- 1A |
ns
Let x1,x2,43+" be the marks.
The “incorrect 7x," = 65x 70=4550- |
<. the "correct Sx; "= 4550-85458 = 4523 -
.. the ‘correct mean’
pai }
We know (sd) => |
|
in the "incorrect )° x; = 70{(5-2)° +(65)"} = 297642-8- : j
‘Then the correct Yix7 = 297642-8-85" +587 = 293781-8-
293781-8 \
Hence the correct (s.4)" = oy = 22-43. |
\
So the correct s.d= 22-43 =4-74. |
Example. 15. Tine factories A, B, C producing similar products are such {
that the mean daily wage of workers of factory A is Rs. 100 with a s.d of
Rs. 10, whe factory B, the mean wage is Rs. 150 and s.d is Rs. 12 |
and in factory C, the mean wage is Rs. 150 and s.d is Rs. 10. Which }
factory is most consis
ie
4 tent in respect of the daily wage of their workers 2 |
ce The sd of workers of factory A per anit-mean -— =0-1- |
|
The s.d of workers of factory B per unit-mean
10
The s.d of workers of factory C per unit-mean = 755
=0-07,
This shows that the daily wage of the workers of factory C is most \
consistant.
Variability is highest for factory 4. ‘
Example 16. Calculate the first four central moments, the skewness
kurtosis for the following distribution of 1083 cases of entric fever.
seeLg
BASIC STATISTICS
327
‘Age (Years) No. of cases | Age (Years) No. of eases
Under 5 33 30-35 3)
5-10 143 35-40 37
10-15 252 40 - 45 20
15-20 244 45 - 50 12
20-25 165 50-55 5
25-30 107 55 - 60 2
Solution.
Calculation of moments
Class [Mid- | f; |x, -32-5 pes | AM Ve | fe | fot
ntervall Value
712 |42768
17875 |89375
16128 }64512
6588 |19764
+1320 | 2640
-107 | 107
0 0
37 | 37
160 320
324 | 972
320 | 1280
25 251
48055 P23025
Eyer ELDFThe moments of y about 0 are
' 1 1 =-2-
mm’) 5 DIN = Fogg T= zat
r 1 22!
m= yD 1083
Tal 1
m() =ylir = pg 18055 = 4372
«12113 =11-185
m (9) = LD Soi - a x 223025 = 205-932
, the central moments of y are
m,(y)=0
my(y) = my (9) {mon} =11-185—(-2-721)? =3-781
3
mms(9) = m5'(9)~ 3m (9m!) +2{ m0}
= 44-372 —3x11-185x(-2-721) +2x(-2-721)° = 6-639
aye rok (he ye kya
img) = ma (1) — Ams (ym, (y) + Gin ( {mi os} s afm 0}
= 205-932 4x (44-372) x (-2-721) + 6 x11 -185x (-2-721)"
\ -3(-2-721)*
= 55-408
32-5
5
:. the central moments of x are given by m,(x)=0
Since y, =
img) = 222, my(x) = 253-781 = 94-525
my(y) -30) omy (x) = 125 x 6-639 = 829-875
m= a or, my(x) = 625 x 55-408 = 34630-00
The Variance of x, 0? = m,(x) =94-525
94-525 = 9-722BASIC STAT
329
Skewness of x,
Kurtosis of x, 72(8) = P= 3 = 0-876
(9-722)4
Example 17. The first four moments of a distribution about 5 are 2, 10,
40 and 218. Find the first four central moments and moments about 0.
Solution. Let /m, and m}. are central moments and moments about 5
respectively,
Using the relation between m, and m!, we have
0-2? =6
my — Im)
my = my —3my'm! + 2m)? =40-3x10x2+2%2 =-4
mg = my — Am m/ + 6m, m,? —3m|"*
= 218-4x 40x 2+6x10x2? -3x2* =90
which are first four central moments.
Let 1, fp. /45,44 be the moments about 0.
Now, first moment about 5 =2
on LY f(x, -9)=2
N
ie
or, yee
or, x-S-bxN=2 x57
N
12 1¢
Now, = BAe —O Ls
Using relation between central moments and moments about any
number we have
my = fy HP 0F,6= Hy - Tt = 55
my = fy —3 ppb, +2)
Of, ~4= py -3x55X7+2%7> OF, Hy = 465
23,4
and my = ply —4bshh + 6b 4 ~ 34|
330 ENC RING MATHEMATICS - 14
ren RE
OF, 90 fly 4% 405%74-6%55x 7? 3x74
Of, fy = 4143
which are the first four moments about 0.
Example 18. The first three moments of a distribution about 7, calculated
from 9 datas, are 0.2, 19-4
standard deviation and the third moment about origin.
Soluition, Let yn," be the moments about 7.
and 41.0 respectively. Find the mean,
som =0+2, mJ =19-4, my =-41-0
1s iS ig
A=) (x, -7) = 0-2 or, =~ -=)7=0-2
ou ) or, 52" om
=7-2
am!
OF, x-—x7x9=0-2
From relation between central moments and moment about any number
we have
my = my, ~m/? =19-4— (0-2)? =19-36
«. the variance =19-36 +, §.d=J19-36 =4-4
my = my — 31mg! + 2m)
=-41-3x19-4x0-24+2x (0-2) =-52-624
Let #1+#2+43 be the first three moments about 0.
Now, 4 =EEU -0)!=¥=7-2
Using relation between m, and j1, we get
My = fly ~ Hy OF, 19-36 py - (7-2). py = 71-2
and m, = fy ~ 3ptyy, + 244)
01, ~52-624 = pty -3x71-2%7-242%(7-2)
2 My = 738-8
Example 19. The A.M of a distribution is 5. The second and the third
central moments are 20 and 140 respectively. Find third moment about
10.
Solution. Let m, be the central moments.
J:m, = 0, my = 20 and m, = 140Fae
STATISTICS 33
pas
Let m,’ be moments about 10.
Gee
Then ™, = 240% 710)
ial
2¥-dxt0n=5-10=-5
n
Using the relation between m, and m,’ we get
my = my —m,? Of, 20= my! -(-5)? sm, =45
and mn, = ms. —3my'm) + 2m?
ot, 140 = my 3x 45x (-5) + 2x (-5)°
= my +675-250 +. =-285
Example 20. The distribution of a variable x has coefficient of variation
=5, variance =4 and measure of skewness =0-5. Find the mean and
mode of the distribution.
Solution. We know, CV =2-x100
Mean
I
ose 100 2. Mean =22108— 40
Mean
< Mean - Mode 40 — Mode
We know, Skewness = aD or, 0-5= =e
Mode = 40-1=39
Example 21. Prove that the second order moment of a variable is minimum
about the mean of the variable. _
Solution. Let x be the variable and x be its mean.
Let a be any real number.
Then (x,-a)? ={(5, -+G-a)}
=3) $2(x, -E-4)+ (e-a)
= (3,
“3 (,-ay =S 4-37 426 - aS (4) - +L O~a?
fal & ial ia
= Ce ye)
pre2G-0-tE6; ~a)+ on - ay
1g ig
am ~ayY =—
nia) =e1S = 47
=~ 35-3) +28 - ate
iat
= 1S, —x)? +2(¢-a)(x-x)+(z-a)?
nia
or, 2nd order moment about a
= 2nd order moment about x+(x-a)?
> 2nd order moment about y
Hence proved.
Exercises 9
1, Find the A.M of the variable assuming the datas 5, 8, 3, 10, 12.
2. Find the A.M of the following frequency distribution :
x sHig* 15) "aGeEs
Frequency: 4 3 1 2
3. The scores of a cricketer playing six matches are 84, 91, 72, 68,
87 and 78. Find the arithmetic mean (A.M) of the scores.
4. Ten measurements of the volume of a cone were recorded by an
engineer as 3.88, 4.09, 3.97, 4.02, 3.95, 4.03, 3.92, 3.98 and 4.06 c.c.
Find the A.M of the measurements.
5. Find the A.M of the following frequency distribution :
x : 462 480 498 516 534 552 570 588 606 624
fice 98 75 56..42. 30 21 15357 Meee 6 2
6. Following is the frequency distribution for the number of minutes
per week spent watching TV by 400 secior citizens.
Viewing Time
(minute) : 300-399 400-499 500-599 600-699 700-799
Number of
Citizens eo: 14 46 58 76 68
800-899 900-999 1000-1099 1100-1199
62 48 22 6
Find the mean TV viewing time for the 400 secior citizens per week.7, Four groups of catt! cons
reported weight are 140, 15.
weight of all the c
sting of 18, 10, 20 and 15 cattles,
» 148 and 162 Kg respectively. Find the mean
[Hint : It is a frequency distribution like
x : 140 153 4g 162
Frequency: 18 10 20 5
8. Average marks in Engineering Mathematies in a Class-test of 45
students is 62%, On scruitiny it is detected that the marks of two students
were erroncously recorded as 25 and 72 instead of 52 for both of them.
What should be the correct mean,
9. A teacher teaches two sections of Mathematies class. Section A has
25 students and their average on the first test was 82. Section B has 15
students and their average on-this test is 74, Find the average on this test
if the teacher combines the scores for both the classes.
10. The mean weight of a lot of beams is 60 kg. The mean weight
of black beam in the lot is 70 kg and that of white beam is 55 kg. Find
the proportion of black-beams and white-beams in the lot. If we have an”
additional information that there are 150 beams in the lot altogether, then
obtain the number of black-beams and the number of white beams in the
class.
11. Fifty students took up a class test carrying a total of 10 marks,
The result of those who passed the test is given below :
Marks 3 4 5 6 7 8 9
No. of students : 8 10 9 6 4 3
If the average marks for the 50 students were 5.16, find out the
average marks of the failed-students.
12. Following is a frequency distribution having two missing
frequencies, The mean of this distribution is 1.46. Find the missing
frequeney :
x :0 1 2 3 4 5 Total
Frequency 146 — — 25 10 Siri 200
13. The A. M. of the following frequency distribution of marks for
4 group of 60 sudents is 30.5. Find the missing values :
x g 10 20 — 40 50
of : 8 10 20 15 1a4 ENG! ING MATHEMATICS - 1A,
14, Find the mean of the variable x assuming following values :
(i) 12,50 with frequency as its values.
(ii) the first n natural numbers. [VB.U. Tech 2004)
(ii) P35? (Qn)?
(iv) 24,620,
(vy) 23,4,6°(2n)?
(vi) 1,2,-+7 with frequency 1?,27,...n? respectively.
15. Suppose that the blood pressure for nine randomly selected
individuals are +
118.6, 127.4, 122.0, 133.2, 108.3, 138.4, 113.7, 130.0, 131.5.
Find (i) the median,
(ii) If the valucs are rounded off to nearest 5 mm. Hg, what is
the sample of the values and what is the median.
(iii) If the second persons blood pressure is 127.6 rather than
127.4 how does this change the median of the actual values and of the
rounded values ?
16: In a batch of 15 students, 5 students failed in a test. The marks
of 10 students who passed were : -
90, 60, 70, 80, 80, 90, 60, 50, 40, 70.
Find the median of the marks of all 15 students.
17. The following data relate to the sizes of shoes sold a shop. Find
the median and mode size of shoes :
Size 19.0 85 80 7.5 7.0 65 6.0 5.5 5.0 4.5
Frequency: 1 4 Il 23 40 60 30 15 5 2
18. (i) The frequency distribution of house rent for 30 families in certain
2
locality is given below :
Rent: 1800-2000 2000-2200 2200-2400 2400-2700 2700-3000 3000-3500
No-of
Familics: 4 7 10 5 2 2
Find the Median.
(ii) Form an ordinary frequency table from the following data :
Marks : Below 10 Below 20 Below 30 Below 40 Below 50
No.of students : 3 8 17 20 22
Hence find the mean and median. [W.B.U. Tech 2005 ]BASIC STAT
FICS
19. Following is
é lo grouped frequency distribution having am
frequency. The median of the distribution is 127.5, Find the missing
frequency
Class-interval: 100-109 110-119 120-129 130-139 140-149. 150-159
Frequency: 5 7 = 8 4 6
20. Following is a grouped frequency distribution of expenditure of
1000 families. The mean and median of the distribution are both Rs. 87.50.
Find the missing frequency :
penditure + 40-59 60-79 80-99 100-J19 120-139
No. of Family: 50 _ 500 _ 50
21. Show that the combined A. M. of two groups lies between the
arithmetic group means,
22. If the height in cm of ten students are 63, 65, 66, 65, 64, 65,
65, 61, 67, 68, find the modal height. {W.B.U.Tech 2004]
23. Among drivers the number of accidents in which each was
involved during a 5-year period was recorded : 2, 0, 1, 0, 1, 3, 1.
Determine the mode.
24. (i) Find the mode of the following frequency distribution :
Class interval: 1500-1700 1700-1900 1900-2100 2100-2300 2300-2500
Frequency : 25 30 37 27 uN
(ii) Find mean, median and mode from the following frequency
distribution
Clessinterval: 300-600600-1000 1000-1800 1800-28002800-3300 3300-3600
3600-4500
Frequency: 10 pt) 30 Py wo 5
5
25, Find the standard deviation of the observations :
(i) 5, 18, 10, 15, 3, 7, 6 and 12
(ii) 5, 7, 1,2, 63
(iii) 3.2, 4.6, 2.8, 5.2, 44 /
26. Find the s.d and variance for the following frequency distribution :
x : 2 5 9 10
i 5 3 6 8 4336 ENGINEERING MATHEMATICS - IA
en EO
27. Find the s.d and variance of the variable x having following
frequency distribution :
x : 945 845 745 64.5 545 44.5 34.5
fo 2 12 22 20 14 4 1
28. Find the mean and_s.d of the following distribution :
Class-interval : 4-6 6-8 8-10 10-12 12-14 14-16
Frequency : 13 HI 182 105 19 7
29. Find the standard deviation of the set of numbers in the arithmetic
progresion 4, 10, 16, 22, 28,...154.
30. Find the standard deviation for the frequency distribution of life
time (in hours) of 80 light bulbs :
Life-hours :500-600 600-700 700-800 800-900 900-1000 1000-1100
Frequency: 6 12 25 18 14 5
31. The mean and standard deviation of marks of 70 students were
found to be 65 and 5.2 respectively. Later it was detected that the value
85 was recorded wrongly and therefore it was removed from the data
set. Then find the mean and s.d for the remaining 69 students.
32, Following are the maximum daily temperature (°C) recorded in
a week in Kolkata :
38, 40, 36, 35, 30, 32, 34.
Using transformation property determine the s.d of maximum daily
temperaturae in Farenheit scale.
[Hint : The relation between °C and °F is £- . iS lL
33. Find the standard deviation for the distribution of duration of
telephone calls in a telephone booth given below :
10-109
110-209
210-309
310-409
410-509
510-609
610-709BASIC STATISTICS asi
eee
34. The marks obtained by 10 students are
70 65 68 70 75 73 80 70 83 86.
Find the variance.
35. (a) The s.d of first » positive integers is 2. Find n.
(b) The s.d of first even positive integers is 5. Find n.
(©) The s.d of first m odd positive integers is 8. Find n.
36. The A. M. and variance of 20 observations were calculated by a
student as 20 and 5 respectively, But while calculating an item 13 was
mis-read as 30. Find the correct A. M. and variance.
37. Find the mean and standard deviation from the following grouped
frequency distribution :
Weight —:35.0-39.9 40.0-44.9 45.0-49.9 50.0-54.95 5.0-59.9
60.0-64.9 65.0-69.9
Frequency: 5 16 30 23 17
8 1
38. Compute the s.d for the following frequency distribution on average
daily sales (is Rs.) of 80 salesmen of a departmental store :
Class: 50-59 60-69 70-79 80-89 90-99 100-109 110-119
Frequency: 6 9 15 25 13 a 5
[.B.U.Tech 2004)
39, Compute the arithmetic mean and standard deviation for the
following data :
Score 1:45 67 89 10-11 12-13 14-15
Frequency: 4 10 20 15 8 3
UB.U.Tech 2002)
40.Find the standard deviation from the following frequency distribution
(W.B.U.Tech.2008}
EM-24-22
a338 NEERING MATHEMAT HA
——————
41. The mean and s.d of the income of teachers in two schools are
given below :
Mean income — s.d
School 4 8000 1200
School B 9000 1350
Compare the variability of the incomes of the teachers in two schools,
42. Following are the scores of two batsmen, A and B in ten innings
A: 8 723 MS 36 19 7 12 6 M9 , 29
Bie ps3 st 42aitg 12H 1 48s =; Sdn sea 47 vie 26-4) 37: 0
Who is the more consistent player.
43. For a class of students the height has a distribution with mean
162 cm, s.d 10 cm and weight has mean 57 kg, s.d 8 kg . Compare the
variability aspect of the distribution of height and weight.
44, Thew scores of two batsman A and B in 10 innings are
A: 19 31 48 53 67 90 10 62 40 80
B : 32 28 47 6 71 39 10 60 96 14
Find which batsman is more consistent in scoring.
[Hint : Find x=50,0 =24-4 », CVof x=49]
Similarly find y= 46,0, =25-5 and CV of y=55]
45. The mean life in days and standard deviation for two types of
electric bulbs are given below :
Mean life in days s.d in days
Typel : 310 9
Type Il 260 4
compare the relative variability of life of the type of bulbs.
46. You are given the distribution of wages in two factories X and Y-
Wages (Rs): 50-100 100-150 150-200 200-250 250-300 300-350
No. of X : 6 ll 18 32 27 i
workers Y : 2 9 29 54 1 5
State in which factory the wages are more variable.BASIC STATISTICS ao)
47. Find the first three central moments of the following frequency
distribution :
Yearly income
in lakh
[No. of families 28 [292 389 [212 30 Tas | 2 | 1000
6-9 19-12 12-15] 15-18 18-21 ]21-24 | Total
48. Find the first four central moments and the skewness, kurtosis
for the variable x having the following frequency distribution :
x 3 21-24 25-28 29.32 33-36 37-40 41-44
of: 40 90 199 110 50 20 -
49. Find mean, mode and standard deviation for the body weight of
the children having following frequency distribution. Hence find the
measure of skewness :
Body weight : 14-5 15-5 16-5 17-5 18-5 19-5 20-5 21-5
No. of children: 35 40 48 100 125 87 43 22
50. Find the skewness of the following distribution :
Wages (x) : 55-58 58-61 61-64 64-67 67-70
No. of workers; 12.17 23 18 ul
51. Find the skewness and kurtosis for the following distribution :
x : 45 14.5245 34.5 44.5 54.5 64.5 74.5 84.5 94.5
fv Veles als 217 hg eg ge oy
52. From the following distribution, find the skewness and explain its
significance.
Weekly : 70-80 80-90 90-100 100-110 110-120 120-130 130-140 140-150
wages (in Rs)
No.of : 12 18 35. 42 50 45 20 8
employees
53. (a) Find skewnes based on mean-median from the following
frequency distribution.
Marks No. of Students ‘Marks No. of Students
Above 0 100 Above 50 50
"10 98 "60 35
"20 95 "70 23
" 30 90 " 80 13
" 40 80 "90 5340 ENGINEERING MATHEMATICS- 1A,
(b) For a distribution if the skewness is 0-42, A.M is 86, Median is
80 find coefficient of variation.
54. If m,' and m, are the second moment about 10 and the AM
respectively then prove that im,’ =m, +4? where k =x-10
55. The first four moments of a distribution about 3 are 2, 10, 40 and
218. Find the moments about the origin and the moments about mean.
56. Find out kurtosis’ to the following data :
class interval : 0-10 10-20 20-30 30-40
frequency: 1 3 4 2
comment on the nature of the distribution .
57. If a=", where m, is central moment and ois standard
o’
deviation, prove that a, = 0 and a,
58. If m, is rth central moment and @, is the rth central moment of
xox
, where oF is S.D of x, then show a, =m,/o" .
z=
o
59. The first three moments of a distribution about 2 are 1, 22 and
10. Find its mean, s.d and the skewness.
60. The first three moments of a distribution about 2 are respectively
1, 16 and —40. Prove that the mean is 3, the variance is 15 and the third
central moment is -86. also find the first three moments about 0.
61. The variance of a symmetrical distribution is 25, What must be
the value of the fourth central moments so that the distribution is (i) lepto
kurtic (ii) mesokurtic (iii) platykurtic.
62. Calculate the first four central moments for the following data and
examine for the nature of the distribution :
xr ol 2 3 4 5 6 7 8 9
fi 6 1300 25 30 22 9 5 2
63. Find the second, third and fourth moments about mean of the
following frequency distribution. Hence find the skewness and kurtosis
of the distribution.341
imits :
100-104-9 105-109-9 110-114.9 115-119-9 120-124-9
frequency + i 1B 25 25 30
64. The first four moments about the mean of a distribution are 0,
2-5, 0-7 and 18.75. Find the skewness and kurtosis of the distribution.
65. The first. four moments of a distribution about 5 are respectively
2, 20, 40 and 50. Obtain, as for as possible, the character of the
distribution in terms of skewness and kurtosis,
66. The following data are given to the manager of Good Year Tyres
company. Is the distribution platykurtic 2
N= 100, 3° fix; = 50, D fix? = 1967-2, ¥ fix? = 2925-8,
D fix =86650-2
67. A survey was conducted by a tophy manufacturing company to
enquire the maximum price at which persons would be willing to buy their
product. The following table gives the stated price (in Rs.) by 100 persons:
Price + 1-80-1-90 1-90-2-00 2-00-2-10 2-10-2-20 2-20-2-30
No. of persons : ao 29 18 27 15
Calculate the skewness and interpret it.
68. Find skewness for the frequency distrirbution given below.
Weekly wages No. of Weekly wages No. of workers
(Rs) workers (Rs)
23-27 2 48-52 16
Bee 6 53-57 2
pe 9 58-62 6
Set 14 63-67 2
43-47 32 6872
69. For a variable, coefficient of skewness =-0.375, Mean =62, -
Median = 65. Find the variance. :
70. Find the coefficient of variation of a variable whose median = 17.4,
Mode =15-3 and skewness =0-35-342 ENGINEERING MATHEMATICS - lA
eee
71. The mean, median and the coefficient of variation of the weekly
wages of a group of workers are respectively Rs 45, Rs 42 and 40. Find
the (i) mode (ii) variance (iii) skewness, for this distribution.
72. The distribution of the wages of the Govt. employee in a country
is such that its first two moments about 4 are respectively1.5 and 2.7.
The median of this distribution is 2-1. What will be the shape of the
frequency diagram of the distribution.
73. For a group of 10 items, x, = 452, x? = 24,270 and the Mode
= 43-7. Find the skewness.
74. For a moderately skewed distribution, mean is 160, mode is 157
and standard deviation is 50.
Find (i) coefficient of variation
(ii) skewness
(iii) median.
75. Ina distribution, Mean = 65, Median = 70 and skewness — -0-6-
Find Mode and the coefficient of variation.
76. Given mean =50,CV =40%, skewness =~-0-4. Find s.d, Mode
and Median.
77. Consider the following distribution ;
Distribution of x Distribution of y
Mean 100 90
Median 90 80
s.d 10 10
Find whether (i) distribution of x has
distribution of y. (ii) both the distri
skewness,
same degree of the variation as
bution have the same degree of
78. The mean of a certain distributi
ion is 50, its s.d is 15 and skewness
is _). Find the median,
79. The skewness of a distribution is 0.32. Its standard deviation and
mean are respectively 6-5 and 29-6. Find the mode and median.
80. Ina frequency distribution the cv =5, sd
=2 and skewness
=0-5. Find the mean and mode of the distribution,pasic STATISTICS 343
—_—_——
Answers
17:6 25-7 3.80 4.3-98 5.501 6.715 7. 150
8. 62:16 9. 79 10. 1:2;50,100 11, 2-1 12, 76, 38 13. 30
—2ntl ntl 4n? 1 .
14.) 37 GS Git) —S— Civ) 1 () Ant +1?
3n(n+1)
2(2n +1)
15. 127,6;130 16.60 17. Bothare 6-5 18. (i) 2280
19.10 20.250, 150 22.65 231.24. (i) 1982-40
(ii) mean = 1765, median = 1533-33, mode = 1070
25. (i) 4-87 (ii) 2-16 (iii) 0-90 26. 2-82,7-95 27. 12-3
28.9+12,1-93 29.45. 30. 131-29 hours 31. 64-71,4-64
32. 5-69 34, 42-8 35. (a) 7(b)4(c)5 36. 19-15, (4-67)7
37. Mean = 50-4, s.d=6-71 38. 15-41 40, Rs.16-87
41. Same 42. B is consistent but A is better because mean of A is
higher than B.
43. The variability is more for weight distribution.
44. batsman A 45. Type II more variable 46. factory X
47. 0,9-4,17-4
48. 31.30,23-04,26-11, 1496-68 s.k=0-24, kurtosis = 0-18
49. 18-07,18-40,—0-19 51-0-713,0-787 50. ~-048
52. 0.332; Negatively skewed 53. (a) 0-574 (b) 50
54, -0-197, --7455 55. 2, 10, 40, 218; 0,6,4, 90
59. 7-2,4-4 60. 3, 24, 76
61. (i) my > 1875 (ii) my =1875 (ili) my <1875
62. 0,2-49, 0-7, 18-33; the distribution is almost normal
63. 0, 38-09, - 110-672, 3229-7057;0-4708; — 0-774
64. 0-031, mesokurtik344 ENGINEERING MATHEMATICS - 11,
ee
65. negatively skewed; playtykurtic 66. Platykurtic
68. 0-057 69. 5.76 70 .0.49
71. (i) 36 (ii) 324 (iii) 0-5
72. The frequency curve is asymmetrical, has a longer tail on te right
hand side.
73. 0-08 74. (i) 31-25 (ii) 0-06 (iii) 159
75. 80, 38-46 76, 20, 58, 52-67
77. (i) distribution of y is more variable than that of x (il) Yes, both
2
have same degree of skewness 78. 0.55
79. 27-52, 28-9 80.400, 399
MULTIPLE CHOICE QUESTIONS
1, The frequency distribution of a variable x is
Kye 8 2 3 1 5
ho: 1 2 3 1
Then its mean is 16
(a) 2 ) >
1 iti
Oz @ 16
2. If the A.M of 2, 6, x, 5, 7 be 4, then the value of x is
(a) -20 (b) 5 () 4 (d) 0.
3. The A.M of 2,4,6,-++2n is
@) n+l (b) n(n +1)
n+l i n(n+1)
OFZ @O->Z
4, The A.M of 7,x-2,10,x+3 is 9. The value of x is
@)o (b)9 (©) 18 (d) 2x +18.
5. The mean of the following distribution is :
Marks; : 20-39 40-59 60-79 80-99
No of students : 10 12 8 10
(a) 68.5 (b) 39.5 (c) 58.5 (d) 60BASIC STATISTICS ‘ M5
Se
6. The standard derivation of a frequency distribution is given by
© PREA-3 Heals
© fyze-7 @ eels)
7. The variance of the following distribution is :
x; 7 -l 0 3 4
fi : 3 2 1 4
(a) 56.8 (b) .568
(©) 5.68 (a) 2.383
8. For the datas x,,x,,--,x,, variance of x is given by
Pew! of ic yao 1 1
@ (ae -(tz5] (b) ola -(55)
1 1
© yb% oe (@) None.
9. The standard derivation of the following observation is 5,7, 1, 2, 6, 3.
(a) 4.66 (b) 2.16
(© 1.47 (4) none.
10. The mode of the following data is : 2, 1,3, 2, 1,5,2, 2, 1, 6,4, 21,3
(as (02
@©3 @i.
11. The median of the following distribution is : 7, 9, 5, 3, 10, 15, 21,
19,17
(a) 15 (b) 9
(c) 10 (@) 17.
12. The median of the following distribution is : 10, 13, 9, 7, 37, 16, 27, 32
(a) 16 (b) 14.5
(© 13 (a) 15.5.oo!
Mo ENGINEERING MATHEMATICS - 11,
— Ss
13, The relations between mean, median and mode is
(a) Mode = 3 Median -2 Mean
()) Mode = 3 Median +2 Mean
(c) Mode = 2 Median —3 Mean
(d) Mode = 2 Median +3 Mean,
14. For a moderately asymmetric distribution median = 27, mean = 26
then mode = 0,
(a) 133 (b) -24
(©) 29 (d) 29.
1S, The A.M of 20 data calculated to be 89.4. Later the data 78 is
replaced by 87, The A.M of the datas after replacement is
(a) 89 (b) 85
(c) 89.85 (d) none of these.
-M of six datas is 0.3. AM of these six datas together with another
16.
data is 0.5, Then the seventh data is
(a) 17 (b) 1
(c)2 (d) 0.3.
17. If 10y, =x, -85 and F=-0523 then ¢=
(a) 80 (b) 79.77
(c) 78.77 (d) 77.77.
18. A.M of a group of 5 observation is 240; that of a group of 2
observation is 100. If the two group are merged into one then the A.M of
the merged group is
(a) 250 (b) 125
(c) 200 (d) none of these.
19. A.M of a group of n observation is 540; that of a group of m
observation is 460. If the A.M of the merged group is 520 then zm =
@) 21 (b) 22
(©) 31 () none of these.
20. In a simple frequency distribution of 200 items one frequency
against a data is missing; but the mean of all datas is known as 7740. The
missing frequency is
(a) 49 (b)50
(c) 48 (d) none of these.
alBASIC STATISTICS a
21. Two set of dai i} and {y,} are related by y
the median of the first set is 130 then the median of the second set is
(a) 124.5 (b) 130.5
(c) 140.5 (d) none of these.
22. The mode of the frequency distribution
ix . 0 1 2 3
tf i 8 24 36 10
1S.
(a) 0 (b) 1
(2 (d) 3.
23. The mode of the frequency distribution
n 3 0 1 2 3 4
f 3 23 24 21 24 20
are
(a) 0 (b) 1
(©) 2 (d) 3.
24, The A.M of 100 observation is 2.5. So the A.M of 50 of these
observation is more than 2.5
(a) True (b) False.
25, The median of 100 observation is 2.5. So 50 of these observation is
2.5 or more
(a) True (b) False.
26. If you remove the largest data from a group of different datas, the
AM of the remaining datas always changes
(a) True (b) False.
27. If the relation between the two group of observations {x,} and
(y,} is 3x, +4y, =21 and if ¥=3 then pa
9
(a)1 0) 4
(c) 3 (d) none of these.ee
348 ENGINEERING MATHEMATIC;
A
28. If the relation between two set of observations {x} and {y,} ig
-¥, +5 and median of x is 25 then the median of y is
(a) 20 (b) 10
(©) 12.5 (d) none of these.
29. If the relations between two set of observations {x;} and Di} is
2y, ~ 6x; = 6 and mode of the Ist set is 21 then the mode of the second set
is
(a) 13 (b) 29
(©) 55 (@) none of these.
‘30. The number of observations of two groups of datas are in the ratio
2:1 and their A.M are 8 and 128 respectively then the AM of the combined
group is,
(a) 88 (b) 453
(0) 48 (@) none of these.
31, The mean of the observations 1,2,3,-
4H with frequencies
1,2?,3? yn? Tespectively is
3n(n+1) * n(n +1)
© 2en+1) ©) 20n+1)
3n(n +1)
() Qnal (d) none of these
32. The AM of the frequency distribution
x : 5 6 7 ee 14
I E 5 6 7 8 14
is
(a) 10 (b) 1037
() 12
(@) none of these.
* upto nth term is
50/19" -1)_5
(@) 7 (10" -1)-2 © S10" ~1)-3
33. The AM of the datas 5,55,555,-+
1
(©-3,(10"-1)349
the two groups {x,} and {y} are related by
30 and if s.d of the se
-800
‘cond group is 2.6257 then the s.d of the
first group is
(a) 131.29
(c) 134
35. The sd of
3.16. Then the s.d of th
(b) 135.16
(d) none of these.
imum daily temperatures in centrigrade scale is
lose of in Farenheit scale is
(a) 5 (b) 7.1 (c) 5.69 (d) 6.69
C_F-32
(Hints : >= 9 iSthe relation between Centrigrade and Farenheit
scale).
36. 2x; + y; =3 is the relation between two sets of datas {x;} and
{y,}. If o, =3 then o, =
(a) 3 (b) 4 (©) 6 (d) none of these.
37. The standard deviation of the datas -5, -10, -12, -19, -20 is a
positive number
(a) True (b) False.
38. The s.d of n number of observations X1XQ4°5X, be s then the s.d
Of -x,- x24, x, Is —s
(a) True (b) False.
39. The variance of first n natural numbers is
r
(a) w-1 (0) =
7 n-1
n
OD @ Fz
40. The variance of the frequency distribution
x : 1 2 3 eee n
Si : 1 2 3 co nis
i ie (4241)
o {n+2Xn—) ye
(a= 2)(n=1) (d) none of these.
@“y ‘SF
350 ENGINEERING MATHEMATICS- 114
—_——
41. The first moments about 4 of the set of numbers 2, 4, 6, 8 is
(ayo (b) 1 (c) 2 (d) -2
42. The second moment about 4 of the set of numbers 2, 4, 6, 8 is
(a) 0 (b) 4 (c) 6 (d)7
43. The third moment about of 4 of the set of numbers 2, 4, 6, 8 is
(a) 1 (b) 4 (©) 16 (d) 12
44. The first four moments of a distribution are 1, 4, 10 and 46
respectively. Then the third central moment is
@)1 (b) 3 (4 (d) -1
45. Kurtosis reveals the shape of the distribution at the top :
(a) True (b) False
46. The two halves of an asymmetrical distribution are mirror images
of each other
(a) True () False
"47, Two distributions, with same mean, s.d and skewness, must have
same peakedness
(a) True (b) False
48. If the mean and the mode of a given distribution are equal then
skewness is
(@)o (b) -1 (1 (d)
49. Skewness is positive when
(a) mean < mode (b) mean = mode
(c) mean > mode (d) for any mode
50. If mean, mode and s.d are 41, 45 and 8 Tespectively then skewness =
@) 0-5 (b)I (©) 5 (d) -0-5
51. The skewness can not exceed
(a) -3 (b) 0 4 (d) 3
52. If kurtosis has a value less than 3 the distribution is called
(a) leptokurtic
(c) normal
(b) mesokurtic
(d) platykurtic »>
BASIC STATISTICS
aS Ee ee
Answers
1b
8.a
15. c
22. ¢
29.
36.
43..¢
50. d
2d
ab
16.a
23.bd
30. ¢
37.a
44.b
51.4
3.4
10.
17.
24.
31.
38.
45.
52.
Sa Gene
4.b
Ic
18. ¢
25.a
32.b
39.d
46.b
5.c
12.b
19.c
26.a
33.b d
40.0
47.b
6.a
13.
20.
27.
34,
41.
48,
a
a
ie
14.
21.
28.
42.
49.
351