Professional Documents
Culture Documents
Statistics
Theory ............................................................................................................................................... 2
STATISTICS
1. INTRODUCTION : n
fi x i
f x f x .... f n x n
An average or a central value of a statistical series in the x 1 1 2 2 i 1
n
f1 f 2 ...f n
value of the variable which describes the characteristics of fi
i 1
the entire distribution.
The following are the five measures of central tendency.
f x A
(1) Arithmetic Mean (ii) Short cut method : Arithmetic mean x A
f
(2) Geometric Mean
Where A = assumed mean, f = frequency and x - A = deviation
(3) Harmonic Mean of each item from the assumed mean.
(4) Median (3) Properties of arithmetic mean
(5) Mode (i) Algebraic sum of the deviations of a set of values from
their arthmetic mean is zero. If xi/fi, i = 1, 2, ..., n is the
2. ARITHMETIC MEAN :
frequency distribution, then
Arithmetic mean is the most important among the n
mathematical mean. fi x i x 0, x being the mean of the distribution.
i 1
According to Horace Secrist,
(ii) The sum of the squares of the deviations of a set of
“The arithmetic mean is the amount secured by dividing the
values is minimum when taken about mean.
sum of values of the items in series by their number”.
(1) Simple arithmetic mean in individual series (Ungrouped (ii) Mean of the composite series : If x i (i = 1,2,...,k) are the
data) means of k-component series of sizes ni, (i = 1, 2, ..., k)
(i) Direct method : If the series in this case be x1,x2,x3,...., xn respectively, then the mean x of the composite series
then the arithmetic mean x is given by obtained on combining the component series is given by
d
Arithmetic mean x A
n
, Where, A = assumed mean,
3. GEOMETRIC MEAN:
d = deviation from assumed mean = x - A, where x is the
individual item, d = sum of deviations and n= number of If x1, x2, x3, ...xn are n values of a variate x, none of them being
items. zero, then geometic mean (G.M.) is given by
G.M. = (x 1 , x 2 , x 3 , ....x n) 1/n log (G.M.)
(2) Simple arithmetic mean in continous series (Grouped
data) 1
log x1 log x 2 ... log x n .
(i) Direct method : If the terms of the given series be x1, x2, 2
....xn and the corresponding frequencies be f1, f2, f3...fn, then In case of frequency distribution G.M. of n values x1, x2, ..., xn
the arithmetic mean x is given by,, of a variate x occuring with frequency f1, f2,..., fn is given by
1
G.M. x1f1 .x f22 ,....,fnn N , where N = f1 + f2 + ... + fn.
STATISTICS 3
f1 f 2 f3 ..... f n N
ten H.M. C
f1 f 2 f 2 i
....... n Median = u , where u = upper limit of the
f
x
1 x 2 x n
median class.
Note : A.M. gives more weightage to larger values whereas
n
G.M. given more weightage to smalle. M fi
i 1
5. MEDIAN :
As median divides a distribution into two equal parts,
Median is defined as the value of an item or observation similarly the quartiles quantiles, deciles and percentiles
above or below which lies on an equal number of observation divide the distribution respectively into 4, 5, 10 and 100
i.e., the median is the central value of the set of observations equal part. The j th quartile is given by
provided all the observations are arranged in the ascending
or descending orders. N
j 4 C
(1) Calculation of median Qj l i; j 1, 2,3.Q1 is the lower quartile, Q2 is
f
(i) Individual series : If the data is raw, arrange in ascending
or descending order. Let n be the number of observations.
the median and Q3 is called the upper quartile.
th (2) Lower quartile
n 1
If n is odd, Median = value of item.
2 th
(i) Discrete series : Q1 size of n 1 item
If n is even, median 4
th th
1 n n
value of item + value of 1 item N
2 2 2 C
4
(ii) Continous series : Q1 i
f
(ii) Discrete series : In this case, we first find the cumulative
frequencies of the variables arragned in ascending or (3) Upper quartile
descending order and the median is given by
th
3 n 1
n 1
th (i) Discrete series : Q3 size of item
Median = observation, where n is the cumulative 4
2
frequency. 3N
C
(iii) For grouped or continous distributions : In this case, (ii) Continous series : Q3 4 i
following formula can be used f
Nk
C
Pk 100 i
f
(i) Mean deviation from ungerouped data (or indiviadual N = f = The total frequency
series)
6 STATISTICS
Short cut method Variance of the combined series : If n1; n2 are the sizes, x1 ; x 2
2
the means and 1 ; 2 the standard deviation of two series,
2
fd 2 fd d 2 d
(i) (ii)
N N N N 1
2
then
n1 12 d12 n 2 22 d 22
n1 n 2
where, d = x – A = Deviation from the assumed mean A
f = Frequency of the item n1x1 n 2 x 2
Where, d1 x1 x, d 2 x 2 x .
N = f = Sum of frequencies n1 n 2
(4) Square deviation
(i) Root mean square deviation
1 n 2
S fi (x A)
4 i 1 Range is widely used in satistical series relating to quality
control in production.
where A is any arbitrary number and S is called mean square
Standard deviation Range i.e., variance (Range)2.
deviation.
Empirical relation between measures of dispersion.
(ii) Relation between S.D. and root mean square deviation :
If be the standard deivation and S be the root mean square 4
deviation. Mean deviation = (standard deviation)
5
Then S2 = 2 + d2.
n2 1
S.D. 100 100 . S.D. of first n natural number is .
x 12
SOLVED EXAMPLES
Example 1 : Example 3 :
If the mean of the distribution is 2.6, then the value of y is If the mean of the set of number x1,x2, x3, ...,xn is x , then the
mean of the number xi + 2i, 1 i n is
Variate x :1 2 3 4 5
Frequency f of x: 4 5 y 1 2 (a) x 2n (b) x n 1
(a) 24 (b) 13 (c) 8 (d) 3 (c) x 2 (d) x n
n
n xi n
fi x i Solution (b) : We know that x i 1
i.e., x i nx
i 1
Solution (c): We know that, Mean n
n i 1
fi
i 1
n n n
x i 2i xi 2 i nx 2 1 2 ...n
i 1
i 1 i 1
n n n
1 4 2 5 3xy 4 1 5 2
i.e. 2.6 or 31.2 + 2.6y = 28
4 5 y 1 2
n n 1
nx 2
+ 3y or 0.4y = 3.2 y = 8 2 x n 1
n
Example 2 :
Example 4 :
In a class of 100 students there are 70 boys whose average
The harmonic mean of 4, 8, 16 is
marks in a subject are 75. If the average marks of the
(a) 6.4 (b) 6.7 (c) 6.85 (d)7.8
complete class are 72, then what are the average marks of
the girls.
3 48
(a) 73 (b) 65 (c) 68 (d) 74 Solution (c): H.M. of 4, 8, 16 6.85
1 1 1 7
4 8 16
Solution (b) : Let the average marks of the girls students be x,
then Example 5 :
x1 x 2 x 3 ...x n
Solution (b): M i.e.,
n
STATISTICS 9
Example 8 :
nM x1 x 2 x 3 ...x n 1 x n
nM x n x1 x 2 x 3 ...x n 1 The median of a set of 9 distinct observation is 20.5. If
nM x n x' x1 x 2 x 3 x n 1 x ' each of the largest 4 observation of the set is increased by
2. then the median of the new set.
n n
(a) Is increased by 2
(b) is decreased by 2
nM x n x '
New average (c) Is two times the original median
n
(d) Remains the same as that of the original set
Example 6 :
Mean of 100 items is 49. It was discovered that three items 9 1 th
which should have been 60, 70, 80 were wrongly read as Solution (d): n = 9, then median term 5 term. Since
2
40, 20, 50 respectively. The correct mean is
last four observation are increased by 2.
1 The median is 5th observation which is remaining
(a) 48 (b) 82 (c) 50 (d) 80
2 unchanged.
There will be no change in median.
Solution (c):
Example 9 :
Sum of 100 items = 49 × 100 = 4900
Sum of items added = 60 + 70 + 80 = 210 Compute the median from the following table
Sum of items replace = 40 + 20 + 50 = 110
New sum = 4900 + 210 - 110 = 5000 Marks obtained No. of students
0-10 2
5000 10-20 18
Correct mean= 50
100 20-30 30
30-40 45
Example 7 : 40-50 35
50-60 20
The following date gives the distribution of height of 60-70 6
students 70-80 3
Height (in cm) 160 150 152 161 156 154 155
Number of students 12 8 4 4 3 3 7
(a) 36.55 (b) 35.55
The median of the distribution is (c) 40.05 (d) None of these
(a) 154 (b) 155 (c) 160 (d) 161
Solution (a):
Solution (b): Arranging the data in ascending order of magnitude,
Marks obtained No. of Cumulative
we obtain students Frequency
0-10 2 2
H eig h t (in c m ) 150 152 154 155 156 160
N u m b er o f stu d e nts 8 4 3 7 3 12
10-20 18 20
C u m u lativ e 8 12 15 22 25 37 20-30 30 50
freq u en c y 30-40 45 95
40-50 35 130
Here, total number of items is 41, i.e., an odd number. Hence, 50-60 20 150
41 1 60-70 6 156
the median is th i.e. 21st item 70-80 3 159
2
For cumulative frequency table, we find that median i.e., n = f = 159
21st item is 155. Hence n = 159, which is odd.
(All items from 16 to 22nd are equal each = 155)
10 STATISTICS
1 1 Example 12 :
Median number n 1 159 1 80, which is in In a series of 2n observations, half of them equal a and
2 2
the class 30-40 (see the row of cumulative frequency 95, remaining half equal -a. If the standard deviation of the
which contains 80). observations is 2, then |a| equals
Example 10 : Example 13 :
A batsman scores runs in 10 inning 38, 70, 48, 34, 42, 55, If is the mean of distribution (yi, fi), then fi(yi - ) =
63, 46, 54, 44, then the mean deviation is
(a) M.D. (b) S.D. (c) 0 (d) Relative frequency
(a) 8.6 (b) 6.4 (c) 10.6 (d) 9.6
Solution (c): We have fi(yi - ) = fiyi – fi = fi – fi = 0
Solution (a): Arranging the given data in ascending order, we
have fi yi
34, 38, 42, 44, 46, 48, 54, 55, 63, 70 fi
46 48 Example 14 :
Here median M 47
2
What is the standard deviation of the following series
( n = 10, median is the mean of 5th and 6th items)
Measurements 0-10 10-20 20-30 30-40
Mean deviation
Frequency 1 3 4 2
xi M x i 47 13 9 5 3 1 1 7 8 16 23 (a) 81 (b) 7.6 (c) 9 (d) 2.26
8.6
n 10 10
Solution (c):
Example 11 :
S.D. of data is 6 when each observation is increased by 1, Class Frequency y1 yi A f1u1 f1u12
i , A 25
then the S.D. of new data is 10
(a) 5 (b) 7 (c) 6 (d) 8 0–10 1 5 –2 –2 4
10–20 3 15 –1 –3 3
Solution (c): S.D. and variance of data is not changed, when each 20–30 4 25 0 0 0
observation is increased (OR decreased) by the same 30–40 2 35 1 2 2
constant. 10 –3 9
f u 2 f u 2 2
2 2
c i i i i
f i f
i
STATISTICS 11
Example 16 :
2
9 3 2
10 90 9 81 9
10 10 ax b
The S.D. of a variate x is . The S.D. of the variate
c
Example 15 :
where a, b, c are constant, is
In an experiment with 15 observation on x, the following
a a
resultswereavailablex2 = 2830, x = 170. On observation (a) (b)
that was 20 was found to be wrong and was replaced by c c
the correct value 30. Then the corrected variance is
a2
(a) 78.00 (b) 188.66 (c) 177.33 (d) 8.33 (c) 2 (d) None of these
c
Solution (a): x = 170, x2 = 2830
Increase in x = 10 then x1 = 170 + 10 = 180 Solution (b):
Increase in x2 = 900 - 400 = 500, then x’ = 2830 + 500 = 3330 ax b a b
Let y i.e., y x i.e. y = Ax + B, where
2 2 c c c
1 x ' 3330 180
Variance x '2 222 144 78
n n 15 15 a b
A ,B
c c
y Ax B
2 2
yy A xx yy
A2 x x
2 2
y y A 2 x x n.2y A.n2y 2y A 2 2x
a
y A x y x
c
a
Thus, new S.D. .
c
12 STATISTICS
15. Statement I The average marks of boys in a class is 52 21. The standard deviation of 25 numbers is 40. If each of the
and that of girls is 42. The average marks of boys and numbers is increased by 5, then the new standard
girls combined is 50. The percentage of boys in the class deviation will be
is 80%. (a) 40 (b) 45
Statement II Mean marks scored by the students of a
class is 53. The mean marks of the girls is 55 and the mean 21
(c) 40 (d) None of these
marks of the boys is 50. The percentage of girls in the 25
class is 64%. 22. The median of a set of 9 distinct observations is 20.5. If
(a) Only statement I is true (b) Only statement II is true each of the largest 4 observations of the set is increased
(c) Both statements are true(d) Both statements are false by 2, then the median of the new set is
16. The variance of the first n natural numbers is (a) increased by 2 (b) decreased by 2
(c) two times the original median(d) remains the same as
n2 1 n2 1 that of original set
(a) (b)
12 6 23. The mean age of a combined group of men and women is
25 yrs. If the mean age of the group of men is 26 and that
n2 1 n2 1 of the group of women is 21, then the ratio of men and
(c) (d)
6 12 women in the group is
17. If v is the variance and is the standard deviation, then (a) 1 : 4 (b) 4 : 1
(c) 3 : 1 (d) 1 : 3
(a) v2 (b) v 2
24. The mean of five observations is 4 and their variance is
1 1 5.2. If three observations are 1, 2, and 6, the other two are
(c) v (d) v
2 (a) 2 and 9 (b) 3 and 8
(c) 4 and 7 (d) 5 and 6
18. If each observation of a raw data whose variance is 2 ,
25. Consider any set of observations x1, x2, x3,....,x101; it being
is increased by , then the variance of the new set is
given that x1 < x2 < x3 < ... <x101; then the mean deviation of
(a) 2 (b) 2 2 this set of observations about a point k is minimum when
k equals
(c) 2 (d) 2 2 (a) x1 (b) x51
(a) 2 (b) 2 2 26. Statement I The mean and variance for first n natural
(c) 2 (d) 2 2 n 1 n2 1
numbers are and , respectively..
2 12
20. If the variance of observations x1, x2, ....... xn is 2 , then
Statement I The mean and variance for first 10 positive
the variance of observations ax1, ax2, ......, axn, a 0 is multiples of 3 are 16.5 and 74.25, respectively.
(a) 2 (b) a2 (a) Only statement I is true (b) Only statements II is true
(c) Both statements are true(d) Both statements are false
2 27. The standard deviation of the data 6, 5, 9, 13, 12, 8, 10 is
(c) a 2 2 (d)
a2
52 52
(a) (b)
7 7
(c) 6 (d) 6
14 STATISTICS
28. Variance of the data 2, 4, 5, 6, 8, 17 is 23.33. Then, variance Further, another set of 15 observations x1, x2......,xn (also
of the data 4, 8, 10, 12, 16, 34 will be n 15
(a) 23.33 (b) 25.33 in seconds) is now available and we have x i 279
i 1
(c) 93.32 (d) 98.32
15
29. The mean of 100 observations is 50 and their standard 2
and x i 5524. The standard deviation of all 40
deviation is 5. The sum of squares of all observations is i 1
1 2
(c) (d)
n n
16 STATISTICS
12. Statement-1 : The variance of first n even natural numbers 16. All the students of a class performed poorly in
Mathematics. The teacher decided to give grace marks of
n2 1 10 to each of the students. Which of the following statistical
is
4 measures will not change even after the grace marks were
Statement-2 : The sum of first n natural numbers is given ? (2013)
(a) median (b) mode
n (n 1)
and the sum of squares of first n natural numbers (c) variance (d) mean
2
17. The mean of a data set consisting of 20 observations is 40.
n (n 1) (2 n 1)
is (2009) If one observation 53 was wrongly recorded as 33, then
6 the correct mean will be (2013/Online Set 1)
(a) Statement 1 is true, Statement 2 is true; Statement 2 is (a) 41 (b) 49
not a correct explanation for Statement 1.
(c) 40.5 (d) 42.5
(b) Statement 1 is true, Statement 2 is false
18. Mean of 5 observations is 7. If four of these observations
(c) Statement 1 is false, Statement 2 is true. are 6, 7, 8, 10 and one is missing, then the variance of all
(d) Statement 1 is true, Statement 2 is true; Statement 2 is the five observations is (2013/Online Set 2)
a correct explanation for Statement 1 (a) 4 (b) 6
13. For two data sets, each of size 5, the variances are given to (c) 8 (d) 2
be 4 and 5 and the corresponding means are given to be 2
19. If the median and the range of four numbers
and 4, respectively. The variance of the combined data set
[x, y, 2x + y, x – y], where 0 < y < x < 2y, are 10 and 28
is (2010)
respectively, then the mean of the numbers is
5 11 (2013/Online Set 3)
(a) (b)
2 2 (a) 18 (b) 10
(c) 5 (d) 14
13
(c) 6 (d) 20. In a set of 2n observation, half of them are equal to a and
2
the remaining half are equal to –a. If the standard deviation
14. If the mean deviation about the median of the numbers a, of all the observations is 2, then the value of |a| is
2a, ..........., 50a is 50, then |a| equals (2011) (2013/Online Set 4)
(a) 4 (b) 5
(a) 2 (b) 2
(c) 2 (d) 3
(c) 4 (d) 2 2
15. Let x1 , x 2 , ......., x n be n observations, and let x be their
21. The variance of first 50 even natural numbers is (2014)
arithmetic mean and 2 be their variance. (2012)
(a) 833 (b) 437
2
Statement 1 : Variance of 2x1 , 2x 2 , ........., 2 x n is 4 . 437 833
(c) (d)
Statement 2 : Arithmetic mean of 2x1 , 2x 2 , ......, 2x n is 4 4
22. In a set of 2n distinct observations, each of the observation
4x.
below the median of all the observations is increased by 5
(a) Statement 1 is true, Statement 2 is true; Statement 2 is and each of the remaining observations is decreased by 3.
not a correct explanation for Statement 1. Then the mean of the new set of observations:
(b) Statement 1 is true, Statement 2 is false (2014/Online Set 1)
(c) Statement 1 is false, Statement 2 is true. (a) increases by 1 (b) decreases by 1
(d) Statement 1 is true, Statement 2 is true; Statement 2 is (c) decreases by 2 (d) increases by 2
a correct explanation for Statement 1
STATISTICS 17
23. Let X and M.D. be the mean and the mean deviation 29. The mean age of 25 teachers in a school is 40 years. A
teacher retires at the age of 60 years and a new teacher is
about X of n observations x i , i 1, 2,......., n. If each of appointed in his place. If now the mean age of the teachers
the observations is increased by 5, then the new mean in this school is 39 years, then the age (in years) of the
and the mean deviation about the new mean, respectively, newly appointed teacher is (2017/Online Set 1)
are : (2014/Online Set 3) (a) 25 (b) 35
(a) X, M.D. (b) X 5, M.D. (c) 30 (d) 40
30. The sum of 100 observations and the sum of their squares
(c) X, M.D. 5 (d) X 5, M.D. 5 are 400 and 2474, respectively. Later on, three observations
3, 4 and 5, were found to be incorrect. If the incorrect
24. Let x, M and 2 be respectively the mean, mode and observations are omitted, then the variance of the
variance of n observations x1 , x 2 ..........., x n and remaining observations is (2017/Online Set 2)
(a) 8.00 (b) 8.25
di x i a, i 1, 2, .........., n, where a is any number..
(c) 9.00 (d) 8.50
(2014/Online Set 4)
9 9
2
Statement-1 : Variance of d1 , d 2 , .............., d n is 2 31. If (x i 5) 9 and (x i 5) 45, then the
i 1 i 1
Statement-2 : Mean and mode of d1 , d 2 , .........., d n are
standard deviation of the 9 items x1 , x 2 , ........, x 9 is :
x a and – M – a, respectively.. (2018)
(a) Statement-1 and Statement-2 are both false (a) 9 (b) 4
(b) Statement-1 and Statement-2 are both true (c) 2 (d) 3
(c) Statement-1 is true and Statement-2 is false 32. The mean of a set of 30 observations is 75. If each
(d) Statement-1 is false and Statement-2 is true observation is multiplied by a non-zero number and then
each of them is decreased by 25, their mean remains the
25. The mean of the data set comprising of 16 observations is same. Then is equal to : (2018/Online Set 1)
16. If one of the observation valued 16 is deleted and three
new observations valued 3, 4 and 5 are added to the data, 1 2
then the mean of the resultant data is : (a) (b)
3 3
(2015)
(a) 16.0 (b) 15.8 4 10
(c) (d)
3 3
(c) 14.0 (d) 16.8
26. If the standard deviation of the numbers 2, 3, a and 11 is 33. If the mean of the data : 7, 8, 9, 7, 8, 7, , 8 is 8, then the
3.5, then which of the following is true ? (2016) variance of this data is : (2018/Online Set 2)
ANSWER KEY
EXERCISE - 1 : BASIC OBJECTIVE QUESTIONS
1. (c) 2. (d) 3. (b) 4. (b) 5. (d) 6. (a) 7. (d) 8. (c) 9. (b) 10. (a)
11. (c) 12. (b) 13. (c) 14. (a) 15. (a) 16. (a) 17. (b) 18. (a) 19. (b) 20. (c)
21. (a) 22. (d) 23. (b) 24. (c) 25. (b) 26. (b) 27. (a) 28. (c) 29. (c) 30. (a)
31. (c) 32. (a) 33. (c) 34. (c) 35. (c) 36. (b) 37. (d) 38. (a) 39. (c) 40. (a)
Dream on !!