Professional Documents
Culture Documents
Chapter 3 Probability and Statistics
Chapter 3 Probability and Statistics
Cont…
For example, you want to describe the age of
students attending the Adama Science
Technology University. Therefore if you randomly
ask 700 students for their age, the data will be
as follows:
Cont…
5 10
xi x1 x2 x3 x4 x5 , xi x4 x5 x6 x7 x8 x9 x10
i 1 i 4
x
i 1
i x1 x2 x3 x4 x5 5 7 7 6 8 33
Properties of Summation
Exercise
x i
x1 x2 ... xn
x i 1
n n
Cont..
If we take an entire population the mean is denoted by μ and
is given by: n
x x x ... x i
i 1
1 2 N
N N
Where N stands for the total number of observations in the
population.
Example 1: Find the mean of the mark of 9 students (out of
100) given below: 52, 75, 70, 67, 35, 52, 70, 70, and 49.
Solution: n = 9
n
x i
x1 x2 ... x9 52 75 70 67 35 52 70 70 49 540
x i 1
60.
n 9 9 9
Exercise: Find the mean of the following data: 10.5, 2.4 ,3.6, 5.9 & 8.7
f i xi
f x f 2 x2 ... f k xk fx i i k
x i 1
1 1 i 1
, n fi
k
f1 f 2 ... f k n
fi 1
i
i 1
No of 1 2 3 4 5 6 7 Total
children
frequency 5 9 12 17 14 10 6 73
Solution:
k
fx i i
5 1 9 2 ... 6 7 299
x i 1
4.09 4
n 73 73
f i xi
x i 1
where n
k is the number of classes
n is total frequencies
xi is the ith class mark
Cont…
Example: Find the mean for the following continuous
data.
C.L 1-5 6-10 11-15 16-20 21-25 26-30 31-35 Total
fi 4 8 12 6 3 4 3 40
C.M (xi) 3 8 13 18 23 28 33
fixi 12 64 156 108 69 112 99 620
Solution:
k
f i xi
4 3 8 8 ... 3 33
x i 1
n 40
620
15.5
40
Cont…
Exercise
The following table gives the daily wages of
laborers. Calculate the average daily wages paid
to a laborer.
Wages in dollar 11-13 13-15 15-17 17-19 19-21 21-23 23-25
Number of 3 4 5 6 6 4 3
laborer
1. The algebraic n
sum of deviations from the mean is always
zero. i.e. ( xi x) 0
i 1
2. The sum of squares of deviations from the mean is
n
Cont…
Example: The mean of 200 observations was 50. Later on,
it was discovered that two observations were wrongly
read as 92 and 8 instead of 192 and 88. Find the correct
mean.
Solution: n = 200, wrong mean = 50
wrong values = 92+8 = 100
correct values = 192+88 = 280
Correct values - wrong values
Correct Mean Wrong Mean
n
280 - 100
50 50.9.
200
Cont…
5. Combined mean:
n1 x1 n2 x2 ... nk xk
xc
n1 n2 ... nk
Example: Last year there were three sections taking Probability &
Statistics course course in ASTU. At the end of the semester,
the three sections got average marks of 80, 83 and 76. There
were 28, 32 and 35 students in each section respectively. Find the
mean mark for the entire students.
n1 x1 n2 x2 n3 x3 7556
xc 79.54
n1 n2 n3 95
Weighted Mean ( 𝒙𝒘 )
In the calculation of arithmetic mean, all items
were assumed to be of equally importance.
That is, each value in the data set has equal weight.
When the observations have different weight, we
use weighted average.
Weights are assigned to each item in proportion
to its relative importance.
If 𝑥1 , 𝑥2 ,…, 𝑥𝑛 represent values of the
observations and 𝑤1 , 𝑤2 ,…, 𝑤𝑛 are the
corresponding weights, then the weighted mean is
given by
Cont…
n
w x i i
w1 x1 w2 x2 ... wn xn
xw i 1
n
w1 w2 ... wn
w
i 1
i
w x i i
48
x w GPA i 1
n
3.0
16
w i 1
i
Example
Find the G. M of a) 3 and 12 b) 2, 4 and 8
Solution:
a) 𝐺. 𝑀 = 𝑥1 . 𝑥2 = 3 × 12 = 36 = 6
3 3
b) 𝐺. 𝑀= 3 𝑥1 . 𝑥2 . 𝑥3 = 2×4×8= 64 = 4
Properties of geometric mean
• It is less affected by extreme values.
• It takes each and every observation into consideration.
• If the value of one observation is zero its values
becomes zero.
Harmonic Mean
It is a suitable measure of central tendency when the data
relates to speed, rate and time.
The harmonic mean of n values is defined as n divided by
the sum of their reciprocal.
𝒏
𝑯. 𝑴 =
𝟏 𝟏 𝟏
+ + ⋯+
𝒙𝟏 𝒙𝟐 𝒙𝒏
H.M for discrete and continuous data:
𝒏
𝑯. 𝑴 = 𝒘𝒉𝒆𝒓𝒆 𝒏 = 𝒇𝒊
𝒇𝟏 𝒇𝟐 𝒇𝒌
+ + ⋯+
𝒙𝟏 𝒙𝟐 𝒙𝒌
For continuous data, xi is the ith class mark.
Median
o It divided a given set of data into two equal parts
o It is obtained by arranging the data in an increasing or decreasing
order of magnitude
o It denoted by 𝑥
Case 1: Median for individual series of data
To determine the median:
arranging the data in an increasing or decreasing order
Identify the total number of observations is either odd or even.
Then,
(𝑛:1
2
)𝑡ℎ 𝑣𝑎𝑙𝑢𝑒 𝑖𝑓 𝑛 𝑖𝑠 𝑜𝑑𝑑
𝑥= (𝑛 2 )𝑡ℎ 𝑣𝑎𝑙𝑢𝑒 + (𝑛 2 +1)𝑡ℎ 𝑣𝑎𝑙𝑢𝑒
𝑖𝑓 𝑛 𝑖𝑠 𝑒𝑣𝑒𝑛
2
Cont…
Example: Find the median of the following discrete
data
Number of 1 2 3 4 5 6 7 Total
children
fi 5 9 12 17 14 10 6 73
L.C.F 5 14 26 43 57 67 73
Solution: n = 73 is odd
73+1 th
𝑥 = (𝑛:1
2
)𝑡ℎ 𝑣 = ( ) v= 37th v = 4
2
Where
𝐿𝑚𝑒𝑑 is the LCB of the median class
w is class width
𝑓𝑚𝑒𝑑 is frequency of the median class
C.F is the L.C.F of the class immediately preceding the
median class
Median Class:- is the class contains the minimum L.C.F
greater than or equal to n/2.
Solution: n = 40
𝑛 40
= = 20.
2 2
The minimum L.C.F greater than or equal to 20 is 24.
Therefore, the 3rd class is the median class.
Thus, 𝐿𝑚𝑒𝑑 =10.5, w=5, 𝑓𝑚𝑒𝑑 =12 , C.F = 12
𝑥 = 𝐿𝑚𝑒𝑑 + 𝑓 𝑤 𝑛2−𝐶.𝐹 =10.5+12 5
20−12 = 13.83
𝑚𝑒𝑑
Merits of median
• It is less affected by extreme values.
• Median can be calculated even in case of open-ended
intervals.
• It can be computed for ratio, interval, and ordinal
level of data.
Demerits of median
Its value is not determined by each & every
observation.
It is not a good representative of the data if the
number of items (data) is small.
The arrangement of items in order of magnitude is
sometimes very boring process if the number of items
is very large.
Wages in 126 and 127-135 136-144 145-153 154-162 163-171 172 and
Birr below above
No. of 3 5 9 12 5 4 2
Employees
Mode
It is the third measure of central tendency.
The mode is the value that occurs most often in the data
set.
The mode is the value with the highest frequency
It denoted by 𝑥 (read as “x-hat”).
A data set may not have a mode or may have more than
one mode.
A data set that has only one value that occurs with the
greatest frequency is said to be unimodal.
If a data set has two values that occur with the same
greatest frequency, both values are considered to be the
mode and the data set is said to be bimodal.
Cont…
If a data set has more than two values that occur with the
same greatest frequency, each value is used as the mode,
and the data set is said to be multimodal.
Example: Find the mode of the following data.
Data X: 3, 4, 6, 12, 31, 8, 9, 8. The Mode (𝑥 ) = 8
Data Y: 6, 8, 12, 13, 11, 12, 6. The Mode (𝑥 ) = 6 and 12
Data Z: 2, 6, 3, 5, 7, 8, 12, 11. No Mode
Exercise: The marks obtained by ten students in a semester
exam in statistics (out of 100) are: 70, 65, 68, 70,75, 73, 80,
70, 83 and 86. Find the mode of the students’ marks.
Merits of mode
Mode is not affected by extreme values.
We can change the size of the observations without
changing the mode.
It can be computed for all level of data i.e. ratio, interval,
ordinal or nominal.
Demerits of mode
It may not exist.
It does not take every value into consideration.
Mode may not exist in the series and if it exists it may
not be unique.
Quartiles
Quartiles: are values which divide the data set in to
approximately four equal parts, denoted by 𝑄1 ,
𝑄2 𝑎𝑛𝑑 𝑄3 .
𝑄1 - the first quartile (the lower quartile)
- 25% of the observations value is below it.
𝑄2 - the 2nd quartile
- 50% of the observations value is
below/above
𝑄3 - the 3rd quartile (the upper quartile)
- 75 % the observations value is below it.
xi 16 11 12 13 14 15 10 17 18
fi 20 8 25 48 65 40 2 9 2
Solution:
1(𝑛+1) 𝑡ℎ (219+1) 𝑡ℎ
𝑄1 = 𝑣 = 𝑣 = 55th v = 13.
4 4
2(𝑛+1) 𝑡ℎ 2(219+1) 𝑡ℎ
𝑄2 = 𝑣 = 𝑣 = 110th v = 14
4 4
3(𝑛+1) 𝑡ℎ 3(219+1) 𝑡ℎ
𝑄3 = 𝑣 = 𝑣 = 165th v = 15
4 4
where
𝐿𝑄𝑖 is the LCB of the ith quartile class,
𝑓𝑄𝑖 is frequency of the ith quartile class,
𝐶. 𝐹 is the L.C.F of the class immediately
preceding the ith quartile class
𝑄1 = 𝐿𝑄1 + 𝑓𝑤 𝑛
−𝐶.𝐹 , 𝑄2 = 𝐿𝑄2 + 𝑓𝑤 2𝑛
−𝐶.𝐹 and
𝑄1 4 𝑄2 4
𝑄3 = 𝐿𝑄3 + 𝑓𝑤 3𝑛
−𝐶.𝐹
𝑄3 4
fi 4 8 15 5 9 5 4
L.C.F 4 12 27 32 41 46 50
Solution:
𝑄1 : 𝑛4 = 50
4
=12.5.
𝑄2 : 2𝑛
4
=
2×50
4
=25.
𝑄3 : 3𝑛
4
=
3×50
4
= 37.5.
Thank you!!!