Professional Documents
Culture Documents
Probability –Statistics
Lecture 1
What is Statistics?
Statistics is the science of
conducting studies to: collect,
organize, present, summarize
analyze, and draw conclusions
“decisions” from data.
Types of Data
(i) Quantitative: consists of numbers
representing counts or measurements.
= i 1
Xi
X i
X i 1
n
Example 1
The data represent the number of days off per year for a
sample of individuals selected from 9 different countries.
Find the sample mean.
20 26 40 36 23
42 35 24 30
Solution
• n = 9.
20 26 40 36 23 42 35 24 30
X
9
276
9
30.7
( n 1)
i.e., the value in position
2
If n is even,
then MD = (sum of the two middle points) / 2
n n
and ( ) 1
2 2
1 2 6 7 12
13 2 6 9 5
18 7 3 15 15
4 17 1 14 5
Step 1: Sorting…
1 1 2 2 3
4 5 5 6 6
7 7 9 12 13
14 15 15 17 18
Step 2 : n = 20 , which is even number so ,the two values
in the middle are in positions n/2 = 20/2 = 10 and
(n/2) + 1 =11
1 1 2 2 3
4 5 5 6 6
7 7 9 12 13
14 15 15 17 18
8 9 9 14 8 8
10 7 6 9 7 8
10 14 11 8 14 11
Solution : Sort the data for convenience
6 7 7 8 8 8
8 8 9 9 9 10
10 11 11 14 14 14
Identify the value with highest frequency
6 7 7 8 8 8
8 8 9 9 9 10
10 11 11 14 14 14
A : 55 60 65 70 75 80 85 90 95 100
B: 73 74 75 76 77 78 79 80 81 82
55 60 65 70 75 80 85 90 95 100
A 77.5
10
73 74 75 76 77 78 79 80 81 82
B 77.5
10
• The two distributions have the same mean!
• Are they the same?
• How exactly are they different?
– The difference is in the spread of values around the mean.
– In population B, the data values are clustered closer to the
mean.
– The grades of population B are more consistent.
Measures of Variation
Dispersion
• We’ll consider two measures of variation:
1. The variance.
N
( Xi ) 2
i 1
2
• Note N
– The variance is an average.
– It is the mean of the square of distances to the
population mean.
– The squaring is needed to get only positive
distances.
Example(7) : Computing the Variance of population A in example(6)
55 60 65 70 75 80 85 90 95 100
1. We know that µ = 77.5
2
n
n x xi
2 n
i
i 1 i 1
S
2
n n 1
2. The Standard Deviation
6
x i 11 .2 11 .9 12.0 12.8 13.4 14.3 75.6
i 1
2. Square each value and find the sum
6 2 2 2 2 2 2
x i (11 .2) (11 .9) (12.0) (12.8) (13.4) (14.3) 958.94
i 1
3. Substitute into the formula
2
n x i2
n n
xi 6 958.94 75.6
2
S2 i 1 i 1 1.276
n n 1 6 5
X 4 5810 13 8
5
So,
S n1
2 ( x x )2
(48)2 (58)2 (88)2 (108)2 (138)2
13.5
4
Also,
S 13.5 3.67
3] Coefficient of Variation (C.V)
(C.V) .100 %
S
X
Population Coefficient of variation
(C.V) .100 %
and is generally expressed as a percentage .This measure allows us to
compare the relative variability of the two data sets
Example (10): Page 29 Measurements made with one of the micrometer
diameter of a ball bearing have a mean of 3.92 mm and a standard
deviation of 0.0152mm, whereas measurements made with another
micrometer of the unstretched length of a spring have a mean of 1.54
inches and a standard deviation of 0.0086 inches. Which of these two
measuring instruments is relatively more precise?
Solution: Calculating the two coefficients of variation, we get
Ball bearing Spring
X 3.92 mm X 1.54 inches
S = 0.0152 mm S= 0.0086 inches
C.V
0.0152
.100 0.39% C.V 0.0086
1.54 .100 0.56 %
3.92
2-Deciles,
3-Percentiles
Measures of Position
Quartiles, Deciles and Percentiles:
Used to locate the relative position of a data
value in a data set.
D1 D2 D3 D4 D5 D6 D7 D8 D9
3-Percentiles
99 Percentiles
The values dividing the data into one hundred parts are
called percentiles and are denoted by P1, P2,....., P99.
D9 = P90
Sample Percentiles
• How to get the 100 pth percentile ?
Solution:
1- Sort: 2, 3, 5, 6, 8, 10, 12, 15, 18, 20.
Solution
1- Sort: 2, 3, 5, 6, 8, 10, 12, 15, 18, 20.
Q1 Q2 Q3
Step 2
Q1
= (170 + 175)/2
= 172.5 Q2
= (190 + 210)/2
= 200.0 Q3
= (235 + 240)/2
= 237.5
Definitions
1- Range = Max. value – Min. value
Q3 Q1
3- Semi-interquartile Range:
2
Example (14) : Back to example(13) . Find IQR
Solution :
• Step 1:
Rank data and divide into 4 parts:
150, 165, 170 175, 180, 190 210, 210, 235 240, 260, 270
Q1 Q2 Q3
Step 2
Q1
= (170 + 175)/2
= 172.5 Q2
= (190 + 210)/2
= 200.0 Q3
= (235 + 240)/2
= 237.5
Step 3: Calculate (IQR ): Q3-Q1
Q3 – Q1 = 237.5 – 172.5 = 65
Example(15) :Page 30 & 31
Consider the data collected in a nanotechnology setting . Engineers
fabricating a new transmission-type electron multiplier created an array
of silicon nanopillars on a flat silicon membrane . The precise structure
can influence the electrical properties , so the heights of 50 nanopillars
were measured in nanometers (nm) , or 10-9 x meters . The ordered
heights of the nanopillars are :
221 234 245 253 265 266 271 272 274 276
276 276 278 284 289 290 290 292 292 296
297 298 300 303 304 305 305 308 308 309
310 311 312 314 315 315 323 330 333 336
337 338 343 346 355 364 366 373 390 391