Professional Documents
Culture Documents
1
© m j winter, ss2003
1
Example: Starting
Salaries
Example - 2
2
Measure of Central Tendency
Advantages, Disadvantages
Questions
x1 < x2 < x3 < x4 <x5 < x6 <x7 <x8 <x9< x10
calculate the mean µ = x + x + ... + x
1 2 10
10
and the median, m = x5 + x6
2
Now increase the largest number by 20. What is the new
mean? The new median?
New mean =
x1 + x2 + ... + ( x10 + 20) 20
=µ+ = µ+2
10 10
The median does not change. x5 + x6
2 6
3
Detour – weighted averages
Calculate the average of: 3.2, 3.2, 3.2, 4.0, 2.5, 2.5
3.2 + 3.2 + 3.2 + 4.0 + 2.5 + 2.5
6
3.2 + 3.2 + 3.2 + 4.0 + 2.5 + 2.5 3*(3.2) + 1*(4.0) + 2*(2.5)
=
6 6
3 1 2
= (3.2) + (4.0) + (2.5) = 3.10
6 6 6
3 1 3 3.1 + 1 2.8
3.1 + 2.8 = = 3.025
4 4 4
4
Mean or Average of Grouped Data
Set of 17 integers
between 2 and 9 7 7
(inc)
[2, 3] 7
4
Freq
[4, 5] 3 3 3
[6, 7] 3
[8,9] 4
0
3.0 5.0 7.0 9.0
2 unnamed 9
between 2 and 9
(inc)
Use mid-interval 4
value. Freq
3 3
0
3.0 5.0 7.0 9.0
7 ⋅ 2.5 + 3 ⋅ 4.5 + 3 ⋅ 6.5 + 4 ⋅ 8.5 2 unnamed 9
x= = 4.9705..
17
10
5
Calculating the mean from a relative
frequency (density) histogram
7 .412
7
.235
4
Freq ..176
3 .176
3
0
3.0 5.0 7.0 9.0
2 2.5 6.5
4.5unnamed 8.5 9
2 ... 2 4 23.53%
3 3 3
3 ... 3 3 17.65%
4 ... 4 2 11.76%
5 ... 5 1 5.88% 2
Freq
6 ... 6 3 17.65%
7 ... 7 0 1 1
8 ... 8 3 17.65%
9 ... 9 1 5.88% 0
2.0 3.0 4.0 5.0 6.0
2
6 78.08 9.0910.0
2 3 4 5unnamed 9
mean value: 4.76
6
The wider the bins, the more information
you lose.
13
139.84
147.2
112.24
206.00
14
7
Elevator-Simulation Examples
Number of time passengers got off at
different floors (3 passengers, 6 floors)
• List 1: (10 trials)
6, 8, 8, 6, 9, 6, 5, 7, 5, 9, 6, 6, 3, 6, 5
15
Sorted Lists
List 1
3 5 5 5 6 6 6 6 6 6 7 8 8 9 9
Mean = Median = Mode =
List 2
49 49 50 51 51 52 52 55 56 56 56 57 58 61 63 64
8
Sorted Lists
List 1
3 5 5 5 6 6 6 6 6 6 7 8 8 9 9
Mean = 6.33 Median = 6 Mode = 6
List 2
49 49 50 51 51 52 52 55 56 56 56 57 58 61 63 64
3 5 5 5 6 6 6 6 6 6 7 8 8 9 9
Median is midpoint - the number of elements below the
median equals the number above it.
First quartile: Take the median of the lower half.
Third quartile: Take the median of the upper half.
3 5 5 5 6 6 6 6 6 6 7 8 8 9 9
interquartile range: 8 – 5 = 3
18
9
Interquartile Range, Box Plot, 5-number summary
25.0% 25.0%
5 8 19
1 1 1 1 4 + 5.5 + 7 + 8.5 25
4 + 5.5 + 7 + 8.5 = = = 6.25
4 4 4 4 4 4
20
10
Commonly reported
statistical results
List 1: (10 trials)
6, 8, 8, 6, 9, 6, 5, 7, 5, 9, 6, 6, 3, 6, 5
6
Freq
0
3 5 6 7 8 9 10
0 unnamed 9
21
11