Professional Documents
Culture Documents
JOHN S. LOUCKS
St. Edward’s University
Measures of Location
Measures of Variability
Measures of Relative Location and Detecting Outliers
Exploratory Data Analysis
Measures of Association Between Two Variables
The Weighted Mean and
Working with Grouped Data
%
x Slide
2
Measures of Location
Mean
Median
Mode
Percentiles
Quartiles
Slide
3
Example: Apartment Rents
Slide
4
Mean
Slide
5
Example: Apartment Rents
Mean
xi 34, 356
x 490.80
n 70
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Slide
6
Median
Slide
7
Median
Slide
8
Example: Apartment Rents
Median
Median = 50th percentile
i = (p/100)n = (50/100)70 = 35.5
Averaging the 35th and 36th data values:
Median = (475 + 475)/2 = 475
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Slide
9
Mode
Slide
10
Example: Apartment Rents
Mode
450 occurred most frequently (7 times)
Mode = 450
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Slide
11
Percentiles
Slide
12
Percentiles
i = (p/100)n
• If i is not an integer, round up. The p th percentile is
the value in the i th position.
• If i is an integer, the p th percentile is the average of the
values in positions i and i +1.
Slide
13
Example: Apartment Rents
90th Percentile
i = (p/100)n = (90/100)70 = 63
Averaging the 63rd and 64th data values:
90th Percentile = (580 + 590)/2 = 585
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Slide
14
Quartiles
Slide
15
Example: Apartment Rents
Third Quartile
Third quartile = 75th percentile
i = (p/100)n = (75/100)70 = 52.5 = 53
Third quartile = 525
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Slide
16
Measures of Variability
Slide
17
Measures of Variability
Range
Interquartile Range
Variance
Standard Deviation
Coefficient of Variation
Slide
18
Range
Slide
19
Example: Apartment Rents
Range
Range = largest value - smallest value
Range = 615 - 425 = 190
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Slide
20
Interquartile Range
Slide
21
Example: Apartment Rents
Interquartile Range
3rd Quartile (Q3) = 525
1st Quartile (Q1) = 445
Interquartile Range = Q3 - Q1 = 525 - 445 = 80
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Slide
22
Variance
Slide
23
Variance
Slide
24
Standard Deviation
Slide
25
Coefficient of Variation
Slide
26
Example: Apartment Rents
Variance
s2
( xi x ) 2
2 ,996.16
n 1
Standard Deviation
s s 2 2996. 47 54. 74
Coefficient of Variation
s 54. 74
100 100 11.15
x 490.80
Slide
27
Measures of Relative Location
and Detecting Outliers
z-Scores
Chebyshev’s Theorem
Empirical Rule
Detecting Outliers
Slide
28
z-Scores
Slide
29
Example: Apartment Rents
Slide
30
Chebyshev’s Theorem
Slide
31
Example: Apartment Rents
Chebyshev’s Theorem
Slide
32
Example: Apartment Rents
Slide
33
Empirical Rule
Slide
34
Empirical Rule
Slide
35
Empirical Rule
Slide
36
Example: Apartment Rents
Empirical Rule
Interval % in Interval
Within +/- 1s 436.06 to 545.54 48/70 = 69%
Within +/- 2s 381.32 to 600.28 68/70 = 97%
Within +/- 3s 326.58 to 655.02 70/70 = 100%
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Slide
37
Detecting Outliers
Slide
38
Example: Apartment Rents
Detecting Outliers
The most extreme z-scores are -1.20 and 2.27.
Using |z| > 3 as the criterion for an outlier,
there are no outliers in this data set.
Standardized Values for Apartment Rents
-1.20 -1.11 -1.11 -1.02 -1.02 -1.02 -1.02 -1.02 -0.93 -0.93
-0.93 -0.93 -0.93 -0.84 -0.84 -0.84 -0.84 -0.84 -0.75 -0.75
-0.75 -0.75 -0.75 -0.75 -0.75 -0.56 -0.56 -0.56 -0.47 -0.47
-0.47 -0.38 -0.38 -0.34 -0.29 -0.29 -0.29 -0.20 -0.20 -0.20
-0.20 -0.11 -0.01 -0.01 -0.01 0.17 0.17 0.17 0.17 0.35
0.35 0.44 0.62 0.62 0.62 0.81 1.06 1.08 1.45 1.45
1.54 1.54 1.63 1.81 1.99 1.99 1.99 1.99 2.27 2.27
Slide
39
Exploratory Data Analysis
Five-Number Summary
Box Plot
Slide
40
Five-Number Summary
Smallest Value
First Quartile
Median
Third Quartile
Largest Value
Slide
41
Example: Apartment Rents
Five-Number Summary
Lowest Value = 425 First Quartile = 450
Median = 475
Third Quartile = 525 Largest Value = 615
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Slide
42
Box Plot
Slide
43
Box Plot (Continued)
Slide
44
Example: Apartment Rents
Box Plot
Slide
45
Measures of Association
Between Two Variables
Covariance
Correlation Coefficient
Slide
46
Covariance
Slide
47
Covariance
Slide
48
Correlation Coefficient
sxy
rxy
sx s y
If the data sets are populations, the coefficient is .
xy
xy xy
x y
Slide
49
The Weighted Mean and
Working with Grouped Data
Weighted Mean
Mean for Grouped Data
Variance for Grouped Data
Standard Deviation for Grouped Data
Slide
50
Weighted Mean
Slide
51
Weighted Mean
x = wi xi
wi
where:
xi = value of observation i
wi = weight for observation i
Slide
52
Grouped Data
Slide
53
Mean for Grouped Data
Sample Data
x
fMi i
f i
Population Data
fMi i
N
where:
fi = frequency of class i
Mi = midpoint of class i
Slide
54
Example: Apartment Rents
Slide
56
Variance for Grouped Data
Sample Data
f i ( M i x ) 2
s2
n 1
Population Data
f i ( M i ) 2
2
N
Slide
57
Example: Apartment Rents
s 2 3, 017.89
s 3, 017.89 54. 94
Slide
58
End of Chapter 3
Slide
59