Professional Documents
Culture Documents
Descriptive Statistics: Learning Objectives
Descriptive Statistics: Learning Objectives
Chapter 3
Descriptive Statistics
LEARNING OBJECTIVES
The focus of Chapter 3 is on the use of statistical techniques to describe data, thereby
enabling you to:
6. Compute the mean, median, standard deviation, and variance on grouped data.
It can be emphasized in this chapter that there are at least two major dimensions
along which data can be described. One is the measure of central tendency with which
statisticians attempt to describe the more central portions of the data. Included here are
the mean, median, mode, percentiles, and quartiles. It is important to establish that the
Chapter 3: Descriptive Statistics 2
median is a useful device for reporting some business data such as income and housing
costs because it tends to ignore the extremes. On the other hand, the mean utilizes every
number of a data set in its computation. This makes the mean an attractive tool in
statistical analysis.
CHAPTER OUTLINE
Quartiles
Correlation
KEY TERMS
3.1 Mode
2, 2, 3, 3, 4, 4, 4, 4, 5, 6, 7, 8, 8, 8, 9
The mode = 4
4 is the most frequently occurring value
3.3 Median
Arrange terms in ascending order:
073, 167, 199, 213, 243, 345, 444, 524, 609, 682
There are 10 terms.
Since there are an even number of terms, the median is the average of the
two middle terms:
The median is located halfway between the 5th and 6th terms.
5th term = 243 6th term = 345
Halfway between 243 and 345 is the median = 294
3.4 Mean
17.3
44.5 µ = x/N = 333.6/8 = 41.7
31.6
40.0
52.8 x = x/n = 333.6/8 = 41.7
38.8
30.1
78.5 (It is not stated in the problem whether the
x = 333.6 data represent as population or a sample).
3.5 Mean
7
-2
5 µ = x/N = -12/12 = -1
9
0
-3
-6 x = x/n = -12/12 = -1
-7
-4
-5
2
-8 (It is not stated in the problem whether the
x = -12 data represent a population or a sample).
11, 13, 16, 17, 18, 19, 20, 25, 27, 28, 29, 30, 32, 33, 34
35
i (15) 5.25
100
P35 = 19
Chapter 3: Descriptive Statistics 6
55
i (15) 8.25
100
P55 = 27
Q1 = P25
25
i (15) 3.75
100
Q1 = 17
Q2 = Median
th
15 1
The median is located at the 8th term
2
Q2 = 25
Q3 = P75
75
i (15) 11 .25
100
Q3 = 30
80, 94, 97, 105, 107, 112, 116, 116, 118, 119, 120, 127,
128, 138, 138, 139, 142, 143, 144, 145, 150, 162, 171, 172
n = 24
20
i (24) 4.8
100
Chapter 3: Descriptive Statistics 7
P20 = 107
47
i (24) 11 .28
100
P47 = 127
83
i (24) 19.92
100
P83 = 145
Q1 = P25
25
i (24) 6
100
Q2 = Median
Q3 = P75
75
i (24) 18
100
Q2 = Median = 2,264
63
i (15) 9.45
100
P63 = 2,646
th
10 1
3.9 The median is located at the 5.5th position
2
75
Q3 = P75: i (10) 7.5
100
Q3 = 751
For P20:
20
i (10) 2
100
For P60:
60
i (10) 6
100
For P80:
80
i (10) 8
100
93
i (10) 9.3
100
P93 = 1096
3.10 n = 17
Mean = x 61 3.59
N 17
th
17 1
The median is located at the = 9th term
2
Median = 4
Mode = 4
75
Q3 = P75: i= (17) 12.75
100
Q3 = 4
11
P11: i= (17) 1.87
100
P11 = 1
35
P35: i= (17) 5.95
100
Chapter 3: Descriptive Statistics 10
P35 = 3
58
P58: i= (17) 9.86
100
P58 = 4
67
P67: i= (17) 11 .39
100
P67 = 4
3.11 x x -µ (x-µ)2
6 6-4.2857 = 1.7143 2.9388
2 2.2857 5.2244
4 0.2857 .0816
9 4.7143 22.2246
1 3.2857 10.7958
3 1.2857 1.6530
5 0.7143 .5102
x = 30 x-µ = 14.2857 (x -µ) = 43.4284
2
x 30
4.2857
N 7
a.) Range = 9 - 1 = 8
x 14.2857
b.) M.A.D. = 2.041
N 7
( x ) 2 43.4284
c.) 2 = = 6.204
N 7
( x ) 2
d.) = 6.204 = 2.491
N
Chapter 3: Descriptive Statistics 11
e.) 1, 2, 3, 4, 5, 6, 9
Q1 = P25
25
i = (7) = 1.75
100
Q3 = P75:
75
i = (7) = 5.25
100
IQR = Q3 - Q1 = 6 - 2 = 4
6 4.2857
f.) z = = 0.69
2.491
2 4.2857
z = = -0.92
2.491
4 4.2857
z = = -0.11
2.491
9 4.2857
z = = 1.89
2.491
1 4.2857
z = = -1.32
2.491
3 4.2857
z = = -0.52
2.491
5 4.2857
z = = 0.29
2.491
3.12 x x x ( x x) 2
4 0 0
3 1 1
0 4 16
5 1 1
2 2 4
9 5 25
4 0 0
Chapter 3: Descriptive Statistics 12
5 1 1
x = 32 x x = 14 ( x x ) = 48
2
x 32
x =4
n 8
a) Range = 9 - 0 = 9
xx 14
b) M.A.D. = = 1.75
n 8
( x x ) 2 48
c) s2 = = 6.857
n 1 7
( x x ) 2
d) s = 6.857 = 2.619
n 1
e) Numbers in order: 0, 2, 3, 4, 4, 5, 5, 9
Q1 = P25
25
i = (8) = 2
100
Q1 = 2.5
Q3 = P75
75
i = (8) = 6
100
Q3 = (5 + 5)/2 = 5
3.13 a.)
Chapter 3: Descriptive Statistics 13
x (x-µ) (x -µ)2
12 12-21.167= -9.167 84.034
23 1.833 3.360
19 -2.167 4.696
26 4.833 23.358
24 2.833 8.026
23 1.833 3.360
x = 127 (x -µ) = -0.002 (x -µ) = 126.834
2
x 127
= = 21.167
N 6
( x ) 2 126.834
= 21.139 = 4.598 ORIGINAL FORMULA
N 6
b.)
x x2
12 144
23 529
19 361
26 676
24 576
23 529
x = 127 x = 2815
2
Chapter 3: Descriptive Statistics 14
=
( x ) 2 (127) 2
x 2 2815
N 6 2815 2688.17 126.83
21.138
N 6 6 6
3.14 s2 = 433.9267
s = 20.8309
x = 1387
x2 = 87,365
n = 25
x = 55.48
3.15
2 = 58,631.359
= 242.139
x = 6886
x2 = 3,901,664
n = 16
µ = 430.375
3.16
14, 15, 18, 19, 23, 24, 25, 27, 35, 37, 38, 39, 39, 40, 44,
46, 58, 59, 59, 70, 71, 73, 82, 84, 90
Q1 = P25
25
i = (25) = 6.25
100
Q1 = 25
Q3 = P75
75
i = ( 25) = 18.75
100
Q3 = 59
IQR = Q3 - Q1 = 59 - 25 = 34
3.17
1 1 3
a) 1 2
1 .75
2 4 4
1 1
b) 1 2
1 .84
2.5 6.25
1 1
c) 1 2
1 .609
1.6 2.56
1 1
d) 1 2
1 .902
3.2 10.24
3.18
Set 1:
x 262
1 65.5
N 4
( x ) 2 (262) 2
x 2 17,970
N 4 = 14.2215
1
N 4
Set 2:
x 570
2 142.5
N 4
Chapter 3: Descriptive Statistics 16
( x ) 2 (570) 2
x 2 82,070
N 4 = 14.5344
2
N 4
14.2215
CV1 = (100) = 21.71%
65.5
14.5344
CV2 = (100) = 10.20%
142.5
3.19
x x x ( x x) 2
7 1.833 3.361
5 3.833 14.694
10 1.167 1.361
12 3.167 10.028
9 0.167 0.028
8 0.833 0.694
14 5.167 26.694
3 5.833 34.028
11 2.167 4.694
13 4.167 17.361
8 0.833 0.694
6 2.833 8.028
106 32.000 121.665
x 106
x = 8.833
n 12
xx 32
a) MAD = = 2.667
n 12
( x x) 2 121.665
b) s2 = = 11.06
n 1 11
c) s = s 2 11 .06 = 3.326
3 5 6 7 8 8 9 10 11 12 13 14
Q1 = P25: i = (.25)(12) = 3
Chapter 3: Descriptive Statistics 17
Q1 = (6 + 7)/2 = 6.5
Q3 = P75: i = (.75)(12) = 9
6 8.833
e.) z = = - 0.85
3.326
(3.326)(100)
f.) CV = = 37.65%
8.833
3.20 n = 11 x x-
768 475.64
429 136.64
323 30.64
306 13.64
286 6.36
262 30.36
215 77.36
172 120.36
162 130.36
148 144.36
145 147.36
x = 3216 x-µ = 1313.08
x 1313.08
b.) MAD = = 119.37
N 11
( x ) 2 (3216) 2
x 2
1,267,252
c.) 2 = N 11 = 29,728.23
N 11
Q1 = 162
Q3 = 323
x 172 292.36
Z = = -0.70
172.42
172.42
g.) CV = (100) 292.36 (100) = 58.98%
3.21 µ = 125 = 12
3.22 µ = 38 =6
Chapter 3: Descriptive Statistics 19
x1 - µ = 50 - 38 = 12
x2 - µ = 26 - 38 = -12
x1 12
= 2
6
x 2 12
= -2
6
1 1 1 3
1 2
1 2 1 = .75
K 2 4 4
x1 - µ = 62 - 38 = 24
x2 - µ = 14 - 38 = -24
x1 24
= 4
6
x 2 24
= -4
6
K=4
1 1 1 15
1 2
1 2 1 = .9375
K 4 16 16
1
1- =.89
K2
1
.11 =
K2
Chapter 3: Descriptive Statistics 20
.11 K2 = 1
1
K2 =
.11
K2 = 9.09
K = 3.015
With µ = 38, = 6 and K = 3.015 at least 89% of the values fall within:
µ ± 3.015
38 ± 3.015 (6)
38 ± 18.09
1
3.23 1- = .80
K2
1
1 - .80 =
K2
1
.20 =
K2
.20K2 =1
K2 = 5 and K = 2.236
1 = 46 - 43 = 3
3 = 51 - 43 = 8
= 2.67
Chapter 3: Descriptive Statistics 21
µ = 28 and 77% of the values lie between 24 and 32 or + 4 from the mean:
1
1- = .77
K2
k2 = 4.3478
k = 2.085
2.085 = 4
4
= = 1.918
2.085
3.25 µ = 29 =4
x1 - µ = 21 - 29 = -8
x2 - µ = 37 - 29 = +8
x1 8
= -2 Standard Deviations
4
x2 8
= 2 Standard Deviations
4
Since the distribution is normal, the empirical rule states that 95% of
the values fall within µ ± 2 .
Exceed 37 days:
Since 95% fall between 21 and 37 days, 5% fall outside this range. Since the normal
distribution is symmetrical 2½% fall below 21 and above 37.
Exceed 41 days:
x 41 29 12
= 3 Standard deviations
4 4
x 25 29 4
= -1 Standard Deviation
4 4
29 ± 1(4) = 29 ± 4
from 25 to 33.
32% lie outside this range with ½(32%) = 16% less than 25.
3.26 x
97
109
111
118
120
130
132
133
137
137
x = 1224 x2 = 151,486 n = 10 x = 122.4 s = 13.615
Bordeaux: x = 137
137 122.4
z = = 1.07
13.615
Montreal: x = 130
130 122.4
z = = 0.56
13.615
Edmonton: x = 111
Chapter 3: Descriptive Statistics 23
111 122.4
z = = -0.84
13.615
Hamilton: x = 97
97 122.4
z = = -1.87
13.615
3.27 Mean
Class f M fM
0- 2 39 1 39
2- 4 27 3 81
4- 6 16 5 80
6- 8 15 7 105
8 - 10 10 9 90
10 - 12 8 11 88
12 - 14 6 13 78
f=121 fM=561
fM 561
= = 4.64
f 121
3.28
Class f M fM
1.2 - 1.6 220 1.4 308
1.6 - 2.0 150 1.8 270
2.0 - 2.4 90 2.2 198
2.4 - 2.8 110 2.6 286
2.8 - 3.2 280 3.0 840
f=850 fM=1902
fM 1902
Mean: = = 2.24
f 850
3.29 Class f M fM
20-30 7 25 175
30-40 11 35 385
40-50 18 45 810
Chapter 3: Descriptive Statistics 24
50-60 13 55 715
60-70 6 65 390
70-80 4 75 300
Total 59 2775
fM 2775
= = 47.034
f 59
f ( M ) 2 10,955.93
=
2 = 185.694
f 59
= 185.694 = 13.627
(fM ) 2 (618) 2
fM 2 8182
s2 = n 54 8182 7071.7 = 20.9
n 1 53 53
s = S2 20.9 = 4.57
66 - 72 15 69 1,035 71,415
f= 231 fM= 10,371 fM = 508,671
2
b.) Mode. The Modal Class = 36-42. The mode is the class midpoint = 39
(fM ) 2 (10,371) 2
fM 2 508,671
c.) s2 = n 231 43,053.5 = 187.2
n 1 230 230
3.32
a.) Mean
Class f M fM fM2
fM 258.5
= = 1.89
f 137
c.) Variance:
(fM ) 2 (258.5) 2
fM 2 682.25
2 = N 137 = 1.4197
N 137
= 2 1.4197 = 1.1915
3.33 f M fM fM2
a.) Mean:
fM 760
= = 38
f 20
c.) Variance:
(fM ) 2 (760) 2
fM 2 33,900
2 = N 20 = 251
N 20
= 2 251 = 15.843
Chapter 3: Descriptive Statistics 27
f = 49 fM2 = 1,970,000
fM 1,970,000
= = 40,204
f 49
The actual mean for the ungrouped data is 37,816. This computed group
mean, 40,204, is really just an approximation based on using the class
midpoints in the calculation. Apparently, the actual numbers of farms per
state in some categories do not average to the class midpoint and in fact
might be less than the class midpoint since the actual mean is less than the
grouped data mean.
fM2 = 1.185x1011
= 28,319.59
The actual standard deviation was 29,341. The difference again is due to
the grouping of the data and the use of class midpoints to represent the
data. The class midpoints due not accurately reflect the raw data.
The stock prices are skewed to the right. While many of the stock prices
are at the cheaper end, a few extreme prices at the higher end pull the
mean.
3.36 mean = 51
median = 54
mode = 59
Chapter 3: Descriptive Statistics 28
The distribution is skewed to the left. More people are older but the most
extreme ages are younger ages.
3( M d ) 3(5.51 3.19)
3.37 Sk = = 0.726
9.59
3.38 n = 25 x = 600
x = 24 x2 = 15,462 s = 6.6521
3( x M d ) 3( 24 23)
Sk = = 0.451
s 6.6521
3.40 n = 18 Q1 = P25:
25
i = (18) = 4.5
100
Q1 = 5th term = 66
Q3 = P75:
75
i = (18) = 13.5
100
Q3 = 14th term = 90
Chapter 3: Descriptive Statistics 29
th
( n 1)th (18 1)th 19
Median: = 9.5th term
2 2 2
Median = 74
IQR = Q3 - Q1 = 90 - 66 = 24
There are no extreme outliers. The only mild outlier is 21. The
distribution is positively skewed since the median is nearer to Q1 than Q3.
x y
xy n
( x) 2
( y ) 2
x y
2 2
n n
r = =
(80)(69)
624
7
(80)
2
(69) 2 164.571
1,148 815
7 7 ( 233.714)(134.857) =
r = =
164.571
r = 177.533 = -0.927
Chapter 3: Descriptive Statistics 30
x y
xy n
( x) 2
( y ) 2
x
2 2
y
n n
r = =
(1,087)(2,032)
507,509
5
(1,087)
2
(2,032) 2
322,345 878,686
5 5
r = =
65,752.2 65,752.2
r = (86,031.2)(52,881.2) = 67,449.5 = .975
47.6 15.1
46.3 15.4
50.6 15.9
52.6 15.6
52.4 16.4
52.7 18.1
x y
xy n
( x) 2
( y ) 2
x y
2 2
n n
r = =
(302.2)(96.5)
4,870.11
6
(302.2)
2
(96.5) 2
15, 259 .62 1,557 .91
6 6
r = = .6445
Chapter 3: Descriptive Statistics 31
x y
xy n
( x) 2
( y ) 2
x
2 2
y
n n
r = =
(6,087)(1,050)
1,130,483
9
(6,087)
2
(1,050) 2
6, 796,149 194 ,526
9 9
r = =
420,333 420,333
r = ( 2,679,308)(72,026) = 439,294.705 = .957
x y
xy n
( x)
2
( y ) 2
x y
2 2
n n
r = =
(17.09)(15.12)
48.97
8
(17.09)
2
(15.12) 2
58. 7911 41. 7054
8 8
r = =
16.6699 16.6699
r = (22.28259)(13.1286) = 17.1038 = .975
Chapter 3: Descriptive Statistics 32
x y
xy n
( x) 2
( y ) 2
x y
2 2
n n
r = =
(15.12)(15.86)
41.5934
8
(15.12)
2
(15.86) 2
41 .7054 42 . 0396
8 8
r = =
11 .618 11 .618
r = (13.1286)(10.59715) = 11 .795 = .985
x y
xy n
( x) 2
( y ) 2
x y
2 2
n n
r = =
(17.09)(15.86)
48.5827
8
(17.09)
2
(15.86) 2
58.7911 42.0396
8 8
r =
14.702 14.702
r = ( 2.2826)(10.5972) = 15.367 = .957
1, 1, 1, 1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2,
3, 3, 3, 3, 3, 3, 4, 4, 5, 6, 8
x 75
Mean: x = 2.5
n 30
Mode = 2 (There are eleven 2’s)
Range = 8 - 1 = 7
Q1 = P25:
25
i = (30) = 7.5
100
Q3 = P75:
75
i = (30) = 22.5
100
IQR = Q3 - Q1 = 3 - 1 = 2
3.47 P10:
10
i = (40) = 4
100
P80:
80
i = (40) = 32
100
Q1 = P25:
25
i = (40) = 10
100
Q3 = P75:
75
i = (40) = 30
100
Range = 81 - 19 = 62
x 126,904
3.48 µ = = 6345.2
N 20
P30: i = (.30)(20) = 6
P60: i = (.60)(20) = 12
Chapter 3: Descriptive Statistics 35
P90: i = (.90)(20) = 18
Q1 = P25: i = (.25)(20) = 5
Q1 = (4464+4507)/2 = 4485.5
Q3 = P75: i = (.75)(20) = 15
Q3 = (6796+8687)/2 = 7741.5
(x) 2 (9332908) 2
x 2
1.06357 x10
13
= N 10 = 438,789.2
N 10
3.50
x
a.) µ= = 26,675/11 = 2425
N
Median = 1965
Chapter 3: Descriptive Statistics 36
Q3 = 2867
c.) Variance:
( x ) 2 ( 26,675) 2
x 2
86,942,873
2 = N 11 = 2,023,272.55
N 11
standard Deviation:
= 2 2,023,272.55 = 1422.42
d.) Texaco:
x 1532 2425
z = = -0.63
1422.42
ExxonMobil:
x 6300 2425
z = = 2.72
1422.42
Chapter 3: Descriptive Statistics 37
e.) Skewness:
3( M d ) 3(2425 1965)
Sk = = 0.97
1422.42
3.51 a.)
x 20,310
Mean: = 2,031
n 10
1670 1920
Median: = 1795
2
Mode: No Mode
1
Q1: (10) 2.5 Located at the 3rd term. Q1 = 1570
4
3
Q3: (10) 7.5 Located at the 8th term. Q3 = 2550
4
x x x x x 2
xx 5252
MAD = = 525.2
n 10
x x
2
s2 = 3,972,490 = 441,387.78
n 1 9
Chapter 3: Descriptive Statistics 38
s= s2 441,387.78 = 664.37
3( x M d ) 3(2031 1795)
Sk = = 1.066
s 664.37
No apparent outliers
3.52 f M fM fM2
a.) Mean:
fM 5680
= = 33.412
f 170
Mode: The Modal Class is 30-35. The class midpoint is the mode = 32.5.
b.) Variance:
(fM ) 2 (5680) 2
fM 2 199,662.5
2 = N 170 = 58.139
N 170
standard Deviation:
= 2 58.139 = 7.625
a) Mean:
Mode: The Modal Class is 0-20. The midpoint of this class is the mode = 10.
b) standard Deviation:
(fM ) 2 (3860) 2
fM 2 253,000
n 90 253,000 165,551.1
s = n 1 89 89
87,448.9
982.571
89
= 31.346
x y
xy n
( x) 2
( y ) 2
x y
2 2
n n
r = =
(36)(44)
188
7
(36)
2
(44) 2
256 300
7 7
r = =
38.2857 38.2857
r = (70.8571)(23.42857) = 40.7441 = -.940
Chapter 3: Descriptive Statistics 41
3.55
x 3.45
CVx = (100%) (100%) = 10.78%
x 32
y 5.40
CVY = (100%) 84 (100%) = 6.43%
y
3.56 µ = 7.5
3 = 14 - 7.5 = 6.5
= 2.167
3.57 µ = 419, = 27
400 419
c.) x = 400. z = = -0.704. This worker is in the lower half of
27
workers but within one standard deviation of the mean.
3.58
x x2
Albania 1,650 2,722,500
Bulgaria 4,300 18,490,000
Croatia 5,100 26,010,000
Germany 22,700 515,290,000
x=33,750 x2= 562,512,500
x 33,750
= = 8,437.50
N 4
(x ) 2 (33,750) 2
x
2
562,512,500
= N 4 = 8332.87
N 4
b.)
x x2
Hungary 7,800 60,840,000
Poland 7,200 51,840,000
Romania 3,900 15,210,000
Bosnia/Herz 1,770 3,132,900
x=20,670 x =131,022,900
2
x 20,670
= = 5,167.50
N 4
(x) 2 (20,670) 2
x 2 131,022,900
= N 4 = 2,460.22
N 4
c.)
1 8,332.87
CV1 = (100) (100) = 98.76%
1 8,437.50
2 2,460.22
CV2 = (100) (100) = 47.61%
2 5,167.50
since these three measures are not equal, the distribution is skewed. The
distribution is skewed to the right. Often, the median is preferred in reporting
income data because it yields information about the middle of the data while
ignoring extremes.
x y
xy n
( x) 2
( y ) 2
x
2 2
y
n n
r = =
(36.62)(57.23)
314.9091
8
(36.62)
2
(57.23) 2
217. 137 479. 3231
8 8
r = =
52.938775
r = ( 49.50895)(69.91399) = .90
3.61
a.) Q1 = P25:
25
i = ( 20) = 5
100
Q3 = P75:
Chapter 3: Descriptive Statistics 44
75
i = ( 20) = 15
100
Median:
th th
n 1 20 1
= 10.5th term
2 2
Inner Fences:
Q1 - 1.5 IQR = 43.85 – 44.775 = - 0.925
Q3 + 1.5 IQR = 73.7 + 44.775 = 118.475
Outer Fences:
Q1 - 3.0 IQR = 43.85 – 89.55 = - 45.70
Q3 + 3.0 IQR = 73.7 + 89.55 = 163.25
b.) and c.) There are no outliers in the lower end. There is one extreme
outlier in the upper end (214.2). There are two mile outliers at the
upper end (133.7 and 158.8). since the median is nearer to Q1, the
distribution is positively skewed.
20 120 220
Tonnage
1.459 = 32
= 21.93
2.425 = 44
= 18.14