Professional Documents
Culture Documents
Numerical Discriptive Measure PDF
Numerical Discriptive Measure PDF
10th Edition
Chapter 3
population
To calculate the coefficient of variation and Z-
scores
To construct and interpret a box-and-whisker plot
Basiccorrelation
Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-2
Chapter Topics
Measures of central tendency, variation, and
shape
Mean, median, mode, geometric mean
Quartiles
Range, interquartile range, variance and standard
deviation, coefficient of variation, Z-scores
Symmetric and skewed distributions
Population summary measures
Mean, variance, and standard deviation
The empirical rule and Chebyshev rule
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-3
Chapter Topics
(continued)
Mode Variance
Coefficient of Variation
∑X i
X G = ( X1 × X 2 × × Xn )1/ n
X= i =1
n Midpoint of Most
ranked frequently
values observed
value
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-6
Arithmetic Mean
The arithmetic mean (mean) is the most
common measure of central tendency
∑X i
X1 + X 2 + + Xn
X= i=1
=
n n
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
Mean = 3 Mean = 4
1 + 2 + 3 + 4 + 5 15 1 + 2 + 3 + 4 + 10 20
= =3 = =4
5 5 5 5
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-8
Median
In an ordered array, the median is the “middle”
number (50% above, 50% below)
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
Median = 3 Median = 3
n +1
Median position = position in the ordered data
2
If the number of values is odd, the median is the middle number
If the number of values is even, the median is the average of
the two middle numbers
n +1
Note that is not the value of the median, only the
2
position of the median in the ranked data
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-10
Mode
A measure of central tendency
Value that occurs most often
Not affected by extreme values
Used for either numerical or categorical
(nominal) data
There may may be no mode
There may be several modes
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 0 1 2 3 4 5 6
$2,000,000
500,000 $500 K
300,000 $300 K
100,000
100,000
$100 K
$100 K
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-12
Review Example:
Summary Statistics
House Prices:
Mean: ($3,000,000/5)
$2,000,000 = $600,000
500,000
300,000
100,000
100,000
Median: middle value of ranked data
Sum $3,000,000
= $300,000
Q1 Q2 Q3
The first quartile, Q1, is the value for which 25% of the
observations are smaller and 75% are larger
Q is the same as the median (50% are smaller, 50% are
2
larger)
Only 25% of the observations are greater than the third
quartile
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-15
Quartile Formulas
(n = 9)
Q1 is in the (9+1)/4 = 2.5 position of the ranked data
so use the value half way between the 2nd and 3rd values,
so Q1 = 12.5
(n = 9)
Q1 is in the (9+1)/4 = 2.5 position of the ranked data,
so Q1 = 12.5
XG = ( X1 × X 2 × × Xn ) 1/ n
R G = [(1 + R1 ) × (1 + R 2 ) × × (1 + Rn )]1/ n − 1
Where Ri is the rate of return in time period i
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-19
Example
Geometric R G = [(1 + R1 ) × (1 + R 2 ) × × (1 + Rn )] − 1
1/ n
mean rate
= [(1 + ( −50%)) × (1 + (100%))]1/ 2 − 1 More
of return:
accurate
= [(.50) ×10e
Basic Business Statistics, (2)] © − 1 = 1 − 1 = 0%
1/ 2 1/ 2
result
2006 Prentice-Hall, Inc. Chap 3-21
Measures of Variation
Variation
Same center,
Basic Business Statistics, 10e ©
different variation
2006 Prentice-Hall, Inc. Chap 3-22
Range
Simplest measure of variation
Difference between the largest and the smallest
values in a set of data:
Example:
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Range = 14 - 1 = 13
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-23
Disadvantages of the Range
Ignores the way in which data are distributed
7 8 9 10 11 12 7 8 9 10 11 12
Range = 12 - 7 = 5 Range = 12 - 7 = 5
Sensitive to outliers
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
Range = 5 - 1 = 4
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 120 - 1 = 119
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-24
Interquartile Range
Example:
Median X
X Q1 Q3 maximum
minimum (Q2)
25% 25% 25% 25%
12 30 45 57 70
Interquartile range
= 57 – 30 = 27
S =2 i=1
n -1
Where X = mean
n = sample size
Xi = ith value
Basic Business Statistics, 10e of
© the variable X
2006 Prentice-Hall, Inc. Chap 3-27
Standard Deviation
Most commonly used measure of variation
Shows variation about the mean
Is the square root of the variance
Has the same units as the original data
n
Sample standard deviation: ∑ (X − X)
i
2
S= i=1
n -1
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-28
Calculation Example:
Sample Standard Deviation
Sample
Data (Xi) : 10 12 14 15 17 18 18 24
n=8 Mean = X = 16
Data A
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 S = 3.338
Data B
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 S = 0.926
Data C
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 S = 4.567
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-31
Advantages of Variance and
Standard Deviation
S
CV =
X
⋅100%
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-33
Comparing Coefficient
of Variation
Stock A:
Average price last year = $50
Standard deviation = $5
S $5
CVA = ⋅ 100% = ⋅ 100% = 10%
X $50 Both stocks
have the same
Stock B: standard
Average price last year = $100 deviation, but
stock B is less
Standard deviation = $5
variable relative
to its price
S $5
CVB = ⋅ 100% = ⋅ 100% = 5%
X $100
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-34
Z Scores
Example:
If the mean is 14.0 and the standard deviation is 3.0,
what is the Z score for the value 18.5?
X − X 18.5 −14.0
Z= = =1.5
S 3.0
The value 18.5 is 1.5 standard deviations above the
mean
(A negative Z-score would mean that a value is less
than the Statistics,
Basic Business mean) 10e ©
2006 Prentice-Hall, Inc. Chap 3-36
Shape of a Distribution
Click OK
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-40
Excel output
Microsoft Excel
descriptive statistics output,
using the house price data:
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
∑X i
X1 + X 2 + + XN
µ= i=1
=
N N
Where μ = population mean
N = population size
Xi = ith value
Basic Business Statistics, 10eof©the variable X
2006 Prentice-Hall, Inc. Chap 3-42
Population Variance
σ2 = i=1
N
N
Population standard deviation:
∑ i − 2
(X μ)
σ= i=1
N
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-44
The Empirical Rule
68%
μ
± 1σ
Basic Business Statistics, 10eμ ©
2006 Prentice-Hall, Inc. Chap 3-45
The Empirical Rule
μ ± 2σ contains about 95% of the values in
the population or the sample
μ ± 3σ contains about 99.7% of the values
in the population or the sample
95% 99.7%
μ ± 2σ μ ± 3σ
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-46
Chebyshev Rule
∑m f
j=1
j j
X=
n
Where n = number of values or sample size
c = number of classes in the frequency distribution
mj = midpoint of the jth class
Basic Business Statistics, 10e
fj = number of © in the jth class
values
2006 Prentice-Hall, Inc. Chap 3-48
Approximating the Standard Deviation
from a Frequency Distribution
Assume that all values within each class interval
are located at the midpoint of the class
∑ (m − X)
j =1
j
2
fj
S=
n -1
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-49
Exploratory Data Analysis
Example:
Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3
00 22 33 55 27
27
The data are right skewed, as the plot depicts
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-53
The Sample Covariance
The sample covariance measures the strength of the
linear relationship between two variables (called
bivariate data)
∑ ( X − X)( Y − Y )
i i
cov ( X , Y ) = i=1
n −1
Only concerned with the strength of the relationship
Basic Business Statistics, 10e ©
No causal effect is implied
cov (X , Y) = i=1
SX = i=1
SY = i=1
n −1 n −1 n −1
Basic Business Statistics, 10e ©
2006 Prentice-Hall, Inc. Chap 3-56
Features of
Correlation Coefficient, r
Unit free
Ranges between –1 and 1
The closer to –1, the stronger the negative linear
relationship
The closer to 1, the stronger the positive linear
relationship
The closer to 0, the weaker the linear relationship
X X X
r = -1 r = -.6 r=0
Y
Y Y
X X X
Basic Business
r = +1 Statistics, 10er ©
= +.3 r=0
2006 Prentice-Hall, Inc. Chap 3-58
Using Excel to Find
the Correlation Coefficient
Select
Tools/Data Analysis
Choose Correlation from
the selection menu
Click OK . . .
100
95
There is a relatively
Test #2 Score
90
80
relationship between 75
test score #1 70
70 75 80 85 90 95 100