Professional Documents
Culture Documents
Numerical Descriptive
Measures
of normal distributions is
that most of the scores pile
up in the middle.
■Normal distributions are
symmetrical in that the right
and left sides of the graph
are identical.
Skewness
Statistic <0 0 >0
Sharper Peak
Than Bell-Shaped
(Kurtosis > 0)
Bell-Shaped
(Kurtosis = 0)
Flatter Than
Bell-Shaped
(Kurtosis < 0)
11 12 13 14 15 16 17 18 19 20 11 12 13 14 15 16 17 18 19 20
Mean = 13 Mean = 14
11 12 13 14 15 16 17 18 19 20 11 12 13 14 15 16 17 18 19 20
Median = 13 Median = 13
■ The location of the median when the values are in numerical order
(smallest to largest):
Note that is not the value of the median, only the position of
the median in the ranked data
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 0 1 2 3 4 5 6
Mode = 9 No Mode
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 26
Measures of Central Tendency:
Review Example
Central Tendency
deviation
Same center,
different variation
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 38
Measures of Variation:
The Range
Example:
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14
Range = 13 - 1 = 12
7 8 9 10 11 12 7 8 9 10 11 12
Range = 12 - 7 = 5 Range = 12 - 7 = 5
▪ Sensitive to outliers
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
Range = 5 - 1 = 4
1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 120 - 1 = 119
■ Sample variance:
Excel File
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 44
Measures of Variation:
Sample Standard Deviation:
Calculation Example
Sample
Data (Xi) : 10 12 14 15 17 18 18 24
n=8 Mean = X = 16
Data A
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 S = 3.338
Both stocks
■ Stock B: have the same
standard
■ Average price last year = $100 deviation, but
■ Standard deviation = $5 stock B is less
variable relative
to its price
Stock C has a
■ Stock C: much smaller
standard
■ Average price last year = $8 deviation but a
■ Standard deviation = $2 much higher
coefficient of
variation
House Prices:
$2,000,000
500,000
300,000
100,000
100,000
Total
Variable Count Mean SE Mean StDev Variance Sum Minimum
House Price 5 600000 357771 800000 6.40000E+11 3000000 100000
N for
Variable Median Maximum Range Mode Mode Skewness Kurtosis
House Price 300000 2000000 1900000 100000 2 2.01 4.13
Q1 Q2 Q3
■ The first quartile, Q1, is the value for which 25% of the
observations are smaller and 75% are larger
■ Q2 is the same as the median (50% of the observations
are smaller and 50% are larger)
■ Only 25% of the observations are greater than the third
quartile
(n = 9)
Q1 is in the (9+1)/4 = 2.5 position of the ranked data
so use the value half way between the 2nd and 3rd values,
so Q1 = 12.5
Q1 and Q3 are measures of non-central location
Q2 = median, is a measure of central tendency
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 71
Quartile Measures
Calculating The Quartiles: Example
(n = 9)
Q1 is in the (9+1)/4 = 2.5 position of the ranked data,
so Q1 = (12+13)/2 = 12.5
■ Measures like Q1, Q3, and IQR that are not influenced
by outliers are called resistant measures
Example:
X Median X
minimu Q1 (Q2) Q3 maximu
m m
25% 25% 25%
25%
12 30 45 57
70
Interquartile range
= 57 – 30 = 27
> ≈ <
Xlargest – Median Xlargest – Median Xlargest – Median
Q1 – Xsmallest Q1 – Xsmallest Q1 – Xsmallest
> ≈ <
Xlargest – Q3 Xlargest – Q3 Xlargest – Q3
Median – Q1 Median – Q1 Median – Q1
> ≈ <
Q3 – Median Q3 – Median Q3 – Median
Q1 Q2 Q 3 Q1 Q 2 Q3 Q1 Q2 Q3
0 2 3 5 27
𝟑 ( 𝒎𝒆𝒂𝒏 − 𝒎𝒆𝒅𝒊𝒂𝒏 )
𝑺𝑲 =
𝑺𝒕𝒂𝒏𝒅𝒂𝒓𝒅 𝒅𝒆𝒗𝒊𝒂𝒕𝒊𝒐𝒏
𝐹𝑖 𝑋 𝑖
´𝑋 = ∑
∑ 𝐹𝑖
■ Solution b - The
median position is
𝒏
𝑴𝒆𝒅𝒊𝒂𝒏=𝒍 +𝒉 × ( 𝟐
− 𝒄𝒇
𝒇 )
𝑛=𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑜𝑏𝑠𝑒𝑟𝑣𝑎𝑡𝑖𝑜𝑛
𝑓 =𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑜𝑓 𝑚𝑒𝑑𝑖𝑎𝑛 𝑐𝑙𝑎𝑠𝑠
h=𝑐𝑙𝑎𝑠𝑠 𝑠𝑖𝑧𝑒
𝑐𝑓 =𝑐𝑢𝑚𝑢𝑙𝑎𝑡𝑖𝑣𝑒 𝑓𝑟𝑒𝑞𝑢𝑒𝑛𝑐𝑦 𝑜𝑓 𝑡h𝑒 𝑝𝑟𝑒𝑐𝑒𝑑𝑖𝑛𝑔 𝑚𝑒𝑑𝑖𝑎𝑛 𝑐𝑙𝑎𝑠𝑠
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 84
Measuring the Skew 𝟑 ( 𝒎𝒆𝒂𝒏 − 𝒎𝒆𝒅𝒊𝒂𝒏 𝑭𝒊 𝑿 𝒊 ) ∑
𝑺𝑲 = ´ =
𝑿
𝑺𝒕𝒂𝒏𝒅𝒂𝒓𝒅 𝒅𝒆𝒗𝒊𝒂𝒕𝒊𝒐𝒏
∑ 𝑭𝒊
No. of
No. of
Output Mid Point Employee
Employee
(cumulative)
No. of
No. of
Output Mid Point Employee
Employee
(cumulative)
No. of
No. of
Output Mid Point Employee
Employee
(cumulative)
■ Population variance:
Variance
Standard
Deviation
68%
95% 99.7%
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 100
Percentiles
■ 80th Percentile
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 101
Percentiles
■ 75th Percentile is 3rd Quartile
Example:
X Median X
minimu Q1 (Q2) Q3 maximu
m m
25% 25% 25%
25%
12 30 45 57
70
Interquartile range
= 57 – 30 = 27
75th Percentile
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 102
We Discuss Two Measures Of The Relationship
Between Two Numerical Variables
▪ The Covariance
▪ The Coefficient of Correlation
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 103
The Covariance
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 104
Interpreting Covariance
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 105
Coefficient of Correlation
where
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 106
Features of the
Coefficient of Correlation
■ The population coefficient of correlation is referred as ρ.
■ The sample coefficient of correlation is referred to as r.
■ Either ρ or r have the following features:
■ Unit free
■ Range between –1 and 1
■ The closer to –1, the stronger the negative linear relationship
■ The closer to 1, the stronger the positive linear relationship
■ The closer to 0, the weaker the linear relationship
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 107
Scatter Plots of Sample Data with
Various Coefficients of Correlation
Y Y
X X
r = -1 r = -.6
Y
Y Y
X X X
r = +1 r = +.3 r=0
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 108
The Coefficient of Correlation Using
Microsoft Excel Function
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 109
The Coefficient of Correlation Using
Microsoft Excel Data Analysis Tool
1. Select Data
2. Choose Data Analysis
3. Choose Correlation &
Click OK
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 110
The Coefficient of Correlation
Using Microsoft Excel
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 111
Interpreting the Coefficient of Correlation
Using Microsoft Excel
▪ r = .733
▪ There is a relatively
strong positive linear
relationship between test
score #1 and test score
#2.
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 112
Pitfalls in Numerical
Descriptive Measures
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 113
Ethical Considerations
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 114
Chapter Summary
Copyright © 2016, 2013, 2010 Pearson Education, Inc. Chapter 3, Slide 115