Professional Documents
Culture Documents
Exploring Data
Chapter 4
9 14 5 16 12 11 10 8 15
13 17 20 8 19 15
5 8 8 9 10 11 12 13 14
15 15 16 17 19 20
76 34 65 42 74 22 83 59 90 44
22 34 42 44 59 65 74 76 83 90
43, 49, 61, 71, 75, 87, 88, 91, 101, 104
Interquartile Range = Q3 – Q1
Answer:
First, arrange the above data in ascending order
3, 5, 7, 8, 9, 11, 15, 16, 20, 21.
Q1: 6.5
Q3: 17 Interquartile range = 17- 6.5= 10.5
Step 3: 10.5*1.5= 15.75
Step 4: 6.5 – 15.75= -9.25 17 + 15.75= 32.75
Step 5: Check the data set for any data values that fall outside the
interval from -9.25 to 32.75. The value -10 and 40 are outside this
interval; hence, they can be considered outliers.
4-12 Copyright 2019 by McGraw-Hill Education. All rights reserved.
Exercise
Check the following data set for outliers.
22,6,50,13,15,18,5,12
Answer
5,6,12,13,15,18,22,50
Q1=7.5 Q3=21 IQR= 21-7.5= 13.5
Q1- 1.5(IQR)=> 7.5 – 1.5(13.5)= -12.75
Q3+ 1.5(IQR)=> 21 + 1.5(13.5)= 41.25
Check the data set for any data values that fall outside the
interval from -12.75 to 41.25. The value 50 is outside this
interval; hence, it can be considered outliers.
4-13 Copyright 2019 by McGraw-Hill Education. All rights reserved.
Box Plots
A box plot is a graphical display using quartiles, that shows the general
shape of a variable’s distribution.
What are the smallest and largest value, the first and third
quartiles, and the median? Would you agree that the
distribution is symmetrical?
Estimate the interquartile range.
Gasoline mileage
10 20 30 40 50
C1
4-21
Skewness
The coefficient of skewness is a measure of the symmetry of a
distribution.
Two formulas for coefficient of skewness
4-29
Describing the Relationship Between
Two Variables
When we are studying a single variable, we refer to this as
univariate data.
When we study the relationship between two variables we
refer to the data as bivariate.
Would it be reasonable to conclude that the more expensive
vehicles are purchased by older buyers?
Is there a relationship between the profit earned on a vehicle
sale and the age of the purchaser?
Do tall fathers tend to have tall children?
One graphical technique we use to show the relationship between
variables is called a scatter diagram.
4-32
How to report the relationships in
Scatter Diagrams?