Professional Documents
Culture Documents
data
Bar Chart
When to Use Categorical data
How to construct
– Draw a horizontal line; write the categories or
labels below the line at regularly spaced
intervals
– Draw a vertical line; label the scale using
frequency or relative frequency
– Place equal-width rectangular bars above each
category label with a height determined by its
frequency or relative frequency
Bar Chart (continued)
What to Look For
Frequently or infrequently occurring
categories
How to construct
– MUST first calculate relative frequencies
– Draw a bar representing 100% of the group
– Divide the bar into segments corresponding to
the relative frequencies of the categories
Remember the Princeton survey . . .
Relative Frequency
Firstdraw a bar
Ideal Distance Students that represents
Parents
Less than 250 miles .35 .53 100% of the
250 to 500 miles .31 .30 students who
500 to 1000 miles .19 .11 answered the
More than 1000 miles .15 .06 survey.
Relative Frequency
Notice
Ideal that
Distance this segmented
Students bar chart displays the
Parents
same
Less thanrelationship
250 miles between
.35 the
.53opinions of students
250
andtoparents
500 milesconcerning
.31 the ideal
.30 distance that college
500 to is
1000 miles
from home as.19the double.11 bar chart does.
More than 1000 miles .15 .06
1.0 First
Next, draw a bar
divide the
Do that
the same thing for
represents
0.8 Less thanbar into segments.
250 miles
parents100% – don’t
of theforget
250 to 500 miles
0.6 a key denoting
students whoeach
Relative frequency
0.2
survey.
Students Parents
Pie (Circle) Chart
When to Use Categorical data
How to construct
– Draw a circle to represent the entire data set
– Calculate the size of each “slice”:
Relative frequency × 360°
– Using a protractor, mark off each slice
To describe
– comment on which category had the largest
proportion or smallest proportion
Typos on a résumé do not make a very good
impression when applying for a job. Senior
executives were asked how many typos in a
résumé would make them not consider a job
candidate. The resulting data are summarized
in the table below.
Number of Typos Frequency Relative Frequency Create a pie
1 60 .40 chart for
2 54 .36 these data.
3 21 .14
4 or more 10 .07
Don’t know 5 .03
Number of Typos Frequency Relative Frequency
1
What does this 60
pie chart tell.40 us about the number of
2
typos occurring 54
in résumés before.36
the applicant would not be considered
for a job?
3 21 .14
4 or more 10 .07
Don’t know 5 .03
First draw a circle
Next,
Repeat calculate
to represent
for each the
theslice.
size of the
entire slice
data for “1
set.
typo”
Here is the
completed pie chart
.40×360º
created=144º
using
Minitab.
Draw that slice.
Dotplot
When to Use Small numerical data
sets
How to construct
– Draw a horizontal line and mark it with an
appropriate numerical scale
– Locate each value in the data set along the scale
and represent it by a dot. If there are two are
more observations with the same value, stack the
dots vertically
Dotplot (continued)
What to Look For
– The representative or typical value
– The extent to which the data values spread out
– The nature of the distribution along the number line
– The presence of unusual values
Collect the following data and then display the data in a dotplot:
How to construct
– Constructed like bar charts, but with two (or
more) groups being compared
– MUST use relative frequencies on the vertical
axis
– MUST include a key to denote the different bars
Why MUST we use relative frequencies?
Stem-and-Leaf Displays
When to Use Univariate numerical data
How to construct
Can Each
– Select
also number
one
create is split
orcomparative
more of theinto two parts:
leading digitsdisplays
stem-and-leaf for the
Remember
stem the data set collected in Chapter 1 – how many
– List Stem
piercingsthe possible
do you
– have?
consistsstem of values
Would the firstinUse
a vertical
a stem-and-leaf
digit(s) column
display be a
Record
– good the
graph leaf
for thisfor each
distribution? Whyfor
observation
small
orbeside
why to
not?
Leaf - consistsstem
each corresponding value
of the final digit(s)
moderate sized data
– Indicate the units for stems sets. and leaves in awork
Doesn’t key
or legend Be sure to list every
If you have a long well for large data
stemlists
from of the
leaves
sets.
To describe behind a few stems,
smallest youlargest
to the can
– comment on the center, spread,
split stemsand shape
in order toofspread
value the
distribution and if there areoutanythe
unusual features
distribution.
The following data are price per ounce for various brands
of different brands of dandruff shampoo at a local
grocery store.
0.32 0.21 0.29 0.54 0.17 0.28 0.36 0.23
Notice
We use L for lower leaf values (0-4)that
andnow
H foryou can
see the shape of this
higher leaf values (5-9).
distribution.
Central Africa
58.3 34.6 35.5 45.4 38.6 63.8 53.9 61.9 69.9
43.0 85.0 63.4 58.4 61.9 40.9 73.9 34.8 74.4
97.4 61.0 66.7 79.6
Histograms
When to Use Univariate numerical data
12 2 4 6 6 7 8 7 8 11 8 3 5 6 7 10 1 9
7 6 9 7 5 4 7 4 6 7 8 10
0
0 1 2 3 4 5 6 7 8 9 10 11 12
How to construct
- construct similar to histograms with continuous data,
but with density on the vertical axis
0.6
Approximately 0.55
0.4
0.2
2 4 6 8 10 12 14
Rainfall
1.0 Approximately 30% of the years had
annual rainfall less than what
0.8 amount?
Cumulative relative frequency
0.6
0.4
0.2
Rainfall
Which interval of rainfall
amounts had a larger 1.0
proportion of years –
0.8
9 to 10 inches or
Cumulative relative frequency
10 to 11 inches?
0.6
Explain
0.4
The interval 10 to 11 inches,
because its slope is steeper,
0.2 indicating a larger proportion
occurred.
2 4 6 8 10 12 14
Rainfall
Displaying Bivariate
Numerical Data
Scatterplots
When to Use Bivariate numerical data
Scatterplots are discussed in much greater depth in
Chapter 5.
How to construct
- Draw a horizontal scale and mark it with
appropriate values of the independent variable
- Draw a vertical scale and mark it appropriate
values of the dependent variable
- Plot each point corresponding to the
observations
To describe
- comment the relationship between the variables
Time Series Plots
When to Use
- measurements collected over time at regular
intervals
How to construct
- Draw
Can a horizontalbivariate
be considered scale and mark it with
appropriate
data where thevalues of time
y-variable is
-the
Draw a vertical
variable scale and
measured and mark it appropriate
values of the observed variable
theeach
- Plot x-variable
point is time
corresponding to the
observations and connect
To describe
- comment on any trends or patterns over time
The accompanying time-series plot of movie box
office totals (in millions of dollars) over 18
weeks in the summer for 2001 and 2002
appeared in USA Today (September 3, 2002).
Describe any
trends or
patterns that
you see.