Professional Documents
Culture Documents
2.1
Definitions…
A variable is some characteristic of a population or sample.
E.g. student grades.
Ordinal
Values must represent the ranked order of the data.
Data may be treated as nominal but not as interval.
Nominal
Values are the arbitrary numbers that represent categories.
Data may not be treated as ordinal or interval.
30 25
19 17
20 13
10
0
1 2 3 4 5 6 7
5
21%
2
7%
4
4%
3
22%
Pie Charts show relative frequencies…
Copyright © 2009 Cengage Learning 2.11
Nominal Data
Light Beer Brand Frequency Relative Frequency
Michelob Light
62
13
21.8
4.6
(based on the same data).
Miller Lite 59 20.7
Just different presentation.
Natural Light 25 8.8
80
70
60
Frequency
50
40
30
20
10
0
15 30 45 60 75 90 105 120
Bills
(18+28+14=60)÷200 = 30%
about half (71+37=108)
i.e. nearly a third of the phone bills
of the bills are “small”,
are $90 or more.
i.e. less than $30
Frequency
Frequency
Variable Variable Variable
Frequency
Variable Variable
Bimodal
Unimodal
Frequency
Frequency
Variable Variable
A modal class
Copyright © 2009 Cengage Learning 2.25
Shapes of Histograms…
A special type of symmetric unimodal histogram is one that
is bell shaped:
Frequency
Variable
Bell Shaped
Copyright © 2009 Cengage Learning 2.26
Stem & Leaf Display…
• Retains information about individual observations that
would normally be lost in the creation of a histogram.
Stem Leaf
0 0000000000111112222223333345555556666666778888999999
1 000001111233333334455555667889999
2 0000111112344666778999
3 001335589
4 124445589
5 33566
6 3458
7 022224556789
8 334457889999 Thus, we still have access to our
9 00112222233344555999 original data point’s value!
10 001344446699
11 124557889
first class…
next class: .355+.185=.540
:
:
“around $35”
(Refer also to Fig. 2.13 in your textbook)
Copyright © 2009 Cengage Learning 2.34
Describing Time Series Data
Observations measured at successive points in time are
called time-series data.
Line chart plots the value of the variable on the vertical axis
against time periods on the horizontal axis.
3.5
2.5
1.5
0.5
0
1 25 49 73 97 121 145 169 193 217 241 265 289 313 337
1.800
1.600
1.400
1.200
1.000
0.800
0.600
0.400
0.200
0.000
1 25 49 73 97 121 145 169 193 217 241 265 289 313 337
10
0
30 36 42 48 54 60 66
Ages
❖ Cross – tabulation y
❖ Scatter Diagram
x
Crosstabulations and Scatter
Diagrams
◼ So far we have focused on methods that are used
to summarize the data for one variable at a time.
◼ Often a manager is interested in tabular and
graphical methods that will help understand the
relationship between two variables.
◼ Cross-tabulation and a scatter diagram are two
methods for summarizing the data for two (or more)
variables simultaneously.
◼ The left and top margin labels define the classes for
the two variables.
Frequency distribution
for the price variable
Frequency distribution
for the home style variable
x = Number of y = Number of
Interceptions Points Scored
1 14
3 24
2 18
1 17
3 30
30
25
20
15
10
5
0 x
0 1 2 3 4
Number of Interceptions
Interval Nominal
Data Data
Histogram Frequency and
Single Set of Relative Frequency
Data Tables, Bar and Pie
Charts
Relationship Scatter Diagram Bar Charts
Between
Two Variables