Professional Documents
Culture Documents
Nsbe9ege Ism Ch01
Nsbe9ege Ism Ch01
1.1
a. Categorical – ordinal. Size indicates rank ordering.
b. Numerical – discrete. The number comes from a counting process.
c. Numerical – continuous. Since the newborn baby’s weight arises from a
measurement (not a counting) process.
d. Categorical– nominal. Since the nationality does not imply ranking of
governor, it is nominal.
1.2
a. Categorical, nominal (The response is categorical because the responses can be
grouped into classes or categories, in this case yes/no. The measurement levels are
nominal because the responses are words that describe the categories.)
b. Categorical, ordinal (The response is categorical because the responses can be
grouped into classes or categories. The measurement levels are ordinal because these
are rankings of the data.)
c. Numerical, discrete (The response is numerical because the responses cannot be
grouped into classes or categories. Since the response is an actual cost, it is discrete
because the value comes from a counting process.)
1.3
The response to the amount spent in a year is numerical since it cannot be grouped into
classes or categories. The measurement levels are discrete as the responses come from a
counting process.
The response to the level of satisfaction received is categorical since it can be grouped
into classes or categories. The measurement levels are ordinal because the responses can
be ranked from 1 to 5.
1.4
a. Categorical – Qualitative – ordinal
b. Numerical – Quantitative – discrete
c. Categorical – Qualitative – nominal
d. Categorical – Qualitative – nominal
1.5
a. Categorical, nominal (The response is nominal since the categories cannot be ranked).
b. Categorical, nominal (The response is nominal since the categories cannot be ranked).
c. Numerical, continuous (The response is a numerical and continuous since the amount of
money spent on food can be any numerical value, it can obtain a decimal value as well).
d. Numerical, discrete (The response is a numerical -discrete since it can take only the integer
values).
e. Categorical, ordinal, (The response is ordinal since the categories can be ranked regarding
the length of time interval).
1.6
a. Categorical – Qualitative – nominal
b. Numerical – Quantitative - discrete
c. Categorical – Qualitative – nominal: yes/no response
d. Categorical – Qualitative – ordinal
1.7
a. An example of a categorical variable with ordinal responses is the "Internet"
variable.
b. An example of a categorical variable with nominal responses is the "More Use"
variable.
c. An example of a numerical variable with discrete responses is the "Books Each
Time" variable.
1.8
a. Various answers – Categorical variable with ordinal responses:
PIR_grp (1-5)
b. Various answers – Categorical variable with nominal responses: female
(male/female – 0/1)
c. Various answers – Numerical variable with continuous responses: daily_cost
d. Various answers – Numerical variable with discrete responses: PIR_p
1.9
a. Pie chart
b. Bar chart
1.10
Pareto diagram – possible defects for a product line
1.11
a. Bar chart
b. Pie chart
Very Satisfied
Moderately Satisfied
No Opinion
Very Satisfied Moderately Dissatisfied
49% Very Dissatisfied
Moderately
Satisfied
39%
1.12
Time employees need to get to work according to type of transportation
1.13
Pie chart
1.14
a. Bar chart indicating the number of employees of domestic credit institutions in the
selected countries of European Union (EU) in 2012.
600000
Number of employees
500000
400000
300000
200000
100000
0
Belgium Bulgaria Czech Republic Denmark Germany
Country
b. Bar chart indicating the number of employees of domestic credit institutions in the
selected countries of European Union (EU) in 2013.
600000
Number of employees
500000
400000
300000
200000
100000
0
Belgium Bulgaria Czech Republic Denmark Germany
Country
c. Bar chart comparing the number of employees of domestic credit institutions in 2012
to the number of employees of domestic credit institutions in 2013 in the selected EU
countries.
600000
Number of employees
500000
400000
300000
200000
100000
0
2012 2013 2012 2013 2012 2013 2012 2013 2012 2013
C1 Belgium Bulgaria Czech Republic Denmark Germany
1.15
1.16
Describe the data graphically
1.17
a. Cluster bar chart of the respondents’ major and gender
Student Majors
16
14
12
Number of Students
10
Male
8
Female
6
0
Accounting Financial Management Marketing Undecided
Major
1.18
a. Pie chart of the percent of males in each of the activity level categories.
b. Pie chart of the percent of females in each of the activity level categories.
1.19
a. Pie chart for worldwide market shares for February 2011.
b. Answer will vary: Pie chart for worldwide market shares for October 2011.
c. Answers will vary; Pie chart for Asian market shares for October 2011.
1.20
1.21
Time-series plot of number of male and female tourists from India visiting Korea
from March 2018 through December 2018.
5000
4000
3000
2000
1000
, , il, , , , , r, r, r, r
ry ch r ay ne ly st be
ua ar Ap M Ju Ju gu be o be be m
br M Au
pt
m ct e m ce
Fe Se
O ov De
N
Month
1.22
a. Time-series plot of Gross domestic investment and Gross private domestic
1.23
a. Time-series plot of real GDP growth rate for EU (28 countries) from 2011 to 2017.
10
8
EU GDP growth rate
b. Time-series plot of real GDP growth rate for Malta from 2011 to 2017.
1.24
a. The Euro (EUR) compared to 1 U.S. Dollar (USD)
1.25
a. Time-series plot of worldwide market shares of IE, Firefox, Chrome, Safari,
and Opera.
b. Time-series plot of European market shares of IE, Firefox, Chrome, Safari, and Opera.
1.26
Time-series plot of a stock market index (Dow Jones Industrial Average) over 14
years from the internet
1.27
400
300
200
100
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
YEAR
1.28
Time-series plot of Housing Starts data
1.29
The earnings per share increased and decreased a fair amount, though sudden jumps up or down
in consecutive years were uncommon.
1.30
a. 5 – 7 classes
b. 7 – 8 classes
c. 8 – 10 classes
d. 8 – 10 classes
e. 10 – 11 classes
1.31
1.32
a. frequency distribution
Bin Frequency
10 0
20 5
30 3
40 8
50 3
60 5
70 4
More 0
b. histogram and
c. ogive
Histogram
9 120.00%
8
Frequency
100.00%
7
6 80.00%
5
60.00%
4
3 40.00%
2
20.00%
1
0 0.00%
10 20 30 40 50 60 70 More
Bin
Frequency Cumulative %
d. stem-and-leaf display
Stem-and-Leaf Display: Data_Ex1.32
Stem-and-leaf of Data_Ex1.32 N = 28
Leaf Unit = 1.0
2 1 23
5 1 557
7 2 14
8 2 8
9 3 2
(6) 3 567799
13 4 0144
9 4
9 5 14
7 5 699
4 6 24
2 6 55
1.33
Stem-and-Leaf Display: Price of 20 Cryptocurrencies
N = 20.
Leaf Unit = 0.10
Stem unit: 1
1 468
2 126
033566
3 9
4 458
5 025
6 2
1.34
1.35
Scatter plot
1.36
a.
1.37
For the file Water - construct a frequency distribution, cumulative frequency
distribution, histogram, ogive and stem-and-leaf display. Various answers – one
possibility is to use 8 classes with a width of .1.
Bin Frequency Cum Freq
3.5 0 0
3.6 1 1
3.7 8 9
3.8 30 39
3.9 22 61
4 12 73
4.1 1 74
4.2 1 75
More 0
Histogram
40 120.00%
Frequency
100.00%
30
80.00%
20 60.00%
40.00%
10
20.00%
0 0.00%
5 6 7 8 9 4 1 2 e
3. 3. 3. 3. 3. 4. 4. or
M
Bin
Frequency Cumulative %
1 35 7
3 36 34
9 36 577799
21 37 111122344444
(17) 37 55566777777889999
37 38 0111112222244
24 38 556677899
15 39 01334444
7 39 56689
2 40
2 40 6
1 41 1
1.38
a. Histogram of the Returns data
Histogram
8 120.00%
7 100.00%
6
Frequency
5 80.00%
4 60.00%
3 40.00%
2
1 20.00%
0 0.00%
100 150 200 250 300 350 400 More
Bin
Frequency Cumulative %
b. stem-and-leaf display
1 1 3
4 1 899
11 2 0014444
(3) 2 589
11 3 0000122
4 3 689
1 4
1 4
1 5 0
1.39
Describe the data in the Sun data file.
Histogram
40 120.00%
Frequency 100.00%
30
80.00%
20 60.00%
40.00%
10
20.00%
0 0.00%
224.5 229.5 234.5 239.5 244.5 249.5 More
Bin
Frequency Cumulative %
1 22 4
1 22
5 22 8999
16 23 11111111111
25 23 222223333
38 23 4444444445555
(17) 23 66666666677777777
45 23 8888888888999
32 24 000000111111
20 24 22222222333
9 24 44455
4 24 67
2 24 99
1.40
450
400
350
300
250
200
150
100
50
0
60 80 100 120 140 160 180 200 220
Prices per day
1.41
1.42
Scatter plot
1.43
Scatter plot of price of orange juice vs. weekly quantities sold – Bishops data
file
The scatter plot shows the expected negative relationship between price and quantity
demanded per unit of time. This is what economic theory suggests for a typical demand
curve.
1.44
Bus transportation is the relationship between the number of delays of the sixteen bus
companies and the number of passengers transported.
50000
number of passengers
40000
30000
20000
2 3 4 5 6 7 8 9
number of delays
There appears to be no relationship between the numebr of delays of each company and number
of passengers transported in a six month time period.
1.45
a. Row percentages are listed below
Rows: Dayweek Columns: SlsLevel
1 2 3 4 All RowPcts
1 6 4 27 41 78 .1667
2 29 27 11 11 78 .1667
3 29 24 12 14 79 .1688
4 28 14 23 11 76 .1624
5 18 27 19 15 79 .1688
6 7 21 24 26 78 .1667
All 117 117 116 118 468
b. Major differences in sales level by day of week indicate that sales levels are highest
for days 1 and 6. This is where the largest number of Sales Level ‘4’ occur (the largest
daily sales). Days 2 and 3 are similar with most of the sales in the lowest two levels.
Days 4 and 5 have sales that are building towards the two largest days of 6 and 1.
1.46
Scatter plot of Citydat – taxbase versus comper
No relationship between the two variables and hence no evidence that emphasis
on attracting a larger percentage of commercial property increases the tax base.
The two outlier points on the right side of the plot might be used to argue that a
very high percentage of commercial property will provide a larger tax base. That
argument, however, is contrary to the overall pattern of the data.
1.47
a. Histogram
1.48
a. Time-series plot with vertical scale from 5,000 to 5,900.
5800
5700
Number of tickets sold
5600
5500
5400
5300
5200
5100
5000
Jan, Feb, Mar, Apr, May, Jun, Jul, Aug, Sep, Oct, Nov, Dec
Month
6500
Number of tickets sold
6000
5500
5000
4500
4000
1 2 3 4 5 6 7 8 9 10 11 12
Month
c. Differences between the two graphs include the variability of the data series.
One graph suggests greater variability in the data series while the other one
suggests less variability. Keep in mind the scale on which the measurements
are made.
1.49
a. Time-series plot of Exchange Rates with vertical axis from 92 to
106.
c. Differences between the two graphs include the variability of the data series.
One graph suggests greater variability in the data series while the other one
suggests a relatively flat line with less variability. Keep in mind the scale on
which the measurements are made.
1.50
Draw two time-series plots for Inventory Sales with different vertical ranges.
Differences between the two graphs include the variability of the data series. One
graph suggests greater variability in the data series while the other one suggests a
relatively flat line with less variability. Keep in mind the scale on which the
measurements are made.
1.51
Describe graphically the amount (in €) that the individual spends per month.
1 1 9
3 2 34
4 2 5
7 3 114
9 3 59
(3 4 013
)
10 4 57
8 5 03
6 5 57
4 6 02
2 6 58
1.52
a. Draw a histogram of 30 daily stock return.
Histogram of daily stock return
20
15
Frequency
10
0
20 30 40 50 60 70
Return
d. Cumulative relative frequencies are in the last column of the table above. These numbers
indicate the percentage of daily price stock returns that sum up all previous classes, up to and
including the current class.
1.53
Bar chart to plot percentage of online shoppers according to their income
On line shoppers
50%
Percentage of on line shoppers
40%
30%
20%
10%
0%
< 50 000 50 000 - 74 999 ≥75 000
Income
1.54
Management level
40
Percentage of managers
30
20
10
0
Management Level 1. and 2. level 3. level 1. and 2. level 3. level 1. and 2. level 3. level
Small Firms Medium firms Large Firms
1.55
Customer familiarity with e-commerce.
Using e-commerce
90%
80%
70%
Percentage of customers
60%
50%
40%
30%
20%
10%
0%
connect to internet downloadd an app could use PayPal basic internet skills basic computer skills
Familliiarity with technology
1.56
County Appraiser’s Office – Data Entry Process
a. Pareto diagram
1.58
1.59
Scatter plot of price of milk versus demand for milk.
Price and quantity demanded are negatively related as predicted by the law of
demand.
1.60
Plot the data for advertising expenditures and total sales
1.61
Scatterplot of weekly sales versus age
200
150
weekly sales
100
50
10 20 30 40 50
age
1.62
5
muscle mass percentage
15 20 25 30 35 40
amount of fiber (in grams)
1.63
a. Describe the data with a cross table
1.64
a. Describe the new product data with a cross table
1.65
a. Pie chart of soft drink preferences
1.66
a. Scatterplot
From the graph, it is evident that the temperature is positively related to the
time it takes to mow.
1.67
Time-series plot of population growth for the state of New York
1.68
a. Cross table of method of payment and day of purchase for Florin data file.
Payment M T W Th F S Tot
Am Ex 7 0 3 4 3 6 23
MC 1 4 4 2 4 9 24
Visa 6 6 4 5 8 10 39
Cash 3 1 0 0 3 9 16
Other 2 0 4 4 7 6 23
Subtotal 19 11 15 15 25 40 125
1.69
Cross table and bar chart of drink preference vs. type of vehicle
1.70
a. Time-series plot of the closing costs
1.71
Time-Series Plot of Facebook New Visitors
1.72
a. Histogram of shopping times
b. Stem-and-leaf display
Stem-and-Leaf Display: Time
9 1 888888899
19 2 0011111333
26 2 5555558
44 3 000011113344444444
52 3 77777788
52 4 00001222233
41 4 555678
35 5 00112222
27 5 5777799
20 6 00334
15 6 7889999
8 7 00001133
1.73
Stem-and-leaf display
Stem-and-Leaf Display: GPA
4 21 2689
6 22 25
9 23 013
11 24 56
23 25 113345555668
25 26 25
28 27 889
35 28 2788888
40 29 01289
45 30 15678
60 31 000488888888888
74 32 01112444568999
(53) 33 11111111111222223333444444444555566666666677777777999
29 34 00000022222222222222222
6 35
6 36 57
4 37 012
1 38 5
1.74
Scatter plot
Quiz
First Scorescore
exam VS. Test Score
vs test score
60
50
First exam score Y
40
Test Score (Y)
30
20
10 15 20 25 30
Test score X
Quiz Score (X)