Professional Documents
Culture Documents
Assignment 3
Data Files needed for these problems are in the Attached Files.
Problems:
2.25
I find the pie chart to be most useful because visually, for me, it is easer to see the area
of a circle for each section relative to the others when they meet at a central point.
However do find the automatic ranking you get from the Pareto chart useful.
c. What conclusions can you reach concerning how college students spend their day?
I’ve learned that my personal habits are quite inline with the rest of my peers.
2.31
a. Construct a side-by-side bar chart and a doughnut chart of project outcome and
category.
Side by Side bar chart
40000
35000
30000
Targetd amount 25000
20000
15000
10000
5000
0
Film \& Video Games Music Technology
Category
Doughnut Chart
b. What conclusions concerning the pattern of successful Kickstarter projects can you
reach?
Music the most successful category, though it still fails half the time, Technology appears to
2.33
Stem Leaf
5 34
6 9
7 4
8
9 38
2.35
a. Construct an ordered array.
91,94,97,100,102,102,103,108,111,112,115,115,116,116,117,117,117,122,122,123,124
,128,129,130,132
The stem and leaf plot is nicer that in it is more dense, as well as showing at a glance
yes it falls off to either side of the 11 range so they are centered around 11.
2.41
Percent Histogram
35
30
25
20
15
10
0
207 224.1 241.2 258.3 275.4 292.5 309.6 326.7 343.8 360.9
b. Construct a cumulative percentage polygon.
100
80
60
40
20
0
207 224.1 241.2 258.3 275.4 292.5 309.6 326.7 343.8 360.9
c. What conclusions can you reach concerning the time Americans living in cities spend
The most common commuting time is 224.1-224.2 minute range, taking 30% of the recorded average
commutes.
2.45
a. Construct a percentage histogram and a percentage polygon.
Percentage Histogram
35
30
25
20
15
10
0
65 73 47 2 93 67 4 13 87 6 33 07 8 53 7
6. 8. 0. 1. 3. 5. 7. 8. 0. 2. 4. 5. 7. .2
13 20 28 35 42 49 56 63 71 78 85 92 99 069
1
Percentage Polygon
35
30
25
20
15
10
0
0 200 400 600 800 1000 1200
100
80
60
40
20
0
0 200 400 600 800 1000 1200
c. What can you conclude about call center performance if a call duration target of less
Looking at the histograms and polynomials leads to an over estimate of how often they meet their goal,
since we have the raw data we can see they meet the goal of <240 exactly 30 out of 50 times, or 60%,
however the histograms include a few data points over that and end up in the high 70’s.
2.47
a. Construct a histogram and a percentage polygon.
Histogram
10
9
8
7
6
Count
5
4
3
2
1
0
1.89 1.91 1.92 1.94 1.95 1.97 1.98 1.99 2.01 2.02 2.04 2.05 2.07 2.08 2.09
Volume
100
80
Percent
60
40
20
0
1.85 1.9 1.95 2 2.05 2.1 2.15
Volume
c. On the basis of the results in (a) and (b), does the amount of soft drink filled in the
Yes it seems like there is a concentration of samples very close to and just over 2.0.
2.49
Time series
25
20
15
Sales
10
0
2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018
Year
b. Does there appear to be any change in annual sales over time? Explain.
Yes, the sales increase slowly over time up until 2013 where they start to slowly fall
back down until they are the same was they were in 2008
2.51
a. Construct a scatter plot with Bundle score on the X axis and typical cost on the Y axis.
Scatter plot
120
100
80
Cost
60
40
20
0
0 10 20 30 40 50 60 70 80 90 100
Bundle Score
b. What conclusions can you reach about the relationship between Bundle score and
typical cost?
For the lower values on the range, the bundles scores are all over the place, once you
start spending at least 40, the bundle scores tend to be at least 60 or higher.
2.73
Like a pie chart, you can easily see the breakdown of relative sizes of the categories.
claim is not supported by the graph since the bin is not broken at the 5 year mark.
8
18
14
12+
9-12
4-8
2-4
0-2
Already done
23
25
12
2.77
The polygon and the histogram show the exact same data, they only differ in presentation and that the
2.79
What are the advantages and disadvantages of using a bar chart, a pie chart, a doughnut chart,
Bar chart: + Can compare multiple data sets, allows full values, not just ratios
- Only allows one data set, does not work well with lots of categories.
-Requires familiarity with the chart to understand what’s going on. Less visually appealing.
2.81
A time series plot has time as the X axis, and the point are in a sequential order, scatter plots do not have to be
in a specific order.
2.83
What are the three different ways to break down the percentages in a contingency table?
2.85
What type of insights can you gain from a contingency table that contains three variables that
you cannot gain from a contingency table that contains two variables?
The table with 3 variables can show you the combinations of any 2 of the 3 as well as the combination
2.87
A sparklines shows overall trends for multiple sources at a time, often with lower resolution and detail
than you would expect in a time-series plot which only shows one data set.