Professional Documents
Culture Documents
Instructions
Part
1
Skittles Project Data Collection
Project Overview
This project will allow you to pull together many of the concepts you are studying this semester, including
organizing and analyzing data, drawing conclusions using confidence intervals and hypothesis tests, and
presenting your work in a well organized paper. Your overall report will be a narrative that clearly
explains your process and your conclusions.
You will complete the project in several stages, submitting portions throughout the semester. Your
submissions must be word processed, including using an equation editor for mathematical work.
Graphics, tables, etc. should all be included in your document, not as separate pages. You should use
either the .pdf or .doc file formats for submissions. Compile the various parts as they are completed, as
you will load the completed project into your ePortfolio later in the semester.
Some portions of this project may be completed in assigned groups, while other portions will be
completed individually.
Each student in the class will purchase one 2.17-ounce bag of Original Skittles and record the
following data:
Count only whole candies, disregarding any partial candies in the bag. Please note that a bay of any
other size or type of Skittles will not work for your project, and will not receive credit. In addition to
submitting your data here, be sure to record them in a table like the one below: you will use the data
throughout the semester.
This quiz is a place for you to submit the number of Skittles for each of the colors required for the
project. I will collect the data in a table and post it so that you can do the comparisons for the entire
class. You can return to this survey at any time to see your responses.
Question 1
ou Answered
12
Question 2
ou Answered
12
https://slcc.instructure.com/courses/605036/quizzes/1692834 2/3
4/23/2020 Skittles Project Part 1: Project Data Collection: MATH-1040-352-803-Sp20
Question 3
ou Answered
11
Question 4
ou Answered
12
Question 5
ou Answered
12
https://slcc.instructure.com/courses/605036/quizzes/1692834 3/3
Erik Ureno
PART 2: Organizing and Displaying Qualitative Data: COLOR
TO TURN IN (feel free to copy, paste, and then edit this with your answers into another document)
Using the work you’ve completed, answer the four questions/prompts below in a single document
called “Skittles_Project_Part_2”. Save as a .pdf file and upload that document to Canvas (Skittles Project
Part 2 graphics assignment).
1. Predictions: What proportion of the Skittles do you expect to see of each color? Complete the table
below, and also briefly explain why you made those predictions.
2. Data: Create a table that displays the counts and the proportions by color and also the totals from
your own bag of candies, together with the data for the entire class sample.
3. Graphics for Qualitative Data: Using StatCrunch, create a pie chart and a Pareto chart for the total
number of candies of each color in our class data set. Insert copies of your graphs into this report
document. (You could use a “snip” tool or copy the image from StatCrunch and insert.)
4. Skittles Colors: Write a well‐thought‐out paragraph discussing your observations of this data.
Respond to the following prompts:
Are you surprised by anything about the proportions of colors in your bag and/or the class data?
Why? I am not surprised at all by the data that I saw in the over all class. My own bag seemed
to be pretty consistent with the amounts of skittles there were per color. If you look at the
data from the whole class, the spread is not significant, it’s minus one or two percent.
Examine the distribution of colors in your bag and in other individual bags in the class sample,
then look at the color proportions in the total class sample. What do you notice about large and
small samples and variation in the color proportions? The proportion is much higher when you
Erik Ureno
look at the entire class of course and the proportion of the total class sample varies much
more than an individual sample.
BAR GRAPH
Erik Ureno
PARETO CHART
PART 3: Organizing and Displaying Quantitative Data: Total Candies per Bag
Mean number of candies per bag 59
Standard deviation of the number of candies per bag 8.92
(MIN: 45) (Q1: 57) (Q2: 58.5)
5‐number summary for the number of candies per bag
(Q3: 60) (MAX: 95)
What is the shape of the distribution for “Total candies in each bag? Is this what you expected?
Why?
Are there any observations that appear to be outliers? If so, what impact might they have on
graphics and summary statistics?
The shape of this data in the Histogram is right skewed. This means that the mean is higher
than the median. That was to be expected when we were asked to do the 5‐number
summary. As you can see, the mean is 59, whereas the Median is 58.5. Looking at the data on
the excel sheet also made me assume that the mean would be higher than the median. There
are defiantly some outliers, someone reported that there were 95 skittles in their bag. When
looking at the mean, it looks like the data was pretty resistant to the few outliers, this also
appeared to be the case with the Median.
Skittles Project Part 4
1. Proportion Yellow Candies: Construct a 99% confidence interval estimate for the
population proportion of yellow candies. Use technology to find the interval, and then
also verify your result using the formulas. Show your computations, using an equation
editor to format your work. Use a complete sentence to interpret this interval, in context.
Red: 284
Orange: 288
Yellow: 287
Green: 267
Purple: 303
Total: 1,429
np̂(1-p̂) ≥ 10
1429(0.2008)(1-0.2008)=229.325
229.325 ≥ 10 ✅
N ≥ 0.005N ✅
2. Mean Number of Candies: Construct a 95% confidence interval estimate for the
population mean number of candies per bag. Use technology to find the interval,
and then also verify your result using the formulas. Show your computations, using
an equation editor to format your work. Use a complete sentence to interpret this
interval, in context.
The mean number of skittles in a bag is 60 w/ a total number of skittles being 1,429.
60/1,429 = 0.0420
np̂(1-p̂) ≥ 10
1429(0.0420)(1-0.0.0420)=57.497
57.497 ≥ 10 ✅
N ≥ 0.005N ✅
3
From MATH-1040-352-803-Sp20
Part
4 What is a Con dence Interval?
Skittles Project
This discussion board will be visible only to members of your group and to your instructor.
I recommend you appoint a leader and determine a schedule for your submission work.
Collaborate with your group to discuss confidence intervals. Consider factors such as sample
size, variability, etc.
Once you have arrived at a consensus, one of your group members needs to post with "FINAL
SUBMISSION" and then your responses to the questions.
Reply
The confidence interval is a range of values that we are fairly sure contain the true values of
certain data. It is used to estimate certain values in a data set that doesn't have a parameter.
https://slcc.instructure.com/groups/283170/discussion_topics/3334221 1/2
4/23/2020 Topic: Skittles Project Part 4 Discussion: What is a Confidence Interval? - Group 4
There are 3 factors. The value of the multipier, the standard deviation, and the sample size.
Reply
The general purpose of a confidence interval is a range of values that would be likely near the
true value.
Reply
The main purpose of a confidence interval is to show the range of various values deriving from
our estimated population.
Factors that affect confidence interval are sample size, standard deviation, and multiplier
value.
Reply
https://slcc.instructure.com/groups/283170/discussion_topics/3334221 2/2