Professional Documents
Culture Documents
LO 1.1
The Relevance of Statistics –Example 2
• The CFO of Starbucks Corp. claims that business is picking up since
sales at stores open at least a year climbed 4% in the quarter ended
December 27, 2009.
LO 1.1
The Relevance of Statistics –Example 3
• Researchers found that infants who sleep with a nightlight are much
more likely to develop myopia later in life.
LO 1.1
Statistics
• Statistics is the
methodology of extracting
useful information from a
data set.
Keys to do good statistical analysis
Find the right data. Use the appropriate statistical tools. Clearly communicate the numerical
information into written language.
Two Branches of Statistics
- Nominal
Qualitative Variables
- Ordinal
- Interval
Quantitative Variables
- Ratio
LO 1.4
Scales of Measurement
• Nominal scale: data are simply categories for grouping the
data
• Ordinal scale: may be categorized and ranked with respect to
some characteristic or trait.
Example
Tweens Survey
• What is the scale of measurement of the radio station data?
Solution: These are ordinal since they can be both categorized and ranked.
Scales of Measurement
• The Interval Scale
• Data may be categorized and ranked with respect to some
characteristic or trait.
• Differences between interval values are meaningful. Thus the
arithmetic operations of addition and subtraction are meaningful.
LO 1.4
Scales of Measurement
• The Ratio Scale
• The strongest level of measurement.
• Ratio data may be categorized and ranked with respect to
some characteristic or trait.
• Differences between interval values are meaningful.
• Business Examples: Sales, Profits, and Inventory Levels
LO 1.4
Example
Tweens Survey
• How are the time data classified? In what ways do the time data differ
from ordinal data? What is a potential weakness of this measurement
scale?
• Solution: Clock time responses are on an interval scale. With this type of
data we can calculate meaningful differences, however, there is no
apparent zero point.
LO 1.4
Example
Tweens Survey
• What is the measurement scale of the money data? Why is it
considered the most sophisticated form of data?
LO 2.1
Summarizing Qualitative Data
• Categories: Cloudy, Rainy, or Sunny.
• Calculate relative frequency by dividing each
category’s frequency by the sample size.
Cloudy 1 1/28=0.036
Rainy 20 20/28=0.714
Sunny 7 7/28=0.250
Total 28 28/28=1.000
LO 2.1
Summarizing Qualitative Data
• A pie chart is a segmented circle whose segments
portray the relative frequencies of the categories of a
qualitative variable.
• In this example,
circle is divided
into sectors
proportional to
categories of the
variable Marital
Status.
Source: Pew Research Center analysis of Decennial Census (1960 -
2000) and American Community Survey data (2008, 2010)
LO 2.2
Summarizing Qualitative Data
• A bar chart depicts the frequency or the relative
frequency for each category of the qualitative data as
a series of horizontal or vertical bars which are
proportional to the values that are to be depicted.
LO 2.2
Summarizing Quantitative Data
LO 2.3
Summarizing Quantitative Data
LO 2.3
Example
300 up to 400 4
400 up to 500 11
500 up to 600 14
600 up to 700 5
700 up to 800 2
Total = 36
LO 2.3
Summarizing Quantitative Data
Question:
What is the price range Class (in $1,000s) Frequency
over this time period? 300 up to 400 4
400 up to 500 11
Question: 500 up to 600 14
How many of the houses 600 up to 700 5
sold in the $500,000 700 up to 800 2
up to $600,000 range? Total = 36
LO 2.3
Summarizing Quantitative Data
Question:
What is the price range Class (in $1,000s) Frequency
over this time period? 300 up to 400 4
400 up to 500 11
$300,000 up to $800,000
500 up to 600 14
600 up to 700 5
Question:
700 up to 800 2
How many of the houses
Total = 36
sold in the $500,000
up to $600,000 range?
14 houses
LO 2.3
Summarizing Quantitative Data
• A cumulative frequency distribution specifies how
many observations fall below the upper limit of a
particular class.
Class (in $1,000s) Frequency Cumulative Frequency
300 up to 400 4 4
400 up to 500 11 4 + 11 = 15
500 up to 600 14 4 + 11 + 14 = 29
600 up to 700 5 4 + 11 + 14 + 5 = 34
700 up to 800 2 4 + 11 + 14 + 5 + 2 = 36
Total 36
Class frequency
Class relative frequency =
Total number of observations
LO 2.3
Summarizing Quantitative Data
• Here are the relative frequency and the cumulative relative
frequency distributions for the house-price data.
2.2 Summarizing
400 up to 500
500 up to 600 14
Quantitative
11
Data (7)
11/36 = 0.31
14/36 = 0.39
0.11 + 0.31 = 0.42
0.11 + 0.31 + 0.39 = 0.81
700 up to 800 2 2/36 = 0.06 0.11 + 0.31 + 0.39 + 0.14 + 0.06 1.0
Total 36 1.0
LO 2.3
Summarizing Quantitative Data
Use the data on the previous slide to answer the
following two questions.
Histogram
Polygon
Ogive
LO 2.4
Summarizing Quantitative Data
• A histogram is a visual representation of a
frequency or a relative frequency distribution.
LO 2.4
Summarizing Quantitative Data
• Here are the frequency and relative frequency
histograms for the house-price data.
LO 2.4
Summarizing Quantitative Data
• Shape of Distribution: typically symmetric or
skewed
Symmetric — mirror image on both sides of its
center.
Symmetric Distribution
LO 2.4
Summarizing Quantitative Data
• Skewed distribution
Positively skewed - data
form a long, narrow tail to
the right.
LO 2.4
Summarizing Quantitative Data
• A polygon is a visual representation of a frequency
or a relative frequency distribution.
LO 2.4
Summarizing Quantitative Data
• Here is a polygon for the house-price data.
LO 2.4
Summarizing Quantitative Data
• An ogive is a visual representation of a
cumulative frequency or a cumulative relative
frequency distribution.
Plot the cumulative frequency (or cumulative
relative frequency) of each class above the upper
limit of the corresponding class.
The neighboring points are then connected.
LO 2.4
Summarizing Quantitative Data
• Here is an ogive for the house-price data.
LO 2.5
Stem-and-Leaf Diagrams (2)
• The following data set shows the wealthiest
people in the world and their associated ages.
• The leftmost digit is the stem while the last digit is
the leaf as shown here. Age = 36
LO 2.5
Discussion:
A police officer is concerned with excessive speeds on a portion of
Interstate 90 with a posted speed limit of 65 miles per hour. Using his radar
gun, he records the following speeds for 25 cars and trucks:
Positive linear
relationship: as x
increases, so does y.
Negative linear
relationship (shown
here): as x increases, y
decreases.
LO 2.6
Scatterplots (3)
• Nonlinear relationship
As x increases,
y increases at an
increasing (or
decreasing) rate.
As x increases y
decreases, at an
increasing (or
decreasing) rate.
LO 2.6
Scatterplots (4)
• No relationship: data are randomly scattered with
no discernible pattern.
LO 2.6