Professional Documents
Culture Documents
TOTAL 100%
Note: University wide passing is 50% of the total
required credit points. Your FINAL GRADE is 50%
MIDTERM + 50% FINALS GRADE
LMS: https://onefeu.instructure.com / https://feu.instructure.com
(CANVAS)
LOGIN USING YOUR FEU EMAIL / MAKE SURE
TO CHECK YOUR ACCESS
ONCE YOU LOGIN PERSONALIZE YOUR
PASSWORD
NOTE: TAKE TIME TO READ THE MANUAL and MAKE SURE TO SET THE AUTO-
NOTIFICATION TO SEND EMAIL or SMS TO YOUR CELLPHONE.
Course Title: Business Statistics
TEXTBOOK: Essentials of
Modern Business Statistics
with Microsoft Office Excel – Anderson,
Sweeney, Williams, Camm, Cochran
FINAL EXAMINATION
login.cengagebrain.com/cb/
REGISTER USING YOUR SECTIONS ASSIGNED COURSE
LINK URL
How to access your MindTap course
MGT1103 BUSINESS STATISTICAL ANALYSIS with SOFTWARE APPLICATIONS – SECTION 2
Instructor : Byron Cabarloc
Start Date : 01/20/2020
What is MindTap?
MindTap empowers you to produce your best work – consistently.
MindTap is designed to help you master the material. Interactive videos, animations, and activities create a learning path designed by
your instructor to guide you through the course and focus on what's important. Get started today!
Registration
Connect to https://login.cengagebrain.com/course/MTPNR0TNMB3D
Follow the prompts to register your MindTap course.
Payment
After registering for your course, you will need to pay for access using one of the options below:
Online: You can pay online using a credit or debit card, or PayPal.
Bookstore: You may be able to purchase access to MindTap at your bookstore. Check with the bookstore to find out what they offer
for your course.
Free Trial: If you are unable to pay at the start of the semester you may choose to access MindTap until 11:59 PM on
02/03/2020 during your free trial. After the free trial ends you will be required to pay for access.
Please note: At the end of the free trial period, your course access will be suspended until your payment has been made. All your
scores and course activity will be saved and will be available to you after you pay for access.
Already registered an access code? Bought MindTap at your bookstore or online? Now use the course link from your instructor to
register for the class: https://login.cengagebrain.com/course/MTPNR0TNMB3D
System Check
To check whether your computer meets the requirements for using MindTap, go
to http://ng.cengage.com/static/browsercheck/index.html
Please Note: the System Check is also accessible in the drop down box next to your name located in the upper right corner of your
MindTap page.
WORK ON BELOW TOPIC (YELLOW PAD): NOT LESS
THAN 500 WORDS
DURATION: 30 MINUTES
CHAPTER 1: Data and Statistics
Essentials of
Modern Business
Statistics (7e)
Anderson, Sweeney, Williams, Camm, Cochran
© 2018 Cengage Learning
25
Chapter 1 - Data and Statistics
Statistics
Applications in Business and Economics
Data Sources
Descriptive Statistics
Statistical Inference
Data Mining
Statistical Analysis Using Microsoft Excel
26
What is Statistics?
The term statistics can refer to numerical
facts such as averages, medians,
percentages, and maximums that help us
understand a variety of business and
economic situations.
Statistics can also refer to the art and
science of collecting, analyzing, presenting,
and interpreting data.
27
Applications in Business and Economics
Accounting
Public accounting firms use statistical sampling procedures when
conducting audits for their clients.
Economics
Economists use statistical information in making forecasts about the
future of the economy or some aspect of it.
Finance
Financial advisors use price-earnings ratios and dividend yields to guide
their investment advice.
28
Applications in Business and Economics
Marketing
Electronic point-of-sale scanners at retail checkout counters are
used to collect data for a variety of marketing research
applications.
Production
A variety of statistical quality control charts are used to monitor
the output of a production process.
Information Systems
A variety of statistical information helps administrators assess the
performance of computer networks.
29
Data and Data Sets
Data: facts and figures from which conclusions
can be drawn
Data set: the data that are collected for a
particular study
Elements: the entities on which data are
collected it may be people, objects, events, or
other entries
Variable: any characteristics, number, or quantity
that can be measured or counted
30
Observations
The set of measurements obtained for a
particular element is called an observation.
A data set with n elements contains n
observations.
31
Data, Data Sets, Elements, Variables,
and Observations
Variables
Data Set
32
Scales of Measurement
Scales of measurement include
• Nominal
• Ordinal
• Interval
• Ratio
The scale determines the amount of information
contained in the data.
The scale indicates the data summarization and
statistical analyses that are most appropriate.
33
Scales of Measurement
Nominal
– Data are labels or names used to identify an
attribute of the element.
– A non-numeric label or numeric code may be
used.
34
Scales of Measurement
Ordinal
– The data have the properties of nominal data and
the order or rank of the data is significant.
– A non-numeric label or numeric code may be
used.
35
Scales of Measurement
Interval
– The data have the properties of ordinal data, and the
interval between observations is expressed in terms
of a fixed unit of measure.
– Interval data are always numeric.
36
Scales of Measurement
Ratio
– The data have all the properties of interval data and the
ratio of two values is meaningful.
– Variables such as distance, height, weight, and time use
the ratio scale.
– This scale must contain a zero value that indicates that
nothing exists for the variable at the zero point.
37
CATEGORICAL (QUALITATIVE) AND QUANTITATIVE DATA
38
Scales of Measurement
Data
Categorical Quantitative
Non-
Numeric Numeric
numeric
40
Time Series Data
A TIME SERIES is a series of data points indexed (or listed or
graphed) in time order. Most commonly, a time series is a
sequence taken at successive equally spaced points in time.
Example:
U.S average price per gallon of
conventional regular gasoline
between 2010 and 2015.
Graphs of time series help
analysts understand what
happened in the past, identify
any trends over time, and
project future values for the time
series
41
Data Sources
Existing Sources
Internal company records – almost any
department
Business database services – Dow Jones & Co.
Government agencies - U.S. Department of Labor
Industry associations – Travel Industry Association
of America
Special-interest organizations – Graduate
Management Admission Council (GMAT)
Internet – more and more firms
42
Data Sources
Data Available From Internal Company
Records
Record Some of the Data Available
Employee records Name, address, social security number
Production Part number, quantity produced, direct labor
records cost, material cost
Inventory records Part number, quantity in stock, reorder level,
economic order quantity
Sales records Product number, sales volume, sales volume by
region
Credit records Customer name, credit limit, accounts receivable
balance
Customer profile Age, gender, income, household size
43
Data Sources
Data Available From Selected Government
Agencies
Government Agency Some of the Data Available
Census Bureau Population data, number of households, household income
Federal Reserve Board Data on money supply, exchange rates, discount rates
44
TYPES OF STATISTICAL STUDIES
Statistical Studies – Observational
– In observational (nonexperimental) studies no
attempt is made to control or influence the
variables of interest
• A survey is a good example
Studies of smokers and nonsmokers are
observational studies because researchers do not
determine or control who will smoke and who will
not smoke
45
TYPES OF STATISTICAL STUDIES
Statistical Studies – Experimental
– In experimental studies the variable of interest is
first identified. Then one or more variables are
identified and controlled so that data can be
obtained about how they influence the variable of
interest.
– The largest experimental study ever conducted is
believed to be the 1954 Public Health Service
experiment for the Salk polio vaccine. Nearly two
million U.S. children (grades 1- 3) were selected.
46
Data Acquisition Considerations
Time Requirement
– Searching for information can be time consuming.
– Information may no longer be useful by the time it is
available.
Cost of Acquisition
– Organizations often charge for information even when it
is not their primary business activity.
Data Errors
– Using any data that happen to be available or were
acquired with little care can lead to misleading
information.
47
Descriptive Statistics
Table 1. The table shows the average salaries for
DESCRIPTIVE STATISTICS is the term various occupations in the United States in 1999.
given to the analysis of data that $112,760 pediatricians
$106,130 dentists
helps describe, show or summarize $100,090 podiatrists
data in a meaningful way such $ 76,140 physicists
48
Two General Types of Statistic That Are Used to Describe
Data:
50
TERMINOLOGIES
Population - The set of all elements of interest
in a particular study.
Sample - A subset of the population.
Statistical inference - The process of using
data obtained from a sample to make estimates
and test hypotheses about the characteristics of a
population.
Census - Collecting data for the entire
population.
Sample survey - Collecting data for a sample.
51
Analytics
Scientific process of transforming data into insight
for making better decisions.
Types
– Descriptive analysis – Analytical techniques that describe
what happened in the past.
– Predictive analysis
– Analytical techniques that use models constructed
from past data to predict future.
– Helps assess the impact the impact of one variable on
another
– Prescriptive analysis – Analytical techniques that yield a
best course of action to take.
52
Data Warehousing
Organizations obtain large amounts of data on a daily
basis by means of magnetic card readers, bar code
scanners, point of sale terminals, and touch screen
monitors.
Wal-Mart captures data on 20-30 million transactions
per day.
Visa processes 6,800 payment transactions per second.
Capturing, storing, and maintaining the data, referred to
as data warehousing, is a significant undertaking.
53
Data Mining
Analysis of the data in the warehouse might aid in
decisions that will lead to new strategies and higher
profits for the organization.
Using a combination of procedures from statistics,
mathematics, and computer science, analysts “mine
the data” to convert it into useful information.
The most effective data mining systems use automated
procedures to discover relationships in the data and
predict future outcomes, … prompted by only general,
even vague, queries by the user.
54
Example: Hudson Auto Repair
The manager of Hudson Auto would
like to have a better understanding of
the cost of parts used in the engine
tune-ups performed in her shop. She
examines 50 customer invoices for
tune-ups. The costs of parts, rounded
to the nearest dollar, are listed on the
next slide.
55
Example: Hudson Auto Repair
Sample of Parts Cost ($) for 50 Tune-ups
91 78 93 57 75 52 99 80 97 62
71 69 72 89 66 75 79 75 72 76
104 74 62 68 97 105 77 65 80 109
85 97 88 68 83 68 71 69 67 74
62 82 98 101 79 105 79 69 62 73
56
Tabular Summary:
Frequency and Percent Frequency
Parts Cost ($) Frequency Percent Frequency
50-59 2 4%
60-69 13 26%
70-79 16 32%
80-89 7 14%
90-99 7 14%
100-109 5 10%
TOTAL 50 100%
57
Graphical Summary: Bar Chart
Example: Hudson Auto
Hudson Auto
18
16
14
12
Frequency
10
0
50-59 60-69 70-79 80-89 90-99
Parts Auto cost
58
Process of Statistical Inference
Example: Hudson Auto
59
EXERCISE A
Classify each of the following qualitative variables
as ordinal or nominative. Explain your answers.
QUALITATIVE VARIABLE CATEGORIES
Statistics course letter grade A B C D F
Door choice on Let's Make A Deal Door #1 Door #2 Door #3
Television show classifications TV-G TV-PG TV-14 TV-MA
Personal computer ownership Yes No
Restaurant rating
Income tax filing status Married filing jointly Married filing separately
Single Head of household Qualifying
widow(er)
EXERCISE A (answers)
Letter Grades: Ordinal
Door Choices: Nominative
TV Classifications: Ordinal
PC Ownership: Nominative
Restaurant Ratings: Ordinal
Filing Status: Nominative.
EXERCISE B
Classify each of the following qualitative variables as ordinal or
nominative. Explain your answers.
QUALITATIVE VARIABLE CATEGORIES
Personal computer operating system Windows XP Windows Vista Windows 7 Windows 8
Motion picture classifications G PG PG-13 R NC-17 X
Level of education Elementary Middle school High school College
Graduate school
Rankings of the top 10 college football 1 2 3 4 5 6 7 8 9 10
teams
Exchange on which a stock is traded AMEX NYSE NASDAQ Other
Zip code 45056 90015 etc.
EXERCISE B (answers)
PC OS: Nominative
Movie Classifications: Ordinal
Education Level: Ordinal
Football Rankings: Ordinal
Stock Exchanges: Nominative
Zip Codes: Nominative.
EXERCISE C
Given the data set: 4 , 10 , 7 , 7 , 6 , 9 , 3 , 8 , 9
Find:
a) the mode (a statistical term that refers to the most frequently
occurring number found in a set of numbers),
b) the median (a simple measure of central tendency),
c) the mean (the usual average),
d) the sample standard deviation (measure that is used to quantify the
amount of variation or dispersion of a set of data values)
e) If we replace the data value 6 in the data set above by 24, will the
standard deviation increase, decrease or stay the same?
EXERCISE C (answers)
• The given data set has 2 modes: 7 and 9
• order data : 3 , 4 , 6 , 7 , 7 , 8 , 9 , 9 , 10 : median = 7
• (mean) : m = (3+4+6+7+7+8+9+9+10) / 9 = 7
x x - m (x - m)2
4 -3 9
10 3 9
7 0 0
7 0 0
6 -1 1
9 2 4
3 -4 16
8 1 1
9 2 4 SUM = 44
• The standard deviation will increase since 24 is further from away from the
other data values than 6.
EXERCISE D
Which of these variables are quantitative and
which are qualitative?
a. The dollar amount on an accounts receivable invoice.
b. The net profit for a company in 2009.
c. The stock exchange on which a company’s stock is
traded.
d. The national debt of the United States in 2009.
e. The advertising medium (radio, television, or print)
used to promote a product.
EXERCISE D (answers)
a. Quantitative; dollar amounts correspond to values
on the real number line.
b. Quantitative; net profit is a dollar amount.
c. Qualitative; which stock exchange is a category.
d. Quantitative; national debt is a dollar amount.
e. Qualitative; media is categorized into radio,
television, or print.
EXERCISE E
DISPLAY SIZE BATTERY LIFE CPU
TABLET COST($) OPERATING SYSTEM (INCHES) (HOURS) MANUFACTURER
b. 5 variables: Cost ($), Operating System, Display Size (inches), Battery Life
(hours), CPU Manufacturer
75
Chapter 2
Descriptive Statistics: Tabular and Graphical Displays
76
Summarizing Categorical Data
Frequency Distribution
Bar Chart
Pie Chart
77
Frequency Distribution
A frequency distribution is a tabular summary of
data showing the number (frequency) of
observations in each of several non-overlapping
categories or classes.
78
Frequency Distribution
Example
• Soft drink purchasers were asked to select one among the five popular
soft drinks: Coca-Cola, Diet Coke, Dr. Pepper, Pepsi and Sprite.
• Soft drink selected by a sample of 20 purchasers are:
79
Frequency Distribution
Example
Rating Frequency
Coca-Cola 2
Diet Coke 3
Dr Pepper 5
Pepsi 9
Sprite 1
Total 20
80
Relative Frequency Distribution
The relative frequency of a class is the fraction or
proportion of the total number of data items
belonging to a class.
Frequency of the class
Relative frequency of a class =
𝑛
81
Example of Relative Frequency Distribution
Suppose that a frequency distribution is based on a sample
of 200 supermarkets. It turns out that 50 of these
supermarkets charge a price between ₱120.00 and
₱130.00 for a kilo of beef. In a relative frequency
distribution, the number assigned to this class would be
0.25 (50/200). In other words, that’s 25 percent of the total.
Percent Frequency Distribution
The percent frequency of a class is the relative
frequency multiplied by 100.
83
Relative Frequency and Percent Frequency
Distributions
Example
84
Bar Chart
A bar chart is a graphical display for depicting
qualitative data.
• On one axis (usually the horizontal axis), we specify the
labels that are used for each of the classes.
• A frequency, relative frequency, or percent frequency
scale can be used for the other axis (usually the vertical
axis).
• Using a bar of fixed width drawn above each class
label, we extend the height appropriately.
• The bars are separated to emphasize the fact that each
class is separate.
85
Bar Chart
Bar Chart for Purchase of Soft Drink
10
6
Frequency
0
Coca-Cola Diet Coke Dr. Pepper Pepsi Sprite
Soft Drink
86
Pie Chart
• The pie chart is a commonly used graphical
display for presenting relative frequency and
percent frequency distributions for categorical
data.
• First draw a circle; then use the relative
frequencies to subdivide the circle into sectors
that correspond to the relative frequency for
each class.
• Since there are 360 degrees in a circle, a class
with a relative frequency of .25 would consume
.25(360) = 90 degrees of the circle.
87
Pie Chart
Coca-Cola
Diet Coke
Dr. Pepper
Pepsi
Sprite
88
Pie Chart
Example
Inferences from the Pie Chart
Almost one-half of the customers surveyed
preferred Pepsi (looking at the left side of the
pie).
The second preference is for Dr. Pepper with
25% of the customers opting for it.
Only 5% of the customers opted for Sprite.
89
Summarizing Quantitative Data
Frequency Distribution
Dot Plot
Histogram
Cumulative Distributions
Stem-and-Leaf Display
90
Frequency Distribution
Example
Sanderson and Clifford, a small public accounting
firm wants to determine time in days required to
complete year end audits. It takes a sample of 20
clients.
91
Frequency Distribution
Example: Sanderson and Clifford
92
Frequency Distribution
The three steps necessary to define the classes
for a frequency distribution with quantitative
data are:
Step 1 - Determine the number of non-overlapping
classes.
Step 2 - Determine the width of each class.
Step 3 - Determine the class limits.
93
HOW TO FIND NUMBER OF CLASSES
Step 1: Find the number of classes. One rule for finding an appropriate
number of classes says that the number of classes should be the smallest
whole number K that makes the quantity 2K greater than the number of
measurements in the data set.
96
Frequency Distribution
Guidelines for Determining the Width of Each
Class
• Use classes of equal width.
• Approximate Class Width =
Largest data value;Smallest data value
Number of classes
• Making the classes the same width reduces the
chance of inappropriate interpretations.
97
Frequency Distribution
Note on Number of Classes and Class Width
• In practice, the number of classes and the appropriate
class width are determined by trial and error.
• Once a possible number of classes is chosen, the
appropriate class width is found.
• The process can be repeated for a different number of
classes.
• Ultimately, the analyst uses judgment to determine the
combination of the number of classes and class width
that provides the best frequency distribution for
summarizing the data.
98
Frequency Distribution
Guidelines for Determining the Class Limits
• Class limits must be chosen so that each data item
belongs to one and only one class.
• The lower class limit identifies the smallest possible
data value assigned to the class.
• The upper class limit identifies the largest possible data
value assigned to the class.
• The appropriate values for the class limits depend on
the level of accuracy of the data.
• An open-end class requires only a lower class limit or
an upper class limit.
99
Frequency Distribution
Class Midpoint
• In some cases, we want to know the
midpoints of the classes in a frequency
distribution for quantitative data.
• The class midpoint is the value halfway
between the lower and upper class limits.
100
Frequency Distribution
Example: Sanderson and Clifford
• If we choose five classes:
• Approximate Class Width = (33 - 12)/5 = 4.2 4
Time in days Frequency
10-14 4
15-19 8
20-24 5
25-29 2
30-34 1
Total 20
101
Relative Frequency and Percent
Frequency Distributions
102
Relative Frequency and Percent
Frequency Distributions
Example: Sanderson and Clifford
Insights obtained from the Percent Frequency
Distribution:
40% of the audits required from 15 to 19 days.
Another 25% of the audits required 20 to 25 days.
Only 5% of the audits required more than 30 days.
103
Dot Plot
• One of the simplest graphical summaries of
data is a dot plot.
• A horizontal axis shows the range of data
values.
• Then each data value is represented by a dot
placed above the axis.
104
Dot Plot
Example: Sanderson and Clifford
105
Histogram
• Another common graphical display of quantitative
data is a histogram.
• The variable of interest is placed on the horizontal
axis.
• A rectangle is drawn above each class interval with
its height corresponding to the interval’s frequency,
relative frequency, or percent frequency.
• Unlike a bar graph, a histogram has no natural
separation between rectangles of adjacent classes.
106
Histogram
Example: Sanderson and Clifford
107
Cumulative Distributions
• Cumulative frequency distribution - shows the
number of items with values less than or equal to
the upper limit of each class.
108
Cumulative Distributions
109
Cumulative Distributions
Example: Sanderson and Cliffords
Audit time Cumulative Cumulative Cumulative
(Days) Frequency Relative Percent
Frequency Frequency
≤ 14 4 .20 20
≤ 19 12 .60 60
≤ 24 17 .85 85
≤ 29 19 .95 95
≤ 34 20 1.00 100
110
Stem-and-Leaf Display
• A stem-and-leaf display shows both the rank order and
shape of the distribution of the data.
• It is similar to a histogram on its side, but it has the
advantage of showing the actual data values.
• The first digits of each data item are arranged to the
left of a vertical line.
• To the right of the vertical line we record the last digit
for each item in rank order.
• Each line (row) in the display is referred to as a stem.
• Each digit on a stem is a leaf.
111
Constructing a Stem-and-Leaf Display
1. Decide what units will be used for the stems and the leaves. Each leaf
must be a single digit and the stem values will consist of appropriate
leading digits. As a general rule, there should be between 5 and 20
stem values.
2. Place the stem values in a column to the left of a vertical line with the
smallest value at the top of the column and the largest value at the
bottom.
3. To the right of the vertical line, enter the leaf for each measurement into
the row corresponding to the proper stem value. Each leaf should be a
single digit—these can be rounded values that were originally more
than one digit if we are using an appropriately defined leaf unit.
4. Rearrange the leaves so that they are in increasing order from left to
right.
Stem-and-Leaf Display
Example
The number of questions answered correctly on an
aptitude test by 50 students analysed with the help
of a Stem – and – leaf display here. The relevant data
is given in the following table.
113
Stem-and-Leaf Display
Number of questions answered correctly by
50 students
112 73 126 82 92 115 95 84 68 100
114
Stem-and-Leaf Display
6 9 8
7 2 3 6 3 6 5
8 6 2 3 1 1 0 45
9 7 2 2 6 2 1 588 5 4
10 7 4 8 0 2 6 606
11 2 8 5 9 3 5 9
12 6 8 7 4
13 2 4
14 1
Stem Leaf
115
Stretched Stem-and-Leaf Display
116
Stretched Stem-and-Leaf Display
6 8 9
7 2 3 3
7 5 6 6
8 0 1 1 2 3 4
8 5 6
9 1 2 2 2 4
9 5 5 6 7 8 8
10 0 0 2 4
10 6 6 6 7 8
11 2 3
11 5 5 8 9 9
12 4
12 6 7 8
13 2 4
14 1
117
Stem-and-Leaf Display
Leaf Units
118
Stem-and-Leaf Display
Example: Leaf Unit = 0.1
If we have data with values such as
8.6 11.7 9.4 9.1 10.2 11.0 8.8
Leaf Unit = 0.1
8 6 8
9 1 4
10 2
11 0 7
119
Stem-and-Leaf Display
Example: Leaf Unit = 10
If we have data with values such as
1806 1717 1974 1791 1682 1910 1838
Leaf Unit = 10
16 8
17 1 9
18 0 3
19 1 7
The 82 in 1682 is rounded down to 80 and is
represented as an 8.
120
• EXERCISES SUMMARIZING DATA FOR A
CATEGORICAL VARIABLE
• EXERCISES SUMMARIZING DATA FOR
QUANTITATIVE VARIABLE
Chapter 2: Descriptive Statistics: Tabular and
Graphical Displays (Part B)
123
Crosstabulation
CROSSTABULATION can be used when:
• one variable is categorical and the other is
quantitative,
• both variables are categorical, or
• both variables are quantitative.
• The left and top margin labels define the
classes for the two variables.
124
Crosstabulation
Example: Zagat’s Restaurant Review
Crosstabulation of quality rating and meal price
data for 300 Los Angeles restaurants is given
here.
125
Crosstabulation
Example: Zagat’s Restaurant Review
Insights Gained from Preceding Crosstabulation
Greatest number of restaurants in the sample (64)
have a very good rating and the meal price in the
$20-29 range.
Only 2 restaurants have an excellent rating and a
meal price in the range of $10-19 range
126
Crosstabulation
Example: Zagat’s Restaurant Review
127
Crosstabulation: Row or Column
Percentages
128
Crosstabulation: Row Percentages
Example: Zagat’s Restaurant Review
Meal Price
Quality $10- $20-
Rating 19 29 $30-39 $40-49 Total
Good 50 47.6 2.4 0 100
Very
Good 22.7 42.7 30.6 4 100
Excellent 3 21.2 42.4 33.4 100
129
Crosstabulation: Simpson’s Paradox
• Data in two or more crosstabulations are
often aggregated to produce a summary Average scores of male and
crosstabulation. female students in two schools
130
Summarizing Data for Two Variables
Using Graphical Display
• Scatter diagrams and trendlines are useful in
exploring the relationship between two
variables.
131
Scatter Diagram and Trendline
132
Scatter Diagram
A Positive Relationship
133
Scatter Diagram
A Negative Relationship
134
Scatter Diagram
• No Apparent Relationship
135
Scatter Diagram
Example:
A Stereo and sound equipment store in San Franscisco wants to analyze
the relationship between sales and advertising. Sample data for ten
weeks with sales in hundreds of dollars is shown below:
136
Scatter Diagram and Trendline for the
Stereo and Sound Equipment Store
137
Scatter Diagram
Example
Insights Gained from the Stereo and Sound
Equipment store Scatter Diagram
The scatter diagram indicates a positive
relationship between the number of commercials
and sales.
Higher sales is associated with greater number of
commercials.
The relationship is not perfect; all plotted points in
the scatter diagram are not on a straight line.
138
Side-by-Side Bar Chart
– A side-by-side bar chart is a graphical display for
depicting multiple bar charts on the same display.
– Each cluster of bars represents one value of the
first variable.
– Each bar within a cluster represents one value of
the second variable.
139
Side-by-Side Bar Chart
Side by Side Bar Chart for the Quality
and Price Meal Data
60
50
Frequency
40
30
20
10
0
$10-19 $20-29 $30-39 $40-49
Meal Price ($)
Good Very Good Excellent
140
Stacked Bar Chart
– A stacked bar chart is another way to display and
compare two variables on the same display.
– It is a bar chart in which each bar is broken into
rectangular segments of a different color.
– If percentage frequencies are displayed, all bars
will be of the same height (or length), extending
to the 100% mark.
141
Stacked Bar Chart
Stacked bar chart for the quality rating and meal
price data
100%
Percentage frequency
90%
80%
70%
60%
50%
40%
30%
20%
10%
0%
$10-19 $20-29 $30-39 $40-49
142
Skewed and Symmetric Data
Skewed to the Left
The situation reverses itself when we deal with data
skewed to the left. Data that are skewed to the left
have a long tail that extends to the left. An
alternate way of talking about a data set skewed to
the left is to say that it is negatively skewed. In this
situation, the mean and the median are both less
than the mode. As a general rule, most of the time
for data skewed to the left, the mean will be less
than the median. In summary, for a data set skewed
to the left:
• Always: mean less than the mode
• Always: median less than the mode
• Most of the time: mean less than median
144
Tabular and Graphical Displays
Summary
Data
145
Chapter 3: Descriptive Statistics:
Numerical Measures
The three most common measures of location are the mean, the median,
and the mode.
The mean is the sum of the values, divided by the number of values. It has
the smallest possible sum of squared differences from members of the list.
The median is the middle value in the sorted list. It is the smallest number
that is at least as big as at least half the values in the list. It has the smallest
possible sum of absolute differences from members of the list.
The mode is the most frequent value in the list (or one of the most frequent
values, if there are more than one). It differs from the fewest possible
members of the list.
MEAN
• The mean is sometimes referred to as the arithmetic mean.
• Perhaps the most important measure of location is the mean,
or average value, for a variable. The mean provides a
measure of central location for the data. If the data are for a
sample, the mean is denoted by x̄; if the data are for a
population, the mean is denoted by the Greek letter µ.
𝑋𝑖
SAMPLE MEAN: 𝑋=
𝑛
MEAN (Example)
Table 2.9: Data on Selling
Home Sale Price ($)
Home Sales
1 138,000
in a Cincinnati, Ohio,
2 254,000
Suburb 3 186,000
4 257,500
5 108,000
6 254,000
7 138,000
8 298,000
9 199,500
10 208,000
11 142,000
12 456,250
MEAN (Example)
Computation of Sample Mean:
• Illustration: Computation of the mean home selling
price for the sample of 12 home sales.
x i x1 x2 x12
x
n 12
138,000 254,000 456,250
12
2,639,250
219,937.50
12
MEDIAN
The median is another measure of central
location.
Arrange the data in ascending order (smallest
value to largest value).
(a)For an odd number of observations, the
median is the middle value.
(b)For an even number of observations, the
median is the average of the two middle
values.
MEDIAN: EXAMPLE
Computation of Sample Median:
– Illustration: When the number of observations are odd,
– Consider the class size data for a sample of five
college classes:
46 54 42 46 32
– Arrange the class size data in ascending order:
32 42 46 46 54
– Middlemost value in the data set = 46.
– Median is 46.
MEDIAN EXAMPLE
Computation of Sample Median:
Illustration: When the number of observations are even:
– Consider the data on home sales in Cincinnati, Ohio,
Suburb (Table 2.9).
– Arrange the data in ascending order:
108,000 138,000 138,000 142,000 186,000 199,500
208,000 254,000 254,000 257,500 298,000 456,250
• Median = average of two middle values:
199,500 208,000
Median 203,750
2
MODE
Another measure of location is the mode.
The mode is the value that occurs with greatest frequency.
Example 2: In a crash test, 11 cars were tested to determine what impact speed
was required to obtain minimal bumper damage. Find the mode of the speeds
given in miles per hour below.
24, 15, 18, 20, 18, 22, 24, 26, 18, 26, 24
Answer: Since both 18 and 24 occur three times, the modes are 18 and 24 miles
per hour. This data set is bimodal.
𝑊𝑖 𝑋𝑖
𝑋=
𝑊𝑖
WHERE: 𝑾𝒊 is weight for observation i
Weighted Mean Example
Narciso wants to buy a new camera, and decides
on the following rating system: Image Quality
50%; Battery Life 30%; Zoom Range 20%
The Sonu camera gets 8 (out of 10) for Image Quality, 6 for
Battery Life and 7 for Zoom Range
𝑛
Formula: 𝑋𝑔 = 𝑋1 𝑋2 𝑋3 … 𝑋𝑛
GEOMETRIC MEAN
𝑌𝑒𝑎𝑟2 − 𝑌𝑒𝑎𝑟1
RATE OF RETURN 𝑅1 =
𝑌𝑒𝑎𝑟 1
𝑛
GEOMETRIC MEAN 𝑅𝑔 = 1 + 𝑅1 (1 + 𝑅2 ) … (1 + 𝑅𝑛 ) - 1
VALUE OF
INVESTMENT AFTER 𝑛
N YEARS 𝐼𝑁𝑉𝐸𝑆𝑇𝑀𝐸𝑁𝑇 𝑥 (1 + 𝑅𝑔 )
GEOMETRIC MEAN SAMPLE
The average person’s monthly salary in a certain town
jumped from $2,500 to $5,000 over the course of ten years.
Using the geometric mean, what is the average yearly
increase?
CALCULATE THE GEOMETRIC MEAN RATE OF RETURN FOR EACH STOCK INDEX FOR THE PERIOD
(B.) FROM 2008 TO 2010.
DJIA =((1-0.307)*(1+0.171))^(1/2)-1 = (0.099)
NASDAQ =((1-0.375)*(1+0.414))^(1/2)-1 = (0.060)
S&P =((1-0.356)*(1+0.216))^(1/2)-1 = (0.115)
SUPPOSE THAT AN INVESTMENT OF $100, 000 IS MADE IN 2008 AND THAT THE PORTFOLIO
PERFORMS WITH RETURNS EQUAL TO THOSE OF THE DJIA. WHAT IS THE INVESTMENT WORTH IN
(C.) 2010?
DJIA =100000*((1+0.099)^(2)) = $ 81,150.30
(D.) REPEAT PART C FOR THE NASDAQ AND THE S&P 500.
NASDAQ $ 100,000.00 $ 88,375.00
S&P $ 100,000.00 $ 78,310.40
Refer to the first exercise. The values of the DJIA
on the first day of trading in 2005, 2006, and 2007
were 10,729.43, 10,847.41, and 12,474.52.
𝑃
𝐿𝑝 = x (n+1)
100
P – percentile / n – sample size
PERCENTILE EXAMPLE
There are 25 test scores such
as: 72,54, 56, 61, 62, 66, 68,
43, 69, 69, 70, 71,77, 78, 79, 85,
87, 88, 89, 93, 95, 96, 98, 99,
99. Find the 60th percentile?
PERCENTILE EXAMPLE
POSITION TEST SCORES
1
2
72
54
STEP 1: Sort the data set
3
4
56
61 STEP 2: Get the location of the required
5 62
6
7
66
68
percentile using the formula
8
9
43
69
STEP 3: Get the value
10 69
11 70
12 71
𝑃
13 77
𝐿60 = x (n+1) = (60 / 100) x (25+1) = 15.6
14 78 100
15 79
16 85
17 87
18 88 VALUE0.6 = (85-79) = 6
19 89
20 93
Although the range is the easiest of the measures of variability to compute, it is seldom used
as the only measure. The reason is that the range is based on only two of the observations
and thus is highly influenced by extreme values. Suppose the highest paid graduate received a
starting salary of $10,000 per month. In this case, the range would be rather than 615. This
large value for the range would not be especially descriptive of the variability in the data
because 11 of the 12 starting salaries are closely grouped between 3710 and 4130.
Interquartile Range
Where x̄ = the mean; Step 1: Subtract the mode from the mean
Mo = the mode; and s = Step 2: Divide by the standard deviation
the standard deviation
for the sample.
Where x̄ = the
mean; Md = the
median and s = the Step 1: Subtract the median from the mean
standard deviation Step 2: Multiply Step 1 by 3
for the sample. Step 3: Divide by the standard deviation:
It is generally used
when you don’t
know the mode.
CAUTION
Caution: Pearson’s first coefficient of skewness uses the mode. Therefore, if the mode
is made up of too few pieces of data it won’t be a stable measure of central
tendency. For example, the mode in both these sets of data is 9:
1 2 3 4 5 6 7 8 9 9.
1 2 3 4 5 6 7 8 9 9 9 9 9 9 9 9 9 9 9 9 10 12 12 13.
In the first set of data, the mode only appears twice. This isn’t a good measure of central
tendency so you would be cautioned not to use Pearson’s coefficient of skewness. The
second set of data has a more stable set (the mode appears 12 times). Therefore,
Pearson’s coefficient of skewness will likely give you a reasonable result.
Interpretation In general:
• The direction of skewness is given by the sign.
• The coefficient compares the sample distribution with a normal distribution.
The larger the value, the larger the distribution differs from a normal
distribution.
• A value of zero means no skewness at all.
• A large negative value means the distribution is negatively skewed.
• A large positive value means the distribution is positively skewed.
Z-Score
z
The z-score, i, can be interpreted as the number of standard deviations is from the
mean .
A z-score greater than zero occurs for observations with a value greater than the
mean, and a z-score less than zero occurs for observations with a value less than the
mean. A z-score of zero indicates that the value of the observation is equal to the
mean.
Z-SCORE
How many standard deviations a value is
from the mean.
For the test scores between 60 and 80, we note that 60 is two standard
deviations below the mean and 80 is two standard deviations above the mean.
Using Chebyshev’s theorem, we see that at least .75, or at least 75%, of the
observations must have values within two standard deviations of the mean. Thus,
at least 75% of the students must have scored between 60 and 80.
For the test scores between 58 and 82, we see that (58-70)/5 indicates 58 is 2.4
standard deviations below the mean and that (82-70)/5 indicates 82 is 2.4
standard deviations above the mean. Applying Chebyshev’s theorem with , we
have z = 2.4
At least 82.6% of the students must
have test scores between 58 and 82.
EMPIRICAL RULE
The empirical rule is a statistical rule which states that for a
normal distribution, almost all data will fall within three standard
deviations of the mean.
The empirical rule shows that 68% will fall within the first
standard deviation, 95% within the first two standard
deviations, and 99.7% will fall within the first three standard
deviations of the distribution's average.
The smallest value is 3710 and the largest value is 4325. We showed how to compute
the quartiles (Q1 = 3857.50; Q2 = 3905; Q3 = 4025). Thus, the five-number summary
for the monthly starting salary data is:
The five-number summary indicates that the starting salaries in the sample are
between 3710 and 4325 and that the median or middle value is 3905. The first and
third quartiles show that approximately 50% of the starting salaries are between
3857.5 and 4025.
BOX PLOT
A box plot is a graphical
display of data based on a
five-number summary. A key
to the development of a box
plot is the computation of the
interquartile range.
1. A box is drawn with the ends of the box located at the first, Q1 = 3857.5 and third quartiles, Q3 =
4025. For the salary data, and . This box contains the middle 50% of the data.
2. A horizontal line is drawn in the box at the location of the median (3905 for the salary data). An X
indicates the value of the mean (3940 for the salary data).
3. By using the interquartile range, IQR = Q3 – Q1 , limits are located at 1.5(IQR) below Q1 , and
1.5(IQR) above Q3. For the salary data, IQR = Q3 – Q1 = 4025 – 3857.50 = 167.5. Thus, the limits
are 3857.5 – 1.5(167.5) = 3606.25 and 4025 – 1.5(167.5) = 4276.25. Data outside these limits are
considered outliers.
4. The vertical lines extending from each end of the box called whiskers. The whiskers are drawn from
the ends of the box to the smallest and largest values inside the limits computed in step 3. Thus,
the whiskers end at salary values of 3710 and 4130.
5. Finally, the location of each outlier is shown with a small dot. In the figure we see one outlier,
4325.
COVARIANCE
Covariance is a measure of the joint variability of two random
variables.
Population covariance, xi x yi y
xy .
N
SAMPLE COVARIANCE
Calculate covariance for the following data set:
x: 2.1, 2.5, 3.6, 4.0 (mean = 3.1)
y: 8, 10, 12, 14 (mean = 11)
= (-1)(-3) + (-0.6)(-1)+(.5)(1)+(0.9)(3) / 3
= 3 + 0.6 + .5 + 2.7 / 3
= 6.8/3
= 2.267
a. i = (90/100) 13 = 11.7. Rounding the index up to 12, the twelfth value is x12 =
192 or $192,000
b. i = (50/100) 13 = 6.5. Rounding the index up to 7, Md = x7 = 152 or $152,000
c. i = (25/100) 13 = 3.25. Rounding the index up to 4, Q1 = x4 = 141 or
$141,000
d. i = (75/100) 13 = 9.75. Rounding the index up to 10, Q3 = x10 = 171 or
$171,000
e. i = (10/100) 13 = 1.3. Rounding the index up to 2, x2 = 132 or $132,000
f. The Inter Quartile Range, IQR = Q3 – Q1 = 171 – 141 = 30 or $30,000
SOLUTION
3.50 a.
Household Incomes
SOLUTION
GOODLUCK ON YOUR
PRELIM EXAM!
Chapter 4: Introduction to Probability
Probability
Probability is a numerical measure of the
likelihood that an event will occur.
Probability values are always assigned on a scale
from 0 to 1. A probability near zero indicates an
event is unlikely to occur; a probability near 1
indicates an event is almost certain to occur.
Managers often base their decisions on an analysis of
uncertainties such as the following:
P(A) = f / N.
P(A) means ―probability of event A‖ (event A is whatever event you are looking
for, like winning the lottery). “f” is the frequency, or number of possible times the
event could happen. N is the number of times the event could happen
The odds of rolling a 2 on a fair die are one out of 6, or 1/6. That’s one possible
outcome (there’s only one way to roll a 1!) divided by the number of possible
outcomes (1,2,3,4,5,6).
What is the Relative Frequency
Method?
The Relative Frequency Method of assigning probabilities is appropriate when
data are available to estimate the proportion of the time the experimental
outcome will occur if the experiment is repeated a large number of times.
For example, to estimate the probability that a randomly selected consumer prefers
Coca-Cola to all other soft drinks, an experiment is performed wherein randomly
selected consumer were ask for his or her preference. There are two possible sample
space outcomes: “prefers Coca-Cola” and “does not prefer Coca-Cola.” However, we
have no reason to believe that these sample space outcomes are equally likely, so we
cannot use the classical method. We might perform the experiment, say, 1,000 times
by surveying 1,000 randomly selected consumers. Then, if 140 of those surveyed said
that they prefer Coca-Cola, we would estimate the probability that a randomly selected
consumer prefers Coca-Cola to all other soft drinks to be 140/1,000 = .14.
What is the Subjective Method of
Assigning Probabilities?
The subjective method of assigning probabilities is most appropriate when one
cannot realistically assume that the experimental outcomes are equally likely and
when little relevant data are available.
Example: John and Marsha Puruntong made an offer to purchase a house. Two
outcomes are possible:
Marsha believes that the probability their offer will be accepted is .8; thus, Marsha
would set P(E1) = 0.8 and P(E2) = 0.2. John, however, believes that the probability that
their offer will be accepted is .6; hence, John would set P(E1) = 0.6 and P(E2) = 0.4 .
Note that Tom’s probability estimate for reflects a greater pessimism that their offer
will be accepted.
EXPERIMENTS
Where:
And, by definition,
Example
The GRAND lottery system uses the random selection of 6 numbers from a
group of 55 numbers to determine each week's lottery winner. There are
Thus, 20 outcomes are possible for the experiment of randomly selecting two parts
from a group of five when the order of selection must be taken into account. If we
label the parts A, B, C, D, and E, the 20 permutations are AB, BA, AC, CA, AD, DA, AE,
EA, BC, CB, BD, DB, BE, EB, CD, DC, CE, EC, DE, and ED.
What is the Difference Between
Combination and Permutation?
EXAMPLE: "My fruit salad is a combination of apples, grapes and bananas"
We don't care what order the fruits are in, they could also be "bananas,
grapes and apples" or "grapes, apples and bananas", its the same fruit salad.
EXAMPLE: "The combination to the safe is 472". Now the order is important.
"724" won't work, nor will "247". It has to be exactly 4-7-2.
z = (x - µ) / σ
(or, equivalently, the population of all
possible observed values of z) is
normally distributed with mean 0 and
standard deviation 1. A normal
distribution (or curve) with mean 0 and
standard deviation 1 is called a
standard normal distribution (or curve).
TYPES OF RANDOM VARIABLES
Three Important Areas under the Normal Curve
P(µ - σ ≤ X ≤ µ + σ) = 0.6826
This means that 68.26 percent of all possible
observed values of x are within (plus or minus)
one standard deviation of μ.
This says that if the insurance company sells a very large number of these
policies, it will average a profit of $280 per policy per year. Because insurance
companies actually do sell large numbers of policies, it is reasonable for these
companies to make profitability decisions based on expected values.
EXERCISE
The following table summarizes investment outcomes and
corresponding probabilities for a particular oil well:
x = the outcome in $ p(x)
−$40,000 (no oil) .25
10,000 (some oil) .70
70,000 (much oil) .05
a. Graph p(x); that is, graph the probability distribution of x.
b. Find the expected monetary outcome. Mark this value on
your graph of part a. Then interpret this value.
c. Calculate the standard deviation of x.
EXERCISE
a. Graph p(x); that is, graph the probability distribution of x
EXERCISE
b. Find the expected monetary outcome. Mark this value on your graph of part
a. Then interpret this value.
μ = ∑ x•p(x) = -$4.20
EXERCISE
The DVD Case: Managing Inventory
A large discount store sells 50 packs of HX-150 blank DVDs and
receives a shipment every Monday. Historical sales records
indicate that the weekly demand, x, for these 50 packs is normally
distributed with a mean of μ=100 and a standard deviation of σ=10.
How many 50 packs should be stocked at the beginning of a week
so that there is only a 5 percent chance that the store will run short
during the week?
252
Discrete Random Variable
with a Finite Number of Values
Example: An accountant taking CPA examination
The examination has four parts.
Let random variable x = the number of parts of
the CPA examination passed
x may assume the finite number of values 0,1,2,3
or 4.
253
Discrete Random Variable
with an Infinite Number of Values
254
Discrete Probability Distributions
• The probability distribution for a random
variable describes how probabilities are
distributed over the values of the random
variable.
• We can describe a discrete probability
distribution with a table, graph, or formula.
255
Discrete Probability Distributions
Two types of discrete probability distributions:
256
Discrete Probability Distributions
• The probability distribution is defined by a
probability function, denoted by f(x), that
provides the probability for each value of the
random variable.
• The required conditions for a discrete
probability function are:
f(x) > 0 and f(x) = 1
257
Discrete Probability Distributions
• There are three methods for assigning
probabilities to random variables:
Classical method,
Subjective method, and
Relative frequency method.
• The use of the relative frequency method to
develop discrete probability distributions
leads to what is called an empirical discrete
distribution.
258
Discrete Probability Distributions
Example: DiCarlo Motors
Using past data on daily car sales for 300 days, a tabular
representation of the probability distribution for sales was
developed. Number of cars Number of x f(x)
sold days
0 54 0 .18
1 117 1 .39
2 72 2 .24
3 42 3 .14
4 12 4 .04
5 3 5 .01
Total 300 1.00
259
Discrete Probability Distributions
Example: DiCarlo Motors
Graphical
representation
of Probability
Distribution.
260
Discrete Probability Distributions
• In addition to tables and graphs, a formula that
gives the probability function, f(x), for every value
of x is often used to describe the probability
distributions.
• Some of the discrete probability distributions
specified by formulas are
Discrete – uniform distribution
Binomial distribution
Poisson distribution
Hypergeometric distribution
261
Discrete Probability Distributions
• The discrete uniform probability distribution is
the simplest example of a discrete probability
distribution given by a formula.
• The discrete uniform probability function is
f(x) = 1/n
262
Expected Value
• The expected value, or mean, of a random
variable is a measure of its central location.
E(x) = =∑x f(x)
• The expected value is a weighted average of
the values the random variable may assume.
The weights are the probabilities.
• The expected value does not have to be a
value the random variable can assume.
263
Variance and Standard Deviation
• The variance summarizes the variability in the
values of a random variable.
Var(x) = 2 = (x - )2f(x)
• The variance is a weighted average of the
squared deviations of a random variable from
its mean. The weights are the probabilities.
• The standard deviation, , is defined as the
positive square root of the variance.
264
Discrete Probability Distributions
Example: DiCarlo Motors
x f(x) xf(x)
0 .18 .00
1 .39 .39
2 .24 .48
3 .14 .42
4 .04 .16
5 .01 .05
1.00 1.50
265
Discrete Probability Distributions
Example: DiCarlo Motors
x x- (x - )2 f(x) (x - )2
f(x)
0 0 – 1.5 = - 2.25 .18 2.25 (.18) =
1.5 .4050
1 1 – 1.5 = -.5 .25 .39 .0975 Variance of
2 2 – 1.5 = .5 .25 .24 .0600 daily sales = 2 = 1.25
3 3 – 1.5 = 1.5 2.25 .14 .3150
4 4 – 1.5 = 2.5 6.25 .04 .2500
5 5 – 1.5 = 3.5 12.25 .01 .1225
Standard deviation of
1.00 1.2500
daily sales = 1. 118 cars
266
Bivariate Distributions
• A probability distribution involving two random
variables is called a bivariate probability
distribution.
• Each outcome of a bivariate experiment consists
of two values, one for each random variable.
Example: Rolling a pair of dice
• When dealing with bivariate probability
distributions, we are often interested in the
relationship between the random variables.
267
A Bivariate discrete probability
distribution
Example: DiCarlo Motors
The crosstabulation of daily car sales for 300
days at DiCarlo’s Saratoga and Geneva
dealership is given below:
Geneva Saratoga Dealership Total
Dealership 0 1 2 3 4 5
0 21 30 24 9 2 0 86
1 21 36 33 18 2 1 111
2 9 42 9 12 3 2 77
3 3 9 6 3 5 0 26
Total 54 117 72 42 12 3 300
268
A Bivariate discrete probability
distribution
Example: DiCarlo Motors
Bivariate empirical discrete probability distribution for
daily sales at DiCarlo dealerships in Saratoga and
Geneva New York is shown below.
Geneva Saratoga Dealership Total
Dealership 0 1 2 3 4 5
0 .0700 .1000 .0800 .0300 .0067 .0000 .2867
1 .0700 .1200 .1100 .0600 .0067 .0033 .3700
2 .0300 .1400 .0300 .0400 .0100 .0067 .2567
3 .0100 .0300 .0200 .0100 .0167 .0000 .0867
Total .18 .39 .24 .14 .04 .01 1.0000
269
A Bivariate discrete probability
distribution
Example: DiCarlo Motors
Expected value and Variance for daily car sales
at Geneva dealership.
x f(x) xf(x) x – E(x) (x – E(x))2 (x –
E(x))2f(x)
0 .2867 .0000 -1.1435 1.3076 .3749
1 .3700 .3700 -.1435 0.0206 .0076
2 .2567 .5134 .8565 0.8565 .1883
3 .0867 .2601 1.8565 1.8565 .2988
E(x) = 1.1435 Var(x) =
.8696
270
A Bivariate discrete probability
distribution
Example: DiCarlo Motors Expected value and Variance
for total daily car sales data.
s f(s) sf(s) s – E(s) (s – E(s))2 (s – E(s))2f(s)
0 .0700 .0000 -2.6433 6.9872 .4891
1 .1700 .1700 -1.6433 2.7005 .4591
2 .2300 .4600 -0.6433 0.4139 .0952
3 .2900 .8700 0.3567 0.1272 .0369
4 .1267 .5067 1.3567 1.8405 .2331
5 .0667 .3333 2.3567 5.5539 .3703
6 .0233 .1400 3.3567 11.2672 .2629
7 .0233 .1633 4.3567 18.9805 .4429
8 .0000 .0000 5.3567 28.6939 .0000
E(s) = 2.6433 Var(s) = 2.3895
271
A Bivariate discrete probability
distribution
272
A Bivariate discrete probability
distribution
Correlation between random variables x and y
𝝈𝒙𝒚
𝝆𝒙𝒚 =
𝝈𝒙 𝝈 𝒚
𝜍𝑥 = .8696 = .9325
𝜍𝑦 = 1.25 = 1.1180
.1350
𝜌𝑥𝑦 = = .1295
.9325 (1.1180)
273
Binomial Probability Distribution
Four Properties of a Binomial Experiment
1. The experiment consists of a sequence of n
identical trials.
2. Two outcomes, success and failure, are
possible on each trial.
3. The probability of a success, denoted by p,
and failure denoted by 1-p does not
change from trial to trial. (This is
referred to as the stationarity assumption.)
4. The trials are independent.
274
Binomial Probability Distribution
• Our interest is in the number of successes
occurring in the n trials.
• We let x denote the number of successes
occurring in the n trials.
275
Binomial Probability Distribution
Binomial Probability Function
𝒏!
𝒇 𝒙 = 𝒑𝒙 (𝟏 − 𝒑)(𝒏;𝒙)
𝒙! 𝒏 − 𝒙 !
where:
x = the number of successes
p = the probability of a success on one trial
n = the number of trials
f(x) = the probability of x successes in n trials
n! = n(n – 1)(n – 2) ….. (2)(1)
276
Binomial Probability Distribution
• Binomial Probability Function
𝑛!
𝑓 𝑥 = 𝑝 𝑥 (1 − 𝑝)(𝑛;𝑥)
𝑥! 𝑛 − 𝑥 !
277
Binomial Probability Distribution
Example: Martin Clothing store
279
Binomial Probability Distribution
Example: Martin Clothing store
𝑛!
𝑓 𝑥 = 𝑝 𝑥 (1 − 𝑝)(𝑛;𝑥)
𝑥! 𝑛 − 𝑥 !
3!
𝑓 1 = 0.3 2 (0.7)1 = .189
2! 3;2 !
281
Binomial Probability Distribution
Example: Martin Clothing store
282
Binomial Probabilities and Cumulative
Probabilities
• Statisticians have developed tables that give
probabilities and cumulative probabilities for a
binomial random variable.
• These tables can be found in some statistics
textbooks.
• With modern calculators and the capability of
statistical software packages, such tables are
almost unnecessary.
283
Expected Value and Variance for
Binomial Distribution
• Expected Value E(x) = = np
• Variance Var(x) = 2 = np(1 – p)
• Standard Deviation 𝜍 = 𝑛𝑝(1 − 𝑝)
284
Expected Value and Variance for
Binomial Distribution
Example: Martin Clothing store
285
Poisson Probability Distribution
286
Poisson Probability Distribution
Examples
• Number of knotholes in 14 linear feet of pine
board
• Number of vehicles arriving at a toll booth in
one hour
• Number of leaks in 100 miles of pipeline
287
Poisson Probability Distribution
Properties of a Poisson Experiment
288
Poisson Probability Distribution
Poisson Probability Function
𝜇𝑥 𝑒 −𝜇
𝑓 𝑥 =
𝑥!
where:
x = the number of occurrences in an interval
f(x) = the probability of x occurrences in an interval
= mean number of occurrences in an interval
e = 2.71828
x! = x(x – 1)(x – 2) . . . (2)(1)
289
Poisson Probability Distribution
Poisson Probability Function
290
Poisson Probability Distribution
Example: Arrivals at the drive-up teller window
of a bank
105 (2.71828);10
𝑓 5 =
5!
= .0378
292
Poisson Probability Distribution
A property of the Poisson distribution is that the
mean and variance are equal.
=2
293
Poisson Probability Distribution
Example: Arrivals at the drive-up teller window
of a bank
= 2 = 10
294
Hypergeometric Probability Distribution
𝑟 𝑁−𝑟
𝑥 𝑛−𝑥
𝑓 𝑥 =
𝑁
𝑛
296
Hypergeometric Probability
Distribution
Hypergeometric Probability Function
𝑟 𝑁;𝑟
𝑥 𝑛;𝑥
𝑓 𝑥 = 𝑁 for 0 < x < r
𝑛
number of ways number of ways
x successes can n – x failures can be
be selected selected
from a total of r number of ways
successes n elements can be from a total of N – r
in the population selected failures
from a population of size in the population
N
297
Hypergeometric Probability
Distribution
Hypergeometric Probability Function
299
Hypergeometric Probability
Distribution
Example: Ontario electric
Using the probability function:
𝑟 𝑁;𝑟 5 7 5! 7!
𝑥 𝑛;𝑥 1 2 1!4! 2!5! (5)(21)
𝑓 𝑥 = 𝑁 = 12 = 12! = = .4773
220
𝑛 3 3!9!
Variance
2 𝑟 𝑟 𝑁−𝑛
𝑉𝑎𝑟 𝑥 = 𝜍 = 𝑛 1−
𝑁 𝑁 𝑁−1
301
Hypergeometric Probability
Distribution
Example: Ontario electric
Mean
𝑟 5
𝜇=𝑛 =3 = 1.25
𝑁 12
Variance
2
5 5 12 − 3
𝜍 =3 1− = .60
12 12 12 − 1
Standard deviation
𝜍 = .77
302
Hypergeometric Probability
Distribution
• Consider a hypergeometric distribution with n trials and let p =
(r/n) denote the probability of a success on the first trial.
• Note that these are the expressions for the expected value and
variance of a binomial distribution.
303
Hypergeometric Probability
Distribution
When the population size is large, a
hypergeometric distribution can be
approximated by a binomial distribution with n
trials and a probability of success p = (r/N).
304
EXERCISES
The probability distribution for the rate of return on an investment is
a. What is the probability that the rate of return will be at least 10%?
b. What is the expected rate of return?
c. What is the variance of the rate of return?
SOLUTION
EXERCISES
The number of electrical outages in a city varies from day
to day. Assume that the number of electrical outages (x)
in the city has the following probability distribution.
x f(x)
0 0.80
1 0.15
2 0.04
3 0.01
n=5
x=2
p = 0.02
Answer: 0.0037648
EXERCISES
Where:
n=5
x=2
p = 0.32
Answer = 0.321978368
EXERCISES
Ten percent of the items produced by a machine are
defective. Out of 15 items chosen at random,
x
x
327
Continuous Probability Distributions
• A continuous random variable can assume any
value in an interval on the real line or in a
collection of intervals.
• It is not possible to talk about the probability
of the random variable assuming a particular
value.
• Instead, we talk about the probability of the
random variable assuming a value within a
given interval.
328
Continuous Probability Distributions
• The probability of the random variable assuming a value
within some given interval from x1 to x2 is defined to be the
area under the graph of the probability density function
between x1 and x2.
f (x) Exponential
Uniform Normal
f (x) f (x)
x
x1 x2
x x
x1 x2
x1 x2
329
Uniform Probability Distribution
A random variable is uniformly distributed
whenever the probability is proportional to the
interval’s length.
The uniform probability density function is:
f (x) = 1/(b – a) for a < x < b
= 0 elsewhere
where: a = smallest value the variable can
assume
b = largest value the variable can
assume
330
Uniform Probability Distribution
Expected Value of x
E(x) = (a + b)/2
Variance of x
Var(x) = (b - a)2/12
331
Uniform Probability Distribution
Example: Flight time of an airplane traveling
from Chicago to New York
332
Uniform Probability Distribution
Uniform Probability Density Function
where:
x = Flight time of an airplane traveling from Chicago
to New York
333
Uniform Probability Distribution
Expected Value of x
E(x) = (a + b)/2
= (120 + 140)/2
= 130
Variance of x
Var(x) = (b - a)2/12
= (140 – 120)2/12
= 33.33
334
Uniform Probability Distribution
Example: Flight time of an airplane traveling
from Chicago to New York
335
Uniform Probability Distribution
Example: Flight time of an airplane traveling from Chicago to
New York
Probability of a flight time between 120 and 130 minutes
P(120 < x < 130) = 1/20(10) = .5
336
Area as a Measure of Probability
• The area under the graph of f(x) and
probability are identical.
• This is valid for all continuous random
variables.
• The probability that x takes on a value
between some lower value x1 and some
higher value x2 can be found by computing the
area under the graph of f(x) over the interval
from x1 to x2.
337
Normal Probability Distribution
– The normal probability distribution is the most important
distribution for describing a continuous random variable.
– It is widely used in statistical inference.
– It has been used in a wide variety of applications including:
Heights of people
Test scores
Rainfall amounts
Scientific measurements
– Abraham de Moivre, a French mathematician, published
The Doctrine of Chances in 1733. He derived the normal
distribution.
338
Normal Probability Distribution
Normal Probability Density Function
1 ;(𝑥;𝜇)2 /2𝜎 2
𝑓 𝑥 = 𝑒
𝜍 2𝜋
Where µ = mean
𝜍 = standard deviation
= 3.14159
e = 2.71828
339
Normal Probability Distribution
Characteristics
340
Normal Probability Distribution
Characteristics
• The entire family of normal probability
distributions is defined by its mean and its
standard deviation .
341
Normal Probability Distribution
Characteristics
• The highest point on the normal curve is at
the mean, which is also the median and
mode.
342
Normal Probability Distribution
Characteristics
• The mean can be any numerical value:
negative, zero, or positive.
343
Normal Probability Distribution
Characteristics
• The standard deviation determines the width of the
curve: larger values result in wider, flatter curves.
344
Normal Probability Distribution
Characteristics
• Probabilities for the normal random variable
are given by areas under the curve. The total
area under the curve is 1 (.5 to the left of the
mean and .5 to the right).
.5 x
.5 345
Normal Probability Distribution
Characteristics (basis for the empirical rule)
346
Normal Probability Distribution
Characteristics (basis for the empirical rule)
347
Standard Normal Probability
Distribution
Characteristics
A random variable having a normal distribution
with a mean of 0 and a standard deviation of 1 is
said to have a standard normal probability
distribution.
348
Standard Normal Probability
Distribution
Characteristics
The letter z is used to designate the standard
normal random variable.
349
Standard Normal Probability
Distribution
Converting to Standard Normal Distribution
𝑥;𝜇
z=
𝜎
350
Using excel to compute standard
normal probabilities
Excel has two functions for computing probabilities
and z values for a standard normal probability
distribution.
351
Standard Normal Probability
Distribution
Example: Grear Tire Company Problem
356
Standard Normal Probability
Distribution
Example: Grear Tire Company Problem
z = (x - )/
= (40,000 – 36,500)/5,000
= .7
• Step 2: Find the area under the standard normal
curve to the left of z = .7.
358
Standard Normal Probability Distribution
Example: Grear Tire Company Problem
Cumulative Probability Table for the Standard Normal
Distribution
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
. . . . . . . . . . .
.5 .6915 .6950 .6985 .7019 .7054 .7088 .7123 .7157 .7190 .7224
.6 .7257 .7291 .7324 .7357 .7389 .7422 .7454 .7486 .7517 .7549
.7 .7580 .7611 .7642 .7673 .7704 .7734 .7764 .7794 .7823 .7852
.8 .7881 .7910 .7939 .7967 .7995 .8023 .8051 .8078 .8106 .8133
. . . . . . . . . . .
359
Standard Normal Probability
Distribution
Example: Grear Tire Company Problem
Solving for the Probability
• Step 3: Compute the area under the standard
normal curve to the right of z = .7
P(z > .7) = 1 – P(z < .7)
= 1- .7580
= .2420
360
Standard Normal Probability
Distribution
Example: Grear Tire Company Problem
361
Standard Normal Probability
Distribution
Example: Grear Tire Company Problem
0 .7
362
Standard Normal Probability
Distribution
Example: Grear Tire Company Problem
What should be the guaranteed mileage if Grear
wants no more than 10% of tires to be eligible
for the discount guarantee?
(Hint: Given a probability, we can use the
standard normal table in an inverse fashion to
find the corresponding z value.)
363
Standard Normal Probability
Distribution
Example: Grear Tire Company Problem
Solving for the guaranteed mileage
364
Standard Normal Probability
Distribution
Example: Grear Tire Company Problem - Solving for the
guaranteed mileage
Step 1: Find the z value that cuts off an area of .1 in the
left tail of the standard normal distribution.
z .00 .01 .02 .03 .04 .05 .06 .07 .08 .09
. . . . . . . . . . .
-1.5 0.0668 0.0655 0.0643 0.0630 0.0618 0.0606 0.0594 0.0582 0.0571 0.0559
-1.4 0.0808 0.0793 0.0778 0.0764 0.0749 0.0735 0.0721 0.0708 0.0694 0.0681
-1.3 0.0968 0.0951 0.0934 0.0918 0.0901 0.0885 0.0869 0.0853 0.0838 0.0823
-1.2 0.1151 0.1131 0.1112 0.1093 0.1075 0.1056 0.1038 0.1020 0.1003 0.0985
-1.1 0.1357 0.1335 0.1314 0.1292 0.1271 0.1251 0.1230 0.1210 0.1190 365
0.1170
Standard Normal Probability
Distribution
From the table we see that z = -1.28 cuts off an area
of 0.1 in the lower tail.
366
Using Excel to Compute Normal
Probabilities
Excel has two functions for computing
cumulative probabilities and x values for any
normal distribution:
– NORM.DIST is used to compute the cumulative
probability given an x value.
– NORM.INV is used to compute the x value given a
cumulative probability.
367
Exponential Probability Distribution
• The exponential probability distribution is useful
in describing the time it takes to complete a task.
• The exponential random variables can be used to
describe:
• Time between vehicle arrivals at a toll booth
• Time required to complete a questionnaire
• Distance between major defects in a highway
• In waiting line applications, the exponential
distribution is often used for service times.
370
Exponential Probability Distribution
• A property of the exponential distribution is
that the mean and standard deviation are
equal.
• The exponential distribution is skewed to the
right. Its skewness measure is 2.
371
Exponential Probability Distribution
Density Function
1 ;𝑥/𝜇
𝑓 𝑥 = 𝑒 for x > 0
𝜇
372
Exponential Probability Distribution
Cumulative Probabilities
373
Exponential Probability Distribution
Example: Loading time for trucks
374
Exponential Probability Distribution
Example: Loading time for trucks
= .3297
375
Using Excel to Compute Exponential
Probabilities
The EXPON.DIST function can be used to
compute exponential probabilities.
The EXPON.DIST function has three arguments:
– 1st The value of the random variable x
– 2nd 1/m - the inverse of the mean number of
occurrences in an interval
– 3rd “TRUE” or “FALSE - We will always enter
“TRUE” because we’re seeking a cumulative
probability.
376
Using Excel to Compute Exponential
Probabilities
379
Relationship between the Poisson and
Exponential Distributions
380
EXERCISES
Suppose x is a normally distributed random variable
with a mean of 22 and a standard deviation of 5. The
probability that x is less than 9.7 is _____.
SOLUTION
EXERCISES
The ages of students at a university are normally
distributed with a mean of 21. What percentage of the
student body is at least 21 years old?
SOLUTION
MEAN = 21 ; PERCENTAGE OF
AGE 21 = 50%
EXERCISES
The weight of football players is normally distributed with a
mean of 200 pounds and a standard deviation of 25 pounds.
The probability of a player weighing more than 241.25 pounds
is _____.
SOLUTION
mean = 200
mean = 40000
std dev = 5000 z= (X - µ) / δ =
P(X ≥ 47,500) 1.5
using table 0.93332
P(X ≥ 47,500) 0.06668
EXERCISES
Density Function
1 ;𝑥/𝜇
𝑓 𝑥 = 𝑒 for x > 0
𝜇
𝜇 = 10
EXERCISES
A random variable x is uniformly distributed between
45 and 150.
a. Determine the probability of x = 48.
b. What is the probability of x ≤ 60?
c. What is the probability of x ≥ 50?
d. Determine the expected vale of x and its standard
deviation.
SOLUTION
Formulas for the theoretical mean and standard deviation
are
0.2000
e = 2.71828
.
f(x) = (1/μ) e-x/μ for x ≥ 0 .
f(x) = (1/5) e-x/5 for x ≥ 0
MEAN = 40
a. What is the probability Lexeleen will wait less than 5 minutes for the ferry?
b. What is the probability Lexeleen will wait less than 10 minutes for the ferry?
c. What is the probability Lexeleen will wait less than 15 minutes for the ferry?
d. What is the probability Lexeleen will not have to wait for the ferry?
e. Suppose Lexeleen leaves at 8:05 A.M. What is the probability Lexeleen will
wait (1) less than 5 minutes for the ferry; (2) less than 10 minutes for the ferry?
f. Suppose Lexeleen leaves at 8:10 A.M. What is the probability Lexeleen will
wait (1) less than 5 minutes for the ferry; (2) less than 10 minutes for the ferry?
g. What appears to be the best time for Lexeleen to leave home if she wishes
to maximize the probability of waiting less than 10 minutes for the ferry?
EXERCISE
Suppose that the random variable x is normally distributed
with mean μ = 500 and standard deviation σ = 100. For each
of the following, use the normal table to find the needed
value k.
a. P(x ≥ k) = .025
b. P(x ≥ k) = .05
c. P(x < k) = .025
d. P(x ≤ k) = .015
e. P(x < k) = .985
f. P(x > k) = .95
g. P(x ≤ k) = .975
h. P(x ≥ k) = .0228
i. P(x > k) = .9772
SOLUTION
a. P(x k) = 0.0250 Distribution Plot
Normal, Mean=500, StDev=100
Distribution Plot
Normal, Mean=500, StDev=100
Density
Density
0.002
0.002
Density
0.002
H
0.025 0.001
0.000
500 696.0
P(x 664.5) = 0.0500
0.000
X
500 717.0 0.0228
X
0.000
A
500 699.9
Distribution Plot X
c. P(z < -1.96) = 0.0250 Normal, Mean=500, StDev=100
E
0.004 Distribution Plot
k = -1.96(100) + 500 = 304 Normal, Mean=500, StDev=100
I
Density
0.95 0.004
0.002 0.003
0.9772
B
d. P(z ≤ -2.17) = 0.0150
Density
0.003
k = -2.17(100) + 500 = 283 0.001 0.002
F
Density
P(x 283) = 0.0150 0.000
500 664.5
0.05
0.001
0.002
X
k = 2.17(100) + 500 = 717
C
Distribution Plot
0.000
Normal, Mean=500, StDev=100 300.1 500
G
0.004 Distribution Plot
Normal, Mean=500, StDev=100
0.975
0.002 0.003
0.002
i. P(z > -2.00) = 0.9772
k = -2.00(100) + 500 = 300 0.001
283.0 500
X
EXERCISE
Weekly demand at a grocery store for a brand of breakfast
cereal is normally distributed with a mean of 800 boxes and
a standard deviation of 75 boxes.
a. What is the probability that weekly demand is959 boxes
or less? More than 1, 004 boxes? Less than 650 boxes
or greater than 950 boxes?
b. The store orders cereal from a distributor weekly. How
many boxes should the store order for a week to have only
a 2.5 percent chance of running short of this brand of cereal
during the week?
SOLUTION
a. (1) P(x 959) = P(x-μσ ≤ 959-80075) = P(z 2.12) = .9830
(2) P(x > 1004) = P(x-μσ > 1004-80075) = P(z > 2.72) = 1 – .9967 = .0033
(3) P(x < 650) + P(x > 950) = P(z < –2) + P(z > 2) = .0228 + (1 – .9772) = .0456
𝑥;𝜇 𝑘;0
P(z > 0.50) = .3085 implies that z = = = 0.50 Thus k = 5000, and we will investigate Process B if
σ 10000
the cost variance exceeds $5000.
If Process B is out of control we use μB = $7,500 and σB = $10,000. Thus the probability of investigating an
out of control Process B is:
𝑥;𝜇 5000;7500
P(xB > 5,000) = P( > ) = P(zB > -0.25) = 1 – 0.4013 = 0.5987
σ 10000
EXERCISE
A business executive, transferred from Chicago to Atlanta, needs to sell her house in
Chicago quickly. The executive’s employer has offered to buy the house for $210,000, but
the offer expires at the end of the week. The executive does not currently have a better offer
but can afford to leave the house on the market for another month. From conversations with
her realtor, the executive believes the price she will get by leaving the house on the market
for another month is uniformly distributed between $200,000 and $225,000.
A) If she leaves the house on the market for another month, what is the mathematical
expression for the probability density function of the sales price?
B) If she leaves it on the market for another month, what is the probability that she will get at
least $215,000 for the house?
C) If she leaves it on the market for another month, what is the probability that she will get
less than $210,000?
D) Should the executive leave the house on the market for another month? Why or why not?
SOLUTION
EXERCISES
The NCAA estimates that the yearly value of a full athletic
scholarship at in-state public universities is $19,000 (The
Wall Street Journal, March 12, 2012). Assume the
scholarship value is normally distributed with a standard
deviation of $2100.
A. For the 10% of athletic scholarships of least value, how
much are they worth?
B. What percentage of athletic scholarships are valued at
$22,000 or more?
C. For the 3% of athletic scholarships that are most valuable,
how much are they worth?
SOLUTION
EXERCISE
Motorola used the normal distribution to determine the probability of defects
and the number of defects expected in a production process. Assume a
production process produces items with a mean weight of 10 ounces.
Calculate the probability of a defect and the expected number of defects for
a 1000-unit production run in the following situations.
A. The process standard deviation is .15, and the process control is set at
plus or minus one standard deviation. Units with weights less than 9.85
or greater than 10.15 ounces will be classified as defects.
B. Through process design improvements, the process standard deviation
can be reduced to .05. Assume the process control remains the same,
with weights less than 9.85 or greater than 10.15 ounces being
classified as defects.
C. What is the advantage of reducing process variation, thereby causing
process control limits to be at a greater number of standard deviations
from the mean?
SOLUTION
EXERCISE
During early 2012, economic hardship was stretching the limits of
France’s welfare system. One indicator of the level of hardship was the
increase in the number of people bringing items to a Paris pawnbroker.
That number had risen to 658 per day (Bloomberg Businessweek, March
5–March 11, 2012). Assume the number of people bringing items to the
pawnshop per day in 2012 is normally distributed with a mean of 658.
A.Suppose you learn that on 3% of the days, 610 or fewer people brought
items to the pawnshop. What is the standard deviation of the number of
people bringing items to the pawnshop per day?
B.On any given day, what is the probability that between 600 and 700
people bring items to the pawnshop?
C.How many people bring items to the pawnshop on the busiest 3% of
days?
SOLUTION
EXERCISE
The port of South Louisiana, located along 54 miles of the Mississippi
River between New Orleans and Baton Rouge, is the largest bulk cargo
port in the world. The U.S. Army Corps of Engineers reports that the port
handles a mean of 4.5 million tons of cargo per week (USA Today,
September 25, 2012). Assume that the number of tons of cargo handled
per week is normally distributed with a standard deviation of .82 million
tons.
A. What is the probability that the port handles less than 5 million tons of
cargo per week?
B. What is the probability that the port handles 3 million or more tons of
cargo per week?
C. What is the probability that the port handles between 3 million and 4
million tons of cargo per week?
SOLUTION
Chapter 7
Sampling and
Sampling
Distributions
SAMPLE
A collection of data from part of the population.
Answer: AB, AC, AD, AE, BC, BD, BE, CD, CE, DE
Using simple random sampling, what is the probability that each sample of size 2 is
selected?
The sampling distribution of the sample mean 𝑋 and its mean and
standard deviation are:
𝜍𝑋= 𝑋2 𝑓 𝑋 − 𝑋 𝑓 (𝑋) 2 =
997 63 2
− = 0.3632
36 12
Introduction to Sampling Distribution
The mean and the standard deviation of the population are:
X 4 5 5 7
𝑿 = 𝟐𝟏
𝑋2 16 25 2 49
𝑋 2 = 115
𝑋 21
𝜇= = = 5.25
𝑁 4
𝜍 = 0.3632
Formula:
𝜎
𝜇= 𝑥± 𝑍𝛼
2 𝑛
Where:
x = mean ; Zα = confidence coefficient
2
452
Margin of Error and the Interval
Estimate
• A point estimator cannot be expected to provide the
exact value of the population parameter.
• An interval estimate can be computed by adding and
subtracting a margin of error to the point estimate.
Point Estimate +/- Margin of Error
• The purpose of an interval estimate is to provide
information about how close the point estimate is to
the value of the parameter.
453
Margin of Error and the Interval
Estimate
𝑥 + Margin of Error
454
Interval Estimate of a Population
Mean: Known
455
Interval Estimate of a Population
Mean: Known
There is a 1 - probability that the value of a
sample
mean will provide a margin of error of 𝑧𝛼/2 𝜍𝑥 or
less.
/ /
2 2
456
Interval Estimate of a Population
Mean: Known
457
Interval Estimate of a Population
Mean: Known
Interval Estimate of
𝜍
𝑥 ± 𝑧𝛼/2
𝑛
458
Interval Estimate of a Population
Mean: Known
Values of z/2 for the Most Commonly Used Confidence
Levels
459
Meaning of Confidence
Because 90% of all the intervals constructed using 𝑥 +
1.645𝜍𝑥 will contain the population mean, we say
we are 90% confident that the interval
𝑥 + 1.645𝜍𝑥 includes the population mean .
460
Interval Estimate of a Population
Mean: Known
Example: Lloyds Department store
Each week Lloyds department store selects a simple
random sample of 100 customers in order to learn about
the amount spent per shopping trip. The historical data
indicates that the population follows a normal
distribution.
During most recent week, Lloyd’s surveyed 100
customers (n = 100) and obtained a sample mean of 𝑥 =
$82. Based on historical data, Lloyd’s now assumes a
known value of 𝜍 = $20. The confidence coefficient to be
used in the interval estimate is .95.
461
Interval Estimate of a Population
Mean: Known
Example: Lloyds Department store
𝜍 20
𝑧𝛼/2 = 1.96 = 3.92
𝑛 100
462
Interval Estimate of a Population Mean:
Known
Example: Lloyds Department store
$82 + $ 3.92
or
$78.08 to $85.29
463
Interval Estimate of a Population
Mean: Known
Example: Lloyds Department store
Confidence level Margin of Error Interval
estimate
90% 3.92 78.08 – 85.92
464
Interval Estimate of a Population
Mean: Known
Adequate Sample Size
– In most applications, a sample size of n ≥ 30 is
adequate.
– If the population distribution is highly skewed or
contains outliers, a sample size of 50 or more is
recommended.
– If the population is not normally distributed but is
roughly symmetric, a sample size as small as 15 will
suffice.
– If the population is believed to be at least
approximately normal, a sample size of less than 15
can be used.
465
Interval Estimate of a Population
Mean: Unknown
• If an estimate of the population standard deviation
cannot be developed prior to sampling, we use the
sample standard deviation s to estimate .
• This is the unknown case.
• In this case, the interval estimate for is based on
the t distribution.
• (We’ll assume for now that the population is
normally distributed.)
466
t Distribution
467
t Distribution
468
t Distribution
• A t distribution with more degrees of
freedom has less dispersion.
469
t Distribution
Comparison of the standard normal distribution
with t distributions having 10 and 20 degrees of
freedom.
470
t Distribution
471
t Distribution
Selected values
from the t
distribution table
472
Interval Estimate of a Population
Mean: Unknown
𝑠
𝑥 ± 𝑡𝛼/2
𝑛
where: 𝑥 = the sample mean
1 - = the confidence coefficient
t/2 = the t value providing an area of /2 in the
upper tail of a t distribution with n - 1 degrees of freedom
s = the sample standard deviation
n = the sample size
473
Interval Estimate of a Population
Mean: Unknown
Example: Credit card debt for the population of
US households
The credit card balances of a sample of 70
households provided a mean credit card debt of
$9312 with a sample standard deviation of $4007.
Let us provide a 95% confidence interval estimate
of the mean credit card debt for the population of
US households. We will assume this population to
be normally distributed.
474
Interval Estimate of a Population
Mean: Unknown
• At 95% confidence, = .05, and /2 = .025.
• t.025 is based on n - 1 = 70 - 1 = 69 degrees of
freedom.
475
Interval Estimate of a Population
Mean: Unknown
Example: Credit card debt for the population of US
households
𝑠
𝑥 ± 𝑡.025
𝑛
4007
9312 + 1.995 = 9312 + 955
70
476
Interval Estimate of a Population
Mean: Unknown
Adequate Sample Size
• Usually, a sample size of n ≥ 30 is adequate when
using the expression 𝑥 ± 𝑡𝛼/2 𝑠/ 𝑛 to develop an
interval estimate of a population mean.
• If the population distribution is highly skewed or
contains outliers, a sample size of 50 or more is
recommended.
• If the population is not normally distributed but is roughly
symmetric, a sample size as small as 15 will suffice.
• If the population is believed to be at least approximately
normal, a sample size of less than 15 can be used.
477
Summary of Interval Estimation Procedures
for a Population Mean
478
Sample Size for an Interval Estimate of
a Population Mean
• Let E = the desired margin of error.
• E is the amount added to and subtracted from
the point estimate to obtain an interval
estimate.
• If a desired margin of error is selected prior to
sampling, the sample size necessary to satisfy
the margin of error can be determined.
479
Sample Size for an Interval Estimate of
a Population Mean
• Margin of Error
𝜍
𝐸 = 𝑧𝛼/2
𝑛
• Necessary Sample Size
(𝑧𝛼/2 )2 𝜎 2
n=
𝐸2
480
Sample Size for an Interval Estimate of
a Population Mean
• The Necessary Sample Size equation requires a
value for the population standard deviation .
• If is unknown, a preliminary or planning value
for can be used in the equation.
1. Use the estimate of the population standard
deviation computed in a previous study.
2. Use a pilot study to select a preliminary study and
use the sample standard deviation from the study.
3. Use judgment or a “best guess” for the value of .
481
Sample Size for an Interval Estimate of
a Population Mean
Example: Cost of renting Automobiles in United States
A previous study that investigated the cost of renting
automobiles in the United States found a mean cost of
approximately $55 per day for renting a midsize
automobile with a standard deviation of $9.65.
Suppose the project director wants an estimate of the
population mean daily rental cost such that there is a .95
probability that the sampling error is $2 or less.
How large a sample size is needed to meet the required
precision?
482
Sample Size for an Interval Estimate of
a Population Mean
Example: Cost of renting Automobiles in United States
𝜍
𝐸 = 𝑧𝛼/2 =2
𝑛
(1.96)2 (9.65)2
𝑛= = 89.43 ⋍ 90
(2)2
The sample size needs to be at least 90 mid size automobile
rentals in order to satisfy the project director’s $2 margin-of-
error requirement.
483
Interval Estimate of a Population
Proportion
The general form of an interval estimate of a
population proportion is:
𝑝 + Margin of Error
484
Interval Estimate of a Population
Proportion
• The sampling distribution of 𝑝 plays a key role
in computing the margin of error for this
interval estimate.
• The sampling distribution of 𝑝 can be
approximated by a normal distribution
whenever np > 5 and n(1 – p) > 5.
485
Interval Estimate of a Population
Proportion
Normal Approximation of Sampling Distribution
of 𝑝
486
Interval Estimate of a Population
Proportion
𝑝(1 − 𝑝)
𝑝 ± 𝑧𝛼/2
𝑛
487
Interval Estimate of a Population
Proportion
Example: Survey of women golfers
A national survey of 900 women golfers was
conducted to learn how women golfers view their
treatment at golf courses in United States. The
survey found that 396 of the women golfers were
satisfied with the availability of tee times.
Suppose one wants to develop a 95% confidence
interval estimate for the proportion of the
population of women golfers satisfied with the
availability of tee times.
488
Interval Estimate of a Population
Proportion
Example: Survey of women golfers
𝑝(1 − 𝑝)
𝑝 ± 𝑧𝛼/2
𝑛
.44(1;.44)
. 44 ±1.96 = .44 ± .0324
900
489
Sample Size for an Interval Estimate of
a Population Proportion
Margin of Error
𝑝(1;𝑝)
E = 𝑧𝛼/2
𝑛
Solving for the necessary sample size n, we get
2
𝑧𝛼/2 𝑝 1 − 𝑝
𝑛=
𝐸2
2 ∗
𝑧𝛼/2 𝑝 1 − 𝑝∗
𝑛=
𝐸2
491
Sample Size for an Interval Estimate of
a Population Proportion
Example: Survey of women golfers
492
Sample Size for an Interval Estimate of
a Population Proportion
Example: Survey of women golfers
𝑝∗ (1;𝑝∗ )
E = 𝑧𝛼/2 = .025
𝑛
2 ∗
𝑧𝛼/2 𝑝 1 − 𝑝∗ 1.96 2 (.44) .56
𝑛= 2
= 2
= 1514.5
𝐸 (.025)
A sample of size 1515 is needed to reach a desired precision
of + .025 at 95% confidence.
493
Sample Size for an Interval Estimate of
a Population Proportion
Note: We used .44 as the best estimate of p in
the preceding expression. If no information is
available about p, then .5 is often assumed
because it provides the highest possible sample
size. If we had used p = .5, the recommended n
would have been 1537.
494
Implications of Big Data
• As the sample size becomes extremely large, the
margin of error becomes extremely small and
resulting confidence intervals become extremely
narrow.
• No interval estimate will accurately reflect the
parameter being estimated unless the sample is
relatively free of non-sampling error.
• Statistical inference along with information
collected from other sources can help in making
the most informed decision.
495
EXERCISE
A statistician selected a sample of 16
accounts receivable and determined the
mean of the sample to be $5,000 with a
standard deviation of $400. She reported
that the sample information indicated the
mean of the population ranges from
$4,739.80 to $5,260.20. She did not report
what confidence coefficient she had used.
Based on the above information, determine
the confidence coefficient that was used.
SOLUTION
𝑠
𝑥 ± 𝑡𝛼/2
𝑛
where:
𝑥 is the sample mean
1 - is the confidence coefficient
z/2 is the z value providing an area of
/2 in the upper tail of the standard
normal probability distribution
is the population standard deviation
n is the sample size
GIVEN:
n =16
𝑥 = 5,000
𝑠 = 400 𝐸 = ± 𝑡𝛼/2 (400/ 16) =
Margin of Error =$4,739.80 to
520.40 ( 16) = ± 𝑡𝛼/2 (400)
$5,260.20 = $520.40
± 𝑡𝛼/2 = 5.204 = 0.98
DF = n -1 = 15
EXERCISE
1. Formulate the null hypothesis H_0 (commonly, that the observations are the result of pure
chance) and the alternative hypothesis H_a (commonly, that the observations show a real
effect combined with a component of chance variation).
2. Identify a test statistic that can be used to assess the truth of the null hypothesis.
3. Compute the P-value, which is the probability that a test statistic at least as significant as
the one observed would be obtained assuming that the null hypothesis were true. The smaller
the P-value, the stronger the evidence against the null hypothesis.
4. Compare the p-value to an acceptable significance value alpha (sometimes called an alpha
value). If p≤ alpha, that the observed effect is statistically significant, the null hypothesis is
ruled out, and the alternative hypothesis is valid.
Rare Event Rule for Inferential Statistics
pq proportions
n
x - µx Test statistic for
z=
mean
n
Test statistic for
(n – 1)s2
2 = standard
2 deviation
Example: A survey of n = 880 randomly
selected adult drivers showed that 56% (or p = 0.56) of
those respondents admitted to running red lights. Find
the value of the test statistic for the claim that the
majority of all adult drivers admit to running red lights.
pq
n (0.5)(0.5)
880
Interpretation: We know from previous chapters
that a z score of 3.56 is exceptionally large. It
appears that in addition to being “more than half,”
the sample result of 56% is significantly more than
50%.
Traditional method:
P-value method:
Another option:
Confidence Intervals:
b) With a claim of p 0.25, the test is two-tailed. Because the test is two-
tailed, and because the test statistic of z = 2.34 is to the right of the center,
Figure 8-6 shows that the P-value is twice the area to the right of z = 2.34. We
refer to Z-Table and find that the area to the right of z = 2.34 is 0.0096, so P-
value = 2 x 0.0096 = 0.0192. The P-value of 0.0192 is less than or equal to the
significance level, so we reject the null hypothesis. The small P-value o 0.0192
shows that the sample results are not likely to occur by chance.
Wording of Final Conclusion
Figure 8-7
Accept Versus Fail to Reject
564
Inferences About the Difference Between
Two Population Means: 1 and 2 Known
• Interval Estimation of 1 – 2
• Hypothesis Tests About 1 – 2
565
Estimating the Difference Between
Two Population Means
• Let 1 equal the mean of population 1 and 2 equal
the mean of population 2.
• The difference between the two population means is
1 2.
• To estimate 1 2, we will select a simple random
sample of size n1 from population 1 and a simple
random sample of size n2 from population 2.
• Let 𝑥1 equal the mean of sample 1 and 𝑥2 equal the
mean of sample 2.
• The point estimator of the difference between the
means of the populations 1 and 2 is 𝑥1 − 𝑥2 .
566
Sampling Distribution of 𝑥1 − 𝑥2
• Expected Value
𝐸(𝑥1 − 𝑥2 )= 𝜇1 − 𝜇2
• Standard Deviation (Standard Error)
𝜎1 2 𝜎2 2
𝜍𝑥1;𝑥2 = +
𝑛1 𝑛2
where: 1 = standard deviation of population 1
2 = standard deviation of population 2
n1 = sample size from population 1
n2 = sample size from population 2
567
Interval Estimate of 1 - 2: 1 and 2
Known
• Interval Estimate
𝜍1 2 𝜍2 2
𝑥1 − 𝑥2 ± 𝑧𝛼/2 +
𝑛1 𝑛2
where:
1 - is the confidence coefficient
568
Interval Estimate of 1 - 2: 1 and 2
Known
Example: Homestyle Furniture
569
Interval Estimate of 1 - 2: 1 and 2
Known
Example: Homestyle Furniture
570
Interval Estimate of 1 - 2: 1 and 2
Known
Example: Homestyle Furniture
571
Estimating the Difference Between Two
Population Means
Population 1 Population 2
Inner-City Store Customers Suburban store customers
1 = mean age of 2 = mean age of
Inner-City Store Customers Suburban store
customers
1 – 2 = difference between
the mean ages
572
Point Estimate of 1 - 2
Example: Homestyle Furniture
where:
1 = mean age of Inner-City Store Customers
2 = mean age of Suburban store customers
573
Interval Estimation of 1 - 2: 1 and
2 Known
Example: Homestyle Furniture
𝜎1 2 𝜎2 2 (9)2 (10)2
𝑥1 − 𝑥2 ± 𝑧𝛼/2 + = 5 ± 1.96 +
𝑛1 𝑛2 36 49
574
Hypothesis Tests About 1 - 2: 1 and
2 Known
• Hypotheses
H0: 1 – 2 > D0 H0: 1 – 2 < D0 H0: 1 – 2 = D0
Ha: 1 – 2 < D0 Ha: 1 – 2 > D0 Ha: 1 – 2 ≠ D0
• Test Statistic
𝑥1 − 𝑥2 − 𝐷0
𝑧=
(𝜍1 )2 (𝜍2 )2
+
𝑛1 𝑛2
575
Hypothesis Tests About 1 - 2: 1 and
2 Known
Example: Training Centers
576
Hypothesis Tests About 1 - 2: 1 and
2 Known
Example: Training Centers
A B
Sample Size 30 40
Sample Mean 82 78
Standard deviation 10 10
(Based on previous
studies)
577
Hypothesis Tests About 1 - 2: 1 and
2 Known
Example: Training Centers
578
Hypothesis Tests About 1 - 2: 1 and
2 Known
Example: Training Centers
p –Value and Critical Value Approaches
1. Develop the hypotheses.
H0: 1 - 2 = 0
Ha: 1 - 2 ≠ 0 (two -tailed test)
Where
µ1 = The mean examination score for the population of
individuals trained at center A
µ2 = The mean examination score for the population of
individuals trained at center B
579
Hypothesis Tests About 1 - 2: 1 and
2 Known
Example: Training Centers
p –Value and Critical Value Approaches
𝑥1 − 𝑥2 − 𝐷0
𝑧=
(𝜍1 )2 (𝜍2 )2
+
𝑛1 𝑛2
82 − 78 − 0 4
𝑧= = = 1.66
(10)2 (10)2 2.4152
+
30 40
580
Hypothesis Tests About 1 - 2: 1 and
2 Known
Example: Training Centers
p –Value Approach
4. Compute the p–value.
For z = 1.66, the area to the left is .9515.
The area in the upper tail of the distribution is
1.0000 -.9515 = .0485
p –value = 2(.0485) = .0970
581
Hypothesis Tests About 1 - 2: 1 and
2 Known
Example: Training Centers
Critical Value Approach
4. Determine the critical value and rejection rule.
For = .05, z.025 = 1.96
Reject H0 if z > 1.96
582
Inferences About the Difference Between
Two Population Means: 1 and 2 Unknown
– Interval Estimation of 1 – 2
– Hypothesis Tests About 1 – 2
583
Interval Estimation of 1 - 2: 1 and 2
Unknown
When 1 and 2 are unknown, we will:
– Use the sample standard deviations s1 and s2 as
estimates of 1 and 2 , and
– Replace z/2 with t/2.
584
Interval Estimation of 1 - 2: 1 and
2 Unknown
– Interval Estimate
𝑠1 2 𝑠2 2
𝑥1 − 𝑥2 ± 𝑡𝛼/2 +
𝑛1 𝑛2
2
𝑠1 2 𝑠2 2
𝑛1 + 𝑛2
𝑑𝑓 = 2 2
1 𝑠1 2 1 𝑠2 2
𝑛1 − 1 𝑛1 +𝑛 −1 𝑛2
2
585
Difference Between Two Population
Means: 1 and 2 Unknown
Example: Clearwater National Bank
586
Difference Between Two Population
Means: 1 and 2 Unknown
Example: Clearwater National Bank
587
Difference Between Two Population
Means: 1 and 2 Unknown
Example: Clearwater National Bank
588
Point Estimate of 1 - 2
Example: Clearwater National Bank
where:
1 = mean checking account balance maintained by the
population of Cherry Grove customers
2 = mean checking account balance maintained by the
population of Beechmont customers
589
Interval Estimation of 1 - 2: 1 and
2 Unknown
Example: Clearwater National Bank
2
(150)2 (125)2
:
28 22
𝑑𝑓 = 2 2 = 47.8 = 47
1 (150)2 1 (125)2
28−1 28
:22−1 22
590
Interval Estimation of 1 - 2: 1 and
2 Unknown
Example: Clearwater National Bank
𝑠1 2 𝑠2 2
𝑥1 − 𝑥2 ± 𝑡𝛼/2 +
𝑛1 𝑛2
(150)2 (125)2
1025 − 910 ± 2.012 +
28 22
591
Hypothesis Tests About 1 - 2: 1 and
2 Unknown
• Hypotheses
H0: 1 – 2 > D0 H0: 1 – 2 < D0 H0: 1 – 2 = D0
Ha: 1 – 2 < D0 Ha: 1 – 2 > D0 Ha: 1 – 2 ≠ D0
• Test Statistic
𝑥1 − 𝑥2 − 𝐷0
𝑡=
(𝑠1 )2 (𝑠2 )2
+
𝑛1 𝑛2
592
Hypothesis Tests About 1 - 2: 1 and
2 Unknown
Example: Computer software package
593
Hypothesis Tests About 1 - 2: 1 and 2
Unknown
Current Technology New Software
Example: Computer software package
300 274
280 220
Summary Statistics 344 308
Sample Size 12 12 385 336
Sample Mean 325 286 372 198
Sample SD 40 44 360 300
288 315
321 258
376 318
290 310
301 332
283 263
594
Hypothesis Tests About 1 - 2: 1 and
2 Unknown
Example: Computer software package
p –Value approach
1. Develop the hypotheses.
H0: 1 - 2 < 0
Ha: 1 - 2 > 0
(right-tailed test)
where:
1 = the mean project completion time for system analysts using the
current technology
2 = the mean project completion time for system analysts using the
new software package
595
Hypothesis Tests About 1 - 2: 1 and
2 Unknown
Example: Computer software package
p –Value approach
325 − 286 − 0
𝑡= = 2.27
(40)2 (44)2
+
12 12
596
Hypothesis Tests About 1 - 2: 1 and
2 Unknown
Example: Computer software package
p –Value Approach
4. Compute the p –value.
The degrees of freedom for t are:
2
(40)2 (44)2
+
12 12
𝑑𝑓 = 2 2 = 21.8 = 21
1 (40) 2 1 (44) 2
+
12 − 1 12 12 − 1 12
597
Hypothesis Tests About 1 - 2: 1 and
2 Unknown
Example: Computer software package
598
Inferences About the Difference Between Two
Population Means: Matched Samples
599
Inferences About the Difference Between
Two Population Means: Matched Samples
Example: Comparison of production methods
Two production methods are tested under similar conditions. A
random sample of six workers is used.
Task Completion Times For a Matched Sample Design
Worker Completion Time for Completion Time for Difference in
Method 1 (Minutes) Method 2 (Minutes) Completion Times
(di )
1 6.0 5.4 .6
2 5.0 5.2 -.2
3 7.0 6.5 .5
4 6.2 5.9 .3
5 6.0 6.0 .0
6 6.4 5.8 .6
600
Inferences About the Difference
Between Two Population Means:
Matched Samples
Example: Comparison of production methods
601
Inferences About the Difference
Between Two Population Means:
Matched Samples
p –Value approach
1. Develop the hypotheses.
H0: d = 0
Ha: d
602
Inferences About the Difference
Between Two Population Means:
Matched Samples
p –Value approach
2. Specify the level of significance. = .05
603
Inferences About the Difference Between
Two Population Means: Matched Samples
p –Value Approach
4. Compute the p –value.
For t = 2.20 and df = 5, the p–value is between .10
and .05. (This is a two-tailed test, so we double the
upper-tail areas of .05 and .025.)
604
Excel’s “t-Test: Paired Two Sample
for Means” Tool
• Step 1 Click the Data tab on the Ribbon
• Step 2 In the Analysis group, click Data Analysis
• Step 3 Choose t-Test: Paired Two Sample for
Means
from the list of Analysis Tools
• Step 4 When the t-Test: Paired Two Sample for
Means
dialog box appears: (see details on next
slide)
605