Professional Documents
Culture Documents
9-1
Statistics
For Business & Economics
II
Prof.dr.ir. Wouter VERBEKE
Drs. Jeroen BERREVOETS
Statistics II
Introductory example
• Classroom experiment
• Media: bullying
• Notation:
• Population proportion: 𝑝 Parameter
• Sample proportion: 𝑝̂ Estimate
• In a simulation,
• We have only two possible outcomes for an event, which
are labeled “success” and “failure”,
• We set the true proportion of successes to a known value,
• Draw random samples,
• And then record the sample proportion of successes.
Observations:
• Not every sample has a sample proportion equal to
0.2.
• Sample proportions bigger than 0.24 and smaller than
0.16 are rare.
• Most sample proportions are between 0.18 and 0.22.
Copyright © 2015 Pearson Education. All rights reserved.
12 9-12
9.1 The Distribution of Sample Proportions
& is Normal
The sampling distribution of 𝒑
Copyright © 2015 Pearson Education. All rights reserved.
14 9-14
9.1 The Distribution of Sample Proportions
*,
• And has a standard deviation equal to )
• 𝑋 has standard deviation 𝑛𝑝𝑞
( )*, )*, *,
• Then ) has standard deviation )
= )-
= )
æ pq ö
• The particular Normal model,N çç p, ÷÷ ,
è n ø
• is a sampling distribution model for the sample proportion.
Simulation:
• SD of sample proportions = 0.0126
*, (6.8)(6.:)
• Theoretical SD = = =
) ;666
0.0126
Copyright © 2015 Pearson Education. All rights reserved.
18 9-18
9.1 The Distribution of Sample Proportions
• A model?
I. Independence Assumption:
The sampled values must be independent of each other.
A. Randomization Condition:
• If your data come from an experiment, subjects should have
been randomly assigned to treatments.
B. 10% Condition:
• If sampling has not been made with replacement,
• then the sample size, n, must be no larger than 10% of the
population.
C. Success/Failure Condition:
• The sample size must be big enough
• so that both the number of “successes,” np, and the number
of “failures,” nq, are expected to be at least 10.
æ (0.92)(0.08) ö
N çç 0.92, ÷÷ = N ( 0.92, 0.021)
è 160 ø
pˆ - p 0.95 - 0.92
z= = = 1.429
SD( pˆ ) 0.021
P ( z ³ 1.429 ) = 0.0765
• Plan:
• Assumptions & conditions
• Sampling distribution model and parameters
• Report conclusion
https://www.gallup.com/home.aspx
pq
SD ( pˆ ) = , where q = 1 - p
n
• We also know from the Central Limit Theorem that the shape
of the sampling distribution is approximately Normal, when
the sample is large enough
*,
• We do not know )
ˆˆ
pq (0.42)(1 - 0.42)
SE ( pˆ ) = = = 0.008
n 3559
Copyright © 2015 Pearson Education. All rights reserved.
31 9-31
9.2 A Confidence Interval for a Proportion
• Now, we use these facts to draw our best guess of the
sampling distribution for the true proportion who think
the economy is getting better:
• So, if you reach out 2SE’s on both sides, you are 95%
sure that p is in your grasp
Copyright © 2015 Pearson Education. All rights reserved.
33 9-33
9.2 A Confidence Interval for a Proportion
• The 95% confidence interval for the true proportion of all US
adults who think the economy is improving is given by:
42.0% ± 2 ´ 0.8%
42.0% ± 1.6%
40.4% to 43.6%
Copyright © 2015 Pearson Education. All rights reserved.
34 9-34
9.2 A Confidence Interval for a Proportion
What Can We Say about a Proportion?
Here’s what we would like to be able to say, however, we can’t say
most of these things…
1) “42.0% of all U.S. adults thought the economy was
improving.”
“We are 95% confident that between 40.4% and 43.6% of U.S.
adults thought the economy was improving.”
“We are 95% confident that between 40.4% and 43.6% of all
U.S. adults thought the economy was improving.”
pˆ ± 2 SE ( pˆ )
• The extent of that interval on either side of 𝑝̂ is called the
margin of error (ME).
• The general confidence interval can now be expressed in
terms of the ME.
estimate ± ME
1.96 Instead of 2
Note:
• the size of the population does not directly affect the finding,
• only the size of the sample does
Plan:
• Statistical research question:
• What can we conclude about the proportion of all
French adults who sympathize with the practice of
bossnapping?
• Setup:
• One-proportion z-interval allows to calculate a
confidence interval for the true proportion
• We choose confidence level of 95%
• Check:
• Assumptions & conditions!
Copyright © 2015 Pearson Education. All rights reserved.
51 9-51
Guided Example (continued):
Independence assumption:
• Randomization Condition:
The sample was selected randomly.
• 10% Condition:
The sample is certainly less than 10% of the population.
(0.5)(0.5)
0.03 = 1.96 Þ n = 1067.1
n
• To be safe, always round up!
• Do take responsibility.
Copyright © 2015 Pearson Education. All rights reserved.
61 9-61
Violations of Assumptions
• You can claim to have the specified level of confidence that the
interval you have computed actually covers the true value.