Professional Documents
Culture Documents
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
Hypothesis testing
• Want to know something about a
population
• Take a sample from that population
• Measure the sample
• What would you expect the sample to
look like under the null hypothesis?
• Compare the actual sample to this
expectation
population
sample
Y = 2675.4
Hypothesis testing
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
The experiment and the
results
• Sheep were trained to get a reward
near a certain other sheep’s picture
• Then placed in a Y-shaped maze
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
3 0.0011
4 0.0046
5 0.015
6 0.037
7 0.074
8 0.12
9 0.16
10 0.18
11 0.16
12 0.12
13 0.074
14 0.037
15 0.015
16 0.0046
17 0.0011
18 0.00018
19 0.00002
20 0.000001
• Test statistic = a quantity calculated
from the data that is used to evaluate
how compatable the data are with the
expectation under the null hypothesis
The null distribution of p
0.2
0.18
0.16
0.14
Test statistic = 16
0.12
0.1
Frequency
0.08
0.06
0.04
0.02
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Number of correct choices
The null distribution of p
0.2
0.18
0.16
Values at least
0.14
as extreme as
0.12 the test statistic
0.1
Frequency
0.08
0.06
0.04
0.02
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Number of correct choices
• P-value - the probability of obtaining the
data* if the null hypothesis were true
0.18
0.16
P=
0.012
0.14
0.12
0.1
Frequency
0.08
0.06
0.04
0.02
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Number of correct choices
P-value calculation
P
=2*(Pr[16]+Pr[17]+Pr[18]+Pr[19]+Pr[20])
=2*(0.005+0.001+0.0002+0.00002+0.000001)
= 0.012
How to find P-values
• Get test statistic
• Compare with null distribution from:
– Simulation
– Parametric tests
– Non-parametric tests
– Re-sampling
Statistical significance
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
21
pˆ = = 0.778
27
€
Sampling distribution of null
hypothesis
Number of right-handed flowers in a random sample
Probability
of 27
0-8 0
9 0.000005
10 0.000031
11 0.000124
12 0.000492
13 0.001752
14 0.005301
15 0.013830
16 0.031094
17 0.060673
18 0.100891
19 0.143051
20 0.172339
21 0.171782
22 0.141034
23 0.091477
24 0.045499
25 0.016409
26 0.003759
27 0.000457
The P-value:
P =
0.83.
Rock-paper-scissors battle
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
QuickTime™ and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
Jargon
Significance level
• A probability used as a criterion for
rejecting the null hypothesis
• Called
• If p < , reject the null hypothesis
• For most purposes, = 0.05 is
acceptable
Type I error
• Rejecting a true null hypothesis
• Power = 1-
Reality
Ho true Ho false
Result
Reject Ho Type I error correct
Test statistic
One-sided tests
• Also called one-tailed tests
• Only used when one side of the null
distribution is nonsensical