Professional Documents
Culture Documents
Untitled
Untitled
eight patients, two of whom had a recurrence. Suppose we take any group of eight
people, tag two of them, and divide the eight at random into two groups of four.
The probability that both of the tagged people will fall into one of the groups is
around .30. If there were only four women in each group, the fact that all the
recurrences fell in the placebo group is not significant. If the study included 500
women in each group, it would be highly unlikely that all 250 with recurrences
were on the placebo, unless the drug was working. The probability that all 250
would fall in one group if the drug was no better than the placebo is the p-value,
which calculates to be less than .0001.
The p-value is a probability, and this is how it is computed. Since it is used
to show that the hypothesis under which it is calculated is false, what does it really
mean? It is a theoretical probability associated with the observations under
conditions that are most likely false. It has nothing to do with reality. It is an
indirect measurement of plausibility. It is not the probability that we would be
wrong to say the drug works. It is not the probability of any kind of error. It is not
the probability that a patient will do as well on the placebo as on the drug. But, to
determine which tests are better than others, Neyman had to find a way to put
hypothesis testing within a framework wherein probabilities associated with the
decisions made from the test could be calculated. He needed to connect the p-
values of the hypothesis test to real life.