Professional Documents
Culture Documents
Statistics 132
BINOMIAL TEST
Assumptions
1. The data consist of a sample of the outcomes of n repetitions of some process.
2. The outcome of each trial can be classified as a “success” or a “failure”. These
designations are purely arbitrary. The number of successes B is the number of
outcomes having a given characteristic.
3. The n trials are independent.
4. The probability of a success, denoted by p, remains constant from trial to trial. We
use p to designate the proportion of the population having the characteristic of
interest.
Remarks
1. We see that when 𝐻𝑜 is true, i.e., 𝑝 = 𝑝0 , the statistic B has a binomial distribution
with parameters n and 𝑝0 . Thus, the critical constant 𝑏(𝛼, 𝑛, 𝑝0 ) is chosen so that
the probability of rejecting 𝐻𝑜 when it is true is 𝛼. Also, without further
assumptions on the underlying distribution from which the dichotomous data
emanate, the distribution of B is binomial. Thus, the test based on B is a
distribution-free test of 𝐻𝑜 .
2. Because B has a discrete distribution, we can find the critical constants
𝑏(𝛼, 𝑛, 𝑝0 ) only for certain values of 𝛼, the natural levels.
when 𝐻𝑜 is TRUE, the statistic B* has an asymptotic (n → ) N(0,1) distribution, that is,
B* N(0,1) as n →
2.2 Examples
2. It is estimated that at least half of the men who currently undergo an operation to
remove prostate cancer suffer from a particular undesirable side effect. In an effort to
reduce the likelihood of this side effect, FDA studied a new method of performing the
operation. Out of the 19 operations, only 3 men suffered the unpleasant side effect. Is
it safe to conclude the new method of operating is effective in reducing side effect?
Use 𝛼 = 0.05.
The estimator of the probability of success, 𝑝, associated with the test statistic B is 𝑝̂ =
𝐵⁄ . The statistic 𝑝̂ is simply the observed relative frequency of success in n Bernoulli trials
𝑛
satisfying the assumptions of the test. Thus, 𝑝̂ qualifies as a natural estimator of 𝑝, the
unknown probability of success in a single Bernoulli trial.
1⁄
The quantity [𝑝̂ (1 − 𝑝̂ )/𝑛] 2 is a consistent estimator for the standard deviation of the
point estimator 𝑝̂ .
A two-sided (1- )100% confidence interval (CI) for p is given by (pL(𝛼), pU(𝛼) ) since we
have P { pL(𝛼) < p < pU(𝛼) } 1- .
𝑝̂(1−𝑝̂) 1⁄2
For large n, the (1- )100% CI for p is (𝑝̂ ∓ 𝑧𝛼⁄2 [ ] ).
𝑛
Note that 𝑝̂ is the midpoint of the approximate confidence intervals for 𝑝̂ . However, this
is not true in general for the confidence intervals for p given by Table A.3.
1. A supermarket is presently closed all day Sunday. They have made an extensive study of
the local situation and determined that it will be worthwhile to start opening on Sunday
only if more than 25% of their present potential customers state that they would do their
regular shopping there on Sunday if the market were open. A random sample of 10
households in the immediate vicinity are interviewed with regard to their shopping habits
and preferences. Each of the 10 is asked about the proposed availability of Sunday
shopping and 4 respond favorably. Should the market open on Sundays? Use 𝛼 = 0.05.
2. Stanton investigated the problem of paroling criminal offenders. He studied the behavior
of all male criminals paroled from New York’s correctional institutions to original parole
supervision. The parolees were observed for three years following their releases or until
they exhibited some delinquent parole behavior. In a study involving a very large number
of subjects, Stanton considered criminals convicted of crimes other than first- or second-
degree murder. He found that approximately 60% of these parolees did not have any
delinquent behavior during the 3 years following their releases. After 5 years he sampled
another group of paroled murderers and found out that 56 of the 65 murderers had no
delinquent parole behavior. Is there reason to believe that there is a change in the
proportion of murderers who do not exhibit delinquent parole behaviors? Use 𝛼 = 0.05.