0% found this document useful (0 votes)
241 views2 pages

The Hypergeometric Distribution: Math 394

1. The document discusses features of the hypergeometric distribution, including its moments. It shows that the expected value and variance of a hypergeometric random variable X approach those of a binomial random variable as the population N and number of successes m approach infinity while the success probability p remains fixed. 2. It further shows that in this limit, the probability mass function of the hypergeometric distribution approaches that of the binomial distribution. Specifically, the probability that X equals some value k approaches pk(1-p)n-k, which is the binomial probability mass function. 3. Therefore, as the document demonstrates, in the appropriate limit the binomial distribution can be viewed as a limiting case of the hypergeometric distribution, both

Uploaded by

Ramswaroop swami
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
241 views2 pages

The Hypergeometric Distribution: Math 394

1. The document discusses features of the hypergeometric distribution, including its moments. It shows that the expected value and variance of a hypergeometric random variable X approach those of a binomial random variable as the population N and number of successes m approach infinity while the success probability p remains fixed. 2. It further shows that in this limit, the probability mass function of the hypergeometric distribution approaches that of the binomial distribution. Specifically, the probability that X equals some value k approaches pk(1-p)n-k, which is the binomial probability mass function. 3. Therefore, as the document demonstrates, in the appropriate limit the binomial distribution can be viewed as a limiting case of the hypergeometric distribution, both

Uploaded by

Ramswaroop swami
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

The Hypergeometric Distribution

Math 394
We detail a few features of the Hypergeometric distribution that are discussed in the book
by Ross

1 Moments
  
mN −m
 
k
Let P [X = k] =  n− k (with the convention that
l
= 0 if j < 0, or j > l.
N j
n
We detail the recursive argument from Ross. Consider, for k = 1, 2, . . .
  
n
m N −m
X k n−k
E [X r ] = kr   (1)
N
k=0
n
The sum can also be extended from 1 to n, since the term with k = 0 does not contribute to
the result. Observe that
m! m · (m − 1)! (m − 1)!
   
m m−1
k =k =k =m =m
k k!(m − k)! k · (k − 1)!(m − k)! (k − 1)!(m − 1 − [k − 1])! k−1
and also, as a consequence,
1 1
     
N N N −1
= ·n· = N
n n n n n−1

Inserting both identities in (1), after writing k r = k r−1 · k,


  
n
m−1 N −m
nm X k−1 n−k
E [X r ] = k r−1  
N N −1
k=1
n−1
We can now change the summation index from k − 1, . . . , n to k − 1 = j = 0, . . . , n − 1, and
notice that n − k = n − (j + 1) = (n − 1) − j, while N − m = (N − 1) − (m − 1), so that
  
n−1
m−1 (N − 1) − (m − 1)
nm X j (n − 1) − j
E [X r ] = (j + 1)r−1  
N N −1
j=0
n−1
Now, the red part of the formula can be compared with (1) to show that calling Y a
hypergeometrically distributed random variable with parameters N − 1, m − 1, n − 1,
  nm  
E Xk = E (Y + 1)r−1
N

1
2 The Binomial Distribution as a Limit of Hypergeometric Distributions 2

This allows a simple recursion:

nm
• E [X] = N

(n−1)(m−1)
  nm nm

• E X2 = N
E [Y + 1] = N N−1
+1

Consequently,
2
nm (n − 1)(m − 1) nm
  
V ar [X] = E X 2 − (E [X])2 =
 
+1 − =
N N −1 N
nm (n − 1)(m − 1) nm
 
= +1−
N N −1 N
m
Notice that, if we set = p, and let N, m → ∞, with m
N N
fixed (that is, if we assume
n ≪ N, m) this expression tends to np (1 = p), the variance of a binomial (n, p). Incidentally,
even without taking the limit, the expected value of a hypergeometric random variable is also
np.

2 The Binomial Distribution as a Limit of Hypergeometric Distributions

The connection between hypergeometric and binomial distributions is to the level of the
distribution itself, not only their moments. Indeed, consider hypergeometric distributions
m
with parameters N, m, n, and N, m → ∞, N = p fixed. A random variable with such a
distribution is such that
  
m N −m
k n−k m! (N − m)! n!(N − n)!
P [X = k] =   = · · =
N (m − k)!k! (n − k)!(N − m − n + k)! N!
n
m! (N − m)! (N − n)!
 
n
= · · =
k (m − k)! (N − m − n + k)! N!
m(m − 1)(m − 2) · · · (m − k + 1) (N − m)(N − m − 1) · · · (N − m − (n − k) + 1)
 
n
·
k N (N − 1)(N − 2) · · · (N − k + 1) (N − k)(N − k − 1) · · · (N − n + 1)
The color coding should help in keeping track where the various terms are coming from. In
the last denominator, note also that N − n + 1 = N − k − (n − k) + 1. Let’s take the limit
now. We notice that:
m(m − 1)(m − 2) · · · (m − k + 1) mk
≈ k = pk
N (N − 1)(N − 2) · · · (N − k + 1) N
since k is kept finite while m and N diverge. Also, by the same argument, that is n is kept
finite,
(N − m)(N − m − 1) · · · (N − m − (n − k) + 1) (N − m)n−k
 N − m n−k
≈ = = (1 − p)n−k
(N − k)(N − k − 1) · · · (N − n + 1) N n−k N
Adding it all up, we have found that
 
n
lim P [X = k] = pk (1 − p)n−k
N,m→∞ m =p k
N

You might also like