You are on page 1of 12

Binomial Distribution

Posted by Ted Hessing

What is Binomial Distribution?


A Binomial Distribution describes the probability of an event that only has 2
possible outcomes. For Example, Heads or tails.

It can also be used to describe the probability of a series of independent


events that only have 2 possible outcomes occurring. For Example. Flipping a
coin 10 times and having it land with 5 on heads exactly 5 times. Or, flipping a
coin 10 times and having at least 5 land on heads.

Furthermore, the probability of an event with n trials and f failures follows a


binomial distribution.

Visualizing Binomial Distribution

The Binomial Distribution looks like so when graphed:


By Tayste – Own work, Public Domain, Link

When Would You Use Binomial Distribution?


Requirements and Conditions for a Binomial Distribution

 Must be a fixed number of trials.


 Continuous data are not binomial.
 Probability of success should be the same on every trial, and also
probability of success is constant.
 Two state. Two possible outcomes.
o (true or false, hot or cold, success or failure, defective or not
defective.)
 Independent trials – trials are statistically independent.
o (For example: The flip of one coin means nothing to the results of
the next coin flip.)
 Use Binomial Distribution when you are sampling with replacement.

You Cannot Use Binomial Distribution On:

 When the probability of success is not constant for an event.


oEx. The probability of it snowing or not snowing in NYC would not
fit the criteria for a Binomial Distribution because the probability
of success is not constant. The chance of snow on winter days is
higher than summer days.
 When you sample WITHOUT replacement.
o In that case, use Hypergeometric.

Cumulative vs Non-Cumulative

There are (2) ways I’ve seen Binomial Distribution Problems be represented in
Six Sigma Exams:

 Non-cumulative questions
 Cumulative questions (with or without a chart)

The questions can either be about the actual equations and translating a word
problem into an actual solution. Or they can be about the necessary scenarios
when you would apply binomial probability – in other words, what MUST be
true to use it, or if an event is binomial, what can we infer about it’s properties.

Non Cumulative Probability

This is just a fancy statistics term of single events taken on their own.

Example: a single flip of a coin

Cumulative Probability

Another fancy statistics term for multiple events. I like to remember that
Cumulative events literally accumulate multiple probabilities.

One thing to be aware of is that you can be asked to either manually calculate
a binomial cumulative probability question or you could be used to leverage a
chart.

Notes About Sampling With Replacement

Binomial is for N trials and F failures and describes the probability


of k successes in n draws with replacement from a finite population of
size N containing exactly K successes.
Remember, binomial is for independent events where the likelihood of each
event occurring is the same as the others. If you sample without replacing,
you’re affecting the likelihood of independence.

This is in contrast to the hypergeometric distribution, which describes the


probability of k successes in n draws when sampling without replacement.

Examples of Binomial Distribution Problems


Non Cumulative Binomial Probability

Non-Cumulative Calculation Example 1: “Contains Exactly”


If a coin is tossed 10 times, what is the probability of obtaining exactly four
heads? (If an honest coin is tossed 10 times, what is the probability of 4
heads?)

With a coin toss there are only 2 possible outcomes; heads or tails. So you
should be on alert for using binomial.

binomial successes
Non-Cumulative Calculation Example 2: “Contains Exactly”
A manufacturing process creates 3.4% defective parts. A sample of 10 parts
from the production process is selected. What is the probability that the
sample contains exactly 3 defective parts?

As soon as you see the word defective, you should be alert to using the
Binomial equation. Since defect in this sense means that a part is in a binary
state, either functioning or defective, it meets our criteria.

binomial successes 2

Non-Cumulative Calculation Example 3: “Contains Exactly”


Forty-five percent of all registered voters in a national election are female. A
random sample of 8 voters is selected. The probability that the sample
contains 2 males is:
Since in this question the assumption is that there are (2) biological genders
measured, male or female, you should be on alert to use the binomial
distribution.

binomial successes 3

Non-Cumulative Chart Example: 


You can see that the same percentage was found on the chart. (In this case, x
is the probability of the event happening instead of p.)

Examples of Cumulative Binomial Probability Questions

Cumulative Calculation Example: “At least” or “Fewer Than” or “At Most”


79% percent of the students of a large class passed the final exam. A random
sample of 4 students are selected to be analyzed by the school. What is the
probability that the sample contains fewer than 2 students that passed the
exam?
Since we are examining data that only has a binary state, pass or fail, you
should be on alert to using the binomial equation.

Also, since you’re asked for ‘x or fewer’, you have to calculate the probability
of ALL POSSIBLE events.

In this example we must calculate the odds of EXACTLY one pass PLUS the
odds of EXACTLY no pass in the sample.

binomial successes 4
Cumulative Calculation Example: “More than”

An example of this kind of question is “Test the probability of the number of


dropped calls will exceed a certain number.”

Here you’d do the opposite of the “Fewer” example above and count the
probabilities above.

So, if we re-wrote the question above to be ‘What’s the probability of sampling


10 students and 8 or more passed’ it would be the probability of EXACTLY 8
passing, PLUS EXACTLY 9 passing, PLUS EXACTLY 10 passing.

Binomial Probability Questions Short Cuts


This can be a LOT of calculations. Fortunately, there are some short cuts:

Factorial Math short cuts:

I find that a lot of these binomial questions can be made easier by taking
advantage of reducing the factorials by cancelling out terms before you plug
them into the calculator.

Just for sake of simplicity let’s say that you have 5! / 3! If you write that out by
hand it’s essentially (5*4*3*2*1 / 3*2*1).. Also, you can eliminate a lot of the
terms by recognizing this and rewriting 5! / 3! as 5*4.

Distribution table short cuts:

Binomial Probability Charts

Here are some good references:

 https://www.statisticshowto.datasciencecentral.com/tables/binomial-
distribution-table/
 http://www.math.hawaii.edu/~xander/fa12_471/
Binomial_Probabilities.pdf

Binomial Chart Question Example: (Without Calculations)

 If I were using a Binomial Distribution Table, which values of X would I


add together to get the probability of at most N defective?
o You would add the probabilities of N, n-1, n-2… all the way to
zero.
o For Example, “which values of X would I add together to get the
probability of at most 4 defective?”
o Add the probabilities of 4 + 3 + 2 +1 + 0.

Binomial Chart Question Example: (“More Than”)

Let’s assume that 20% of drivers on the road have NO car insurance. What is
the probability that more than 7 out of a sample of 15 people have no car
insurance?

1. Find n = 15 in the first column on the left.


2. And then find the column containing p = 0.20.
3. Find the 7 in the second column on the left, since we want to find F(7)
= P(X ≤ 7).

Now, all we need to do is read the probability value where the p = 0.20 column
and also the (n = 15, x = 7) row intersect. What do you get?
So, the cumulative binomial probability table tells us that P(X ≤ 7) = 0.9958.
Therefore: P(X > 7) = 1 − 0.9958 = 0.0042

That is, the probability that more than 7 in a random sample of 15 would have
no car insurance is 0.0042

Example of Using Binomial Probability in a Six Sigma


Project
Typically in a Six Sigma project you are using DMAIC methodology. Let’s say
during the Define Phase you chartered a project to see if you could improve a
process that had a binomial outcome. You even might use the Binomial
Distribution to articulate the business case for why you should do the project
in the first place – for instance, event A should be binomial, but it’s clearly not.
You might use the Binomial distribution to see if samples of the process
outcome were in fact following a binomial distribution during the Analysis
phase.

If you are running a pilot implementation during the Improve Phase, you might


make the assumption that some events that are inputs to your process in fact
will follow a binomial distribution.

Also, In the Control Phase you might monitor that a specific process is


generating Binomially distributed results if it should.

What You Need to Know for Certification


Six Sigma Green Belts

The ASQ Six Sigma Green Belt BOK requires:

Probability distributions
Describe and interpret normal, binomial, and Poisson, chi square, Student’s t,
and also F distributions. (Apply)

Six Sigma Black Belts

The ASQ Six Sigma Black Belt BOK Requires:

Commonly used distributions

Describe, apply and interpret the following


distributions: normal, Poisson, binomial, chi square, Student’s t and F
distributions. (Evaluate)

The Villanova Six Sigma Black Belt BOK requires the following

Advanced statistical process control

Distributions commonly used by black belts. Describe and


apply binomial, Poisson, normal, chi-square, Student’s t, and F distributions.

Videos on Binomial Distributions


Additional Helpful Resources
Hypothesis Testing with Binomial Distribution

 https://internal.ncl.ac.uk/ask/numeracy-maths-statistics/statistics/
hypothesis-testing/hypothesis-testing-with-the-binomial-
distribution.html
 https://www.real-statistics.com/binomial-and-related-distributions/
hypothesis-testing-binomial-distribution/

You might also like