Professional Documents
Culture Documents
YOUR NOTES
AS Maths CIE
3. Statistical Distributions
CONTENTS
3.1 Probability Distributions
3.1.1 Discrete Probability Distributions
3.1.2 E(X) & Var(X) (Discrete)
3.2 Binomial & Geometric Distribution
3.2.1 The Binomial Distribution
3.2.2 Calculating Binomial Probabilities
3.2.3 The Geometric Distribution
3.3 Normal Distribution
3.3.1 The Normal Distribution
3.3.2 Standard Normal Distribution
3.3.3 Normal Distribution - Calculations
3.3.4 Finding Sigma and Mu
3.4 Working with Distributions
3.4.1 Modelling with Distributions
3.4.2 Normal Approximation of Binomial
Page 1 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 2 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 3 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 4 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
1
(a) Show that k = .
30
Page 5 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
ExamTryTip
to draw a table if there are a finite number of values that the
discrete random variable can take
When finding a probability, it will sometimes be quicker to subtract the
probabilities of the unwanted values from 1 rather than adding together
the probabilities of the wanted values
Always make sure that the probabilities are between 0 and 1, and that
they add up to 1!
Page 6 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Σ x P(X = x )
Look out for symmetrical distributions (where the values of X are symmetrical and
their probabilities are symmetrical) as the mean of these is the same as the
median
For example if X can take the values 1, 5, 9 with probabilities 0.3, 0.4, 0.3
respectively then by symmetry the mean would be 5
How do I calculate E(X²)?
E(X²) means the expected value or the mean of a random variable defined as X²
For a discrete random variable, it is calculated by:
Squaring each value of X to get the values of X2
Multiplying each value of X2 with its corresponding probability
Adding all these terms together
Σ x 2P (X = x )
In a similar way E(f(x)) can be calculated for a discrete random variable by:
Applying the function f to each value of to get the values of f(X)
Multiplying each value of f(X ) with its corresponding probability
Adding all these terms together
Σ f (x ) P (X = x )
Page 7 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
This is the mean of the squares of X minus the square of the mean of X
Compare this to the definition of the variance of a set of data
Var(X) is always positive
The standard deviation of a random variable X is the square root of Var(X)
Page 8 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
(b)
Find the value of E (X 2) .
(c)
Find the value of Var (X ) .
(a)
Find the value of E(X ) .
(b)
Find the value of E (X 2) .
(c)
Find the value of Var (X ) .
ExamCheck
Tip
if your answer makes sense. The mean should fit within the range
of the values of X.
Page 9 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 10 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
Page 11 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 12 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
The number of times that a person can swim a length of a swimming pool in YOUR NOTES
under a minute when swimming 50 lengths
The probability of swimming a lap in under a minute will decrease as the
person gets tired
Worked Example
It is known that 8% of a large population are immune to a particular virus.
Mark takes a sample of 50 people from this population. Mark uses a
binomial model for the number of people in his sample that are immune to
the virus
(a)
State the distribution that Mark uses.
(b)
State the two assumptions that Mark must make in order to use a binomial
model.
ExamIf you
Tip
are asked to criticise a binomial model always consider whether
the trials are independent, this is usually the one that stops a variable
from following a binomial distribution!
Page 13 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 14 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
P (X ≥ r ) is asking you to find the probabilities of all values greater than and YOUR NOTES
including r
This means all values that are at least r
It could also be written as P (X ≥ r ) = 1 − P (X < r )
P (X > r ) is asking you to find the probabilities of all values greater than but not
including r
This means all values that are more than r
Start at r + 1
It could also be written as P (X > r ) = 1 − P (X ≤ r )
If calculating P (a ≤ X ≤ b ) pay attention to whether the probability of a and b should
be included in the calculation or not
For example, P (4 < X ≤ 10) :
You want the integers 5 to 10
Worked Example
If X is the random variable X ∼ B(10,0 . 35) . Find:
(i)
P(X = 3)
(ii)
P(X ≤ 3)
(iii)
P(X > 3)
(iv)
P(3 < X < 6)
Page 15 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
ExamLooking
Tip
carefully at the inequality within the probability is key here,
make sure you consider which integers should be counted within your
calculations.
Page 16 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
e.g. If the probability of success in a single trial is 0.3, then the probability it will
take 6 trials to obtain the first success is given by
P (X = 6) = (1 − 0 . 3) 6 − 1 (0 . 3) = (0 . 7) 5 (0 . 3) = 0 . 050421
Page 17 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 18 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 19 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
e.g. a skateboarder performing a trick - the probability of success should YOUR NOTES
increase after practising the trick
Page 20 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Worked Example
Given that X ∼ Geo(0 . 4) find
(i)
P(X = 5)
(ii)
P(X > 5)
(iii)
P(3 ≤ X < 8)
(iv)
P(X = 8 | X > 5)
(v)
The mode of X .
Page 21 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
ExamTryTip
not to get bogged down with formulae for the geometric
distribution, most questions can be deduced using logic
If you are asked to criticise a geometric model always consider whether
trials are independent
especially if it involves “practising” or “performing” a skill
most people will improve after they’ve made several attempts at a
skill
so the probability of success should gradually increase over time
If finding the number of trials required (r ) then be careful counting
calculator presses; remember you are likely to be finding r − 1(the
number of failures before success) in the first instance
Page 22 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 23 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
Page 24 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Worked Example
The random variable S represents the speeds (mph) of a certain species of
cheetahs when they run. The variable is modelled using N(40, 100).
(a)
Write down the mean and standard deviation of the running speeds of
cheetahs.
(b)
State the two assumptions that have been made in order to use this model.
(a)
Write down the mean and standard deviation of the running speeds of cheetahs.
(b)
State the two assumptions that have been made in order to use this model.
Page 25 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
Exam Tip
Remember the second number in N(x, y) is the variance, if you want the
standard deviation then you need to square root it.
Page 26 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 27 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
How is the table of the normal distribution function used? YOUR NOTES
In your formula booklet you have the table of the normal distribution which
provides probabilities for the standard normal distribution
The probabilities are provided for Φ(z ) = P(Z ≤ z ) = P(Z < z )
To find other probabilities you should use the symmetry property of the
normal distribution curve
The table gives probabilities for values of z between 0 and 3
For negative values of z, the symmetry property of the normal distribution is
used
For values greater than z = 3 the probabilities are small enough to be
considered negligible
The tables give the probabilities to 4 decimal places
To read probabilities from the normal distribution table for a z value of up to 2
decimal places:
The very first column lists all z values to 1 decimal place from z = 0.0 to z =
2.9
The top row gives the second decimal place for each of these z values
So the value of Φ(1. 23) = P (Z ≤ 1. 23) would be found at the point where the ‘1.2’
row meets the ‘3’ column
P (Z ≤ 1 . 23) = 0 . 8907
To read probabilities from the normal distribution table for a z value of 3 decimal
places:
There is an extra section to the right of the tables that gives the amount to
add on to the probabilities for the third decimal place
The values given in the columns represent one ten-thousandth
If the value is 7 we add 0.0007 to the probability
If the value is 23 we add 0.0023 to the probability
To find the value of Φ(1. 234) we would need to find the amount to add on to
0.8907
Find the point where the 1.2 row meets the ADD 4 column, this gives us the
number 7
Add the value 0.0007 to the probability for Φ(1. 23)
P (Z ≤ 1 . 234) = 0 . 8914
How is the table used to find probabilities that are not listed?
The property that the area under the graph is 1 allows probabilities to be found for
P( Z > z)
Use the formula P (Z > z ) = 1 − Φ(z )
The symmetrical property of the normal distribution gives the following results:
P(Z ≤ z ) = P(Z ≥ − z )
P(Z ≥ z ) = P(Z ≤ − z )
This allows probabilities to be found for negative values of z or for P (Z > z )
Φ( − z) = P (Z ≤ − z ) = P (Z > z) =1 − Φ(z )
Therefore:
Φ( − z ) = 1 − Φ(z )
P (Z > z ) = 1 − Φ(z )
P (Z > − z ) = P (Z ≤ z ) = Φ(z )
Page 28 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
How are z values found from the table of the normal distribution
function?
To find the value of z for which P (Z ≤ z ) = p look for the value of p from within the
table and find the corresponding value of z
If the probability is given to 4 decimal places most of the time the value will
exist somewhere in the tables
Occasionally you may have to use the ADD columns to find the exact value
If the values in the ADD columns don’t exactly match up use the closest value
or find the midpoint of the z values that are either side of the probability
If your probability is 0.5 or greater look through the tables to find the
corresponding z value
For P (Z < z ) ≥ 0. 5 use the z value found in the table
For P (Z > z ) ≥ 0. 5 take the negative of the z value found in the table
If the probability is less than 0.5 you will need to subtract it from one before
using the tables to find the corresponding z value
For P (Z < z ) < 0. 5 take the negative of the z value found in the table
For P (Z > z ) < 0. 5 use the z value found in the table
Always draw a sketch so that you can see these clearly
The formula booklet also contains a table of the critical values of z
This gives z values to 3 decimal places for common probabilities
The probabilities in this table are 0.75, 0.9, 0.95, 0.975, 0.99, 0.995, 0.9975,
0.999 and 0.9995
Page 29 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
(i)
P(Z ≤ 0 . 957)
(ii)
P(Z > 0 . 957)
(iii)
P(Z ≤ − 0 . 957)
(iv)
P( − 0 . 957 < Z ≤ 0 . 957)
(b)
Find the value of z such that P ( Z < z ) = 0.3
(a)
By sketching a graph and using the table of the normal distribution, find the
following:
(i)
P(Z ≤ 0 . 957)
(ii)
P(Z > 0 . 957)
(iii)
P(Z ≤ − 0 . 957)
(iv)
P( − 0 . 957 < Z ≤ 0 . 957)
Page 30 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
(b)
Find the value of z such that P (Z < z) = 0 . 3
Page 31 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
ExamA sketch
Tip
will always help you to visualise the required probability and
can be used to check your answer. Check whether the area shaded is
more or less than 50% and compare this with your answer.
Page 32 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 33 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 34 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
(a)
P (X ≤ 22) ,
(b)
P(18 ≤ X < 27)
(a)
P (X ≤ 22) ,
(b)
P(18 ≤ X < 27)
Page 35 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
If your probability is less than 0.5 you will need to subtract it from one to find the
corresponding z value
Remember that the position of the z-value will not change, only the direction
of the inequality
Once you have the correct z value substitute it into the formula z = a −σ μ and
solve to find the value of a
Page 36 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Always check that your answer makes sense by considering where a is in relation YOUR NOTES
to the mean
Given the value of P(µ- a < X < µ + a) I find the value of a ?
A sketch making use of the symmetry of the graph is essential
⎛ 100 + α ⎞
If you are given P (μ − a < X < μ + a ) = α % then P (X < μ + a ) will be ⎜⎜ 2 ⎟⎟ %
⎝ ⎠
This is easier to see from a sketch than to remember
You can then look through the tables for the corresponding z-value and
(μ + a ) − μ a
substitute into the formula z = σ
=
σ
Worked Example
The random variable W ∼ N(50, 36)
Find the value of w such that P (W > w ) = 0 . 7676
Page 37 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
ExamTheTip
most common mistake students make when finding values from
given probabilities is forgetting to check whether the z-value should be
negative or not. Avoid this by checking early on using a sketch whether
z is positive or negative and writing a note to yourself before starting
the other calculations.
Page 38 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
How do I find the mean (μ) and the standard deviation (σ) if both of them
are unknown?
If both of them are unknown then you will be given two probabilities for two
specific values of x
The process is the same as above
You will now be able to calculate two z-values
You can form two equations (rearranging to the form x = μ + σz is helpful)
You now have to solve the two equations simultaneously (you can use your
calculator to do this)
Be careful not to mix up which z-value goes with which value of x
Page 39 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
ExamThese
Tip
questions are normally given in context so make sure you identify
the key words in the question. Check whether your z-values are positive
or negative and be careful with signs when rearranging.
Page 40 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 41 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
Can the binomial distribution and the normal distribution be used in the
same question?
Some questions might require you to first use the normal distribution to find the
probability of success and then use the binomial distribution
Remember a discrete distribution is either a binomial or geometric distribution
The key is to make sure you are very clear about what each parameter/variable
represents
Worked Example
In a population of cows, the masses of the cows can be modelled using a
normal distribution with mean 550 kg and standard deviation 80 kg. A
farmer classifies cows as beefy if they weigh more than 700 kg. The farmer
takes a random sample of 10 cows and weighs them.
Find the probability that at most one cow is beefy.
Page 42 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
YOUR NOTES
ExamAlways
Tip
state what your variables and parameters represent. Make sure
you know the conditions for when each distribution is (or is not) a
suitable model.
Page 43 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 44 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
In your exam you must use the formula and not a calculator to find binomial YOUR NOTES
probabilities so you are limited to small values of n
What are continuity corrections?
The binomial distribution is discrete and the normal distribution is continuous
A continuity correction takes this into account when using a normal
approximation
The probability being found will need to be changed from a discrete variable, X,
to a continuous variable, XN
For example, X = 4 for binomial can be thought of as 3. 5 ≤ X N < 4. 5 for normal
as every number within this interval rounds to 4
Remember that for a normal distribution the probability of a single value is
zero so P(3. 5 ≤ X N < 4. 5) = P(3. 5 < X N < 4. 5)
How do I apply continuity corrections?
Think about what is largest/smallest integer that can be included in the inequality
for the discrete distribution and then find its upper/lower bound
P (X = k ) ≈ P (k − 0 . 5 < X N < k + 0 . 5)
P (X ≤ k ) ≈ P (X < k + 0 . 5)
N
You add 0.5 as you want to include k in the inequality
P (X < k ) ≈ P (X < k − 0 . 5)
N
You subtract 0.5 as you don't want to include k in the inequality
P (X ≥ k ) ≈ P (X > k − 0 . 5)
N
You subtract 0.5 as you want to include k in the inequality
P (X > k ) ≈ P (X > k + 0 . 5)
N
You add 0.5 as you don't want to include k in the inequality
For a closed inequality such as P (a < X ≤ b )
Think about each inequality separately and use above
P (X > a ) ≈ P (X N > a + 0 . 5)
P (X ≤ b ) ≈ P (X < b + 0. 5)
N
Combine to give
P (a + 0 . 5 < X N < b + 0 . 5)
Page 45 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Worked Example
The random variable X ∼ B (1250, 0 . 4) .
Use a suitable approximating distribution to approximate P(485 ≤ X ≤ 530)
.
ExamIn the
Tip
exam, the question will often tell you to use a normal
approximation but sometimes you will have to recognise that you
should do so for yourself. Look for the conditions mentioned in this
revision note, n is large, p is close to 0.5, np > 5 and nq > 5.
Page 46 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 47 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 48 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers
Head to savemyexams.co.uk for more awesome resources
Page 49 of 49
© 2015-2021 Save My Exams, Ltd. · Revision Notes, Topic Questions, Past Papers