You are on page 1of 15

TOPIC 5

Random Variables
and Probability
Distributions

Random Variable
 A random variable, RV (pemboleh ubah rawak) is a
variable that can take on different values according to
the outcome of an experiment.

An upper-case letter will represent the name of the


random variable, usually X. Its lower-case counterpart
will represent the value of the random variable.

 A discrete random variable (pemboleh ubah rawak


diskret) is one that can only take on a finite number of
values. A continuous random variable (pemboleh
ubah rawak selanjar) is one whose values are
uncountable.

1
Discrete Probability Distribution
 A probability distribution (taburan kebarangkalian) is a
table, formula, or graph that describes the values (x) of a
RV and the probability, p(x) associated with these values.

 Requirements for a discrete probability distribution:

(a) 0  p(x)  1, for all x and (b) 𝑝 𝑥 =1

 Example of discrete prob. distribution: Binomial, Poisson

 The prob. distribution represents a population. We can


describe the prob. distribution by computing the mean
and variance.

Example 1:
A financial counselor conducts investment seminars with
each seminar limited to 6 attendees. Because of the small size
of the seminar group and the personal attention each person
receives, some of the attendees become clients following the
seminar.

For the past 20 seminars she has conducted, the number of


attendees who become clients, X has had the probability
distribution shown below.

x 0 1 2 3 4 5 6
p(x) 0.05 m 0.20 0.25 0.15 0.15 m

2
a) Find the value of m.

2m = 1 – 0.8 = 0.2

Therefore, m = 0.1

b) Find the probability that at least 4 attendees become


clients of the financial counselor.

P(X ≥ 4) = P(X = 4) + P(X = 5) + P(X = 6) = 0.40

c) Find the probability that less than 2 attendees become


clients of the financial counselor.

P(X < 2) = P(X = 0) + P(X = 1) = 0.15

Continuous Probability Distribution


 Since a continuous random variable can assume an
uncountable number of values, we cannot list the
possible values because there is an infinite number of
them. Therefore, the probability that any particular value
will be assumed as zero

P(X = c) = P(X = d) = 0

 Hence, it is only meaningful to find the probability that X


will assume a value within a particular interval.

 Such probability is found using the probability density


function (fungsi ketumpatan kebarangkalian), f (x)
associated with X.

3
 The probability that X will take a value in the interval
c  X  d is given by the area under the graph of f (x)
between the values c and d.

𝑃 𝑐≤𝑋≤𝑑 = 𝑓 𝑥 𝑑𝑥

 The following equality holds for any continuous random


variable X

P(c < X < d) = P(c ≤ X ≤ d) = P(c < X ≤ d) = P(c ≤ X < d)

 Requirements for a probability density function f (x)


whose range is a ≤ X ≤ b

a) f (x)  0 for all x between a and b

b) the total area under the curve between


a and b is 1.0

4
Normal Distribution
 The normal distribution (taburan normal) is the most
important of all probability distributions because of its crucial
role in statistical inference.

 Characteristics of a Normal Probability Distribution:

 normal curve is bell-shaped & has a single peak at the


exact center of the distribution

 arithmetic mean, median, & mode of the distribution are


equal & located at the peak

 normal probability distribution is symmetrical about its


mean

 half the area under the curve is above the mean and half
is below it

 normal curve extends to infinity and gets closer and


closer to the X-axis but never actually touches it

 there is a "family" of normal probability distribution with


different combination of mean,  and standard
deviation, 

5
Normal curve is symmetrical
two halves identical

Curve Curve
extends Area Area extends
to - = 0.5 = 0.5 to +

mean = median = mode

 the probability density function of a normal random


variable, X is

1 , - ≤ x ≤ 
𝑓(𝑥) = 𝑒
𝜎 2𝜋
where
 is the mean of the normal distribution
 is the standard deviation of the normal distribution

 normal distribution is described by 2 parameters,


the mean () and the standard deviation ()
 X ~ N( ,  2)

6
Standard Normal Distribution
 The standard normal distribution (taburan normal
piawai) or Z distribution (taburan Z) is a special normal
distribution with a mean of 0 and a standard deviation
of 1  Z ~ N(0, 1)

 Z-score (skor Z) is the distance between a selected


value, designated x , and the population mean ()
divided by the population standard deviation ()
𝑥−𝜇
𝑍=
𝜎

The Z-score is how far above or below the individual value is


compared to the population mean in units of standard deviation.

90 hours is located 1 180 hours is located


standard deviation 2 standard deviation
below the mean. above the mean.

7
By standardizing (mempiawaikan) the normal RV, the prob. that
a normal RV falls into any interval could be found using the
standardized normal, Z table (jadual normal piawai, Z).

Example 2:
Determine the following probability.

a) P(Z  -0.86) f) P(Z  0.7)


b) P(Z > -1.53) g) P(0 < Z  1.49)
c) P(-2.1  Z < 0) h) P(1.16 < Z < 2.15)
d) P(-1.88  Z  -0.95) i) P(-2.6 < Z < 0.34)
e) P(Z < 1.03) j) P(-1  Z  1)

8
a) P(Z  -0.86)
= 0.1949

-0.86 0 Z

b) P(Z > -1.53)


= 1 – 0.0630
= 0.937

-1.53 0 Z

c) P(-2.1  Z < 0)
= 0.5 – 0.0179
= 0.4821

-2.1 0 Z

d) P(-1.88  Z  -0.95)
= P(Z  -0.95) –
P(Z  -1.88)
= 0.1711 – 0.0301
= 0.141

-1.88 -0.95 0 Z

9
e) P(Z < 1.03)
= 0.8485

0 1.03 Z

f) P(Z  0.7)
= 1 – 0.7580
= 0.242

0 0.7 Z

g) P(0 < Z  1.49)


= 0.9319 – 0.5
= 0.4319

0 1.49 Z

h) P(1.16 < Z < 2.15)


= P(Z < 2.15) –
P(Z < 1.16)
= 0.9842 – 0.8770
= 0.1072

0 1.16 2.15 Z

10
i) P(-2.6 < Z < 0.34)
= P(Z < 0.34) – P(Z < -2.6)
= 0.6331 – 0.0047
= 0.6284

-2.6 0 0.34 Z

j) P(-1  Z  1)
= P(Z  1) – P(Z  -1)
= 0.8413 – 0.1587
= 0.6826 (Empirical Rule)

-1 0 1 Z

Example 3:
Following their production, industrial generator shafts
are tested for static and dynamic balance, and the
necessary weight is added to predrilled holes in order to
bring each shaft within balance specifications. From past
experience, the amount of weight added to a shaft has been
normally distributed, with an average of 35 grams and a
standard deviation of 9 grams. What is the probability that
a randomly selected shaft will require

11
a) more than 26 grams of weight for proper balance?

𝑃 𝑋 > 26 = 𝑃 𝑍 >

= 𝑃 𝑍 > −1 = 0.8413

b) between 41 and 49 grams of weight for proper balance?

41 − 35 49 − 35
𝑃 41 < 𝑋 < 49 = 𝑃 <𝑍<
9 9

= 𝑃 0.67 < 𝑍 < 1.56 = 0.192

Finding Values of Z
 Sometime, we are required to determine the value of Z
given a probability.

 For example, what is the value of a standard normal


random variable (Z ) such that

i) P(Z > Z ) =  or ii) P(Z < -Z ) = 

P(Z < -Z ) =  P(Z > Z ) = 

-Z 0 Z Z

12
Example 4:
a) Find the value of -Z0.004

P(Z < -2.64) = 0.004

-Z0.004 = -2.64

b) Find the value of Z0.035

P(Z > 1.81)  0.035

0.965

Z0.035  1.81

For selected right-tail or left-tail probability value, following


table can be used to find the respective Z-score.

The column  is the prob. and the u is the Z-score.

13
c) Find the value of Z0.015 d) Find the value of Z0.01

P(Z > 2.1701) = 0.015 P(Z > 2.3263) = 0.01

Z0.015 = 2.1701 Z0.01 = 2.3263

e) Find the value of -Z0.01

Since normal prob. distribution is symmetrical about its mean,

P(Z > 2.3263) = P(Z < -2.3263) = 0.01

-Z0.01 = -2.3263

Example 5:

During 2014, the average number of flying hours for aircraft


operated by a regional airline was 2389.

Assume a normal distribution and a standard deviation of 300


hours, what number of annual flying hours was exceeded by
75% of the aircraft?

Solution:

Given that  = 2389 and  = 300


Assumed that the required number of annual flying hours is
denoted by Xm where P(X > Xm) = 0.75  Refer Figure (a)

14
From Figure (b),
Figure (a)
P(Z < -Z0.25) = 0.25

Refer to the statistical table,

Xm 2389 X -Z0.25 = -0.6745

Using Z-score formula,

Figure (b) 𝑋 −𝜇
= −0.6745
𝜎
0.25 Xm = 2389 – 0.6745(300)

Xm = 2186.65 hours
-Z0.25 0 Z

15

You might also like