Discrete and Continuous Random Variables

Discrete and Continuous
Random Variables
Summer 2003
Random Variables
A random variable is a rule that assigns a numerical
value to each possible outcome of a probabilistic
experiment.
We denote a random variable by a capital letter (such as
“X”)
Examples of random variables:

r.v. X: the age of a randomly
selected student here today.
r.v. Y: the number of planes
completed in the past week.
15.063 Summer 2003 2

Discrete or Continuous
A discrete r.v. can take only distinct, separate
values
– Examples?
A continuous r.v. can take any value in some

interval (low,high)
– Examples?
15.063 Summer 2003 3

Discrete Random Variables
A probability distribution for a discrete r.v. X
consists of:
– Possible values x1, x2, . . . , xn
– Corresponding probabilities p1, p2, . . . , pn
with the interpretation that
p(X = x1) = p1, p(X = x2) = p2, . . . , p(X = xn) = pn
Note the following:
– Variable names are capital letters (e.g., X)
– Values of variables are lower case letters (e.g., x1)
– Each pi ≥ 0 and p1 + p2 + . . . + pn = 1.0
15.063 Summer 2003 4

Probability tree and probability distribution for
r.v. X (total # Heads in experiment 1)
Outcome X (#Hs) Probab
H3
H1H2H3 3 0.125
H2 0.5
0.5
T3
H1H2T3 2 0.125
H1 0.5
0.5
H3
H1T2H3 2 0.125
T2 0.5
P(X = 0) = 1/8
0.5
T3
P(X = 1) = 3/8 0.5
H1T2T3 1 0.125
P(X = 2) = 3/8 H3
T1H2H3 2 0.125
P(X = 3) = 1/8 H2 0.5
0.5
T3
T1H2T3 1 0.125
T1 0.5
0.5
H3
T1T2H3 1 0.125
T2 0.5
0.5
T3
T1T2T3 0 0.125
0.5 1.000
15.063 Summer 2003 5

A histogram is a display of probabilities as a bar chart
r.v. X = number of heads when tossing 3 unbiased coins
5/8
4/8
3/8
2/8
1/8
0.00
0 1 2 3
Now let’s consider experiment 2: “number of heads when tossing

3 biased coins (p(H) = 0.30)”. Again r.v. X: total number of heads
obtained when performing experiment 2…..
15.063 Summer 2003 6
Probability tree and probability distribution for
r.v. X (total # Heads in experiment 2)
Outcome X (#Hs) Probabil
H3
H1H2H3 3 0.027
H2 0.3
0.3
T3
H1H2T3 2 0.063
H1 0.7
0.3
H3
H1T2H3 2 0.063
T2 0.3
X p(X) 0.7
0 0.343 T3
1 0.441 H1T2T3 1 0.147
2 0.189 0.7
3 0.027
1.000
H3
T1H2H3 2 0.063
H2 0.3
0.3
T3
T1H2T3 1 0.147
T1 0.7
0.7
H3
T1T2H3 1 0.147
T2 0.3
0.7
T3
T1T2T3 0 0.343
0.7 1.000
15.063 Summer 2003 7

Histogram for experiment 2
r.v. X = total number of heads when tossing 3 biased
coins with p(H) = 0.30.
5/8 p(X)
4/8 0.500
0.441
0.450
0.400
3/8 0.343
0.350
0.300
2/8 0.250 p(X)
0.189
0.200
0.150
1/8
0.100
0.050 0.027
0/8 0.000 0 1 2 3
0 1
x
2 3 x
15.063 Summer 2003 8

Example 2: Let X be the random variable that denotes the
number of orders for aircraft for next year. Suppose that the
number of orders for aircraft for next year is estimated to obey the
following distribution:
Orders for aircraft next year Probability

xi pi
42 0.05
43 0.10
44 0.15
45 0.20
46 0.25
47 0.15
48 0.10
15.063 Summer 2003 9

AA histogram
histogram isis aa display
display of
of probabilities
probabilities as
as aa bar
bar chart
chart
Probability Distribution of the

Number of Orders of Aircraft Next Year
0.35
0.30
0.25
Probability
0.20
0.15
0.10
0.05
0.00
43 44 45 46 47 48
Number of Orders
15.063 Summer 2003 10

Probability Distribution Function of Eastern
Division Sales
Eastern Division 0.40

0.35
Sales 0.30
Probability
($ million) Probability 0.25
3.0 0.05 0.20

0.15
4.0 0.20
0.10
5.0 0.35 0.05
6.0 0.30 0.00
3.0 4.0 5.0 6.0 7.0 8.0
7.0 0.10
Sales ($ million)
8.0 0.00
Probability Distribution Function of Western
Western Division Division Sales
Sales
0.40
($ million) Probability
0.35
3.0 0.15 0.30
Probability
4.0 0.20 0.25
0.20
5.0 0.25
0.15
6.0 0.15 0.10
7.0 0.15 0.05
0.00
8.0 0.10
3.0 4.0 5.0 6.0 7.0 8.0
Sales ($ million)
X and Y denote the sales next year in the eastern division and the western division
of a company, respectively. X and Y obey the following distributions:
15.063 Summer 2003 11

Consider two random variables X, Y, with the following
histograms:
(A) (B)
0.50 0.50
0.40 0.40
0.30 0.30
P(Y = y)
P(X =x) 0.20 0.20
0.10 0.10
0.00 0.00
0 1 2 3 4 5 6 0 1 2 3 4 5 6
x y
Random variable X Random variable Y
How do we describe and compare X and Y?
15.063 Summer 2003 12

Summary Statistics for a r.v.: Three important measures
- Mean or Expected Value:

Represents “average” outcome; a measure of “central tendency”
n n
E(X) = µX = ∑ P(X = x i )x i = ∑ pi x i
i =1 i =1
- Variance:
Squared deviation around the mean; a measure of “spread”
n n
Var(X) = σ 2
X = ∑ P(X = x i )(xi − µX ) = ∑ pi (x i − µX )2
2
i =1 i =1
- Standard Deviation :
Square root of the variance. A measure of spread in the same units
as the random variable X.
2
SD(X) = σ X = σX
15.063 Summer 2003 13
Example: Let X be the number that comes up on a roll of one die.
Compute the mean, variance and standard deviation of X.
Outcome P(X=x)
1 1/6
2 1/6
3 1/6
4 1/6
5 1/6
6 1/6
µx= 1/6*1 +1/6 *2 +1/6 *3 + 1/6*4+1/6*5+1/6*6 = 3.5

(σx)2= 1/6*(1-3.5)2 +1/6 *(2-3.5) 2 +1/6 *(3-3.5) 2 + 1/6 *(4-3.5) 2
+1/6 *(4-3.5) 2 +1/6 *(5-3.5) 2 +1/6 *(6-3.5) 2 = 2.917
σx= Sqrt{2.917}= 1.708 15.063 Summer 2003 14

Example 2:
Compute the mean, variance ,standard deviation of
X (eastern division sales) and Y (western division sales)
µ X = $5.2 mil µY = $5.25 mil
σ = 1.06 mil
2
X
2
σ Y2 = 2.3875 mil 2
σX = $1.029 mil σ Y = $1.545mil
15.063 Summer 2003 15

Continuous Random Variables
A continuous random variable can take any value in some interval

Example: X = time a customer spends waiting in line at the store
• “Infinite” number of possible values for the random variable.

How can we describe a probability distribution?
(We can no longer list the pi’s and xi’s!)
• For a continuous random variable, questions are phrased in terms

of a range of values.
Example: We might talk about the event that a customer waits

between 5.0 and 10.0 minutes, and not about the event that a
customer waits exactly 5.25 minutes! (why?)
15.063 Summer 2003 16

Continuous
Continuous Probability
Probability Distributions
Distributions
Continuous R.V.’s have continuous probability
distributions known also as the probability density
function (PDF)
Since a continuous R.V. X can take an infinite
number of values on an interval, the probability that a
continuous R.V. X takes any single given value is
zero: P(X=c)=0
Probabilities for a continuous RV X are calculated for
a range of values: P (a ≤ X ≤ b)
P (a ≤ X ≤ b) is the area under
the probability distribution function,
f(x) , for continuous R.V. X.
The total area under f(x) is 1.0
ab c
15.063 Summer 2003 17
The Uniform Distribution
X is uniform on [a,b] if X is equally likely to take any value in
the range from a to b.
Example: Suppose that transit time of the subway between Alewife
Station and Downtown Crossing is uniformly distributed between
10.0 and 20.0 minutes.
a) What is the mean transit time?
E(X) = µx = ?
b) What is the probability that the transit
time exceeds 12.0 minutes?
P(X >= 12.0) = ?
c) What is the probability that the transit
time is between 14.0 and 18.0 minutes?
P(14<= X <= 18) = ?
15.063 Summer 2003 18

Uniform
Uniform Distribution
Distribution
A continuous R.V. X is uniform in the interval [a-b]
if it is equally likely to take any value in the interval
[a,b]. Thus f(x) for a continuous uniform R.V. is a
horizontal line (i.e., a constant). f(x)=1/(b-a) so
that the area under the curve is 1.0.
⎧ 1
⎪b − a for a≤ x≤b 1
⎪ b−a
f ( x) = ⎨
⎪ 0
f (x)
for all other values
⎪⎩
Area = 1
a x b
15.063 Summer 2003 19
Uniform
Distribution
transit time of the subway between Alewife
Station and Downtown Crossing is uniformly
distributed between 10.0 and 20.0 minutes.
⎧ 1
⎪ 20 − 10 for 10 ≤ x ≤ 20
⎪ 1
f ( x) = ⎨
⎪ 0 b10− a
for all other valu es
⎪ f (x)
⎩
Area = 1
P(X >= 12.0) = 0.80
P(14<= X <= 18) = 0.40 10 x 20
15.063 Summer 2003 20

Lot
Lot Weights
Weights inin aa Warehouse
Warehouse are
are Uniformly
Uniformly
distributed
distributed Between
Between 41
41 and
and 47
47 lbs.
lbs.
⎧ 1
⎪ 47 − 41 for 41 ≤ x ≤ 47
⎪ 1 1
f ( x) = ⎨ =
⎪ 0 47 − 41 6
⎪⎩
for all other values f (x)
Area = 1
41 47 x
15.063 Summer 2003 21

What
What is
is the
the probability
probability that
that the
the weight
weight of
of aa
randomly
randomly selected
selected lot
lot is
is between
between 42 42 and
and 45
45 lbs?
lbs?
P ( x1 ≤ X ≤ x 2 ) =
1
(x − x )
b−a 2 1
P(42 ≤ X ≤ 45) = (45− 42) =

1 1
(45 − 42 ) 1 =
1
6 2 6 2
f (x)
1/ 6
Area
= 0.5
41 42 45 47 x
15.063 Summer 2003 22

Uniform
Distribution
Mean
Mean and
and Standard
Standard Deviation
Deviation
Mean Mean
a + b 41 + 47 88
µ = µ = = = 44
2 2 2
Standard Deviation Standard Deviation

b−a 47 − 41 6
σ= σ= = = 1. 732
12 12 3. 464
15.063 Summer 2003 23

Cumulative Distribution Function (CDF)
t
P ( X ≤ t) = ∫
− ∞
f ( x ) dx = F (t )
1.0
F(t)
t 0.0
t
P( X ≤ s) F (s) = P( X ≤ s)
1.0
F(s)
s
0.0 s
P ( s ≤ X ≤ t ) = P ( X ≤ t ) − P ( X < s) = F (t ) − F ( s)
15.063 Summer 2003 24
Let X be uniformly distributed on [a,b]. Then density
function of X is:
f(x)
⎧ 1
⎪b − a for a≤ x≤b
⎪
f ( x) = ⎨ 1/(b-a)
⎪ 0 for all other values
⎪⎩
Find the CDF: a b

x
F (x) = ∫
− ∞
f ( u ) du
F(x)
⎧ 1 1
⎪ (b − a ) x − a for a≤ x≤b
(b − a )
⎪ 1
F ( x) = ⎨ 0 x<a
⎪ 1 for x>b
⎪
⎩ a b
15.063 Summer 2003 25

Summary
Discrete and continuous random variables
have to be understood differently
Histograms are very useful for discrete
variables
Next session we will talk about “binomial”
distributions
Uniform continuous random variables are
a good place to start for our later work with
normal distributions etc.
15.063 Summer 2003 26

Discrete and Continuous Random Variables

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Discrete and Continuous Random Variables

Uploaded by

Copyright:

Available Formats

Discrete and Continuous

Examples of random variables:

15.063 Summer 2003 2

A continuous r.v. can take any value in some

15.063 Summer 2003 3

15.063 Summer 2003 4

P(X = 3) = 1/8 H2 0.5

15.063 Summer 2003 5

Now let’s consider experiment 2: “number of heads when tossing

15.063 Summer 2003 7

15.063 Summer 2003 8

Orders for aircraft next year Probability

15.063 Summer 2003 9

Probability Distribution of the

15.063 Summer 2003 10

Eastern Division 0.40

3.0 0.05 0.20

15.063 Summer 2003 11

Random variable X Random variable Y

How do we describe and compare X and Y?

15.063 Summer 2003 12

- Mean or Expected Value:

µx= 1/6*1 +1/6 *2 +1/6 *3 + 1/6*4+1/6*5+1/6*6 = 3.5

σx= Sqrt{2.917}= 1.708 15.063 Summer 2003 14

µ X = $5.2 mil µY = $5.25 mil

σX = $1.029 mil σ Y = $1.545mil

15.063 Summer 2003 15

A continuous random variable can take any value in some interval

• “Infinite” number of possible values for the random variable.

• For a continuous random variable, questions are phrased in terms

Example: We might talk about the event that a customer waits

15.063 Summer 2003 16

15.063 Summer 2003 18

15.063 Summer 2003 20

15.063 Summer 2003 21

P(42 ≤ X ≤ 45) = (45− 42) =

15.063 Summer 2003 22

Standard Deviation Standard Deviation

15.063 Summer 2003 23

Find the CDF: a b

15.063 Summer 2003 25

15.063 Summer 2003 26

You might also like

µx= 1/61 +1/6 2 +1/6 3 + 1/64+1/65+1/66 = 3.5