Chapter 6 - Joint Distributions

Mathematical Statistics I
Module for Math 150.1
an
lim
Di
UP
s,
ic
at
em
h
at
M
of
te
itu
st
In
Chapter 6: Joint Distributions
Previous chapters dealt with univariate distributions. In many applications, there will be
distributions involving two or more random variables.
6.1 Joint and Marginal Distributions
an
lim
In order to study distributions of two random variables, we have the following definitions.
Di
We start with 2 discrete random variables.
UP
Definition 6.1 Let X and Y be two discrete random variables defined over the same prob-
s,
ability space. The joint probability mass function (joint pmf ) of X and Y is the
ic
at
non-negative function fX,Y : R2 → R defined as

em
fX,Y (x, y) = P(X = x, Y = y).

h
at
M
Note that by P(X = x, Y = y) we mean the probability of the event {ω ∈ Ω : X(ω) =

of
x and Y (ω) = y}. We also define the joint range of X and Y to be the set
te
itu
ran(X, Y ) = {(x, y) ∈ R2 : P(X = x, Y = y) > 0}.

st
In
Remark: The joint pmf of X and Y should satisfy the following properties:
1. 0 ≤ fX,Y (x, y) ≤ 1
XX
2. fX,Y (x, y) = 1
y x
X
3. P[(X, Y ) ∈ A] = fX,Y (x, y), A ⊆ R2 .
(x,y)∈A∩ran(X,Y )
Note that P[(X, Y ) ∈ A] denotes the probability of {ω ∈ Ω : (X(ω), Y (ω)) ∈ A}.
We state a similar definition for the continuous case.
2
Chapter 6. Joint Distributions 3
Definition 6.2 Two random variables X and Y are said to be jointly continuous if
there exists a non-negative function fX,Y : R2 → R, such that for any subset D ⊆ R2 ,
¨
P[(X, Y ) ∈ D] = fX,Y (x, y) dA.
D
Such function is called the joint probability density function (joint pdf ) of X and Y .
In this case, the joint range of X and Y is defined as
ran(X, Y ) = {(x, y) ∈ R2 : fX,Y (x, y) > 0}.
Remark:
an
1. Just like in the discrete case, the joint pdf of X and Y must satisfy
lim
¨ ¨
Di
fX,Y (x, y) dA = fX,Y (x, y) dA = 1.
R2 ran(X,Y ) UP
2. In general, two random variables do not need to be both continuous to have
ic s,
a continuous joint pdf. But for the sake of clarity and coherence, when we
at
talk about a continuous joint distribution, we shall assume that both random
em
variables are also continuous.

h
at
M
Examples.
of
1. An analyst is studying traffic accidents in 2 adjacent towns. The random vari-

te
able X represents the number of accidents in town A and Y for town B. The
itu
joint pmf of X and Y is given by

st
In
e−2
p(x, y) = , x = 0, 1, 2, . . . ; y = 0, 1, 2, . . .
x!y!
Verifying,
X X e−2 ∞ ∞
−2
X 1 X 1
=e = e−2 (e)(e) = 1.
y x
x!y! y=0
y! x=0 x!
Suppose we want to find the probability that there will be 1 accident in town
e−2 1
A and 2 accidents in town B. This is given by p(1, 2) = = 2.
1!2! 2e
2. Suppose that the joint pdf of X and Y is given by
(
x + cy 2 , 0 ≤ x, y ≤ 1
f (x, y) = .
0 , otherwise
Find c and P 0 ≤ X < 13 , 0 ≤ Y ≤ 21 .

Solution: Since f is a joint pdf,

ˆ 1ˆ 1 ˆ 1
2 1 2 1 c
1= (x + cy ) dxdy = + cy dy = + .
0 0 0 2 2 3
3
Thus, c = . Next,
2
ˆ 1/2 ˆ 1/3
3y 2

1 1
P 0 ≤ X < ,0 ≤ Y ≤ = x+ dxdy
3 2 0 0 2
ˆ 1/2
y2

1
= + dy
0 18 2
1 1
= + ≈ 0.04861
an
36 48
lim
1
3. Consider rolling a pair of unbiased dice. Each outcome has probability . Let
Di
36
X be the smaller and Y the larger outcome on a roll. The joint pdf of X and
UP
Y is given by 
1
s,
 , 1≤x=y≤6
ic

36

at
p(x, y) =
 2, 1≤x<y≤6
em


36
h
at
The following table summarizes the joint pdf values of X and Y .

M
Y \X 1 2 3 4 5 6 row total
of
1 1
1
te
36 36
itu
2 1 3
2 36 36 36
st
2 2 1 5
3 36 36 36 36
In
2 2 2 1 7
4 36 36 36 36 36
2 2 2 2 1 9
5 36 36 36 36 36 36
2 2 2 2 2 1 11
6 36 36 36 36 36 36 36
11 9 7 5 3 1
column total 36 36 36 36 36 36
1
The row and column totals of the values in the table above are listed. The
column totals are the respective probabilities that X = x, x = 1, 2, . . . , 6. Sim-
ilarly, the row totals are the respective probabilities that Y = y, y = 1, 2, . . . , 6
Moreover, observe that all these values are less than 1 and the sum of row and
column totals are both equal to 1. These give rise to the concept of marginal
probabilities.
The joint pmf/pdf of X and Y contains all information regarding the distributions of both
X and Y , i.e., we can obtain the pmf/pdf of X and Y from their joint pmf/pdf.
Definition 6.3 Let X and Y have the joint pmf pX,Y (x, y). The respective marginal
probability mass functions (marginal pmf ) of X and Y are defined as
X X
pX (x) = pX,Y (x, y) and pY (y) = pX,Y (x, y).
y x
We define an analogue for continuous random variables.
Definition 6.4 Let X and Y have the joint pdf fX,Y (x, y). The respective marginal
probability density functions (marginal pdf ) of X and Y are defined as
an
ˆ ∞ ˆ ∞
lim
fX (x) = fX,Y (x, y) dy and fY (y) = fX,Y (x, y) dx.
−∞ −∞
Di
Examples. UP
e−2
1. Given p(x, y) = , x, y = 0, 1, 2, . . .,
s,
x!y!
ic
at
X e−2 ∞
e−2 X 1 e−1
pX (x) = = = , x = 0, 1, . . .
em
y
x!y! x! y=1
y! x!
h
at
e−1
M
Similarly, pY (y) = , y = 0, 1, . . .
y!
Hence X, Y ∼ P oi(1).
of
te
2. From the column total and row total of Table 3, we have

itu
12 − (2x − 1) 2y − 1
st
pX (x) = , x = 1, 2, . . . , 6 and pY (y) = , y = 1, 2, . . . , 6.

In
36 36
Definition 6.5 Let X and Y be random variables. The joint cumulative distribution func-
tion of X and Y , denoted by FX,Y (x, y) or simply F (x, y) is given by
F (x, y) = P(X ≤ x, Y ≤ y), for all x, y ∈ R.
Remark: If X and Y are continuous random variables with joint pdf f and joint cdf F ,
then ˆ ˆ
y x
F (x, y) = f (s, t) ds dt
−∞ −∞
and that
∂2
f (x, y) = F (x, y).
∂x∂y
Theorem 6.6 The joint cdf F (x, y) of any two random variables X and Y satisfies the
following properties:
1. F (−∞, y) = lim F (x, y) = 0, for all y.

x→−∞
2. F (x, −∞) = lim F (x, y) = 0, for all x.
y→−∞
3. F (∞, ∞) = lim F (x, y) = 1.
x,y→∞
4. If x1 < x2 and y1 < y2 , then
P(x1 < X < x2 , y1 < Y < y2 ) = F (x2 , y2 )−F (x2 , y1 )−F (x1 , y2 )+F (x1 , y1 ) ≥ 0
5. F (x, y) is right continuous in each variable, that is,
an
lim F (x + h, y) = lim+ F (x, y + h) = F (x, y).
lim
h→0+ h→0
Di
6. FX (x) = limy→∞ F (x, y) is the marginal cdf of X.
7. FY (y) = limx→∞ F (x, y) is the marginal cdf of Y .
UP
s,
Examples:
ic
at
1. Consider the experiment of tossing two regular four-sided polyhedrons, each

em
with sides labelled 1 to 4. Let X denote the number on the down-turned face
h
at
of the first tetrahedron and Y the larger of numbers in the down-turned faces.
M
Find F (2, 3).

of
te
Solution: Assuming that the experiment is conducted fairly, the joint pmf is
itu
1
given p(x, y) = , for all x, y = 1, 2, 3, 4. We consider all outcomes wherein
st
16
the number in the down-turned face of the 1st tetrahedron is 1 or 2; and that
In
the larger of the two numbers is either 1, 2, or 3. Then

6 3
F (2, 3) = p(1, 1) + p(1, 2) + p(1, 3) + p(1, 4) + p(2, 2) + p(2, 3) = = .
16 8
2. The joint pdf of X and Y is given by

 2e−x e−2y , 0 < x, y < ∞

f (x, y) =
 0,

elsewhere
Find the following probabilities:
P(X > 1, Y < 1), P(X < Y ), and P(X < a), a > 0.
Solution:
ˆ 1 ˆ ∞
P(X > 1, Y < 1) = 2e−x e−2y dx dy
ˆ0 1
1
∞
= 2e−2y −e−x dy
0 1
ˆ 1
−1 1 − e−2
= e 2e−2y dy = .
0 e
ˆ ∞ ˆ y
P(X < Y ) = 2e−x e−2y dx dy
ˆ0 ∞ 0
1
= 2e−2y (1 − e−y ) dy = .
0 3
an
ˆ a ˆ ∞
lim
−x −2y
P(X < a) = 2e e dy dx
ˆ0 0
Di
a
= e−x dx = 1 − e−a .
UP
0
3. Let X and Y have a joint pdf

ic s,
at
(
K(x + y), 0 < x < 1, 0 < y < 1
em
f (x, y) =
0, elsewhere
h
at
1

Solve for K, fX (x), P X < and P (X + Y ≤ 1).
M
2
of
Solution:Since f is a joint pdf, then

te
ˆ 1ˆ 1 ˆ 1
itu

1
1= K(x + y) dxdy = K +y dy = K.
2
st
0 0 0
In
ˆ 1 1
y2

1
fX (x) = (x + y) dy = xy + = x + , 0 < x < 1.
0 2 0 2
ˆ 1/2 ˆ 1/2
1 1 3
P X< = fX (x) dx = x+ dx = .
2 0 0 2 8
ˆ 1 ˆ 1−x
P (X + Y ≤ 1) = (x + y) dy dx
0 0
ˆ 1
(1 − x)2
= x(1 − x) + dx
0 2
1
= .
3
6.2 Independent Random Variables

Definition 6.7 The random variables X and Y are said to be independent if for any sets
A, B ⊆ R,
P(X ∈ A, Y ∈ B) = P(X ∈ A)P(Y ∈ B).
Theorem 6.8 Let X and Y be independent random variables. Then the following are
satisfied:
1. If FX,Y is the joint cdf of X and Y , then
FX,Y (x, y) = P(X ≤ x, Y ≤ y) = FX (x) · FY (y),
an
lim
where FX and FY are the marginal cdf ’s of X and Y , resp.
2. If pX,Y is the joint pmf of X and Y , then
Di
UP
pX,Y (x, y) = P(X = x, Y = yx) = pX (x) · pY (y),
ic s,
where pX and pY are the marginal pmf ’s of X and Y , resp.

at
3. If fX,Y is the joint pdf of X and Y , then

h em
fX,Y (x, y) = fX (x) · fY (y),

at
M
where fX and fY are the marginal pdf ’s of X and Y , resp.

of
te
Example: Let X and Y be independent uniformly distributed random variables in (0, 60).
itu
Find P(X + 10 < Y ) + P(Y + 10 < X).

st
In
x + 10 < y
60
R1
y + 10 < x
R2
10
0 10 60
Observe that regions R1 and R2 have the same area. Hence
P(X + 10 < Y ) + P(Y + 10 < X) = 2P(X + 10 < Y ).

Solving for the desired probability, we have

¨
2P(X + 10 < Y ) = 2 f (x, y) dx dy
x+10<y
¨
= 2 f (x)f (y) dx dy
x+10<y
ˆ 60 ˆ y−10 2
1
= 2 dx dy
10 0 60
ˆ 60
2
= (y − 10) dy
602 10
25
= .
36
an
Exercises:
lim
Di
1. Identify whether X and Y are independent given the joint pdf f (x, y).
xy 2 UP
a. f (x, y) = , x = 1, 2, 3, y = 1, 2
30
xy 2
s,
b. f (x, y) = , (x, y) = (1, 1), (1, 2), (2, 2)

ic
13
at
2. Let X, Y , Z be independent and uniformly distributed random variables on

em
(0, 1). Find P(X ≥ Y Z).

h
at
The definition of joint distributions and independence can be extended to n random vari-
M
ables.
of
te
Definition 6.9 Let X1 , X2 , . . . , Xn be random variables.

itu
st
1. The joint cdf of X1 , X2 , . . . , Xn is given by

In
FX1 ,X2 ,...,Xn (x1 , x2 , . . . , xn ) = P(X1 ≤ x1 , X2 ≤ x2 , . . . , Xn ≤ xn )
2. The joint pmf (for discrete) or pdf (for continuous) of X1 , X2 , . . . , Xn , denoted

by f (x1 , x2 , . . . , xn ), satisfies the following properties:
i. f (x1 , x2 , . . . , xn ) ≥ 0.
ii. The n-tuple summation (for discrete) or integral (for continuous) of
f (x1 , x2 , . . . , xn ) over all values of xi , i = 1, 2, . . . , n is equal to 1.
3. X1 , X2 , . . . , Xn are said to be independent if
n
Y
P(X1 ≤ x1 , X2 ≤ x2 , . . . , Xn ≤ xn ) = P(Xi ≤ xi ).
i=1
6.3 Conditional Distributions

The concept of conditional probability of events could be extended to conditional random
variables.
Definition 6.10 If X and Y are both discrete or continuous random variables with joint
pmf/pdf f (x, y), then the conditional probability mass/density function of Y given X = x,
is defined to be
f (x, y)
f (y|x) = ,
fX (x)
given fX (x) > 0.
Similarly, the conditional pmf/pdf of X given Y = y is
an
lim
f (x, y)
f (x|y) = ,
fY (y)
Di
given fY (y) > 0.
UP
s,
Remark. If X and Y are independent random variables, then the conditional density of
ic
at
Y given X = x is just the marginal density of Y . That is,

em
fX (x)fY (y)
f (y|x) = = fY (y).
h
fX (x)
at
M
Similarly, f (x|y) = fX (x).

of
te
Definition 6.11 If X and Y are jointly distributed, then

itu
1. the conditional cumulative distribution of Y given X = x, is defined by

st
In
 X

 p(yj |x), X, Y discrete

 {j:y ≤y}
j
FY |X (y|x) = ˆ
 y


 f (z|x) dz, X, Y continuous
−∞
2.
ˆ b
P[a < Y < b | X = x] = f (y|x) dy
a
and ˆ d
P[c < X < d | Y = y] = f (x|y) dx.
c
Examples:
1. Let X and Y be discrete random variables with joint pmf

1
p(x, y) = (x + y), x = 1, 2, 3; y = 1, 2.
21
What is the conditional pmf of X given Y = 2?
Solution:
p(x, 2)
P(X|Y = 2) =
pY (2)
1
21
(x
+ 2)
= 1
21
[(1 + 2) + (2 + 2) + (3 + 2)]
x+2
= , x = 1, 2, 3.
12
an
2. If f (x, y) = xe−x(y+1) , x, y ≥ 0. Find f (y|x).
lim
Solution:
xe−x(y+1) xe−x(y+1)
Di
f (y|x) = ˆ ∞ = = xe−xy .
e−x
xe−x(y+1) dy UP
0
3. Suppose f (x, y) = (x + y) I(0,1)) (x) I(0,1) (y). Find f (y|x) and F (y|x).
ic s,
Solution:
at
x+y x+y
f (y|x) = ˆ
em
1 = .
x + 12
(x + y) dy
h
at
0
Then
M
ˆ y y ˆ
(x + z)
of
F (y|x) = f (z|x) dz = 1 dz
−∞ x + 2
te
−∞
itu
y2

1
= xy + , 0 < y < 1.
x + 12 2
st
In
Exercises:
1. Let 
 6x2 y, 0 < x < 1, 0 < y < 1

f (x, y) =
 0,

elsewhere
be the joint pdf of X and Y . Find P(0 < X < 34 , 13 < Y < 1).
2. Given joint pdf

 6(1 − x − y), 0 < x, y < 1, 0 < x + y ≤ 1

f (x, y) = ,
 0,

o.w.
find P(0 ≤ X ≤ 21 ).
3. Let X and Y have joint pdf f (x, y) = ke−x e−y , for all 0 < x < y < ∞. Find k
and the marginal pdf of X, x > 0?
4. If f (x, y) = k(x2 +y 2 ) is a joint pdf of continuous random variables X and Y over
the unit square bounded by (0, 0), (1, 0), (0, 1), and (1, 1). Find P(X + Y ≥ 1).
1
5. The joint cdf of X and Y is F (x, y) = (3x3 y + 2x2 y 2 ), 0 < x, y < 1. Find
5
f 21 , 12 .

6. If f (x, y) = e−(x+y) , x, y > 0, find P(X > Y |X > 21 ).

1

7. If f (x, y) = 1, 0 < x < 1, 0 < y < 1. Find P Y − X > 0|X + Y > 2
.
y=x
1
an
lim
Di
UP
0 1
ic s,
1
y= 2 −x
at
hem
6.4 Expectation
at
M
Now we discuss the joint, marginal, and conditional expectations of several random vari-
of
te
ables.
itu
Definition 6.12 Given a function g(X, Y ) of jointly distributed random variables X and
st
In
Y with defined joint density function, then the expectation of g(X, Y ) is defined
 XX


 g(x, y)p(x, y), X, Y discrete
 y x
E[g(X, Y )] =
 ˆ ∞ˆ ∞
g(x, y)f (x, y), X, Y continuous



−∞ −∞
To define marginal expectations, one makes use of the marginal density functions, de-
fined as follows.
Definition 6.13 Given jointly distributed random variables X and Y with defined joint
density function, the marginal expectation of X is given by

 X


 xfX (x), X, Y discrete
 x
E[X] =
 ˆ ∞
xfX (x), X, Y continuous



−∞
Similarly,  X


 yfY (y), X, Y discrete
 y
E[Y ] =
 ˆ ∞
yfY (y), X, Y continuous



−∞
Examples:
an
lim
1. Gien f (x, y) = 6xy 2 , 0 < x < 1, 0 < y < 1, find E(XY ).
Solution:
Di
ˆ 1 ˆ 1
1
E(XY ) = xy 6xy 2 dxdy = .
UP
0 0 2
2. Let f (x, y) = (x + y) I(0,1)) (x) I(0,1) (y). Find E(XY ), E(X + Y ), E(X), and
ic s,
E(Y ).
at
Solution:
em
ˆ 1 ˆ 1
1
h
E(XY ) = xy (x + y) dxdy =
at
3
ˆ0 1 ˆ0 1
M
7
E(X + Y ) = (x + y)2 dxdy =
of
6
ˆ0 1 0 ˆ 1 ˆ 1
te

7
E(X) = xfX (x) dx = x (x + y) dy dx = .
itu
0 0 0 12
ˆ 1 ˆ 1 ˆ 1
st

7
E(Y ) = yfY (y) dy = x (x + y) dx dy = .
In
0 0 0 12
3. An accident occurs at a point X that is uniformly distributed on a road of
length L. At the time of the accident, an ambulance is at a location Y that is
also uniformly distributed on the road. Assuming X and Y are independent,
find the expected distance between the ambulance and the point of accident.
Solution: X ∼ U (0, L) and Y ∼ U (0, L). Since X and Y are independent, then
1
f (x, y) = fX (x)fY (y) = 2 . So
L
ˆ Lˆ L
1
E(|X − Y |) = |x − y| dy dx
L2 0 0
ˆ L ˆ x ˆ L
1
= (x − y) dy + (y − x) dy dx
L2 0 0 x
ˆ L
L2

1
= + x2 − Lx dx
L2 0 2
L
= .
3
The expectation being a linear operator also applies to joint distributions. We look at
the case when X and Y are joint continuous random variables. Indeed, for a, b ∈ R,
ˆ ∞ˆ ∞
E(aX + bY ) = (ax + by) f (x, y) dx dy
ˆ ∞ˆ ∞
−∞ −∞
ˆ ∞ˆ ∞
= ax f (x, y) dy dx + by f (x, y) dx dy
ˆ ∞
−∞ −∞
ˆ ∞ −∞ −∞
an
= a x fX (x) dx + b y fY (y) dy
−∞ −∞
lim
= aE(X) + bE(Y ).
Di
Theorem 6.14 Let X and Y be jointly distributed random variables. If X and Y are
UP
independent, then
s,
E[g(X)h(Y )] = E[g(X)]E[h(Y )],

ic
at
for any function g of X and h of Y .

h em
at
Proof: We prove only the continuous case. Let f (x, y) be the joint pdf of X and Y . Since
M
X and Y are independent, we have

ˆ ˆ
of
∞ ∞
te
E[g(X)h(Y )] = g(x)h(y)f (x, y) dxdy

itu
ˆ−∞
∞ ˆ−∞
∞
st
= g(x)h(y)f (x)f (y) dxdy

In
ˆ−∞
∞ −∞
ˆ ∞
= g(x)f (x) dx h(y)f (y)dy
ˆ−∞
∞
−∞
= E[g(X)]h(y)f (y) dy
−∞
= E[g(X)]E[h(Y )].

Suppose now that given X and Y jointly distributed, we want to identify the expectation
of say Y given a fixed value of X, vice versa. This gives rise to the notion of conditional
expectation.
Definition 6.15 Let X and Y be jointly distributed random variables and g(Y ) a function
of Y . Then the conditional expectation of g(Y ) given X = x is defined as
 X


 g(y)p(y|x), X, Y discrete
 y
E[g(Y )|X = x] =
 ˆ ∞
g(y)f (y|x) dy, X, Y continuous



−∞
Examples:
1. Suppose the joint pdf of X and Y id f (x, y) = k, −1 < x < 1, x2 < y < 1, k
constant. Find E(Y |X = x).
Solution: Solving for the marginal density function of X,
an
ˆ 1
lim
fX (x) = k dy = k(1 − x2 ), −1 < x < 1.
Di
x2
Then,
UP
ˆ 1 ˆ 1
f (x, y) k 1
s,
E(Y |X = x) = y dy = y dy = (x2 + 1), −1 < x < 1.

ic
fX (x) k − kx2 2
x2 x2
at
em
2. Suppose that the joint pdf of X and Y is given by

h

at
 10xy 2 , 0 < x < y < 1


M
f (x, y) =
 0,

otherwise
of
te
What is E(X 2 |Y = y)?

itu
Solution: Solving for the marginal density function of Y ,

st
ˆ y
In
fY (y) = 10xy 2 dx = 5y 4 , 0 < y < 1.

0
Then, ˆ y
y2

2 2 2x
E(X |Y = y) = x dx = , 0 < y < 1.
0 y2 2
Theorem 6.16 Let X and Y be jointly distributed random variables.
1. If g1 and g2 are functions of Y , then
E[g1 (Y ) + g2 (Y )|X = x] = E[g1 (Y )|X = x] + E[g1 (Y )|X = x].

2. If g is a function of Y and h is a function of X, then
E[g(Y )h(X)|X = x] = h(X)E[g(Y )|X = x].
The proof of the theorem is left as an exercise.
Now we give a theorem involving conditional expectations that will be helpful in our
discussion of the conditional variance. The proof is left as an exercise for the reader.
Theorem 6.17 Let X and Y be jointly distributed random variables, and g a function of
Y . Then
an
E[E[g(Y )|X]] = E[g(Y )].
lim
Di
Definition 6.18 Let X and Y be jointly distributed, then the variance of Y given X = x
UP
is defined
ics,
 X
[y − E(Y |x)]2 p(y|x), X, Y discrete
at



 y
em
V ar[Y |X = x] = ˆ ∞
h

[(y − E(Y |x)]2 f (y|x) dy, X, Y continuous


at

−∞
M
Alternatively,
of
V ar(Y |X = x) = E(Y 2 |X = x) − [E(Y |X = x)]2 .

te
itu
st
Now we state a theorem that relates marginal and conditional variances.

In
Theorem 6.19
V ar(Y ) = E[V ar(Y |X)] + V ar[E(Y |X)].
The proof is again left as an exercise but will use the same argument as the proof of
the previous theorem.
Example: Let X and Y be jointly distributed with pdf
f (x, y) = 2, 0 < x < y < 1.
Find E(X|Y = y) and V ar(X|Y = y).

Solution: We first solve for fY (y).

ˆ y
fY (y) = 2 dx = 2y, 0 < y < 1.
0
Then ˆ y
1 y
E(X|Y = y) = x dx = , 0 < y < 1.
0 y 2
ˆ y
y 2 1 y2
V ar(X|Y = y) = x− dx = , 0 < y < 1.
0 2 y 12
Exercises:
1. Given f (x, y) = kxy, where k is constant and 0 < x < y < 2.
an
lim
a. Find k such that f is a joint pdf of X and Y .
Di
b. Find the marginal pdf of X and of Y .
c. Find the marginal means of X and of Y . UP
d. Find the marginal variances of X and of Y .
s,
e. Find the conditional expectation E[X|y].

ic
at
em
2. Suppose that a two-dimensional RV (X, Y ) is uniformly distributed over the

region bounded by y = x and y = x2 for x ≥ 0. Find the joint pdf and marginal
h
at
pds of X and Y .
M
of
te
6.5 Covariance and Correlation

itu
st
Covariance and correlation both measure how much two random variables change together.
In
These measures could be positive, negative, or even 0.
Definition 6.20 Let two random variables X and Y have mean µX and µY , respectively,
and jointly distributed with pdf f (x, y). The covariance of X and Y , denoted σX,Y =
cov(X, Y ), is defined
cov(X, Y ) = E[(X − µX )(Y − µY )]

= E(XY ) − µX µY
Remark 6.21 Let X and Y be jointly distributed. We then have the following.
1. cov(X, Y ) = cov(Y, X)
2. If Y = X, then cov(X, Y ) = cov(X, X) = E[(X − µX )2 ] = V ar(X).

3. If X and Y are independent, then cov(X, Y ) = E(X)E(Y ) − E(X)E(Y ) = 0.
4. For any a, b ∈ R, cov(aX + bY ) = abcov(X, Y ).
Example. cov(X, Y ) = 0 does not imply that X and Y are independent.
Consider X ∼ U (−1, 1). Hence, E[X] = E[X 3 ] = 0. Let Y = X 2 . Then
cov(X, Y ) = E[X 3 ] − E[X 2 ]µX = 0,
but clearly X and X 2 are not independent.
an
Now we state a theorem that relates the variances of two rv’s with their covariance.
lim
Theorem 6.22 V ar(X + Y ) = V ar(X) + V ar(Y ) + 2cov(X, Y ).
Di
Proof:
UP
s,
V ar(X + Y ) = E[(X + Y )2 ] − [E(X + Y )]2

ic
at
= E(X 2 ) + 2E(XY ) + E(Y 2 ) − ([E(X)]2 + 2E(X)E(Y ) + [E(Y )]2 )

em
= E(X 2 ) − [E(X)]2 + E(Y 2 ) − [E(Y )]2 + 2(E(XY ) − E(X)E(Y ))

h
at
= V ar(X) + V ar(Y ) + 2cov(X, Y ).

M
of
Corollary 6.23 If X and Y are independent, then V ar(X + Y ) = V ar(X) + V ar(Y ).

te
itu
Definition 6.24 The correlation coefficient of two random variables X and Y , denoted
st
ρX,Y , is defined as
In
cov(X, Y )
ρX,Y = ,
σX σY
where σX and σY are the standard deviations of X and Y , respectively.
Examples:
1. Find cov(X, Y ) given the joint and marginal distribution values of X and Y .
Y \X -1 0 1 fY (x)
1 2 3 6
1 18 18 18 18
2 3 5
0 18
0 18 18
3 2 2 7
-1 18 18 18 18
6 4 8
fX (x) 18 18 14
1
Solution:

1 3 3 2 3
E(XY ) = 1 −1 +0+1 + 0 + (−1) −1 +0+1 = .
18 18 18 18 18
1
X 6 8 2
E(X) = xfX (x) = −1 + 0 + 1 = .
x=−1
18 18 18
1
X 6 7 1
E(Y ) = yfY (y) = −1 + 0 + 1 =− .
y=−1
18 18 18
Then,
3 2 1 5
cov(X, Y ) = − − = .
18 18 18 18
an
lim
2. Let X and Y be continuous random variables with joint pdf
Di
f (x, y) = 2, 0 < y < 1 − x, 0 < x < 1.
UP
Find cov(X, Y ).
ic s,
Solution:
at
ˆ 1 ˆ 1−x
em
1
E(XY ) = 2xy dydx = − .
36
h
ˆ0 1 0ˆ 1−x
at

1
E(X) = x 2 dy dx = .
M
0 0 3
ˆ 1 ˆ 1−y
of

1
E(Y ) = y 2 dx dy = .
te
0 0 3
itu
st
In
Therefore,
1 1 5
cov(X, Y ) = − − =− .
36 9 36
3. Suppose f (x, y) = x + y, 0 < x, y < 1. Find ρX,Y . Solution:

1 7
E(XY ) = , E(X) = E(Y ) = .
3 12
5 11
E(X 2 ) = E(Y 2 ) = ⇒ V ar(X) = V ar(Y ) = .
12 144
Hence,
1 49
3
− 144 1
ρX,Y = 11 =− .
144
11
Exercises:
1. An insurance policy pays a total medical benefit consisting of two parts for each
claim. Let X represent the part of the benefit that is to be paid to the doctor,
and let Y represent the part to be paid to the hospital. The variance of X is
5, 000, while the variance of Y is 10, 000, and the variance of X + Y is 17, 000.
Due to increasing medical costs, the insurance company decides to increase X
by a flat amount of 100 and increase Y by 10%. Calculate the variance of the
total benefit after the revisions.
2. Suppose X ∼ U (a, b) , with b > a. Find the values of a and b, if E[X] = 2 and
16
cov(X, X 2 ) = .
an
3
3. Suppose that X and Y are jointly continous random variables with joint pdf
lim
(
Di
k , x ≤ y ≤ 2 − x, x ∈ [0, 1]
f (x, y) = UP
0 , elsewhere
Find k, cov(X, Y ) and E[Y |X = c] for any c ∈ [0, 1).

ic s,
4. Let X be a continuous random variable with pdf

at
em
(
2x , x ∈ [0, 1]
h
fX (x) =
at
0 , elsewhere
M
Suppose that given X = x, a random variable Y is uniformly distributed over

of
[−x, x], find fY (y) and P(|Y | < X 3 ).

te
itu
st
In

Chapter 6 - Joint Distributions

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Chapter 6 - Joint Distributions

Uploaded by

Copyright:

Available Formats

Mathematical Statistics I

Module for Math 150.1

6.1 Joint and Marginal Distributions

non-negative function fX,Y : R2 → R defined as

fX,Y (x, y) = P(X = x, Y = y).

Note that by P(X = x, Y = y) we mean the probability of the event {ω ∈ Ω : X(ω) =

ran(X, Y ) = {(x, y) ∈ R2 : P(X = x, Y = y) > 0}.

Note that P[(X, Y ) ∈ A] denotes the probability of {ω ∈ Ω : (X(ω), Y (ω)) ∈ A}.

We state a similar definition for the continuous case.

ran(X, Y ) = {(x, y) ∈ R2 : fX,Y (x, y) > 0}.

variables are also continuous.

1. An analyst is studying traffic accidents in 2 adjacent towns. The random vari-

joint pmf of X and Y is given by

Find c and P 0 ≤ X < 13 , 0 ≤ Y ≤ 21 .

Solution: Since f is a joint pdf,

The following table summarizes the joint pdf values of X and Y .

We define an analogue for continuous random variables.

2. From the column total and row total of Table 3, we have

pX (x) = , x = 1, 2, . . . , 6 and pY (y) = , y = 1, 2, . . . , 6.

F (x, y) = P(X ≤ x, Y ≤ y), for all x, y ∈ R.

1. F (−∞, y) = lim F (x, y) = 0, for all y.

5. F (x, y) is right continuous in each variable, that is,

1. Consider the experiment of tossing two regular four-sided polyhedrons, each

Find F (2, 3).

the larger of the two numbers is either 1, 2, or 3. Then

Find the following probabilities:

3. Let X and Y have a joint pdf

Solution:Since f is a joint pdf, then

6.2 Independent Random Variables

1. If FX,Y is the joint cdf of X and Y , then

FX,Y (x, y) = P(X ≤ x, Y ≤ y) = FX (x) · FY (y),

where pX and pY are the marginal pmf ’s of X and Y , resp.

3. If fX,Y is the joint pdf of X and Y , then

fX,Y (x, y) = fX (x) · fY (y),

where fX and fY are the marginal pdf ’s of X and Y , resp.

Find P(X + 10 < Y ) + P(Y + 10 < X).

Observe that regions R1 and R2 have the same area. Hence

P(X + 10 < Y ) + P(Y + 10 < X) = 2P(X + 10 < Y ).

Solving for the desired probability, we have

b. f (x, y) = , (x, y) = (1, 1), (1, 2), (2, 2)

2. Let X, Y , Z be independent and uniformly distributed random variables on

(0, 1). Find P(X ≥ Y Z).

Definition 6.9 Let X1 , X2 , . . . , Xn be random variables.

1. The joint cdf of X1 , X2 , . . . , Xn is given by

FX1 ,X2 ,...,Xn (x1 , x2 , . . . , xn ) = P(X1 ≤ x1 , X2 ≤ x2 , . . . , Xn ≤ xn )

2. The joint pmf (for discrete) or pdf (for continuous) of X1 , X2 , . . . , Xn , denoted

6.3 Conditional Distributions

Y given X = x is just the marginal density of Y . That is,

Similarly, f (x|y) = fX (x).

Definition 6.11 If X and Y are jointly distributed, then

1. the conditional cumulative distribution of Y given X = x, is defined by

1. Let X and Y be discrete random variables with joint pmf

6. If f (x, y) = e−(x+y) , x, y > 0, find P(X > Y |X > 21 ).

density function, the marginal expectation of X is given by

E[g(X)h(Y )] = E[g(X)]E[h(Y )],

for any function g of X and h of Y .

X and Y are independent, we have

E[g(X)h(Y )] = g(x)h(y)f (x, y) dxdy

= g(x)h(y)f (x)f (y) dxdy

E(Y |X = x) = y dy = y dy = (x2 + 1), −1 < x < 1.

2. Suppose that the joint pdf of X and Y is given by

 10xy 2 , 0 < x < y < 1

What is E(X 2 |Y = y)?

Solution: Solving for the marginal density function of Y ,