Professional Documents
Culture Documents
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal The normal distribution
distribution
Useful distributions
An application
PCA
Francesco Corona
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application
PCA
The normal distribution
Useful distributions
The normal The normal distribution
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal Motivation for the normal distribution is found in the central limit theorem
distribution
An application
PCA Normal distributions provide an important family of distributions
Applications and inference
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
Consider the integral
The multivariate
normal
distribution
0.4
An application
PCA
f (z )
Z +∞ − z 2
0.2 1
I = √ exp dz (1)
−∞ 2π 2
0
−5 0 5
z
This integral exists
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146) Consider the relevant part of the integrand
PR (TIP8412)
2017.2 It is a positive continuous function bounded by an integrable function
The normal
distribution 3
The multivariate
normal
(2π)1/2 f (z )
distribution
2
An application − z 2
PCA exp
1 2
0
−5 0 5
z
That is,
− z 2
0 < exp < exp (−|z | + 1), −∞ < z < ∞
2
and Z ∞
exp (−|z | + 1)dz = 2e
−∞
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution Set z = r cos θ and w = r sin θ
The multivariate
normal Then,
distribution
Z 2π Z
An application 1 ∞ 2
PCA I2 = e −r /2
r dr dθ
2π 0 0
| {z }
2 2
xe cx dx =1/(2c)e cx =1
R
Z 2π
1
= dθ
2π 0
=1
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The integrand is a PDF of a continuous random variable with support R
The normal
distribution
Denote this random variable Z , Z has the PDF
The multivariate
1 − z 2
normal
distribution
f (z ) = p exp , for − ∞ < x < ∞ (2)
(2π) 2
An application
PCA
0.4
f (z )
0.2
0
−5 0 5
z
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
For t ∈ R, the MGF of Z can be derived by a completion of a square
The normal
distribution Z ∞ 1 1
The multivariate E [exp (tZ )] = exp (tz ) √ exp − z 2 dz
normal −∞ 2π 2
distribution
1 Z ∞ 1 h 1 i
2
An application = exp t exp − (z − t)2 dz (3)
PCA 2 −∞ 2π 2
1 Z ∞ 1 h 1 i
= exp t2 √ exp − w 2 dw
2 −∞ 2π 2
1 − z 2
f (z ) = p exp , for − ∞ < x < ∞,
(2π) 2
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
Thus, the MGF of Z
distribution
2
The multivariate MZ (t) = e (1/2t ) , for − ∞ < x < ∞ (4)
normal
distribution
2 2
MZ′′ (t) = e (1/2t )
+ t 2 e (1/2t )
E (Z ) = 0
(5)
Var(Z ) = 1
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
Define the continuous random variable X = bZ + a, for b > 0
The multivariate
normal • This is a 1-to-1 transformation
distribution
An application
We want to derive the PDF of X
PCA
• The inverse of the transformation is z = b −1 (x − a)
• The Jacobian is J = b −1
1 1 h 1 x − a 2 i
fX (x ) = √ exp − , for − ∞ < x < ∞
2π b 2 b
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal By Equation (5), we have
distribution
An application E (X ) = a
PCA
Var(X ) = b 2
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
Definition 1.1
distribution Normal distribution
An application
PCA A random variable X has a normal distribution if its PDF is
1 h 1 x − µi
f (x ) = √ exp − , x ∈ (−∞, +∞) (6)
2πσ 2 σ
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution Remark
An application
Consider the random variable Z with the PDF
PCA
1 − z 2
f (z ) = p exp , for − ∞ < x < ∞
(2π) 2
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
An application
We use the relationship
PCA
X = σ Z+ µ
|{z} |{z}
b a
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
We obtain,
An application
PCA
E exp (tX ) = E exp t(σZ + µ) = exp (µt)E exp (tσZ )
1 1
= exp (µt) exp σ2 t 2 = exp µt + σ2 t 2
2 2
for −∞ < t < +∞
The normal The normal distribution (cont)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application
Remark
PCA Consider the relationship between Z and X
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2 Example
The normal Consider the random variable X with the MGF
distribution
The multivariate
normal
distribution
An application 1.05
2
M (t) = e 2t +32t
PCA
M (t)
1
1 M (t) = exp µt + σ2 t 2
2
−2 0 2
t ·10−2
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
Example
The normal We can derive all the moments of the standard normal RV using its MGF
distribution
The multivariate
We can do the same for a general normal RV
normal
distribution
1 n n
(x + y)k =
Pn
x n−k y k =
Pn
k =0 k k =0 k
x k y n−k
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
0
−5 0 5 10
x
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
There are many practical applications that involve normal distributions
An application
PCA
We need to be able to readily calculate probabilities
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application We can use the relation between normal and standard normal variables
PCA
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution Consider the CDF of a standard normal random variables Z
The multivariate Z z − w 2
normal
1
distribution
Φ(z ) = √ exp dw (8)
−∞ 2π 2
An application
PCA
Let X ∼ N (µ, σ2 )
UFC/DC
ATAII (CK0146)
PR (TIP8412) Suppose we are interested in the value xp such that p = FX (xp ) for some p
2017.2
Take zp = Φ−1 (p), then xp = σzp + µ
The normal
distribution
An application
PCA
0.3
0.2 1 − z 2
f (z )
Φ(z ) = p exp
(2π) 2
0.1
0
−4 −2 0 2 4
z
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
• pnorm(x,a,b), P (X ≤ x )
• dnorm(x,a,b), the PDF of X at x
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146) Example
PR (TIP8412)
2017.2 Let the random variable X ∼ N (2, 25)
f (x |µ = 2, σ2 = 25)
The multivariate 3
normal
distribution
An application 2
PCA
0
−20 0 20
x
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application
1 − 2
− 8 − 2
PCA
P (−8 < X < 1) = Φ −Φ
5 5
= Φ(−0.2) − Φ(−2)
= (1 − 0.579) − (1 − 0.977) = 0.398
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
Example
An application Let the random variable X ∼ N (µ, σ2 )
PCA
µ + 2σ − µ µ − 2σ − µ
P (µ − 2σ < X < µ + 2σ) = Φ −Φ
σ σ
(10)
= Φ(2) − Φ(−2)
= 0.977 − (1 − 0.977) = 0.954
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
Proof
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
That is, √
The multivariate Z
normal
v 1 2
distribution G(v ) = 2 √ e (−w /2) dw , for 0 ≤ v
0 2π
An application
PCA Moreover,
G(v ) = 0, for v < 0
√
Consider a change of the integration variable, w = y
Z v 1 (−y/2)
G(v ) = √ √ e dy, for 0 ≤ v
0 2π y
The normal The normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution Hence, the PDF g(v ) = G ′ (v ) of random variable V of the continuous type
The multivariate
normal
distribution 1√ v (1/2−1) e (−v/2) , 0 < v < ∞
√
An application g(v ) = π 2
0,
PCA elsewhere
√
Then, it must be Γ(1/2) = π and therefore V ∼ χ2 (1)
The normal The normal distribution (cont.)
distribution
The multivariate
normal
Suppose that Xi ∼ N (µi , σi2 ), for i = 1, . . . , n,
distribution
An application
Pn
Let Y = ai Xi , for some constants a1 , . . . , an
i=1
PCA
P P
Then, the distribution of Y is N ( ni=1 ai µi , ni=1 ai2 σi2 )
Proof
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
Let X1 , X2 , . . . , Xn be random sample from a N (µ, σ2 )
The normal
A corollary gives the distribution of the sample mean
distribution
n
The multivariate 1X
normal X = Xi
distribution n i=1
An application
PCA
Corollary
Let X1 , . . . , Xn be IID random variables with common N (µ, σ2 ) distribution
1 Pn
Let X = i=1 Xi
n
Then, X has a N (µ, σ2 /n) distribution
Proof
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
The multivariate normal distribution for a n-dimensional random vector
An application
PCA • Examples for the bivariate case, n = 2
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
Consider the random vector Z = (Z1 , Z2 )′
2017.2
Since n = 2, we have
The normal
distribution n n
Y 1 n 1 o 1 n/2 n 1X o
fZ (z) = √ exp − zi2 = exp − z2 ,
The multivariate
normal i=1
2π 2 2π 2 i=1 i
distribution
n 2
X o
An application
PCA
= 1/(2π) exp − 1/2 zi2 = 1/(2π) exp − 1/2(z12 + z22 )
i=1
0.1
4
0 2
−4 −2 0
0 2 −2
4 −4 z2
z1
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution The Zi s have mean zero, unit variance and they are uncorrelated
An application
PCA
Then, the mean and covariance matrix of Z
E (Z) = 0
(12)
Cov(Z) = In
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
2
The multivariate
normal
The MGF of each of the Zi s evaluated at ti is e (ti /2)
distribution
An application
As the Zi s are independent, the MGF of Z
PCA
n
hY i Yn
MZ (t) = E exp {ti Zi } = E [exp {ti Zi }]
i=1 i=1
n
(13)
n1X o n1 o
= exp ti2 = exp t′ t , for all t ∈ Rn
2 i=1 2
The normal The multivariate normal distribution
distribution
UFC/DC
ATAII (CK0146) Consider the random vector Z = (Z1 , Z2 )′
PR (TIP8412)
2017.2
Since n = 2, we have
The normal n n
distribution
hY i Y
MZ (t) = E exp {ti Zi } = E [exp {ti Zi }]
The multivariate
normal i=1 i=1
distribution 2
n1X o
An application = exp ti2 = exp 1/2(t12 + t22 ) , for all (t1 , t2 ) ∈ R2
PCA 2 i=1
1.04
MZ1 ,Z2 (t1 , t2 )
1.02
1 0.2
−0.2 0
0
0.2−0.2
t2
t1
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application
PCA Z is said to have a multivariate normal distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
Let Σ be a n × n, symmetric and positive-definite matrix (z′ Σz > 0, ∀z)
2017.2
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application
PCA Σ = Γ′ ΛΓ
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
Remark
normal Eigenvalues and eigenvectors
distribution
y = Ax
yei = Aei = λi ei
λi is often called an eigenvalue of A
ei is the corresponding eigenvector
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution Aei = λi ei
The multivariate
normal
distribution
Every (n × n) matrix has n eigenvalues and up to n eigenvectors
An application
They can be jointly specified
PCA
(λ1 In − A)e1 = 0
(λ2 In − A)e2 = 0
···
(λn In − A)en = 0
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The set of n vector equations can have non-trivial solutions
The normal
distribution
The n values of λi need be solutions to the scalar equation
The multivariate
normal
distribution |λIn − A| := ∆(λ) = 0
An application
(λIn − A) is called the characteristic matrix of A
PCA
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
If the roots are all distinct, there will be n independent eigenvectors
The multivariate
normal If the roots are repeated, there may be fewer eigenvectors
distribution
An application
PCA
For a distinct eigenvalue λi , the eigenvector ei is contained in Adj(λi In − A)
Adj(λi In − A) = α1 ei α2 ei · · · αn ei
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application Together, the n eigenvectors form the columns of the modal matrix E
PCA
E = e1 e2 · · · en
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The modal matrix can be used to diagonalise A
The normal
distribution To transform A into a diagonal matrix (of eigenvalues)
The multivariate
normal
distribution
Ae1 = λ1 e1
An application Ae2 = λ2 e2
PCA
. .. , or AE = EΛ
.. .
Aen = λn en
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal E is non-singular because the ei are linearly independent
distribution
An application A = EΛE−1
PCA
Λ = E−1 AE
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution If A is a real, symmetric matrix, its eigenvalues are real
The multivariate • The eigenvectors are orthogonal to each other
normal
distribution There are n eigenvectors, even with repeated roots
An application
PCA
The eigenvectors corresponding to distinct eigenvalues
Adj(λi In − A) = α1 ei α2 ei · · · αn ei
d m−1
Adj(λi In − A) = ··· ei1 ei2 ··· eim ···
dλm−1 λ=λi
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
v1,1 v2,1 ··· vn,1 λ1 0 ··· 0 v1,1 v1,2 ··· v1,n
v1,2 v2,2 ··· vn,2 0 λ2 ··· 0 v2,1 v2,2 ··· v2,n
. . . . . .. . . . .
.. .. .. .. .. . .. .. .. ..
v1,n v2,n ··· vn,n 0 0 ··· λn vn,1 vn,2 ··· vn,n
| {z }| {z }
λ2 v2 ′
v2
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
Because the λi s are non-negative, we can define the diagonal matrix
distribution
p p p
The multivariate Λ1/2 = diag( λ1 , λ2 , . . . , λn )
normal 1/2
distribution λ1 0 ··· 0
An application 0
λ2 1/2 · · · 0
PCA = .. .. .. ..
. . . .
0 0 · · · λn 1/2
Σ = Γ′ Λ1/2 ΓΓ ′ 1/2
|{z} Λ Γ
I
| {z }
Λ
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution Σ = Γ′ Λ1/2 ΓΓ′ Λ1/2 Γ
|{z}
The multivariate I
normal | {z }
distribution
Λ
An application
PCA
Define the square root of the positive semi-definite matrix Σ
Suppose Σ is positive definite and all its eigenvalues are strictly positive
Then, we have
(Σ1/2 )−1 = Γ′ Λ−1 Γ = Σ−1/2 (17)
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
X = Σ1/2 Z + µ (18)
E [X] = µ
(19)
Cov[X] = Σ1/2 Σ1/2 = Σ
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution The MGF of X
An application
PCA
MX (t) = E exp t′ X = E exp {t′ Σ1/2 Z + t′ µ}
′
= exp (t′ µ)E exp Σ1/2 t Z
′ (20)
= exp (t′ µ) exp (1/2) Σ1/2 t Σ1/2 t
= exp (t′ µ) exp (1/2)t′ Σt
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution Definition
The multivariate Multivariate normal
normal
distribution
A n-dimensional random vector X has a multivariate normal distribution
An application
PCA
if its MGF is
MX (t) = exp t′ µ + (1/2)t′ Σt (21)
for all t ∈ Rn
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution The definition is for positive semi-definite positive matrices Σ
An application • Σ is usually positive definite
PCA
We can get the density of X
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal X = Σ1/2 Z + µ
distribution
An application
The transformation between X and Z is 1-to-1
PCA
Z = Σ−1/2 (X − µ)
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
1 h 1 i
fX (x) = exp − (x − µ)′ Σ−1 (x − µ) , for x ∈ Rn (22)
(2π)n/2 |Σ|1/2 2
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
A corollary gives us the marginal distributions of a multivariate normal RV
distribution
An application
PCA
Let X1 be any sub-vector of X of dimension m < n
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate In the same way, we partition mean and covariance matrix of X
normal
distribution
µ1
An application µ=
PCA
µ2
(24)
Σ11 Σ12
Σ=
Σ21 Σ22
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
We define A to be the matrix
h i
The multivariate
normal A = Im 0mp
distribution
An application
1 0 ··· 0 0 ··· ··· 0
PCA 0 1 · · · 0 0 ··· ··· 0
= . . .. .. .. .. .. ..
.. .. . . . . . .
0 0 ··· 1 0 ··· ··· 0
Then, X1 = AX
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412) We apply Theorem 2.1 to the transformation X1 = AX
2017.2
The multivariate
Corollary 2.1
normal
distribution Suppose X has a Nn (µ, Σ) distribution, partitioned as in Equation (23-24)
An application
PCA µ1
µ=
µ2
Σ11 Σ12
Σ=
Σ21 Σ22
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2 Example
The normal Consider the multivariate normal in the case when n = 2
distribution
The bivariate normal
The multivariate
normal
distribution We use common notation (X , Y ) rather than (X1 , X2 )
An application
PCA Suppose that (X , Y ) ∼ N2 (µ, Σ)
µ1
µ=
µ2
(25)
σ22 σ12
Σ=
σ21 σ22
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution Substituting ρσ1 σ2 for σ12 in Σ
The multivariate 2
normal σ2 ρσ1 σ2
Σ=
distribution
ρσ2 σ1 σ22
An application
PCA
The inverse of Σ
1 σ22 −ρσ1 σ2
Σ−1 = (26)
σ12 σ22 (1 − ρ2 ) −ρσ2 σ1 σ12
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
From this expression, the PDF of (X , Y )
distribution
1
e (−q/2) ,
An application
f (x , y) = p −∞ < x , y < ∞ (27)
PCA
2πσ1 σ2 1 − ρ2
with
1 h x − µ1 2 x − µ1 x − µ2 y − µ2 2 i
q= − 2ρ + (28)
1− ρ2 σ1 σ1 σ2 σ2
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application
In general, if X and Y are independent RVs, correlation coefficient is zero
PCA
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
1
The multivariate f (x , y) = p e (−q/2) , −∞ < x , y < ∞
normal
distribution
2πσ1 σ2 1 − ρ2
An application
h x − µ1 2 y − µ2 2 i
PCA q= +
σ1 σ2
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2 If two random variables are independent their covariance is zero
• The converse is not necessarily true
The normal
distribution
The multivariate
Yet, it can be shown that this is true for the multivariate normal
normal
distribution
An application
PCA
Theorem 2.2
Suppose that X ∼ Nn (µ, Σ)
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
Proof
The multivariate
normal
distribution Note that Σ12 = Σ′21
An application
PCA The joint MGF of X1 and X2
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
By Corollary (2.1),
normal
distribution
• X1 ∼ Nm (µ1 , Σ11 )
An application
PCA • X2 ∼ Np (µ2 , Σ22 )
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution n 1 ′ o
An application
exp t′1 µ1 + t′2 µ2 + t1 Σ11 t1 + t′2 Σ22 t2 + t′2 Σ21 t1 + t′1 Σ12 t2
2
PCA
n 1 ′ o
exp t′1 µ1 + t′2 µ2 + t1 Σ11 t1 + t′2 Σ22 t2
2
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution Theorem
The multivariate Suppose X has a Nn (µ, Σ) distribution, partitioned as in Equation (23-24)
normal
distribution
µ1
An application µ=
µ2
PCA
Σ11 Σ12
Σ=
Σ21 Σ22
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
Proof
The multivariate
normal
distribution Consider the joint distribution of W = X1 − Σ12 Σ−1
22 X2 and X2
An application
The joint distribution can be obtained from the transformation
PCA
W I −Σ12 Σ−1 X1
= m 22
X2 0 Ip X2
| {z }
A
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution The means
The multivariate
normal
• E (W) = µ1 − Σ12 Σ−1
22 µ2
distribution
• E (X2 ) = µ2
An application
PCA
The covariance matrix
Im −Σ12 Σ−1
22 Σ11 Σ12 Im 0′
0 Ip Σ21 Σ22 −Σ−1
22 Σ21 Ip
| {z }| {z }| {z }
A Σ A′
Σ11 − Σ12 Σ−1
22 Σ21 0′
=
0 Σ22
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
By Theorem 2.2, the random vectors W and X2 are independent
normal
distribution
• The conditional of W2 |X2 equals the marginal of W
An application
PCA W|X2 ∼ Nm µ1 − Σ12 Σ−1 −1
22 µ2 , Σ11 − Σ12 Σ22 Σ21
W + Σ12 Σ−1
22 X2 |X2
∼ Nm µ1 − Σ12 Σ−1 −1 −1
22 µ2 + Σ12 Σ22 X2 , Σ11 − Σ12 Σ22 Σ21
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2 Example
The normal Suppose that (X , Y ) ∼ N2 (µ, Σ)
distribution
The multivariate µ1
normal µ=
distribution
µ2
An application σ22 σ12
Σ=
PCA σ21 σ22
Nm µ1 − Σ12 Σ−1 −1 −1
22 µ2 + Σ12 Σ22 X2 , Σ11 − Σ12 Σ22 Σ21
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The multivariate σ2
normal E (Y |x ) = µ2 + ρ
distribution σ1
An application
PCA The conditional mean of Y , given that X = x is linear in x
Remark
This follows from the fact that the coefficient of x in a linear conditional
mean E (Y |x ) is the product of correlation coefficient and σ2 /σ1
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
Most of the probability for the distribution of (X , Y ) lies within the band
σ2 p
µ2 + ρ (x − µ1 ) ± (≈ 2.57)σ2 1 − ρ2
σ1
about the plot of the linear conditional mean
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
Theorem
The multivariate
normal Suppose X has a Nn (µ, Σ) distribution
distribution
Proof
Pn
Let W = Z′ Z = i=1 Zi2
The normal The multivariate normal distribution (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution Each Zi has a N (0, 1) distribution, for i = 1, 2, . . . , n
An application
From Theorem 1.1, it follows that Zi2 has a χ2 (1) distribution
PCA
The normal
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application
PCA
An application
The normal distribution
The normal
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application
PCA
Principal components analysis
An application
The normal Principal components analysis
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
We consider applications of the multivariate normal distribution
An application
• Principal components analysis (PCA)
PCA
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
Let the random vector X has the multivariate normal distribution Nn (µ, Σ)
The normal
distribution • Σ is positive definite
The multivariate
normal
distribution
An application
Consider the spectral decomposition of Σ = Γ′ ΛΓ
PCA The eigenvalues form the main diagonal of Λ
λ1 , λ2 , . . . , λn
v1 , v2 , . . . , vn
λ1 ≥ λ2 ≥ · · · ≥ λn > 0
The normal Principal components analysis (cont.)
distribution
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
Define the random vector Y = Γ(X − µ)
An application
PCA ΓΣΓ′ = Λ, thus Y has a Nn (0, Λ) distribution (by Theorem 2.1)
The components Y1 , Y2 , . . . , Yn are independent RVs
Yi has a N (0, λi ) distribution, for i = 1, 2, . . . , n
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution Total variation of a random vector sums the variances of its components
An application
PCA
For a random vector X, because Γ is an orthogonal matrix
n
X n
X
TV (X) = σi2 = Tr(Σ) = Tr(Γ′ ΛΓ) = Tr(ΛΓΓ′ ) = λi = TV (Y)
i=1 i=1
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
Consider the first component of Y
An application
PCA
Y1 = v1′ (X − µ)
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
Consider any other linear combination of (X − µ)
The normal
distribution
a′ (X − µ), with ||a||2 = 1
The multivariate
normal
distribution
As a ∈ Rn and because {v1 , . . . , vn } form a basis for Rn
An application
PCA We must have
n
X
a= aj vj
j =1
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution Pn
We have that Σ = Γ′ ΛΓ = i=1 λi vi vi′ and that λi > 0
The multivariate
normal
distribution
Then, the inequality
An application n
X
PCA
Var(a′ X) = a′ Cov(X)a = a′ Σa = λi (a′ vi )2
i=1
n n
(31)
X X
= λi ai2 ≤ λ1 ai2 = λ1 = Var(Y1 )
i=1 i=1
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
distribution
An application
Other components Y2 , Y3 , . . . , Yn share a similar property
PCA
UFC/DC
ATAII (CK0146)
PR (TIP8412)
2017.2
The normal
distribution
The multivariate
normal
Theorem
distribution
For j = 2, . . . , n and i = 1, 2, . . . , j − 1,
An application
PCA Var[a′ X] ≤ λj = Var(Yj )
Proof
The proof follows the lines of that for the first principal component (⋆)