Professional Documents
Culture Documents
• Objectives:
Fisher Linear Discriminant Analysis
Multiple Discriminant Analysis
Examples
• Resources:
D.H.S.: Chapter 3 (Part 2)
W.P.: Fisher
DTREG: LDA
S.S.: DFA
URL: Audio:
Component Analysis (Review)
• Previously introduced as a “whitening transformation”.
• Consider the problem of projecting data from d dimensions onto a line with
the hope that we can optimize the orientation of the line to minimize error.
n 2
• Our challenge is to find w that J0 x0 x0 xk
k 1
maximizes separation.
si 2 y m
~ ~ 2
i
yYi
1 2 ~2
• An estimate of the variance of the pooled data is: ( )(~
s1 s2 )
n
and is called the within-class scatter.
Si x - mi x - mi t
xDi
• The total scatter, Sw, is:
SW S1 S 2
xDi
w t x mi x mi t w w t Si w
xDi
~ m
m1 2
~ w tm w tm
1 2 2
w t m1 m2 m1 m2 t w
wtSBw
S B m1 m2 m1 m2 t
• This ratio is well-known as the generalized Rayleigh quotient and has the
well-known property that the vector, w, that maximizes J(), must satisfy:
S B w SW w
ST x - m x - m t
x
• The total scatter is related to the within-class scatter (derivation omitted):
c
ST SW S B S B ni (mi m)(mi m)t
i 1
• We have c-1 discriminant functions of the form:
y W t x w ti x i 1,2 ,...,c-1
• Because SB is the sum of c rank one or less matrices, and because only c-1 of
these are independent, SB is of rank c-1 or less.