Professional Documents
Culture Documents
1
HOW DO WE FIND OUT -
Answer…
Discriminant Analysis
2
DISCRIMINANT ANALYSIS – BASIC CONCEPT
D = b0 + b1 X1 + b2 X2 + … + bk Xk + … + bn Xn
3
DISCRIMINANT FUNCTION
Group2
4
If we want to draw a line to separate out the two groups here – what do we do?
5
Linear Discriminant Analysis helps in creating an axis that maximizes the separability
between categories
6
The new axis has been able to do a good job of separating out the two categories
7
How is this achieved?
8
How is this achieved?
9
How about LDA for 3 categories?
x2
x1
10
DATA TYPE
11
TERMINOLOGIES
• Analysis sample: Part of total sample that is used for estimation of the discriminant
function. Usually, 75% of the total sample constitute the analysis sample. Also called
estimation sample.
• Holdout sample: That part of total sample that is used to check the results of the
estimation sample. Usually, 25% of the total sample constitute the holdout sample. Also
called validation sample.
• Centroid: Is the mean values for the discriminant scores for a particular group. There
are as many centroids as there are groups.
12
TERMINOLOGIES
• F values : Is the ratio of the between sum of squares to the within sum of squares of
variable.
• Wilks’ Lambda: Is the ratio of the within sum of squares to the total sum of squares for
the entire set of variables in analysis. Wilks’ Lambda varies between 0 to 1. Also called
U statistics.
• Classification matrix : Is a matrix that contains the number of correctly classified and
misclassified cases.
13
ASSUMPTIONS
• Groups must be mutually exclusive, with every case belonging to only one group.
• Homogeneity of covariance/correlation
14
Simulator
15
Thank you!
16