Discriminant 5

Discriminant Analysis
Discriminant analysis is a multivariate data

analysis tool used in marketing analytic.
It is a method of constructing a linear

combination of the variables in such a way
that this newly created function optimally
discriminates among the groups.
In this analysis, two or more groups are compared and

the question is: do the groups differ, and if so, how.
For example, we might be interested in determining
the characteristics that differentiate the following:
1. Light and heavy users of a product.

2. Purchasers of our brand and those of competing
brands.
3. Customers who patronize every-day-low pricing
retail outlets vs. those who shop at high-end, service
oriented ones.
4. Good and poor sales representatives.
Objectives of Discriminant Analysis
• To find a linear combination of variables that discriminant

between categories of dependent variable in the best possible
manner.
• To find out which independent variables are relatively better
in discriminating between groups.
• To determine the statistical significance of the discriminant
function and whether any statistical difference exists among
groups in terms of predictor variables.
• To develop the procedure for assigning new objects, firms or
individuals whose profile but not the group identity are known
to one of the two groups.
• To evaluate the accuracy of classification, i.e., the percentage
of customers that it is able to classify correctly.
Discriminant Analysis Model
The discriminant analysis model involves linear combinations of

the following form:
D = b0 + b1X1 + b2X2 + b3X3 + . . . + bkXk
Where:
D = discriminant score
b 's = discriminant coefficient or weight
X 's = predictor or independent variable
• The coefficients, or weights (b), are estimated so that the groups

differ as much as possible on the values of the discriminant function.
• This occurs when the ratio of between-group sum of squares to
within-group sum of squares for the discriminant scores is at a
maximum.
Statistics Associated with
• Canonical correlation. Canonical correlation measures

the extent of association between the discriminant scores
and the groups. It is a measure of association between the
single discriminant function and the set of dummy variables
that define the group membership.
• Centroid. The centroid is the mean values for the
discriminant scores for a particular group. There are as
many centroids as there are groups, as there is one for each
group. The means for a group on all the functions are the
group centroids.
• Classification matrix. Sometimes also called confusion or
prediction matrix, the classification matrix contains the
number of correctly classified and misclassified cases.
Statistics Associated with Discriminant
Analysis
• Discriminant function coefficients. The discriminant

function coefficients (unstandardized) are the multipliers of
variables, when the variables are in the original units of
measurement.
• Discriminant scores. The unstandardized coefficients are
multiplied by the values of the variables. These products
are summed and added to the constant term to obtain the
discriminant scores.
• Eigenvalue. For each discriminant function, the Eigenvalue
is the ratio of between-group to within-group sums of
squares. Large Eigenvalues imply superior functions.
Statistics Associated with Discriminant Analysis
• F values and their significance. These are calculated from

a one-way ANOVA, with the grouping variable serving as the
categorical independent variable. Each predictor, in turn,
serves as the metric dependent variable in the ANOVA.
• Group means and group standard deviations. These are

computed for each predictor for each group.
• Pooled within-group correlation matrix. The pooled

within-group correlation matrix is computed by averaging
the separate covariance matrices for all the groups.
Statistics Associated with Discriminant Analysis
• Standardized discriminant function coefficients. The standardized

discriminant function coefficients are the discriminant function
coefficients and are used as the multipliers when the variables have
been standardized to a mean of 0 and a variance of 1.
• Structure correlations. Also referred to as discriminant loadings, the

structure correlations represent the simple correlations between the
predictors and the discriminant function.
• Total correlation matrix. If the cases are treated as if they were

from a single sample and the correlations computed, a total correlation
matrix is obtained.
• Wilks' . Sometimes also called the U statistic, Wilks' for each

 is the ratio of the within-group sum of squares to the total
predictor
sum of squares. Its value varies between 0 and 1. Large  values
of (near 1) indicate that group means do not seem to be different.
Small values of (near 0) indicate that the group means seem to be

different.

Conducting Discriminant Analysis
Formulate the Problem
Estimate the Discriminant Function Coefficients
Determine the Significance of the Discriminant Function
Interpret the Results
Assess Validity of Discriminant Analysis

Discriminant 5

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Discriminant 5

Uploaded by

Copyright:

Available Formats

Discriminant Analysis

Discriminant analysis is a multivariate data

It is a method of constructing a linear

In this analysis, two or more groups are compared and

1. Light and heavy users of a product.

• To find a linear combination of variables that discriminant

The discriminant analysis model involves linear combinations of

• The coefficients, or weights (b), are estimated so that the groups

• Canonical correlation. Canonical correlation measures

• Discriminant function coefficients. The discriminant

• F values and their significance. These are calculated from

• Group means and group standard deviations. These are

• Pooled within-group correlation matrix. The pooled

• Standardized discriminant function coefficients. The standardized

• Structure correlations. Also referred to as discriminant loadings, the

• Total correlation matrix. If the cases are treated as if they were

• Wilks' . Sometimes also called the U statistic, Wilks' for each

Formulate the Problem

Estimate the Discriminant Function Coefficients

Determine the Significance of the Discriminant Function

Interpret the Results

Assess Validity of Discriminant Analysis

You might also like