Professional Documents
Culture Documents
Latent
A person may be asked to respond to these statements on a Likert
Variable scale, with 1=highly disagree, 3=neutral and 5= highly agree.
Basic An identity matrix is one in which all the elements of the principal
Terminology diagonal are ones and all the elements are zero.
Method for
Do not use PCA (Principal Component Analysis), as it is just a data
EFA reduction technique, where as EFA is a technique to construct latent
variables.
Rotation can be simply defined as any of a variety of methods used to
further analyze the initial EFA results.
Rotation aims to make the pattern of loadings clearer and more
pronounced.
Add all the observed variables in the factor analysis variable box
Running EFA
on SPSS
Click the “descriptives” button, in the dialog box , tick mark the KMO
and Bartlett’s test option…..
Next, Click on “extraction ” button and in the dialog box select the
“method” box and select, Principal Axis Factoring. Click continue then…
Next, click on the “rotation” button and in the dialog box, select “Direct
Oblimin”. After this click continue and click ok.
Examine KMO and Bartlett’s test values
KMO
Examining the
Output
P-value of
Bartlett’s test
Examine the Eigen values, here we have obtained 3 factors, as there are 3
Eigen values, which are all greater than 1.
Eigen Values
Examining the
Output
Examine the “structure matrix” for factor loadings, irrespective of their sign.
The component where the factor loadings are the highest, is the factor where
a particular variable belongs.
Output Factor 1
For example Factor 3 had var1, var2, var3 in it, enter them in the items box in the
dialog box shown below.
Follow Up
For reliable measurement, Cronbach’s Alpha should be greater than 0.7.
Follow Up
In the dialog box, enter the independent variables in the box labeled,
“intendents” and the nominal dependent variable in the “grouping variable”
box.
Next, Click the “define range” button, here enter the range of numeric coding
used for coding the nominal variable, we will enter here 1 and 3, as we have 3
classes in the dependent variable.
Training a LDA
Model in SPSS
Next, Click the “statistics” button, in the dialog box, check “Box’s M” option.
Training a LDA
Model in SPSS
Next, Click the “classify” button, in the dialog box, check “combined-groups”
option under plots. This helps us to visualize the discriminant equation and
make predictions.
Training a LDA
Model in SPSS
The classification performance of classifiers is usually check using a confusion
matrix, which shows the correctly and incorrectly classified cases.
To generate a confusion matrix, click on the “classify” button the
discriminant analysis dialog box, in the dialog box that opens click on
“summary table” under the display section.
The first thing to examine is the “Box’s” test for homogeneity of variance.
We can see below that the p-value associated with this test is less than 0.05,
which indicates that assumption of homogeneity of variance has not been
violated.
Examining the
Output
p-value
Here, we can see that the first function has the highest Eigen value, so we will
consider the discriminant coefficients of this function to make the final equation.
Discriminant Coefficients
So, final Discriminant Equation will be:
Examining the
Output -0.422.Sepal length-0.522.sepal width+0.940.petal
length+ 0.585.petal width
This plot helps to visualizes where a particular point will belong based on the
its discriminant score.
Let us pick a random data point (first data point for sake of easiness) from our
dataset with
Evaluating
Performance
Diagonal values are correctly classified cased and off diagonal are incorrect
classified cased.
Cluster analysis or clustering is the task of grouping a set of objects in
such a way that objects in the same group (called a cluster) are more
similar (in some sense) to each other than to those in other groups
(clusters)
Hierarchical
Clustering in
SPSS
Next, click on the “statistics button”, and select “agglomeration schedule”
and “proximity matrix”. Click continue.
Hierarchical
Clustering in
SPSS
Next, click on the “plots”, and select “dendogram” and Click continue. Click
ok.
Hierarchical
Clustering in
SPSS
We can see there are 3 clusters at the first level, secondly clusters 1 and 2 are
similar so they are grouped together in the second level. Lastly, at the third
level we have the third cluster and the combined and 1 and 2 clusters.
Dendogram
Load the data set and go to analyze -> classify -> K-means cluster.
Add the features in the “variables” box. In the number of clusters set it to 3.
K-Means
Clustering in
SPSS
Click on options and select “ANOVA table” in the dialog box. Click continue and
click ok.
K-Means
Clustering in
SPSS
Mean value of features in each
cluster
Importance
of each
The Output feature in
clustering