Professional Documents
Culture Documents
1 1 1 1
c e m p
CONFIRMATORY
FACTOR ANALYSIS:
ONE FACTOR MODELS
measurement error.
Ability to use latent variables is the primary
strength of SEM
Multi-Indicator Approach
Reduces the overall effect of measurement error of
any individual observed variable on the accuracy of
the results
We distinguish between observed variables
(indicators) and underlying latent variables or
factors (constructs)
measurement model: observed variables and the
latent variables
Error Error Error
Error Error Error VT AT ST
Var1 Var2 Var3
1 1 1
1 1 1
Verbal
Abilities
Latent
Variable
Construct
of interest
Exploratory Factor Analysis (EFA)
Problems
Communality: must know the communality
before estimation, but communality is a
function of the loadings
Number of factors
Rotation: when there are two or more
factors, the solution is not unique
Exploratory Factor Analysis (EFA)
Principal Components
Communality is set to 1
Factor is defined as the sum of the variables
Loadings chose to maximize the explanation of the
variances of the measures
Loadings are usually too “high” in that the predicted
correlations are larger than the observed correlations
Exploratory Factor Analysis (EFA)
Principal Factors
Communality iteratively estimated
Factor is a latent variable
Loadings chose to:
to maximize the explanation of the correlations between the
measures.
minimizes the sum of squared residuals
residual = observed correlation minus predicted correlation
Exploratory Factor Analysis (EFA)
Maximum Likelihood
Solution is iteratively estimated
Factor is a latent variable
Loadings chosen to maximize the explanation of the
correlations between the measures
tries harder to explain the larger correlations
Statistical tests available
Exploratory Factor Analysis (EFA)
Example: Bollen.sps
Note that Principal Components loadings generally larger than the other methods.
PA and ML fairly similar.
EFA vs. CFA
EFA is useful when the researcher does not know
how many factors there are or when it is uncertain
what measures load on what factors
EFA typically used as a “data reduction” strategy
Both EFA and CFA reduce a larger number of
observed variables into a smaller number of latent
factors
However, EFA is done with little a priori hypothesis;
CFA requires a priori specification based on
hypothesis
Assumptions of CFA
Multivariate normality
Sufficient sample size
Correct model specification
Sampling Assumptions- Simple random sample
Confirmatory Factor Analysis
Representation in SEM Error Error Error
Var1 Var2 Var3
Latent variable represented by a circle
Measured variables (indicators) represented by
1 1 1
a square
Each indicator variable has an error term
CFA Initial Specification Observed Observed Observed
Variable 1 Variable 2 Variable 3
Each variable loads on one and only one factor
Factors can (and typically are) correlated 1
Errors across indicator variables are
independent
Assumptions
Latent
The factors are uncorrelated with the Variable
measurement errors
Most (if not all) of the errors of different
indicators are uncorrelated with each other
Residuals In CFA
1 1 1 1
c e m p
G
Identification
Identification in CFA is largely determined by the
number of indicator variables used in the model
(more later).
Number of indicators
◦ 2 is the minimum
◦ 3 is safer, especially if factor correlations are weak
◦ 4 provides safety
◦ 5 or more is more than enough (If too many indicators then
combine indicators into sets or parcels.)
Identification
Overidentified model = knowns > unknowns
Number of knowns = Number of variances and covariances of observed
variables computed by k(k+1)/2, where k is the number of observed
variables
Number of unknowns (free parameters) is based on the specified model. It is
typically a sum of the number of:
exogenous variables (one variance estimated for each)
BSI2
1
e2 Number of knowns = 28
W1 V3
(7*8)/2=28
1
BSI5 e5
V1 W4
V4
1 1
BSI8 e8
Depression
V5
Factor W2 1
BSI10 e10
W5
W6 BSI14
1
e14
V6 Number of unknowns =
W3
BSI16
1
e16
V7
14
1
V8 Variance of latent factor (1)
BSI18 e18
Free factor loadings (6)
Variances of error terms (7)
e1 e2 e3 e4
c e m p
G
4 basic steps- CFA:
1. Define the factor model.
select the number of factors to determine the
nature of the paths between the factors and the
measures.
Paths can be fixed at zero, fixed at another
constant value, allowed to vary freely, or be
allowed to vary under specified constraints (such
as being equal to each other).
2. Fit the model to the data.
4 Basic Steps for CFA
3. Evaluate model adequacy.
When the factor model is fit to the data, the factor
loadings are chosen to minimize the discrepancy
between the correlation matrix implied by the model
and the actual observed matrix.
The amount of discrepancy after the best parameters
are chosen can be used as a measure of how
consistent the model is with the data.
Fit statistics
4 Basic Steps for CFA
4. Compare with other models.
To compare two nested models, examine the
difference between their 2 statistics.
Most tests of individual factor loadings can be
made as comparisons of full and reduced factor
models.
For non-nested models, you can compare the Root
mean square error of approximation (RMSEA), an
estimate of discrepancy per degree of freedom in
the model, other fit indices, and the AIC and BIC.