Professional Documents
Culture Documents
Methods
Volume 10 | Issue 2 Article 16
11-1-2011
Recommended Citation
Beauducel, André (2011) "Indeterminacy of Factor Score Estimates In Slightly Misspecified Confirmatory Factor Models," Journal of
Modern Applied Statistical Methods: Vol. 10 : Iss. 2 , Article 16.
DOI: 10.22237/jmasm/1320120900
Available at: http://digitalcommons.wayne.edu/jmasm/vol10/iss2/16
This Regular Article is brought to you for free and open access by the Open Access Journals at DigitalCommons@WayneState. It has been accepted for
inclusion in Journal of Modern Applied Statistical Methods by an authorized editor of DigitalCommons@WayneState.
Journal of Modern Applied Statistical Methods Copyright © 2011 JMASM, Inc.
November 2011, Vol. 10, No. 2, 583-598 1538 – 9472/11/$95.00
André Beauducel
University of Bonn,
Bonn, Germany
Two methods to calculate a measure for the quality of factor score estimates have been proposed. These
methods were compared by means of a simulation study. The method based on a covariance matrix
reproduced from a model leads to smaller effects of sampling error.
Key words: Confirmatory factor analysis, structural equation modeling, indeterminacy, factor score
estimates.
583
MISSPECIFICATION AND INDETERMINACY OF CONFIRMATORY FACTOR MODELS
Fan & Sivo, 2007; Beauducel & Wittmann, one, the expectation of the covariance of the
2005; Marsh, Hau & Wen, 2004; Hu & Bentler, observed variables is Σ (ε[XX´] = Σ). The
1999). As a consequence, model parameters can covariance matrix Σ can be decomposed by
be over- and/or under-estimated not only
because of sampling error, but also because of a Σ = ΛΦΛ´ + Ψ2, (2)
difference between the model parameters and
the population parameters. where Φ represents the q by q factor correlation
There is a discussion on the size of matrix and Ψ2 the p by p covariance matrix
difference between model and data that might be between the observed variables X and the error
regarded as acceptable (Marsh, et al., 2004; scores E (Cov[X, E]= Ψ2) and Ψ2 also
Barrett, 2007), but a small difference between represents the covariance matrix of the error
the covariance matrix implied by the model and
scores E (Cov[E, E]= Ψ2). Ψ2 is generally
the empirical covariance matrix is accepted by
assumed to be a diagonal matrix and it will be
many researchers in structural equation
assumed herein that it contains only positive
modeling. A difference between model and data
values. In order to investigate CFA modelling as
could also occur in exploratory factor analysis,
it often occurs in empirical research, it was,
but the only way to obtain model
however, decided also to allow for some non-
misspecification in this context is over- or
diagonal elements of Ψ2.
under-extraction of factors. Nevertheless, this
article focuses on factor score indeterminacy as The factor score indeterminacy ρ, the
it is calculated from CFA with correctly and multiple correlation of the variables with the
misspecified model parameters, because factor can be described on the basis of
indeterminacy has rarely been evaluated in this Thurstone’s (1935) regression score estimate,
context. A simulation study was performed in which is the best linear factor score estimate
order to investigate the effects of sampling error (Krijnen, Wansbeek & Ten Berge, 1996). The
and model misspecification on factor score covariances of the factors with the best linear
indeterminacy. factor score estimates are given by
^
The Calculation of ρ or ρ² diag( F F´ ) = diag( FX´ Σ −1 ΛΦ ) . (3)
It should be noted that there are two
different ways to calculate indeterminacy, often
It follows from equation 1 that it is possible to
referred to as ρ, the correlation between the
variables and the factor (Grice, 2001). In order insert ΦΛ´ for FX´ into equation 3. Moreover, it
is possible to standardize the covariances of the
to present the calculation of ρ or ρ², the common
factors with the best linear factor score estimates
factor model is described first. The common
in order to obtain the correlations. This yields
factor model assumes that the observations are
generated by ^
diag( F F´)
X = ΛF + E, (1)
= diag(ΦΛ´Σ −1ΛΦ)diag(ΦΛ´Σ −1 ΛΦ)−1/ 2
where X is the random vector of observations of = diag(ΦΛ´Σ −1 ΛΦ)1/ 2
order p, F the random vector with factor scores
(4)
of order q, E the unobservable random error
vector of order p, and Λ the factor pattern matrix
so that the diagonal elements in the left hand
of order p by q. The observations X, the factor
side of equation 4 contain the correlations of the
scores F, and the error vectors E are assumed to
best linear factor score estimates with the
have an expectation zero (ε[X] = 0, ε[F] = 0, factors. Standardizing F is not necessary,
ε[E] = 0). The covariance between the factor because it has by definition a standard deviation
scores and the error scores is assumed to be zero of one. Because the best linear factor score
(Cov[F, E] = 0). The standard deviation of F is estimate is the best linear combination of the
584
ANDRÉ BEAUDUCEL
585
MISSPECIFICATION AND INDETERMINACY OF CONFIRMATORY FACTOR MODELS
correlations of all residuals were fixed to zero in The misspecified models were again estimated
the misspecified models (the variance of the by means of maximum likelihood estimation.
residuals was freely estimated). For the The variances of the factors were
orthogonal models the correlations between the constrained to be one, the non-salient loadings
factors were fixed to zero, for the oblique were fixed to zero, the unconstrained salient
models they were freely estimated. loadings were freely estimated, and the
Table 1 contains the correctly specified covariance matrix of the error terms was
and the misspecified population loadings for the constrained to be diagonal (there were no
0.40/0.60 (moderate loadings) condition and for correlated residuals in these models, but the
the 0.60/0.80 (large loadings) condition for the variances of the residuals were freely estimated).
orthogonal two-factor models based on the For the orthogonal models the correlations
population covariance matrices including between the factors were fixed to zero, for the
correlated residuals (the correlations between the oblique models they were freely estimated. The
residuals are presented at the bottom of Table 1). misspecification for the two-factor model was
Table 2 contains the corresponding introduced by means of equality constraints for
parameters for the oblique models. The each of the smaller loadings of the variables v1-
misspecified models would be accepted v4 on the first factor with each of the larger
according to conventional cut-off criteria for fit loadings v13-v16 on the second factor. For the
indices (e.g., Hu & Bentler, 1999). It was four- and eight-factor models, similar equality
intended to generate small and generally constraints were imposed on the loadings of
accepted amounts of model-misspecification, so each pair of factors.
that even the misspecified models investigated Table 3 contains the correctly specified
here represent models as they might be and the misspecified population loadings for the
published in empirical research. Nevertheless, 0.40/0.60 (moderate loadings) condition and for
the omission of the correlated residuals leads to the 0.60/0.80 (large loadings) condition for the
small errors with respect to the loading size both orthogonal two-factor models. The equality of
in the orthogonal and in the oblique model (see loadings resulting from the equality constraints
Tables 1 and 2). The population parameters for was not perfect in the completely standardized
the orthogonal and oblique four- and eight-factor solutions (it was perfect in the unstandardized
models would be identical to the corresponding solutions). Not surprisingly, the fit of the
parameters presented in Table 1 and 2 so that correctly specified population models was
they are not presented. perfect, but even the misspecified models fit the
Another type of model misspecification data very well (see Table 3). The misspecified
with an impact on the loading size and thereby population model would not be rejected
on the coefficients of indeterminacy occurs according to conventional fit criteria (Hu &
when equality constraints are imposed on Bentler, 1999). The population loadings were
loadings that are unequal in the population. In the same for the four- and eight-factor models
order to base the results of the present and are therefore not presented.
simulation study on more than one type of The population loadings for the
model misspecification, misspecifications correctly specified and the misspecified oblique
resulting from equality constraints on the two-factor models are presented in Table 4. As
loadings were also investigated. Again, the before, the model misspecification was
parameters of the correctly specified models introduced by means of equality constraints on
were fixed in the first step and then the loadings that were not equal in the population
corresponding population covariance matrices (see Table 4). Again an evaluation of the model
were calculated from the model parameters. fit of the misspecified models would not lead to
Finally, these population covariance matrices model rejection for conventional criteria (Hu &
were used for CFA modeling with misspecified Bentler, 1999).
parameters. Again, the model parameters were
chosen in a way to ensure that the reproduced
covariance matrices were correlation matrices.
586
ANDRÉ BEAUDUCEL
F1 F2 F1 F2 F1 F2 F1 F2
x1 .400 - .414 - .600 - .607 -
x2 .400 - .414 - .600 - .607 -
x3 .400 - .392 - .600 - .596 -
x4 .400 - .392 - .600 - .596 -
x5 .600 - .584 - .800 - .793 -
x6 .600 - .584 - .800 - .793 -
x7 .600 - .637 - .800 - .816 -
x8 .600 - .637 - .800 - .816 -
x9 - .400 - .414 - .600 - .607
x10 - .400 - .414 - .600 - .607
x11 - .400 - .392 - .600 - .596
x12 - .400 - .392 - .600 - .596
x13 - .600 - .584 - .800 - .793
x14 - .600 - .584 - .800 - .793
x15 - .600 - .637 - .800 - .816
x16 - .600 - .637 - .800 - .816
Correlated Residuals
x1 with x2 .126 .000 .096 .000
x7 with x8 .096 .000 .054 .000
x9 with x10 .126 .000 .096 .000
x15 with x16 .096 .000 .054 .000
Notes: a The model fit for the population model without misspecification is perfect by definition: χ²(100) =
0.00; b The χ²-test for the misspecified model with moderate loadings is non-significant even for the largest
sample size used in the simulation study (N=750): χ²(104) = 50.93; Comparative Fit Index = 0.99; Root Mean
Square Error of Approximation = 0.026; Standardized Root Mean Square Residual = 0.012. c The χ²-test for
the misspecified model with large loadings is non-significant even for the largest sample size used in the
simulation study (N=750): χ²(104)= 51.18; Comparative Fit Index = 0.99; Root Mean Square Error of
Approximation = 0.026; Standardized Root Mean Square Residual = 0.012.
587
MISSPECIFICATION AND INDETERMINACY OF CONFIRMATORY FACTOR MODELS
F1 F2 F1 F2 F1 F2 F1 F2
x1 .400 - .414 - .600 - .607 -
x2 .400 - .414 - .600 - .607 -
x3 .400 - .392 - .600 - .596 -
x4 .400 - .392 - .600 - .596 -
x5 .600 - .585 - .800 - .793 -
x6 .600 - .585 - .800 - .793 -
x7 .600 - .636 - .800 - .815 -
x8 .600 - .636 - .800 - .815 -
x9 - .400 - .414 - .600 - .607
x10 - .400 - .414 - .600 - .607
x11 - .400 - .392 - .600 - .596
x12 - .400 - .392 - .600 - .596
x13 - .600 - .585 - .800 - .793
x14 - .600 - .585 - .800 - .793
x15 - .600 - .636 - .800 - .815
x16 - .600 - .636 - .800 - .815
Interfactor-
.300 .289 .300 .297
Correlation
Correlated Residuals
x1 with x2 .126 .000 .096 .000
x7 with x8 .096 .000 .054 .000
x9 with x10 .126 .000 .096 .000
x15 with x16 .096 .000 .054 .000
Notes: a The model fit for the population model without misspecification is perfect by definition: χ²(99) =
0.00; b The χ²-test for the misspecified model with moderate loadings is non-significant even for the largest
sample size used in the simulation study (N=750): χ²(103) = 51.40; Comparative Fit Index = 0.97; Root Mean
Square Error of Approximation = 0.026; Standardized Root Mean Square Residual = 0.017. c The χ²-test for
the misspecified model with large loadings is non-significant even for the largest sample size used in the
simulation study (N=750): χ²(103)= 51.35; Comparative Fit Index = 0.99; Root Mean Square Error of
Approximation = 0.026; Standardized Root Mean Square Residual = 0.012.
588
ANDRÉ BEAUDUCEL
F1 F2 F1 F2 F1 F2 F1 F2
589
MISSPECIFICATION AND INDETERMINACY OF CONFIRMATORY FACTOR MODELS
F1 F2 F1 F2 F1 F2 F1 F2
590
ANDRÉ BEAUDUCEL
591
MISSPECIFICATION AND INDETERMINACY OF CONFIRMATORY FACTOR MODELS
a z-standardized common random variable v can order to allow for a combined analysis of
be described as orthogonal and oblique models. The conditions
for this analysis were computation method of
oi = 0.300.5 v + 0.700.5 zi, for i = 1 to 8. indeterminacy (according to equations 4 and 5),
(14) orthogonality (orthogonal versus oblique),
number of factors (2, 4 and 8 factors), loading
Eight oblique population factor scores size (moderate versus large loadings), and
were computed as a basis for the oblique two-, number of cases or sample size (250, 500 and
four- and eight-factor models. It follows from 750 cases).
equation 14 that the interfactor-correlations were For each of these 36 conditions 500
0.30 in the population, according to the weight samples were analyzed by means of CFA so that
of the common variable v (see Beauducel & the first simulation study was based on 18,000
Wittmann, 2005 for more details on the samples. For each sample one CFA with correct
aggregation of random variables). The oblique model specification and one CFA with incorrect
factor scores oi were inserted instead of the model specification was performed. For analysis
orthogonal factor scores fi into equations 6-9 in of the correctly and misspecified models based
order to generate the variables for the oblique on population data without correlated residuals,
factor models with moderate loadings and the population data sets 3 and 4 were combined
correlated residuals and in equations 10-13 in in order to allow for a combined analysis of
order to generate the variables for the oblique orthogonal and oblique models. The conditions
models with large loadings and correlated (computation method, orthogonality, number of
residuals. factors, loading size and number of cases) were
The two-factor models were based on o1 exactly as in the analysis of the models with
and o2, the four-factor models on o1-o4, and the correlated residuals.
eight factor models on o1-o8. For the orthogonal For the correctly specified models, the
models without correlated residuals, the 64 difference between the population ρ² of the
variables were generated only on the basis of fi correctly specified models and the samples ρ² of
and ei, without the common terms ck, so that the the corresponding correctly specified models
equations for the models contained only the (same number of factors, same loading size, etc.)
weights as in equations 7, 8, 11 and 12 (see was calculated and averaged across factors.
Table 3, for the corresponding loadings). For the For the misspecified models, the
oblique models without correlated residuals the difference between the population ρ² of the
equations were based on the random variables oi misspecified models and the samples ρ² of the
and ei and they had also the same weights as corresponding misspecified models (same
equations 7, 8, 11 and 12 (see Table 4, for the number of factors, same loading size, etc.) was
corresponding loadings). calculated and averaged across factors. The ρ²-
Subsamples of variables were analyzed differences were calculated for both computation
for the two- and four-factor models. The two- methods (see equation 4 and 5) and entered into
factor models were based on the variables x1-x16 repeated measures ANOVA.
(see Table 1), the four-factor models were based In order to limit the results to those that
on the variables x1-x32 and the eight-factor are interesting in the present context, only main-
models were based on the 64 variables. The two effects and interactions involving the factor
types of models and their corresponding Computation-method are reported. Due to the
misspecifications (omitted correlations between very large sample size (6,000 cases) all reported
residuals, specification of equal loadings) were effects were significant at p < 0.001 and only
analyzed separately, in order to allow for a effects with large effect sizes (partial η² > 0.20)
separate interpretation of the results. are reported. The effect sizes of the within-
For the analysis of the correctly and subjects effects were based on Greenhouse-
misspecified models based on population data Geisser corrected univariate effects.
with correlated residuals, the results from the
population data sets 1 and 2 were combined in
592
ANDRÉ BEAUDUCEL
Model Type Loading Size Specification Odd Factors Even Factors Odd Factors Even Factors
Correctly
.738 .738 .751 .751
Specified
.40
With Misspecified .761 .761 .761 .761
Correlated
Correctly
Residuals .897 .897 .903 .903
Specified
.60
Misspecified .906 .906 .906 .906
Correctly
.751 .751 .751 .751
Specified
.40
Without Misspecified .791 .697 .904 .623
Correlated
Correctly
Residuals .903 .903 .903 .903
Specified
.60
Misspecified .922 .883 1.011 .823
Notes: The column odd factors contains the mean ρ² for the factors with odd numbers, the column even factors
contains the mean ρ² for the factors with even numbers.
593
MISSPECIFICATION AND INDETERMINACY OF CONFIRMATORY FACTOR MODELS
occurred for Computation method (η²= 0.94). (η²= 0.83). This three-way-interaction occurs
The mean ρ²-difference was 0.081 (SD= 0.068) because the size of the two-way interaction
when based on equation 4 and 0.171 (SD= computation method x Number of factors is
0.094) when based on equation 5. Thus, the larger for the small samples (250 cases) than for
mean difference between ρ² in the population the large samples (750 cases). In fact, the mean
and in the samples was about twice as large difference between the ρ²-differences for the two
when it was based on equation 5. This indicates computation methods is only 0.018 for the two-
that the empirical covariance matrix (used in factor models based on 750 cases and it is 0.304
equation 5) introduces a substantial amount of for the eight-factor models based on 250 cases.
sampling error into ρ². Finally, the interaction of computation
A large effect size occurred for the method with Obliqueness is of relevant size (η²=
interaction between computation method and 0.43). The difference between the computation
number of factors (η²= 0.94). This interaction is methods is smaller for the orthogonal models
mainly due to a larger increase of the ρ²- than for the oblique models. Although there is a
difference with number of factors when ρ² is substantial main effect for misspecification (η²=
computed according to equation 5 (see Figure 0.47), the size of the interaction between
1a). Another large effect size occurs for the computation method and misspecification is
interaction of computation method and number moderate (η²= 0.16) and the interaction is
of cases (η²= 0.81). This interaction is mainly extremely small in terms of mean differences:
due to a larger increase of the ρ²-difference with The difference between the ρ²-differences for
Number of cases when ρ² is computed according the two computation methods is 0.092 for the
to equation 5 (see Figure 1b). Moreover, a large correctly specified models and it is 0.089 for the
three-way interaction computation method x misspecified models; thus, misspecification had
number of factors x number of cases occurred no relevant effect on the difference between the
computation methods.
Figure 1: ρ²-Differences for the Two Computation Methods Based on the Data Sets with Correlated Residuals:
a) for 2-, 4-, and 8-factor models; b) for 250, 500, and 750 cases
Δρ² Δρ²
eq. 4 eq. 4
eq. 5 eq. 5
594
ANDRÉ BEAUDUCEL
The differences between the population models than for oblique models (Computation
ρ² and the corresponding samples ρ² based on method x Obliqueness; η² = 0.92). The effect of
the models without correlated residuals were model misspecification on the ρ²-differences for
entered into a repeated measures ANOVA with the two methods was, however, moderate (η² =
the same factors as the ρ²-differences for the 0.17). For the correctly specified models the
models based on correlated residuals. Again, a difference between the computation methods
large main effect occurred for computation was slightly larger (0.125) than for the
method (η² = 0.97), indicating that the mean ρ²- misspecified models (0.122).
difference between population and sample ρ²
was considerably smaller when ρ² was computed Conclusion
according to equation 4. This study compared two calculation methods of
The mean ρ²-difference was only 0.01 the indeterminacy coefficient ρ² (or ρ) that
(SD = 0.01) when ρ² was computed according to allows for the evaluation of factor score
equation 4 and it was 0.14 (SD = 0.09) when ρ² estimates. Thereby it should be investigated
was computed according to equation 5. A which method should be preferred when a CFA
substantial interaction of computation method model is slightly misspecified, as is often the
with number of factors occurred (η² = 0.97). An case. Therefore, the two calculation methods for
inspection of this interaction reveals that the indeterminacy were compared in correctly and
computation methods had similar ρ²-differences misspecified CFA models.
for the two-factor models, but that the Correctly specified and misspecified
computation method based on equation 5 models based on data sets with correlated
yielded much larger ρ²-differences in the eight- residuals as well as on data sets without
factor models (see Figure 2a). Another correlated residuals were investigated. For the
substantial interaction occurred for computation models based on data sets with correlated
method and number of cases (η²= 0.77), residuals, the correlated residuals were not
indicating that the ρ²-differences increased more specified in order to generate misspecified
models in addition to the correctly specified
with decreasing sample size when ρ² was
models. For the models based on data sets
computed according to equation 5 (see Figure
without correlated residuals misspecified models
2b).
were generated by means of equality constraints
The effect size of the three-way
imposed on unequal loadings.
interaction Computation method x Number of
Two computation methods for
factors x Number of cases was also substantial
coefficients of indeterminacy were investigated:
(η²= 0.83). This relation of Number of factors
The first method is based on the correlations or
and Number of cases with the Computation
covariances of the observed variables
method can be described by the following result:
reproduced from the model (equation 4), the
The mean ρ²-differences were rather similar for second method (equation 5) is based on the
both Computation methods when based on the empirical correlations or covariances of the
two-factor models with 750 cases (their observed variables. Because both the
difference was 0.033). The mean differences
computation of ρ² by means of the reproduced
were, however, very different for the
covariance matrix (McDonald, 1974; Mulaik &
computation methods when based on the eight-
McDonald, 1978) and the computation of ρ² by
factor models with 250 cases (their difference
means of the sample covariance matrix
was 0.333). The ρ²-differences based on
(Gorsuch, 1983; Grice, 2001; Heermann, 1963)
equation 5 were larger than the ρ²-differences have been proposed, an investigation of the
based on equation 4 when the size of the differences between these methods was regarded
loadings was larger (Computation method x as important. Moreover, in case of model
Loading-size; η² = 0.59). The ρ²-differences misspecification, it is clear that the covariance
based on equation 5 were also larger than the ρ²- matrix reproduced from the model (Σ) contains
differences based on equation 4 for orthogonal some error. The errors due to model
595
MISSPECIFICATION AND INDETERMINACY OF CONFIRMATORY FACTOR MODELS
Figure 2: ρ²-Differences for the Two Computation Methods Based on the Data Sets Without Correlated Residuals:
a) for 2-, 4-, and 8-factor models; b) for 250, 500, and 750 cases
Δρ² Δρ²
eq. 4 eq. 4
eq. 5 eq. 5
misspecification are not present in the empirical misspecification were not investigated.
covariance matrix (S), so that the computation Nevertheless, the results of the simulation study
based on S might have been expected to work shed some light on the effects of sampling error
well for misspecified models. Therefore, the two on ρ² for different types of correctly and
computation methods were investigated both in misspecified CFA models.
correctly as well as in misspecified models. The difference between ρ² computed
However, the model misspecifications were from the population and the samples was
moderate in order to represent models that might substantially smaller when ρ² was computed
be accepted according to conventional fit criteria according to equation 4 (as can be seen from the
(Hu & Bentler, 1999). The reason for the main effect of Computation method). This result
investigation of models with small amounts of can be interpreted as a larger effect of sampling
misspecification was that this allows some error on ρ² when computed according to
insight into the effects of model misspecification equation 5, as might be expected from using the
on ρ² that might occur in empirical research with sample covariance matrix S in equation 5 instead
a given amount of accepted misfit. Sample size of the population covariance matrix Σ.
(250, 500, 750 cases), number of factors (2, 4, 8 The interpretation that the use of S for
factors), obliqueness (orthogonal versus the computation of ρ² introduces some sampling
correlated factors), and size of salient loadings error into the coefficient is also supported by the
(0.40/0.60 versus 0.60/0.80) were manipulated interaction of computation method with sample
in the simulation study. The main limitations of size, indicating that the difference between the
the present simulation study are that only two population ρ² and the sample ρ² was larger for
types of model misspecification were explored
smaller sample sizes, especially when ρ² was
and that the effects of severe model
596
ANDRÉ BEAUDUCEL
computed according to equation 5 (based on S). Beauducel, A., & Rabe, S. (2009).
Even in the misspecified models, when Σ suffers Model-related factor score estimates for
from the misspecification, due to its being confirmatory factor analysis. British Journal of
reproduced from the (misspecified) model Mathematical and Statistical Psychology, 62,
parameters, the mean differences between the 489-506.
populations ρ² and the samples ρ² was smaller Beauducel, A., & Wittmann, W. W.
when ρ² was computed on the basis of Σ (2005). Simulation study on fit indices in confir-
(equation 4). matory factor analysis based on data with
Although the model misspecifications slightly distorted simple structure. Structural
used in the present study were not very large, it Equation Modeling, 12, 41-75.
is still possible that advantages of using S for the Fan, X., & Sivo, S. A. (2007).
computation of ρ² (equation 5) might occur for Sensitivity of fit indices to model
extreme amounts of model misspecification. On misspecification and model types. Multivariate
the other hand, it seems rather unlikely that Behavioral Research, 42, 509-529.
severely misspecified models would generally Gorsuch, R. L. (1983). Factor analysis
be accepted according to fit indexes and it might (2nd Ed.). Hillsdale, NJ: Lawrence Erlbaum
be regarded as problematic to base the results of Associates.
a simulation study on models that should not Green, B. F. (1976). On the factor score
occur in empirical research. The results of the controversy. Psychometrika, 41, 263-266.
present study are therefore taken as support for a Guttman, L. (1955). The determinacy of
computation of ρ² by means of the reproduced factor score matrices with applications for five
correlation or covariance matrix (equation 4). other problems of common factor theory. British
Moreover, it was found for the population Journal of Mathematical and Statistical
models that effects of misspecification can result Psychology, 8, 65-82.
in serious over-estimation of ρ², so that the Grice, J. W. (2001). Computing and
validity of factor score predictors might be over- evaluation of factor scores. Psychological
estimated, just because the respective models Methods, 6, 430-450.
were incorrectly specified. Heermann, E. F. (1963). Univocal or
Nevertheless, the effect of sampling orthogonal estimators of orthogonal factors.
Psychometrika, 28, 161-172.
error and model misspecification on ρ² found in
Hu, L., & Bentler, P. M. (1999). Cutoff
this study should not discourage researchers to
criteria for fit indexes in covariance structure
report indeterminacy coefficients when factor
analysis: Conventional criteria versus new
score estimates are computed from CFA models.
alternatives. Structural Equation Modeling, 6, 1-
It is necessary to report indeterminacy
55.
coefficients – otherwise the validity of the factor
Krijnen, W. P. (2006). Some results on
score estimates remains unknown. Of course,
mean square error for factor score prediction.
indeterminacy coefficients might be even more
Psychometrika, 71, 395-409.
biased than reported here when a model is more
Krijnen, W. P., Wansbeek, T., & Ten
seriously misspecified; the case of extreme
Berge, J. M. F. (1996). Best linear predictors for
misspecification was not investigated in this
factor scores. Communications in Statistics:
study because factor score estimates should not
Theory and Methods, 25, 3013-3025.
at all be computed for seriously misspecified
Marsh, H. W., Hau, K.-T., & Wen, Z.
CFA models, thus the question of the validity of
(2004). In search of golden rules: comment on
such scores is irrelevant.
hypothesis-testing approaches to setting cutoff
values for fit indexes and dangers in
overgeneralizing Hu and Bentler’s (1999)
References findings. Structural Equation Modeling, 11,
Barrett, P. (2007). Structural equation 320–341.
modeling: Adjudging model fit. Personality and
Individual Differences, 42, 815-824.
597
MISSPECIFICATION AND INDETERMINACY OF CONFIRMATORY FACTOR MODELS
598