Professional Documents
Culture Documents
Structural
Equation
Modeling
Third Edition
Structural
Equation
Modeling
Third Edition
Randall E. Schumacker
The University of Alabama
Richard G. Lomax
The Ohio State University
For permission to photocopy or use material electronically from this work, please access www.
copyright.com (http://www.copyright.com/) or contact the Copyright Clearance Center, Inc.
(CCC), 222 Rosewood Drive, Danvers, MA 01923, 978-750-8400. CCC is a not-for-profit organiza-
tion that provides licenses and registration for a variety of users. For organizations that have been
granted a photocopy license by the CCC, a separate system of payment has been arranged.
Trademark Notice: Product or corporate names may be trademarks or registered trademarks, and
are used only for identification and explanation without intent to infringe.
Schumacker, Randall E.
A beginner’s guide to structural equation modeling / authors, Randall E.
Schumacker, Richard G. Lomax.-- 3rd ed.
p. cm.
Includes bibliographical references and index.
ISBN 978-1-84169-890-8 (hardcover : alk. paper) -- ISBN 978-1-84169-891-5
(pbk. : alk. paper)
1. Structural equation modeling. 2. Social sciences--Statistical methods. I.
Lomax, Richard G. II. Title.
QA278.S36 2010
519.5’3--dc22 2010009456
1 Introduction................................................................................................. 1
1.1 What Is Structural Equation Modeling?........................................ 2
1.2 History of Structural Equation Modeling..................................... 4
1.3 Why Conduct Structural Equation Modeling?............................. 6
1.4 Structural Equation Modeling Software Programs..................... 8
1.5 Summary.......................................................................................... 10
References................................................................................................... 11
3 Correlation................................................................................................. 33
3.1 Types of Correlation Coefficients.................................................. 33
3.2 Factors Affecting Correlation Coefficients.................................. 35
3.2.1 Level of Measurement and Range of Values.................. 35
3.2.2 Nonlinearity....................................................................... 36
3.2.3 Missing Data....................................................................... 38
3.2.4 Outliers................................................................................ 39
3.2.5 Correction for Attenuation............................................... 39
3.2.6 Nonpositive Definite Matrices......................................... 40
3.2.7 Sample Size......................................................................... 41
3.3 Bivariate, Part, and Partial Correlations...................................... 42
3.4 Correlation versus Covariance...................................................... 46
3.5 Variable Metrics (Standardized versus Unstandardized)......... 47
3.6 Causation Assumptions and Limitations.................................... 48
3.7 Summary.......................................................................................... 49
References................................................................................................... 51
vii
4 SEM Basics................................................................................................. 55
4.1 Model Specification......................................................................... 55
4.2 Model Identification........................................................................ 56
4.3 Model Estimation............................................................................ 59
4.4 Model Testing.................................................................................. 63
4.5 Model Modification........................................................................ 64
4.6 Summary.......................................................................................... 67
References................................................................................................... 69
5 Model Fit..................................................................................................... 73
5.1 Types of Model-Fit Criteria............................................................ 74
5.1.1 LISREL–SIMPLIS Example............................................... 77
5.1.1.1 Data...................................................................... 77
5.1.1.2 Program............................................................... 80
5.1.1.3 Output.................................................................. 81
5.2 Model Fit........................................................................................... 85
5.2.1 Chi-Square (χ 2)................................................................... 85
5.2.2 Goodness-of-Fit Index (GFI) and Adjusted
Goodness-of-Fit Index (AGFI).......................................... 86
5.2.3 Root-Mean-Square Residual Index (RMR)..................... 87
5.3 Model Comparison......................................................................... 88
5.3.1 Tucker–Lewis Index (TLI)................................................. 88
5.3.2 Normed Fit Index (NFI) and Comparative Fit
Index (CFI).......................................................................... 88
5.4 Model Parsimony............................................................................ 89
5.4.1 Parsimony Normed Fit Index (PNFI).............................. 90
5.4.2 Akaike Information Criterion (AIC)............................... 90
5.4.3 Summary............................................................................. 91
5.5 Parameter Fit.................................................................................... 92
5.6 Power and Sample Size.................................................................. 93
5.6.1 Model Fit............................................................................. 94
5.6.1.1 Power.................................................................... 94
5.6.1.2 Sample Size......................................................... 99
5.6.2 Model Comparison.......................................................... 108
5.6.3 Parameter Significance.....................................................111
5.6.4 Summary............................................................................113
5.7 Two-Step Versus Four-Step Approach to Modeling.................114
5.8 Summary.........................................................................................116
Chapter Footnote......................................................................................118
Standard Errors.........................................................................................118
Chi-Squares................................................................................................118
References................................................................................................. 120
xv
Approach
This book presents a basic introduction to structural equation modeling
(SEM). Readers will find that we have kept to our tradition of keeping
examples rudimentary and easy to follow. The reader is provided with
a review of correlation and covariance, followed by multiple regression,
path, and factor analyses in order to better understand the building blocks
of SEM. The book describes a basic structural equation model followed by
the presentation of several different types of structural equation models.
Our approach in the text is both conceptual and application oriented.
Each chapter covers basic concepts, principles, and practice and then
utilizes SEM software to provide meaningful examples. Each chapter also
features an outline, key concepts, a summary, numerous examples from
a variety of disciplines, tables, and figures, including path diagrams, to
assist with conceptual understanding. Chapters with examples follow the
conceptual sequence of SEM steps known as model specification, identifi-
cation, estimation, testing, and modification.
The book now uses LISREL 8.8 student version to make the software and
examples readily available to readers. Please be aware that the student
version, although free, does not contain all of the functional features as a
full licensed version. Given the advances in SEM software over the past
decade, you should expect updates and patches of this software package
and therefore become familiar with any new features as well as explore the
excellent library of examples and help materials. The LISREL 8.8 student
version is an easy-to-use Windows PC based program with pull-down
menus, dialog boxes, and drawing tools. To access the program, and/or
if you’re a Mac user and are interested in learning about Mac availability,
please check with Scientific Software (http://www.ssicentral.com). There
is also a hotlink to the Scientific Software site from the book page for A
Beginner’s Guide to Structural Equation Modeling, 3rd edition on the Textbook
Resources tab at www.psypress.com.
The SEM model examples in the book do not require complicated pro-
gramming skills nor does the reader need an advanced understanding of
statistics and matrix algebra to understand the model applications. We have
provided a chapter on the matrix approach to SEM as well as an appendix
on matrix operations for the interested reader. We encourage the under-
standing of the matrices used in SEM models, especially for some of the
more advanced SEM models you will encounter in the research literature.
xvii
book in class with our students. As a result of those experiences, the third
edition represents a more useable book for teaching SEM. As such it is an
ideal text for introductory graduate level courses in structural equation
modeling or factor analysis taught in departments of psychology, educa-
tion, business, and other social and healthcare sciences. An understand-
ing of correlation is assumed.
The third edition offers several new surprises, namely:
1. Our instruction and examples are now based on freely available
software: LISREL 8.8 student version.
2. More examples presented from more disciplines, including input,
output, and screenshots.
3. Every chapter has been updated and enhanced with additional
material.
4. A website with raw data sets for the book’s examples and exer-
cises so they can be used with any SEM program, all of the book’s
exercises, hotlinks to related websites, and answers to all of the
exercises for instructors only. To access the website visit the book
page or the Textbook Resource page at www.psypress.com.
5. Expanded coverage of advanced models with more on multiple-
group, multi-level, and mixture modeling (Chs. 13 and 15), second-
order and dynamic factor models (Ch. 14), and Monte Carlo
methods (Ch. 16).
6. Increased coverage of sample size and power (Ch. 5), including
software programs, and reporting research (Ch. 11).
7. New journal article references help readers better understand
published research (Chs. 13–17).
8. Troubleshooting tips on how to address the most frequently
encountered problems are found in Chapters 3 and 11.
9. Chapters 13 to 16 now include additional SEM model examples.
10. 25% new exercises with answers to half in the back of the book
for student review (and answers to all for instructors only on the
book and/or Textbook Resource page at www.psypress.com).
11. Added Matrix examples for several models in Chapter 17.
12. Updated references in all chapters on all key topics.
Overall, we believe this third edition is a more complete book that can
be used to teach a full course in SEM. The past several years have seen an
explosion in SEM coursework, books, websites, and training courses. We
are proud to have been considered a starting point for many beginner’s
to SEM. We hope you find that this third edition expands on many of the
programming tools, trends and topics in SEM today.
Acknowledgments
The third edition of this book represents more than thirty years of inter-
acting with our colleagues and students who use structural equation
modeling. As before, we are most grateful to the pioneers in the field of
structural equation modeling, particularly to Karl Jöreskog, Dag Sörbom,
Peter Bentler, James Arbuckle, and Linda and Bengt Muthèn. These indi-
viduals have developed and shaped the new advances in the SEM field as
well as the content of this book, plus provided SEM researchers with soft-
ware programs. We are also grateful to Gerhard Mels who answered our
questions and inquiries about SEM programming problems in the chap-
ters. We also wish to thank the reviewers: James Leeper, The University
of Alabama, Philip Smith, Augusta State University, Phil Wood, the
University of Missouri–Columbia, and Ke-Haie Yuan, the University of
Notre Dame.
This book was made possible through the encouragement of Debra
Riegert at Routledge/Taylor & Francis who insisted it was time for a third
edition. We wish to thank her and her editorial assistant, Erin M. Flaherty,
for coordinating all of the activity required to get a book into print. We
also want to thank Suzanne Lassandro at Taylor & Francis Group, LLC
for helping us through the difficult process of revisions, galleys, and final
book copy.
Randall E. Schumacker
The University of Alabama
Richard G. Lomax
The Ohio State University
Key Concepts
Latent and observed variables
Independent and dependent variables
Types of models
Regression
Path
Confirmatory factor
Structural equation
History of structural equation modeling
Structural equation modeling software programs
models, latent growth curve models, and Monte Carlo studies. Chapter 17
presents matrix notation for one of our SEM applications, covers the differ-
ent matrices used in structural equation modeling, and presents multiple
regression and path analysis solutions using matrix algebra. We include
an introduction to matrix operations in the Appendix for readers who
want a more mathematical understanding of matrix operations. To start
our journey of understanding, we first ask, What is structural equation
modeling? Then, we give a brief history of SEM, discuss the importance of
SEM, and note the availability of SEM software programs.
also specified entirely with observed variables, but the flexibility allows
for multiple independent observed variables and multiple dependent
observed variables—for example, export sales, gross national product,
and NASDAQ index influence consumer trust and consumer spending
(dependent observed variables). Path models, therefore, test more com-
plex models than regression models. Confirmatory factor models con-
sist of observed variables that are hypothesized to measure one or more
latent variables (independent or dependent); for example, diet, exercise,
and physiology are observed measures of the independent latent variable
“fitness.” An understanding of these basic models will help in under-
standing structural equation modeling, which combines path and factor
analytic models. Structural equation models consist of observed variables
and latent variables, whether independent or dependent; for example, an
independent latent variable (home environment) influences a dependent
latent variable (achievement), where both types of latent variables are
measured, defined, or inferred by multiple observed or measured indica-
tor variables.
Some years later, Charles Spearman (1904, 1927) used the correlation
coefficient to determine which items correlated or went together to create
the factor model. His basic idea was that if a set of items correlated or
went together, individual responses to the set of items could be summed
to yield a score that would measure, define, or infer a construct. Spearman
was the first to use the term factor analysis in defining a two-factor con-
struct for a theory of intelligence. D.N. Lawley and L.L. Thurstone in 1940
further developed applications of factor models, and proposed instru-
ments (sets of items) that yielded observed scores from which constructs
could be inferred. Most of the aptitude, achievement, and diagnostic
tests, surveys, and inventories in use today were created using factor ana-
lytic techniques. The term confirmatory factor analysis (CFA) is used today
based in part on earlier work by Howe (1955), Anderson and Rubin (1956),
and Lawley (1958). The CFA method was more fully developed by Karl
Jöreskog in the 1960s to test whether a set of items defined a construct.
Jöreskog completed his dissertation in 1963, published the first article on
CFA in 1969, and subsequently helped develop the first CFA software pro-
gram. Factor analysis has been used for over 100 years to create measure-
ment instruments in many academic disciplines, while today CFA is used
to test the existence of these theoretical constructs. In an example study,
CFA was used to confirm the “Big Five” model of personality by Goldberg
(1990). The five-factor model of extraversion, agreeableness, conscientious-
ness, neuroticism, and intellect was confirmed through the use of multiple
indicator variables for each of the five hypothesized factors.
Sewell Wright (1918, 1921, 1934), a biologist, developed the third type of
model, a path model. Path models use correlation coefficients and regres-
sion analysis to model more complex relationships among observed
variables. The first applications of path analysis dealt with models of
animal behavior. Unfortunately, path analysis was largely overlooked
until econometricians reconsidered it in the 1950s as a form of simultane-
ous equation modeling (e.g., H. Wold) and sociologists rediscovered it in
the 1960s (e.g., O. D. Duncan and H. M. Blalock). In many respects, path
analysis involves solving a set of simultaneous regression equations that
theoretically establish the relationship among the observed variables in
the path model. In an example path analysis study, Walberg’s theoretical
model of educational productivity was tested for fifth- through eighth-
grade students (Parkerson et al., 1984). The relations among the follow-
ing variables were analyzed in a single model: home environment, peer
group, media, ability, social environment, time on task, motivation, and
instructional strategies. All of the hypothesized paths among those vari-
ables were shown to be statistically significant, providing support for the
educational productivity model.
The final model type is structural equation modeling (SEM). SEM mod-
els essentially combine path models and confirmatory factor models;
that is, SEM models incorporate both latent and observed variables. The
early development of SEM models was due to Karl Jöreskog (1969, 1973),
Ward Keesling (1972), and David Wiley (1973); this approach was initially
known as the JKW model, but became known as the linear structural rela-
tions model (LISREL) with the development of the first software program,
LISREL, in 1973. Since then, many SEM articles have been published; for
example, Shumow and Lomax (2002) tested a theoretical model of paren-
tal efficacy for adolescent students. For the overall sample, neighborhood
quality predicted parental efficacy, which predicted parental involvement
and monitoring, both of which predicted academic and social-emotional
adjustment.
Jöreskog and van Thillo originally developed the LISREL software pro-
gram at the Educational Testing Service (ETS) using a matrix command
language (i.e., involving Greek and matrix notation), which is described
in chapter 17. The first publicly available version, LISREL III, was released
in 1976. Later in 1993, LISREL8 was released; it introduced the SIMPLIS
(SIMPle LISrel) command language in which equations are written
using variable names. In 1999, the first interactive version of LISREL was
released. LISREL8 introduced the dialog box interface using pull-down
menus and point-and-click features to develop models, and the path dia-
gram mode, a drawing program to develop models. Karl Jöreskog was rec-
ognized by Cudeck, DuToit, and Sörbom (2001) who edited a Festschrift
in honor of his contributions to the field of structural equation modeling.
Their volume contains chapters by scholars who address the many top-
ics, concerns, and applications in the field of structural equation model-
ing today, including milestones in factor analysis; measurement models;
robustness, reliability, and fit assessment; repeated measurement designs;
ordinal data; and interaction models. We cover many of these topics in
this book, although not in as great a depth. The field of structural equa-
tion modeling across all disciplines has expanded since 1994. Hershberger
(2003) found that between 1994 and 2001 the number of journal articles
concerned with SEM increased, the number of journals publishing SEM
research increased, SEM became a popular choice amongst multivariate
methods, and the journal Structural Equation Modeling became the primary
source for technical developments in structural equation modeling.
When you click on the icon, an empty dialog box will appear that should
look like this:
N ote: Nothing appears until you open a program file or data set using
the File or open folder icon; more about this in the next chapter.
We do want to mention the very useful HELP menu. Click on the ques-
tion mark (?), a HELP menu will appear, then enter Output Questions in
the search window to find answers to key questions you may have when
going over examples in the Third Edition.
1.5 Summary
In this chapter we introduced structural equation modeling by describ-
ing basic types of variables—that is, latent, observed, independent, and
dependent—and basic types of SEM models—that is, regression, path,
confirmatory factor, and structural equation models. In addition, a brief
history of structural equation modeling was provided, followed by a dis-
cussion of the importance of SEM. This chapter concluded with a brief
listing of the different structural equation modeling software programs
and where to obtain the LISREL 8.8 student version for use with examples
in the book, including what the dialog box will first appear like and a very
useful HELP menu.
In chapter 2 we consider the importance of examining data for issues
related to measurement level (nominal, ordinal, interval, or ratio), restric-
tion of range (fewer than 15 categories), missing data, outliers (extreme
values), linearity or nonlinearity, and normality or nonnormality, all of
which can affect statistical methods, and especially SEM applications.
Exercises
1. Define the following terms:
a. Latent variable
b. Observed variable
c. Dependent variable
d. Independent variable
2. Explain the difference between a dependent latent variable and
a dependent observed variable.
3. Explain the difference between an independent latent variable
and an independent observed variable.
4. List the reasons why a researcher would conduct structural
equation modeling.
5. Download and activate the student version of LISREL: http://
www.ssicentral.com
6. Open and import SPSS or data file.
References
Anderson, T. W., & Rubin, H. (1956). Statistical inference in factor analysis. In
J. Neyman (Ed.), Proceedings of the third Berkeley symposium on mathemati-
cal statistics and probability, Vol. V (pp. 111–150). Berkeley: University of
California Press.
Cudeck, R., Du Toit, S., & Sörbom, D. (2001) (Eds). Structural equation modeling:
Present and future. A Festschrift in honor of Karl Jöreskog. Lincolnwood, IL:
Scientific Software International.
Delucchi, M. (2006). The efficacy of collaborative learning groups in an under-
graduate statistics course. College Teaching, 54, 244–248.
Goldberg, L. (1990). An alternative “description of personality”: Big Five factor
structure. Journal of Personality and Social Psychology, 59, 1216–1229.
Hershberger, S. L. (2003). The growth of structural equation modeling: 1994–2001.
Structural Equation Modeling, 10(1), 35–46.
Howe, W. G. (1955). Some contributions to factor analysis (Report No. ORNL-1919).
Oak Ridge National Laboratory, Oak Ridge, Tennessee.
Jöreskog, K. G. (1963). Statistical estimation in factor analysis: A new technique and its
foundation. Stockholm: Almqvist & Wiksell.
1 Introduction