Professional Documents
Culture Documents
research-article2019
ADHXXX10.1177/1523422319869915Advances in Developing Human ResourcesFlatt and Jacobs
Article
Advances in Developing Human
Resources
Principle Assumptions of 2019, Vol. 21(4) 484–502
© The Author(s) 2019
Regression Analysis: Testing, Article reuse guidelines:
sagepub.com/journals-permissions
Techniques, and Statistical DOI: 10.1177/1523422319869915
https://doi.org/10.1177/1523422319869915
journals.sagepub.com/home/adh
Reporting of Imperfect Data
Sets
Abstract
The Problem.
Journal pages are filled with articles that scarcely mention the assumptions behind
the chosen statistical techniques and models. Based on questionable foundations,
the ultimate conclusions are intended to shape academia and guide practitioners.
Violations of the underlying assumptions can result in biased and misleading forecasts,
confidence intervals, and scientific insights.
The Solution.
The field of human resource development (HRD) is equipped to present these
assumptions clearly and concisely to ensure the integrity of statistical analysis and
subsequent conclusions. Testing the principle assumptions of regression analysis is a
process. As such, the presentation of this process in a systems framework provides
a comprehensive plan with step-by-step guidelines to help determine the optimal
statistical model for a particular data set. The goal of this article is to provide
practitioners a Regression Development System that can be adapted to organizational
performance as well as information that can be used to evaluate the strength of
journal articles.
The Stakeholders.
Quantitative researchers, practitioners, instructors, and students.
Keywords
regression, quantitative, assumptions
Corresponding Author:
Candace Flatt, Economist, Illinois Department of Employment Security, 607 E. Adams, 8th Floor,
Springfield, IL 62701, USA.
Email: candace.flatt@illinois.gov
Flatt and Jacobs 485
Introduction
Journal pages are filled with articles that scarcely mention the assumptions behind the
chosen statistical techniques and models. Based on questionable foundations, the ulti-
mate conclusions are intended to shape academia and guide practitioners. To gain a
sense of pervasiveness, Ernst and Albers (2017) systematically reviewed clinical psy-
chology journals in 2013 for discussions on the underlying assumptions of linear
regression. Out of the 172 articles, only 2% of the articles both addressed and correctly
tested the assumptions of linear regression. Another 6% of articles addressed the
assumptions, but incorrectly tested for them. The field of human resource develop-
ment (HRD) fares slightly better. Garavan et al. (2019) conducted a literature review
of 219 articles examining the relationship between training and organizational out-
comes since 2007. They found 14% addressed independence, 5% reported issues
related to variance, 34% referenced normality, and 14% commented on linearity.
Unfortunately, the reason for the omission is most likely due to lack of basic statis-
tical knowledge. Hoekstra, Kiers, and Johnson (2012) asked 30 PhD students to ana-
lyze data sets with familiar statistical techniques as well as complete a questionnaire
on their approach. Approximately, 90% of the participants were unfamiliar with the
normality assumption of regression, and 70% of the participants were unfamiliar with
the assumption of constant variances.
Violations of the underlying assumptions can result in biased and misleading forecasts,
confidence intervals, and scientific insights (Nau, 2018). Readers are unable to trust or
replicate the results of a study if the model assumptions are violated. Nontransparency in
the analysis of the underlying assumptions reduces the informational value of the study
(Ernst & Albers, 2017). Today’s research standards require a thorough description and
defense of the chosen research design. “Good designs have a beneficial side effect: they
typically lend themselves to a simple explanation and empirical methods and a straight-
forward presentation of results” (Angrist & Pischke, 2010, p. 17).
Each assumption can be analyzed in two ways—graphically or statistically. Graphical
analysis offers a visual overview of the data set. This overview hints at the general
relationships and provides clues to determine the optimal statistical model. However, a
graphical analysis stops short of definitive conclusions. Graphical methods show viola-
tions of assumptions rather than proof of adherence (Smith, 2012). A statistical test
provides an objective determination of the hypothesis. A thorough evaluation of
assumptions includes both types of analyses—graphical and statistical. This article pro-
vides a systematic approach to testing the assumptions of regression analysis. The first
three assumptions describe the requirements of the error terms—statistical indepen-
dence, constant variance, and normal distribution. The last assumption specifies the
linear and additive relationships between dependent and independent variables.
Literature Review
Journal articles, course notes, statistical resources, textbooks, and statistical software
guides were reviewed for content relating to the assumptions of regression analysis.
486 Advances in Developing Human Resources 21(4)
Note. SAS: Statistical Analysis System; SPSS: Statistical Package for Social Sciences
This literature review is divided into four sections—each section focusing on a spe-
cific assumption of regression analysis. Each section outlines the potential problems
arising from a violation, the common tests, and the possible remedies.
signals a fundamental structural problem with the model. A violation of this assumption
is occasionally due to a violation of the linearity assumption. In these cases, the trans-
formations described under Assumption 4 are possible remedies (Nau, 2018).
In some cases, ordinary least squares (OLS) may not be the best choice. Linear
mixed models (LMM) are statistical models that can be applied to continuous outcome
variables characterized by normally distributed residuals that may not be independent
(violation of Assumption 1) or have constant variances (violation of Assumption 2). In
particular, LMM is an appropriate choice for study designs using longitudinal data
sets. Longitudinal data are data sets with dependent variables repeatedly measured
over time for each unit of analysis (West, Welch, & Galecki, 2015). If the chosen
model does not address a nested data set, then the risk of rejecting a null hypothesis
(Type I error) is increased (Nimon, 2012). Turner (2015) provides detailed instructions
and uses of LMM in the field of HRD.
constant variance. Although the test is not readily available in SPSS, IBM (2016)
provides instructions on SPSS code to produce the White test.
Although severe heteroscedasticity is problematic, the parameter estimates are not
biased. In other words, OLS estimation can still be used without the risk of distortion.
When heteroscedasticity is a concern, there are two ways to address the problem. First,
heteroscedasticity is often due to a violation of another assumption. There are several
tests available to determine if a model is mis-specified (see Assumption 4). A model
re-specification or transformation of variables may eliminate heteroscedasticity.
Second, robust standard errors can be used to obtain unbiased standard errors. Robust
standard errors relax the assumption that errors are independent and identically distrib-
uted (Williams, 2015).
Graphical Stem-and-leaf plot A stem-and-leaf plot presents the shape of the data by displaying the
numbers (i.e., error terms) in two columns. The left column (stem)
contains the largest place value with a different value in each row.
The right column (leaf) contains the remaining portion of each
number. This graph is useful for small sample sizes.
Histogram A histogram displays the observations (i.e., error terms) proportionally
by intervals or categories. According to Smith (2012), a histogram
is not a good way to check for normality because different intervals
influence the shape of the graph.
Probability– The P-P plot is a comparison of an empirical distribution function with
Probability plot a theoretical cumulative distribution function (i.e., normal distribution
(P-P plot) function).
Quantile–Quantile The Q-Q plot is a quantile comparison of a probability distribution
plot (Q-Q plot) with a specific theoretical distribution (i.e., normal distribution).
Numerical Skewness The right tail is longer for a positively skewed probability distribution.
The left tail is longer for a negatively skewed probability distribution.
Skewness is a descriptive statistic that captures the direction and
magnitude a probability distribution deviates from normality.
Kurtosis Kurtosis measures the peakedness of the distribution. A normal
distribution has a kurtosis of 3. A distribution with kurtosis greater
than 3 is leptokurtic. A distribution with kurtosis less than 3 is
platykurtic.
Normality tests The Shapiro–Wilk test can be performed for sample sizes ≥ 7 and
≤ 2,000. It is the ratio of the best estimator for the sum of squares
estimator of the variance. The value of 1 indicates normality. The test
is available in SAS, Stata, and SPSS.
The Shapiro–Francis test is a modification of the Shapiro–Wilk test. It
can be used for sample sizes ≥ 5 and ≤ 5,000. The test is available
in Stata.
The Kolmogorov–Smirnov test compares the empirical distribution
function of a sample (i.e., error terms) with a cumulative distribution
function of a specific distribution (i.e., normal distribution). The test
is available in SAS, SPSS, and Python. This test is useful when the
sample size is greater than 2,000.
The Anderson–Darling test is considered one of the most powerful
tools for determining if the error term is normally distributed.
Compared with the Cramer–von Mises test, the Anderson–Darling
test places greater weight on the distribution tails. Both of these
tests belong to the class of quadratic empirical distribution functions
and are available in SAS. These tests are useful when the sample size
is greater than 2,000.
The Jarque–Bera test determines if the skewness and kurtosis of
the error term match a normal distribution. This test is a good
alternative for large sample sizes. Other tests tend to reject the null
hypothesis (the error term is normally distributed) with large sample
sizes. The Jarque–Bera test can be performed in Stata using the
.sktest command or Python using jarque_bera.
490 Advances in Developing Human Resources 21(4)
Implications
The consequences of ignoring the assumptions of regression analysis impact both
practitioners and scholars. This section outlines the implications for practice, research,
and theory development.
Is the data nested? The error terms of longitudinal or other nested data sets most likely
violate the first two assumptions—independence and homoscedasticity. Hierarchical
structures are naturally occurring in organizations. For instance, employees are nested
within departments and departments are nested within organizations. An example of
this type of study is Choi, Lee, and Jacobs (2015). To address the violations, the
authors used LMM to examine the relationship between employee traits, organiza-
tional traits, and structured on-the-job (S-OJT) training.
Table 4. Tests for Linear and Additive Relationships Between Variables.
and Opportunity Act of 2014 (WIOA) places greater emphasis on performance mea-
sures and evidence-based practices than the previous WIA. An increasing number of
studies will predict employment probabilities to meet WIOA requirements.
Does the dependent variable represent income? In HRD, workforce development studies
that measure income typically take the logarithm of income to address violation of
Assumption 4—linearity. The transformation captures the diminishing utility of
income and simplifies the interpretation of results, which are similar to elasticities
(Flatt & Jacobs, 2018).
Is there an intervening variable? Song and Lim (2015) examine the use of mediation
analysis in HRD. The increasing complexity of organizations has resulted in the need
for more sophisticated models. A mediating variable is an intervening variable that
alters the relationship between dependent and independent variables. Based on studies
published between 2000 and 2014, they found 66 articles in HRDQ, 17 articles in
HRDI, and one article in ADHR that were mediation studies. A specific example of a
mediation study was conducted by Fletcher (2016). The author found personal role
engagement to be a strong mediator of the relationship between training perceptions
and task proficiency. In addition, both work engagement and personal role engage-
ment served as mediators between training perceptions and task proactivity. Failure to
include a mediating relationship in a model would result in a model mis-specification
and possibly violation of Assumption 4.
Is there a moderating variable? Whereas a mediator alters the relationship itself, mod-
erators alter the scale of the relationship between independent and dependent vari-
ables. If an interaction term is not included to capture a moderating relationship, then
the additivity requirement of Assumption 4 is violated. Ismail (2016) provides an
example of a moderating relationship. The author found that the relationship between
training and organizational commitment was higher for individuals oriented toward
higher learning goals. In other words, learning goal orientation was found to be a mod-
erator between training and organizational commitment.
Second, the error terms do not have a constant variance. The heteroscedasticity can
be seen through the negative shape of the residual versus predicted value graph. In
496 Advances in Developing Human Resources 21(4)
9 23.94 .0044
(continued)
497
498
Table 5. (continued)
Note. OLS = ordinary least squares; BLUE = best linear unbiased estimators; GLMM = generalized linear mixed models
Flatt and Jacobs 499
Pictorially, the results of GLMM produce a unique line for each training participant
through the random effects (random intercept and random slope).
The construction of a GLMM involves choosing a link function. The link function
is chosen based on theory and the distribution of the data. The distribution of the ran-
dom component of the dependent variable determines the type of GLMM and the link
function. Common link functions include logit, probit, logarithm, and multinomial
logit (Liao, 1994). For this particular example, a logistic link function and a binomial
distribution would be a good option to accommodate the dependent variable (the prob-
ability of good job performance).
Conclusion
The field of HRD is uniquely equipped with the theory and models to present statisti-
cal methods clearly, concisely, and completely. Based on systems theory, the Regression
Development System provides academics and practitioners a comprehensive over-
view of model development as well as the details in each development phase. This
system provides authors and practitioners a tool to ensure that each principle assump-
tion of OLS is analyzed. In addition, the system is designed to assist with developing
models and identifying limitations.
The intent of the study and the severity of the assumption violations determine if
OLS is the appropriate type of regression. Minor violations of the assumptions might
be acceptable. For example, heteroscedastic models (violation of Assumption 2) are
problematic in forecasting, but coefficient estimates remain unbiased. Similarly, the
assumption of normality (Assumption 3) is technically only required for forecasting.
If the purpose of a study is to examine relationships, then violations of Assumptions 2
and 3 will not impact the results of the study. In such cases, the test of the assumptions
should still be performed, and the reason the violations are acceptable should be noted.
A minimum reporting requirement should include a list of assumptions, the diag-
nostic tools, and the criteria. If the manuscript length is an issue, then the statistical
checks can be provided as supplemental information (Ernst & Albers, 2017). The
Regression Development System provides a tool to promote standard testing of OLS
assumptions and to adhere to the minimum reporting requirements. The principle
assumptions form the basis of OLS regression analysis. An increase in the awareness
and understanding of these assumptions by both authors and audience is necessary to
improve the quality of research.
Acknowledgment
C.F. thanks Eastern Illinois University’s Provost Office and Library.
Funding
The author(s) received no financial support for the research, authorship, and/or publication of
this article.
References
Anderson, C., Verkuilen, J., & Johnson, T. (2012). Applied generalized linear mixed models:
Continuous and discrete data. Retrieved from https://education.illinois.edu/docs/default-
source/carolyn-anderson/edpsy587/GLM_GLMM_LMM.pdf
Angrist, J., & Pischke, J. (2010). The credibility revolution in empirical economics: How better
research design is taking the con out of Econometrics. Journal of Economic Perspectives,
24, 3-30. doi:10.1257/jep.24.2.3
Bhalla, D. (2015). Detect non-linear and non-monotonic relationship between variables. Retrieved
from https://www.listendata.com/2015/03/detect-non-linear-and-non-monotonic.html
Chen, X., Ender, P., Mitchell, M., & Wells, C. (2003a). Regression with SAS. https://stats.idre.
ucla.edu/stat/sas/webbooks/reg/default.htm
Chen, X., Ender, P., Mitchell, M., & Wells, C. (2003b). Regression with Stata. https://stats.idre.
ucla.edu/stat/stata/webbooks/reg/default.htm
Chen, X., Ender, P., Mitchell, M., & Wells, C. (2003c). Regression with SPSS. https://stats.idre.
ucla.edu/stat/spss/webbooks/reg/default.htm
Choi, Y. J., Lee, C., & Jacobs, R. (2015). The hierarchical linear relationship among struc-
tured on-the-job training activities, trainee characteristics, trainer characteristics, training
environment characteristics, and organizational characteristics of workers in small and
medium-sized enterprises. Human Resource Development International, 18, 499-520. doi:
10.1080/13678868.2015.1080046
Ernst, A., & Albers, C. (2017). Regression assumptions in clinical psychology research prac-
tice—A systematic review of common misconceptions. PeerJ, 5, e3323. Retrieved from
https://peerj.com/articles/3323.pdf
Flatt, C. (2019). Selected works of Candace Flatt. Retrieved from https://works.bepress.com/
candace-flatt/
Flatt, C., & Jacobs, R. (2018). The relationship between participation in different types of
training programs and gainful employment for formerly incarcerated individuals. Human
Resource Development Quarterly, 29, 263-286. doi:10.1002/hrdq.21325
Fletcher, L. (2016). Training perceptions, engagement, and performance: Comparing work
engagement and personal role engagement. Human Resource Development International,
19, 4-26. doi:10.1080/13678868.2015.1067855
Garavan, T., McCarthy, A., Sheehan, M., Lai, Y., Saunders, M., Clarke, N., & Shanahan, V.
(2019). Measuring the organizational impact of training: The need for greater methodologi-
cal rigor. Human Resource Development Quarterly, 2019, 1-19.
Hoekstra, R., Kiers, H., & Johnson, A. (2012). Are assumptions of well-known statistical tech-
niques checked, and why (not)? Frontiers in Psychology, 3, 137. Retrieved from https://
www.ncbi.nlm.nih.gov/pmc/articles/PMC3350940/pdf/fpsyg-03-00137.pdf
IBM. (2016). Retrieved from http://www-01.ibm.com/support/docview.wss?uid=swg21476748
Ismail, H. (2016). Training and organizational commitment: Exploring the moderating role of
goal orientation in the Lebanese context. Human Resource Development International, 19,
152-177. doi:10.1080/13678868.2015.1118220
Flatt and Jacobs 501
Jacobs, R. L. (2014). Chapter two: System theory and HRD. In N. Chalofsky, T. Rocco, & L.
Morris (Eds.), Handbook of human resource development (pp. 21-39). New York, NY:
Wiley.
Liao, T. (1994). Interpreting probability models: Logit, probit, and other generalized linear
models. Thousand Oaks, CA: SAGE publications.
Moore, R., & Gorman, P. (2009). The impact of training and demographics in WIA program
performance: A statistical analysis. Human Resource Development Quarterly, 20, 381-396.
doi:10.1002/hrdq.20029
Nau, R. (2018). Statistical forecasting: Notes on regression and time series analysis. Retrieved
from http://people.duke.edu/~rnau/testing.htm
Nimon, K. (2011). Improving the quality of quantitative research reports: A call for action.
Human Resource Development Quarterly, 22, 387-393. doi:10.1002/hrdq.20091
Nimon, K. (2012). Statistical assumptions of substantive analyses across the general linear
model: A mini review. Frontiers in Psychology, 3, 322. doi:10.3389/fpsyg.2012.00322
Nimon, K. (2017). HRDQ submissions of quantitative research reports: Three common com-
ments in decision letters and a checklist. Human Resource Development Quarterly, 28,
281-298. doi:10.1002/hrdq.21290
Osborne, J., & Waters, E. (2002). Four assumptions of multiple regression that researchers
should always test. Practical Assessment, Research, & Evaluation, 8(2), 1-5. Retrieved
from https://pareonline.net/getvn.asp?n=2&v=8
Osbourne, J. (2013). Is data cleaning and the testing of assumptions relevant in the 21st century?
Frontiers in Psychology, 4, 370. doi:10.3389/fpsyg.2013.00370
Park, H. (2008). Univariate analysis and normality test using SAS, Stata, and SPSS (Working
Paper). Retrieved from http://download.brookerobertshaw.com/normality_testing_sas_
spss.pdf
Perktold, J., Seabold, S., & Taylor, J. (2018). Regression diagnostics and specification tests.
Retrieved from https://www.statsmodels.org/dev/diagnostic.html
Smith, M. (2012). Common mistakes in using statistics. Retrieved from https://web.ma.utexas.
edu/users/mks/statmistakes/TOC.html
Song, J. H., & Lim, D. H. (2015). Mediating analysis approaches: Trends and implications for
advanced applications in HRD research. Advances in Developing Human Resources, 17,
57-71. doi:10.1177/1523422314559807
Turner, J. (2015). Hierarchical linear modeling: Testing multilevel theories. Advances in
Developing Human Resources, 17, 88-101. doi:10.1177/1523422314559808
West, B., Welch, K., & Galecki, A. (2015). Linear mixed models: A practical guide using sta-
tistical software. Baca Raton, FL: CRC Press.
Williams, R. (2015). Nonlinear relationships. Retrieved from https://www3.nd.edu/~rwilliam/
stats2/l61.pdf
Wooldridge, J. (2013). Introductory econometrics: A modern approach. Mason, OH:
South-Western, Cengage Learning. Retrieved from http://economics.ut.ac.ir/docu-
ments/3030266/14100645/Jeffrey_M._Wooldridge_Introductory_Econometrics_A_
Modern_Approach__2012.pdf
Zientek, L., Nimon, K., & Hammack-Brown, B. (2016). Analyzing data from a pretest-post-
test control group design: The importance of statistical assumptions. European Journal of
Training and Development, 40, 638-659. doi:10.1108/EJTD-08-2015-0066
502 Advances in Developing Human Resources 21(4)
Author Biographies
Candace Flatt, PhD, is an Economist with the Illinois Department of Employment Security and
an instructor at Millikin University. She earned her doctorate from the University of Illinois at
Urbana–Champaign in Education Policy, Organization, & Leadership with concentrations in
Human Resource Development and Higher Education.
Ronald L. Jacobs, PhD, is a professor of human resource development at the University of
Illinois at Urbana–Champaign.