Professional Documents
Culture Documents
Whether spanking is helpful or harmful to children continues to be the source of considerable debate
among both researchers and the public. This article addresses 2 persistent issues, namely whether effect
sizes for spanking are distinct from those for physical abuse, and whether effect sizes for spanking are
robust to study design differences. Meta-analyses focused specifically on spanking were conducted on a
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
total of 111 unique effect sizes representing 160,927 children. Thirteen of 17 mean effect sizes were
This document is copyrighted by the American Psychological Association or one of its allied publishers.
significantly different from zero and all indicated a link between spanking and increased risk for
detrimental child outcomes. Effect sizes did not substantially differ between spanking and physical abuse
or by study design characteristics.
Around the world, most children (80%) are spanked or other- As this body of work on spanking and physical punishment has
wise physically punished by their parents (UNICEF, 2014). The accumulated, several nagging questions about the quality, consis-
question of whether parents should spank their children to correct tency, and generalizability of the research have persisted. Two
misbehaviors sits at a nexus of arguments from ethical, religious, primary concerns that have been raised about past meta-analyses
and human rights perspectives both in the U.S. and around the are that spanking has been confounded with potentially abusive
world (Gershoff, 2013). Several hundred studies have been con- parenting behaviors in some studies and that spanking has only
ducted on the associations between parents’ use of spanking or been linked with detrimental outcomes in methodologically weak
physical punishment and children’s behavioral, emotional, cogni- studies (Baumrind, Larzelere, & Cowan, 2002; Ferguson, 2013;
tive, and physical outcomes, making spanking one of the most Larzelere & Kuhn, 2005). The goal of the current article is to
studied aspects of parenting. What has been learned from these address these two concerns with a new set of meta-analyses using
hundreds of studies? Several efforts have been made to synthesize the most recent research studies to date. Because the social science
this large body of research, first in narrative form (Becker, 1964; theories regarding why spanking might be linked with child out-
Larzelere, 1996; Steinmetz, 1979; Straus, 2001) and later through comes have been summarized extensively elsewhere (Donnelly &
meta-analyses (Ferguson, 2013; Gershoff, 2002; Larzelere & Straus, 2005; Gershoff, 2002), we will not repeat them here and
Kuhn, 2005; Paolucci & Violato, 2004). Each of these four meta- instead will focus in this paper on key questions about the research
analyses included a different set of articles and came to varied conducted to date.
conclusions, namely that physical punishment is largely ineffective The terms “corporal punishment,” “physical punishment,” and
and harmful (Gershoff, 2002), that physical punishment is effec- “spanking” are largely synonymous in American culture. The
tive under certain circumstances (Larzelere & Kuhn, 2005), and majority of the studies discussed in our literature review use the
that physical punishment is linked with children’s cognitive, emo- term physical punishment which we define as noninjurious, open-
tional, and behavioral problems but only modestly (Ferguson, handed hitting with the intention of modifying child behavior. In
2013; Paolucci & Violato, 2004). These competing conclusions our meta-analyses, however, we focused on the most common
have left both social science researchers and the public at large form of physical punishment which is known in the U.S. as
confused about what outcomes can and cannot be attributed to spanking, and which we define as hitting a child on their buttocks
spanking. or extremities using an open hand.
453
454 GERSHOFF AND GROGAN-KAYLOR
“the use of physical force with the intention of causing a child behavior than disciplinary alternatives. Customary physical
to experience pain but not injury for the purposes of correction punishment was found to predict more detrimental outcomes
or control of the child’s behavior” (per Straus, 2001, p. 4) and when children’s initial levels of child misbehavior were statis-
excluded any methods that would “knowingly cause severe tically controlled, d ⫽ ⫺.19, but was generally not significantly
injury to the child” (Gershoff, 2002, p. 543). All 11 meta- different from other disciplinary tactics, including reasoning,
analyses were significant and all but one indicated an undesir- taking away privileges, and time out, in the strength or direction
able association. Specifically, physical punishment was associ- of its associations with child outcomes. The severe and pre-
ated with more immediate compliance (d ⫽ 1.13) but was dominant categories of physical punishment were consistently
also associated with lower levels of moral internalization associated with detrimental outcomes, such as less compliance,
(d ⫽ ⫺.33), quality of the parent– child relationship lower conscience, lower positive behavior, and higher antiso-
(d ⫽ ⫺.58), and mental health in childhood (d ⫽ ⫺.49) and cial behavior (Larzelere & Kuhn, 2005). The authors concluded
adulthood (d ⫽ ⫺.09), as well as with higher levels of aggres- that, in general, physical punishment was no worse than other
sion in childhood (d ⫽ .36) and adulthood (d ⫽ .57), antisocial disciplinary techniques. This is of course also to say that
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
behavior in childhood (d ⫽ .42) and adulthood (d ⫽ .42), risk physical punishment was no better than other disciplinary tech-
This document is copyrighted by the American Psychological Association or one of its allied publishers.
Spanking Has Been Confounded With Harsher Forms Spanking Has Only Been Linked With Negative Child
of Physical Punishment Outcomes in Cross-Sectional or Methodologically
Weak Studies
The main criticism of the Gershoff (2002) meta-analysis has
been that it included harsh and potentially injurious behaviors, The primary standard for determining causal relations among
such as hitting children with objects, in its definition of physical variables has been the randomized controlled experiment because
punishment (Baumrind et al., 2002; Benjet & Kazdin, 2003; al- potentially confounding selection factors that might distinguish
though note that this criticism applies to the Paolucci & Violato, naturally occurring groups (e.g., spankers and nonspankers) are
2004 meta-analysis as well). This broad definition of physical eliminated through randomization (Shadish, Cook, & Campbell,
punishment included parent behaviors that most professionals and 2001). However, parents’ use of spanking is not easily or ethically
most parents would agree were abusive and that may be linked studied through an experimental design, as children cannot be
with negative outcomes while spanking is not (Kazdin & Benjet, randomly assigned to parents with varying predispositions to
2003). Baumrind, Larzelere, and Cowan (2002) reanalyzed the spank, nor can parents typically be randomly assigned to spank or
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
data from Gershoff (2002), separating out what they deemed harsh not spank. There are a small handful of experimental studies that
This document is copyrighted by the American Psychological Association or one of its allied publishers.
tions (American Academy of Pediatrics, 2012) and intergovern- outcome of interest; and (d) The study included appropriate sta-
mental and human rights organizations (Committee on the Rights tistics for calculating effect sizes. The reasons for exclusion of all
of the Child, 2006), there is a need for definitive conclusions about 1,499 studies are listed the Appendix. Only 75 studies met all four
the potential consequences of spanking for children. The purpose criteria and were retained for the meta-analyses.
of the current study was to conduct a new set of meta-analyses to
address the two unresolved debates described above and to do so Inclusion and Exclusion of Studies From Past
while incorporating an additional 13 years of literature since the Meta-Analyses
first meta-analysis was published (Gershoff, 2002). The present
study is distinguished from the previous meta-analyses by focusing All of the 162 unique studies used in the four previously
exclusively on parents’ use of spanking, by including only peer- published meta-analyses were considered for inclusion, but only
reviewed journal articles, by using random effects meta-analyses, 36 met all of our criteria. Of the 88 studies in Gershoff (2002), 23
and by incorporating several dozen new studies not included in were included in the present study. Paolucci and Violato (2004)
previous meta-analyses. analyzed 70 studies; 16 were included here. Of the 26 studies in
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Coding of studies involved a two-step process. In the initial step, where sdpooled was calculated as
冑
the titles, abstracts, or full text of the 1,574 studies identified
((n1 ⫺ 1) * sd21) ⫹ ((n2 ⫺ 1) * sd22)
through the sources above were subjected to an initial screening. sdpooled ⫽
Studies were excluded at this stage if they were not relevant to or n1 ⫹ n2 ⫺ 2
usable in the meta-analyses; examples of studies excluded at this Calculation of Cohen’s d was straightforward when an article
stage were literature reviews, studies of beliefs about rather than reported the sample size, mean and standard deviation of a group
use of spanking, and studies that were not available in English. exposed to spanking and one that had never been spanked. For
This initial screening process eliminated 1,016 studies and retained articles that did not report effects as group comparisons, we
558 potential studies. utilized formulas found in Borenstein, Hedges, Higgins, and Roth-
In the second step of coding, each of these 558 potential studies stein (2009) and Johnson (1993) to convert quantitative measures
was coded independently by each of the authors. Any disagree- of association such as correlations and differences of proportions
ments in coding were resolved through follow-up discussion. Stud- to Cohen’s d effect sizes. For each study, we also calculated the
ies were coded as to whether they met several criteria: (a) the study standard error of the estimate of Cohen’s d utilizing formulas
was published in a peer-reviewed journal; all book chapters, un- given in Sterne (2009).
published dissertations, and unpublished conference papers were
excluded, even if they had been included in any of the previously Selection or Aggregation of Single Effect Sizes
published meta-analyses; (b) the study included a measure of
From Studies
parents’ use of customary, noninjurious spanking (or slapping or
hitting) that was intended to be a correction of a child’s misbe- Because meta-analyses are focused on simple effects, only bi-
havior. The terms “spank” or “smack” were used alone or in variate comparisons or correlations can be used (Borenstein et al.,
combination with other general terms (e.g., slap) in 63% of studies. 2009); thus, bivariate associations such as standardized differences
The remaining studies measured corporal punishment as “physical of means or correlations were selected over adjusted coefficients
punishment” or “physical discipline” (19%), “corporal punish- from multivariate models. When both longitudinal and cross-
ment” (10%), and “slap or hit” (8%); (c) the study reported a sectional results were available, the appropriate longitudinal effect
bivariate association between parents’ spanking and the child sizes were use in the meta-analyses in order to obtain the most
SPANKING META-ANALYSES 457
methodologically robust effect size. If a study reported multiple included data from a total of 160,927 unique children. The study-
effect sizes for the same outcome, such as when bivariate associ- level effect sizes, confidence intervals, and sample sizes are listed
ations were reported for subgroups but not the whole sample, the in Table 1. For between-subjects designs, the subsample sizes for
weighted average of these subgroup effect sizes was used as the the subgroup that were spanked and the subgroup that was not
effect size for that study for that outcome. We allowed studies that spanked are presented, whereas for within-subjects designs a sin-
reported effect sizes for more than one of our target outcomes to gle sample size is presented. As a means of graphically represent-
contribute to each appropriate meta-analyses; however, each study ing the effect sizes, this table also includes bar graphs of the effect
(or dataset, in the case of multiple articles from one dataset) was sizes and their corresponding confidence intervals both for the
permitted to contribute only one effect size to each analysis for a individual studies and for the random effects mean effect size for
specific outcome, so that a single individual was only counted once each outcome category. For the purposes of comparison and ag-
in any given meta-analysis for a specific outcome. gregation across meta-analyses, all of the study-level effect sizes
were coded so that larger positive values corresponded to more
Coding of Study-Level Moderators detrimental child outcomes. This meant that for studies in which
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Seven study characteristics were coded for each study to be the effect sizes were recoded so that higher values reflected ad-
used in moderator analyses: (a) study design (experimental, verse outcomes rather than beneficial outcomes (e.g., low con-
longitudinal, cross-sectional, or retrospective); (b) measure of science).
spanking (observation, parent report, child report, child retro- As the effect sizes and bar graphs in Table 1 indicate, the
spective, or both parent and child reports); (c) index of spanking findings across studies were highly consistent. Of the 111 individ-
(when used [either observed or in an experiment], frequency, ual effect sizes, 102 were in the direction of a detrimental outcome
frequency and severity, ever in time period, or ever in life); (d) with 78 of these statistically significant. In contrast, nine of the
independence of the raters of spanking and the child outcome effect sizes were in the direction of a beneficial outcome but only
(same rater or different raters); (e) time period in which spank- one (Tennant, Detels, & Clark, 1975) was statistically different
ing was administered (observed, last week, last month, last from zero. Thus, among the 79 statistically significant effect sizes,
year, ever, hypothetical, specific time period, or not specified); 99% indicated an association between spanking and a detrimental
(f) the country in which the study was conducted (U.S. or other child outcome.
than U.S.); and (g) the age range of children at the time of Table 2 summarizes the mean weighted effect sizes and confi-
spanking (less than 2-years-old, 2- to 5-years-old, 6- to 10- dence intervals for each outcome along with a Z test for significant
years-old, and 11- to 15-years-old). The authors independently difference from zero and an I2 statistic that estimates the amount of
coded these characteristics for each study. Any discrepancies variation in the mean weight effect size that was attributable to
were resolved through discussion. underlying study heterogeneity. Spanking was significantly asso-
ciated with 13 of the 17 outcomes examined. In each case, spank-
Meta-Analytic Procedure ing was associated with a greater likelihood of detrimental child
outcomes. In childhood, parental use of spanking was associated
Once all study effect sizes had been converted to the metric of with low moral internalization, aggression, antisocial behavior,
Cohen’s d, effect sizes were combined in a meta-analysis. Each externalizing behavior problems, internalizing behavior problems,
study was entered into the model, weighted by its precision (1/sed), mental health problems, negative parent– child relationships, im-
and combined into a weighted average of effect sizes for the paired cognitive ability, low self-esteem, and risk of physical
respective outcome domain. The meta-analyses reported in this abuse from parents. In adulthood, prior experiences of parental use
paper utilized the random effects model (Borenstein et al., 2009; of spanking were significantly associated with adult antisocial
DerSimonian & Laird, 1986) using the Stata command metan behavior, adult mental health problems, and with positive attitudes
(Bradburn, Deeks, & Altman, 2009). The random effects model for about spanking. The remaining four meta-analyses were not sig-
meta-analysis does not assume that there is a single underlying nificantly different from zero. The 13 statistically significant mean
effect size of the studies being analyzed and rather allows effect effect sizes ranged in size from .15 to .64. The overall mean
sizes to differ across studies to account for the fact that study weighted effect size across all of the 111 study-level effect sizes
samples differ by characteristics such as age, gender, race, ethnic- was d ⫽ .33, with a 95% confidence interval of .29 to .38; this
ity and nationality. The random effects model calculates the mean mean effect was statistically different from zero, Z ⫽ 14.84, p ⬍
effect size, an estimate of statistical significance, and a measure of .001.
the heterogeneity of effect sizes in terms of their variation around
the estimated mean effect size. We conducted a separate meta-
analysis for each child outcome as well as an overall meta-analysis Moderator Analyses Comparing Spanking
for all of the studies together. With Physical Abuse
To address the concern that the findings of negative outcomes
Results associated with spanking in past research were a result of the
confounding of spanking with overly harsh or potentially abusive
Main Meta-Analyses methods, we identified seven studies that reported bivariate asso-
ciations for both spanking and physical abuse. The latter was
A total of 111 unique effect sizes were derived from data defined variously as “hitting with fist or object, beating up, kick-
representing 204,410 child measurement occasions; these studies ing, or biting” (Bugental, Martorell, & Barraza, 2003), “beaten to
458 GERSHOFF AND GROGAN-KAYLOR
Table 1
Study-Level Effect Sizes for Spanking by Child Outcome
95%
Spank No spank Confidence Beneficial Detrimental
Individual studies by outcome n n d interval outcomes outcomes
Table 1 (continued)
95%
Spank No spank Confidence Beneficial Detrimental
Individual studies by outcome n n d interval outcomes outcomes
(table continues)
460 GERSHOFF AND GROGAN-KAYLOR
Table 1 (continued)
95%
Spank No spank Confidence Beneficial Detrimental
Individual studies by outcome n n d interval outcomes outcomes
Table 1 (continued)
95%
Spank No spank Confidence Beneficial Detrimental
Individual studies by outcome n n d interval outcomes outcomes
Adult support for physical punishment 1,016 177 .38 .15 .61
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Note. “Spank n” refers to the subsample in between-subjects designs that reported spanking, or to the entire sample in within-subjects designs. “No spank
n” refers to the subsample in between-subjects designs that did not report spanking.
injury” (Lau, Chan, Lam, Choi, & Lai, 2003), “been injured from used abusive methods of discipline. Two of the studies contrib-
a beating” (Lau et al., 2005), “frequent or severe physical punish- uted more than one effect size, yielding a total of 10 pairs of
ment” (Fergusson, Boden, & Horwood, 2008), “use of a weapon, effect sizes for spanking and physical abuse. The effect sizes
punching, or kicking” (Lynch et al., 2006), “severe physical as- are presented in Table 3. In three cases, the effect size for
sault” (Miller-Perrin, Perrin, & Kocur, 2009), and “physical abuse spanking was larger than that for physical abuse. The weighted
leading to bruising” (Schweitzer, Zavar, Pavlicova, & Fallon, mean effect size for spanking was d ⫽ .25, while for physical
2011). Each of these studies employed a within-subjects design; abuse it was d ⫽ .38. Both were significantly different from
in each case, the same respondent (either a parent or the adult zero and both were positive in sign, indicating that both spank-
child recalling the behavior) reported both how often the parent ing and physical abuse were associated with greater levels of
used spanking and, in a separate question, how often the parent detrimental child outcomes. The magnitude of the mean effect
Table 2
Summary of Spanking Meta-Analyses by Outcome
Table 3
Effect Sizes for Studies That Reported Effect Sizes Separately for Spanking and Physical Abuse
⫺1 0 1 2
Lau et al. (2003) child externalizing behavior spanking .15 ⫺.30 .60
problems physical abuse .65 .19 1.10
Lau et al. (2005) child externalizing behavior spanking .19 .16 .22
problems physical abuse .33 .30 .37
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Bugental, Martorell, and child mental health problems spanking 1.23 .77 1.69
Barraza (2003) ⫺.21
This document is copyrighted by the American Psychological Association or one of its allied publishers.
Lau et al. (2003) child mental health problems spanking .42 ⫺.03 .87
physical abuse .62 .17 1.07
Lau et al. (2003) child low self-esteem spanking .00 ⫺.45 .45
physical abuse .37 ⫺.08 .82
Fergusson et al. (2008) adult antisocial behavior spanking .45 .33 .57
physical abuse .25 .13 .37
Lynch et al. (2006) adult antisocial behavior spanking .10 .00 .20
physical abuse .51 .41 .62
Fergusson et al. (2008) adult mental health problems spanking .21 .09 .33
physical abuse .55 .43 .66
Miller-Perrin, Perrin, and adult mental health problems spanking .04 ⫺.55 .63
Kocur (2009) physical abuse .58 ⫺.01 1.17
Schweitzer, Zafar, Pavlicova, adult mental health problems spanking 1.12 .38 1.86
and Fallon (2011) physical abuse .96 .23 1.70
size for spanking was 65% of the magnitude of the mean effect (Baumrind et al., 2002); for the studies examine here, no evidence
size for physical abuse. was found that the magnitude or direction of effect sizes was
smaller in longitudinal than cross-sectional studies. Average effect
Moderator Analyses by Study Characteristics sizes also did not significantly vary based on how spanking was
measured, how it was indexed, whether the raters of spanking and
We examined whether study-level effect sizes varied across outcome were independent, the time period over which spanking
seven study-level characteristics using meta-regression to calculate was measured, the country of the study, or the age group of the
average effect sizes by study subgroup (Borenstein et al., 2009; children studied.
Harbord & Higgins, 2009). The results from these moderator
analyses are presented in Table 4. All of the comparisons were
Tests for Publication Bias
nonsignificant, indicating that the effect sizes did not vary by study
characteristic. The finding that the average effect size for longitu- One potential threat to the validity of meta-analyses is what is
dinal studies was the same as that for cross-sectional studies referred to as publication bias, or commonly “the file drawer
( ⫽ ⫺.07, ns) is important in light of the criticism that previous effect:” Namely, how likely is it that there are many studies with
meta-analyses were overly influenced by cross-sectional studies contradictory findings that were not published that would under-
SPANKING META-ANALYSES 463
Table 4
Effect Sizes Moderated by Study Characteristics
mine the conclusions of the meta-analysis? For each meta-analysis, nitive ability, and lower self-esteem. The largest effect size was for
we conducted the publication bias test developed by Egger, Davey physical abuse; the more children are spanked, the greater the risk
Smith, Schneider, and Minder (1997) and implemented in Stata that they will be physically abused by their parents.
(Harbord, Harris, & Sterne, 2009; Steichen, 1998). None of the Three of the four adult outcomes were significantly associated
tests were significant, indicating that the risk of publication bias with a history of spanking from parents: adult antisocial behavior,
for any of our mean effect sizes is small. adult mental health problems, and adult support for physical pun-
ishment. While these findings suggest that there may be lasting
Discussion impacts of spanking that reach into adulthood, they are only
suggestive, as adults who engage in antisocial behavior or who are
The goal of this article was to address two major concerns about
experiencing mental health problems may focus on negative mem-
past meta-analyses of the association of parents’ use of spanking
ories of their childhoods and report more spanking than they
and a range of child outcomes. We will discuss each in turn, but
actually received. The finding that a history of received spanking
begin with a summary of the overall findings from this new set of
is linked with more support for spanking of children as an adult
meta-analyses.
may be an example of intergenerational transmission of spanking,
or it may be an example of adults selectively remembering their
Spanking Is Associated With Higher Risk for past as a way of rationalizing their current beliefs. Only one of the
Detrimental Outcomes 20 effect sizes for outcomes in adulthood was from a prospective
Thirteen of the 17 child outcomes examined were found to be longitudinal study (McCord, 1991). More longitudinal studies are
significantly associated with parents’ use of spanking. Among the needed to confirm the direction of effect.
outcomes in childhood, spanking was associated with more ag- An important observation about the meta-analyses is that the
gression, more antisocial behavior, more externalizing problems, individual studies are highly consistent: 71% of all of the effect
more internalizing problems, more mental health problems, and sizes, and 99% of the significant effect sizes, indicated a signifi-
more negative relationships with parents. Spanking was also sig- cant association between parental spanking and detrimental child
nificantly associated with lower moral internalization, lower cog- outcomes. The only study that found a significant association with
464 GERSHOFF AND GROGAN-KAYLOR
a beneficial outcome, Tennant, Detels, and Clark (1975), had a studies was significantly associated with detrimental outcomes
unique sample (U.S. Army soldiers stationed in West Germany in with an effect size d ⫽ .25, while the mean effect size for physical
1971 and 1972, most of whom were White [77%]), and found that abuse for these studies was significant and d ⫽ .38 (see Table 3).
soldiers who recalled being spanked were less likely to report However, the mean effect size for all studies from Table 2, d ⫽
using amphetamines or opiates. While this is clearly a beneficial .33, is closer to the mean effect size for physical abuse in Table 3
outcome, the uniqueness of the sample limits the generalizability than it is to the mean effect size for spanking from these 10 select
of this finding and may explain why this study is an outlier, as it effect sizes, indicating that spanking and physical abuse have
was in the Larzelere and Kuhn (2005) meta-analyses. relations with child outcomes that are similar in magnitude and
Three outcomes in childhood were not significantly associated identical in direction.
with spanking in the meta-analyses: immediate defiance, child That spanking and physical abuse may have similar associations
alcohol or substance abuse, and low-self regulation. The failure to with child outcomes is consistent with previous literature. Both
reach significance for immediate defiance appears to result from behaviors involve parents intentionally hitting (and hurting) chil-
the small n (150), while for child alcohol or substance abuse and dren, albeit to different degrees (Gershoff, 2013), and most in-
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
low self-regulation the cause appears to be heterogeneity in effect stances of substantiated physical abuse (75%), like all instances of
This document is copyrighted by the American Psychological Association or one of its allied publishers.
sizes. The finding that spanking was not linked with immediate spanking, begin as responses to children’s misbehavior (Durrant et
defiance was unexpected given the opposite findings in the Ger- al., 2006). In addition, many researchers have argued that spanking
shoff (2002) meta-analyses. The disparity arose because we coded and physical abuse are on a continuum of violence against chil-
the effect sizes from the three experimental studies of compliance dren, and that spanking can escalate into physical abuse (Straus,
(Bean & Roberts, 1981; Day & Roberts, 1983; Roberts & Powers, 2001), an argument supported by our finding that spanking was
1990) differently. Unlike Gershoff (2002) in which effect sizes for significantly associated with physical abuse (Table 2; d ⫽ .64).
each study were calculated by subtracting the rate of compliance Clearly not all parents who spank their children also administer
among children in the spanking condition from the rate of com- more severe punishment; as with all of the meta-analyses pre-
pliance among children in the comparison condition, we calculated sented, the association only indicates that milder and more severe
the within condition difference in pre- and postintervention com- corporal punishment are linked, and that the former may increase
pliance rates for the spank and no-spank groups and then sub- the risk that children will also be physically abused.
tracted these two difference scores from each other. Because there
were baseline differences between the treatment and control Spanking Effect Sizes Are Similar Across Study
groups in each study, our effect sizes thus captured the extent to
Characteristics
which spanking was associated with decreases in immediate defi-
ance over baseline. From these five studies, it appears that children A major concern raised about the spanking literature in general
are as likely to defy their parents when they spank as comply with and previous meta-analyses in particular is that their reliance on
them, but future research will be needed to substantiate this con- cross-sectional designs may mask what are truly child-elicitation
clusion. effects (Baumrind et al., 2002; Ferguson, 2013; Larzelere et al.,
Taken together, these meta-analyses support the conclusion that 2004). In other words, associations between spanking and prob-
parents’ use of spanking is associated with detrimental child out- lematic behavior may reflect the fact that difficult children elicit
comes. As most of the included studies were correlational or more spanking from parents, not that spanking causes the prob-
retrospective (72%), causal links between spanking and child lematic behavior in the first place (Baumrind et al., 2002; Larzel-
outcomes cannot be established by these meta-analyses. That said, ere et al., 2004). Longitudinal or experimental designs are needed
given that a correlational association is a necessary condition for a to isolate the direction of effect, and several were available for
causal relationship (Shadish et al., 2001), we can conclude that the inclusion in the meta-regression moderation analyses. While it was
data are consistent with a conclusion that spanking is associated indeed true that the majority of studies (70%) were cross-sectional
with undesirable outcomes. or retrospective in nature, the effect sizes for the longitudinal and
experimental studies were not significantly different from the
Spanking Alone Is Associated With Detrimental effect sizes for the cross-sectional studies (see Table 4). This
finding indicates that methodologically stronger studies did not
Outcomes, but in Similar Ways to Physical Abuse
find significantly smaller effect sizes than methodologically
Our first research question was whether spanking would be weaker studies, lending more confidence to the findings from the
associated with detrimental child outcomes when studies relying main meta-analyses that include both. The mean effect size for
on harsh and potentially abusive methods were removed. The spanking also did not vary by any of the other six study charac-
answer to this question is: Yes, it is. As noted above, all of the teristic moderators. The association between spanking and detri-
mean effect sizes indicated that even when a restricted definition mental child outcomes did not depend on how spanking was
of spanking is used, spanking is associated with detrimental child assessed, who reported the spanking, the country where the study
outcomes. The mean effect size across all studies, d ⫽ .33, was was conducted, or what age children were the focus of the study.
smaller than the overall mean effect size reported by Gershoff Across all categories, methodologically stronger study designs
(2002), d ⫽ .40, but still statistically significant. identified the same risk for negative outcomes as did weaker study
In order to better compare the findings for spanking with those designs, suggesting that the associations between spanking and
for abuse, we identified seven studies that reported effect sizes for child outcomes are robust to study design.
spanking and for physical abuse for the same child outcomes and We were surprised that none of the moderators was significant
conducted a meta-analyses of this set of studies. Spanking in these given that most of the I2 values in Table 2 indicate high levels of
SPANKING META-ANALYSES 465
heterogeneity. There is little guidance in the literature about how time (Gershoff, Ansari, Purtell, & Sexton, 2016). More studies
to interpret significant I2 values when paired with significant and capitalizing on experimental designs are needed.
consistent mean effect sizes. We suspect that the I2 tests are By looking at change over time and by accounting for potential
picking up on expected variability in our independent variable. alternative explanations through statistical methods or by capital-
Unlike clinical trials, for which the I2 was developed, in which izing on data from experimental designs, such studies support the
treatment is systematized, it is nearly impossible to manipulate the conclusion that there is a significant association of parents’ use of
amount or frequency of exposure to spanking and thus every spanking with later child outcomes, over and above children’s
participant’s experience with our independent variable is different. initial behavior and child elicitation effects. None of these studies
Given that 13 of the 17 mean effect sizes were significantly using advanced statistical methods found evidence that the path-
different from zero and that nearly three quarters (71%) of the way is entirely one of selection or child elicitation, or that spanking
studies yielded effect sizes in the direction of detrimental out- predicts improvements in children’s behavior over time, as critics
comes, including nearly all of the significant effect sizes, we of the literature on spanking have contended (Baumrind et al.,
suspect that the I2 is picking up on heterogeneity in spanking itself 2002; Ferguson, 2013; Kazdin & Benjet, 2003; Larzelere et al.,
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
rather than in its associations with child outcomes. 2004). Rather, these studies with strong designs provide more, not
This document is copyrighted by the American Psychological Association or one of its allied publishers.
ⴱ
Barnes, J. C., Boutwell, B. B., Beaver, K. M., & Gibson, C. L. (2013). mal Child Psychology, 11, 141–152. http://dx.doi.org/10.1007/
Analyzing the origins of childhood externalizing behavior problems. BF00912184
ⴱ
Developmental Psychology, 49, 2272–2284. http://dx.doi.org/10.1037/ Deb, S., & Adak, M. (2006). Corporal punishment of children: Attitude,
a0032061 practice and perception of parents. Social Science International, 22,
Baumrind, D., Larzelere, R. E., & Cowan, P. A. (2002). Ordinary physical 3–13.
punishment: Is it harmful? Comment on Gershoff (2002). Psychological DerSimonian, R., & Laird, N. (1986). Meta-analysis in clinical trials.
Bulletin, 128, 580 –589. http://dx.doi.org/10.1037/0033-2909.128.4.580 Controlled Clinical Trials, 7, 177–188. http://dx.doi.org/10.1016/0197-
ⴱ
Bean, A. W., & Roberts, M. W. (1981). The effect of time-out release 2456(86)90046-2
contingencies on changes in child noncompliance. Journal of Abnormal Donnelly, M., & Straus, M. A. (Eds.). (2005). Corporal punishment of
Child Psychology, 9, 95–105. http://dx.doi.org/10.1007/BF00917860 children in theoretical perspective. New Haven, CT: Yale University
Beauchaine, T. P., Webster-Stratton, C., & Reid, M. J. (2005). Mediators, Press. http://dx.doi.org/10.12987/yale/9780300085471.001.0001
moderators, and predictors of 1-year outcomes among children treated ⴱ
Durrant, J. E. (1994). Sparing the rod: Manitobans’ attitudes toward the
for early-onset conduct problems: A latent growth curve analysis. Jour- abolition of physical discipline and implications for policy change.
nal of Consulting and Clinical Psychology, 73, 371–388. Canada’s Mental Health, 41, 2– 6.
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Becker, W. C. (1964). Consequences of different models of parental Durrant, J., Trocmé, N., Fallon, B., Milne, C., Black, T., & Knoke, D.
This document is copyrighted by the American Psychological Association or one of its allied publishers.
discipline. In M. L. Hoffman & L. W. Hoffman (Eds.), Review of child (2006). Punitive violence against children in Canada. CECW Informa-
development research (Vol. 1, pp. 169 –208). New York, NY: Sage. tion Sheet #41E. Toronto, Canada: University of Toronto, Faculty of
Benjet, C., & Kazdin, A. E. (2003). Spanking children: The controversies, Social Work. Retrieved from www.cecw-cepb.ca/DocsEng/
findings, and new directions. Clinical Psychology Review, 23, 197–224. PunitiveViolence41E.pdf
http://dx.doi.org/10.1016/S0272-7358(02)00206-4 Egger, M., Davey Smith, G., Schneider, M., & Minder, C. (1997). Bias in
ⴱ
Berlin, L. J., Ispa, J. M., Fine, M. A., Malone, P. S., Brooks-Gunn, J., meta-analysis detected by a simple, graphical test. British Medical
Brady-Smith, C., . . . Bai, Y. (2009). Correlates and consequences of Journal, 315, 629 – 634. http://dx.doi.org/10.1136/bmj.315.7109.629
spanking and verbal punishment for low-income White, African Amer- ⴱ
Eisenberg, N., Chang, L., Ma, Y., & Huang, X. (2009). Relations of
ican, and Mexican American toddlers. Child Development, 80, 1403– parenting style to Chinese children’s effortful control, ego resilience,
1420. http://dx.doi.org/10.1111/j.1467-8624.2009.01341.x
and maladjustment. Development and Psychopathology, 21, 455– 477.
Borenstein, M., Hedges, L. V., Higgins, J. P. T., & Rothstein, H. (2009).
http://dx.doi.org/10.1017/S095457940900025X
Introduction to meta-analysis. West Sussex, UK: Wiley. http://dx.doi
Ferguson, C. J. (2013). Spanking, corporal punishment and negative long-
.org/10.1002/9780470743386
ⴱ term outcomes: A meta-analytic review of longitudinal studies. Clinical
Boutwell, B. B., Franklin, C. A., Barnes, J. C., & Beaver, K. M. (2011).
Psychology Review, 33, 196 –208.
Physical punishment and childhood aggression: The role of gender and ⴱ
Fergusson, D. M., Boden, J. M., & Horwood, L. J. (2008). Exposure to
gene-environment interplay. Aggressive Behavior, 37, 559 –568.
childhood sexual and physical abuse and adjustment in early adulthood.
Bradburn, M. J., Deeks, J. J., & Altman, D. G. (2009). Metan—A com-
Child Abuse & Neglect: The International Journal, 32, 607– 619. http://
mand for meta-analysis in Stata. In J. A. C. Sterne (Ed.), Meta-analysis
dx.doi.org/10.1016/j.chiabu.2006.12.018
in Stata: An updated collection from the Stata journal (pp. 3–28). ⴱ
Flynn, C. P. (1999). Exploring the link between corporal punishment and
College Station, TX: Stata Press.
ⴱ children’s cruelty to animals. Journal of Marriage and the Family, 61,
Buehler, C., & Gerard, J. M. (2002). Marital conflict, ineffective parent-
971–981.
ing, and children’s and adolescents’ maladjustment. Journal of Marriage ⴱ
Foshee, V. A., Ennett, S. T., Bauman, K. E., Benefield, T., & Suchindran,
and the Family, 64, 78 –92.
ⴱ C. (2005). The association between family violence and adolescent
Bugental, D. B., Martorell, G. A., & Barraza, V. (2003). The hormonal
costs of subtle forms of infant maltreatment. Hormones and Behavior, dating violence onset: Does it vary by race, socioeconomic status, and
43, 237–244. http://dx.doi.org/10.1016/S0018-506X(02)00008-9 family structure? Journal of Early Adolescence, 25, 317–344.
ⴱ
ⴱ
Burton, R. V., Maccoby, E. E., & Allinsmith, W. (1961). Antecedents of Frias-Armenta, M. (2002). Long-term effects of child punishment on
resistance to temptation in four-year-old children. Child Development, Mexican women: A structural model. Child Abuse & Neglect, 26,
32, 689 –710. 371–386.
ⴱ
ⴱ
Choe, D. E., Olson, S. L., & Sameroff, A. J. (2013). The interplay of Gagné, M. H., Tourigny, M., Joly, J., & Pouliot-Lapointe, J. (2007).
externalizing problems and physical and inductive discipline during Predictors of adult attitudes toward corporal punishment of children.
childhood. Developmental Psychology, 49, 2029 –2039. http://dx.doi Journal of Interpersonal Violence, 22, 1285–1304.
.org/10.1037/a0032054 Gershoff, E. T. (2002). Corporal punishment by parents and associated
Cohen, J. (1988). Statistical power analysis for the behavioral sciences. child behaviors and experiences: A meta-analytic and theoretical review.
Hillsdale, NJ: Erlbaum. Psychological Bulletin, 128, 539 –579.
ⴱ Gershoff, E. T. (2013). Spanking and child development: We know enough
Christie-Mizell, C. A., Pryor, E. M., & Grossman, E. R. B. (2008). Child
depressive symptoms, spanking and emotional support: Differences be- now to stop hitting our children. Child Development Perspectives, 7,
tween African American and European American youth. Family Rela- 133–137. http://dx.doi.org/10.1111/cdep.12038
tions, 57, 335–350. Gershoff, E. T., Ansari, A., Purtell, K. M., & Sexton, H. R. (2016).
Committee on the Rights of the Child. (2006). General comment No. 8 Changes in parents’ spanking and reading as mechanisms for Head Start
(2006): The right of the child to protection from corporal punishment impacts on children. Journal of Family Psychology. Advance online
and or cruel or degrading forms of punishment (articles 1, 28(2), and 37, publication. http://dx.doi.org/10.1037/fam0000172
ⴱ
inter alia), 42nd Sess., U. N. Doc. CRC/C/GC/8. Retrieved from http:// Gershoff, E. T., Grogan-Kaylor, A., Lansford, J. E., Chang, L., Zelli, A.,
www.ohchr.org/english/bodies/crc/docs/co/CRC.C.GC.8.pdf Deater-Deckard, K., & Dodge, K. A. (2010). Parent discipline practices
ⴱ
Coyl, D., Roggman, L., & Newland, L. (2002). Stress, maternal depres- in an international sample: Associations with child behaviors and mod-
sion, and negative mother-infant interactions in relation to infant attach- eration by perceived normativeness. Child Development, 81, 487–502.
ment. Infant Mental Health Journal, 23, 145–163. http://dx.doi.org/10.1111/j.1467-8624.2009.01409.x
ⴱ ⴱ
Day, D. E., & Roberts, M. W. (1983). An analysis of the physical Gest, S. D., Freeman, N. R., Domitrovich, C. E., & Welsh, J. A. (2004).
punishment component of a parent training program. Journal of Abnor- Shared book reading and children’s language comprehension skills: The
SPANKING META-ANALYSES 467
moderating role of parental discipline practices. Early Childhood Re- child psychopathology in Outer Mongolia. Child Psychiatry and Human
search Quarterly, 19, 319 –336. Development, 35, 163–181.
ⴱ ⴱ
Gershoff, E. T., Lansford, J. E., Sexton, H. R., Davis-Kean, P., & Lansford, J. E., Wager, L. B., Bates, J. E., Pettit, G. S., & Dodge, K. A.
Sameroff, A. J. (2012). Longitudinal links between spanking and chil- (2012). Forms of spanking and children’s externalizing behaviors. Fam-
dren’s externalizing behaviors in a national sample of White, Black, ily Relations, 61, 224 –236. http://dx.doi.org/10.1111/j.1741-3729.2011
Hispanic, and Asian American families. Child Development, 83, 838 – .00700.x
843. http://dx.doi.org/10.1111/j.1467-8624.2011.01732.x Larzelere, R. E. (1996). A review of the outcomes of parental use of
ⴱ nonabusive or customary physical punishment. Pediatrics, 98, 824 – 828.
Graziano, A. M., Lindquist, C. M., Kunce, L. J., & Munjal, K. (1992).
ⴱ
Physical punishment in childhood and current attitudes: An exploratory Larzelere, R. E., Klein, M., Schumm, W. R., & Alibrando, S. A., Jr.
comparison of college students in the United States and India. Journal of (1989). Relations of spanking and other parenting characteristics to
Interpersonal Violence, 7, 147–155. self-esteem and perceived fairness of parental discipline. Psychological
ⴱ Reports, 64, 1140 –1142.
Grinder, R. E. (1962). Parental child-rearing practices, conscience, and
resistance to temptation of sixth-grade children. Child Development, 33, Larzelere, R. E., & Kuhn, B. R. (2005). Comparing child outcomes of
803– 820. physical punishment and alternative disciplinary tactics: A meta-
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Grogan-Kaylor, A. (2005). Relationship of corporal punishment and anti- analysis. Clinical Child and Family Psychology Review, 8, 1–37. http://
This document is copyrighted by the American Psychological Association or one of its allied publishers.
ences on the maternal and child psychological correlates of physical uptake inhibitor treatment. Journal of Clinical Psychopharmacology, 31,
discipline in African American families. Journal of Family Psychology, 365–368. http://dx.doi.org/10.1097/JCP.0b013e31821896c3
ⴱ
21, 165–175. Sears, R. R. (1961). Relation of early socialization experiences to aggres-
ⴱ
Medina, A. M., Mejia, V. Y., Schell, A. M., Dawson, M. E., & Margolin, sion in middle childhood. Journal of Abnormal and Social Psychology,
G. (2001). Startle reactivity and PTSD symptoms in a community 63, 466 – 492.
sample of women. Psychiatry Research, 101, 157–169. ⴱ
Slade, E. P., & Wissow, L. S. (2004). Spanking in early childhood and
ⴱ
Miller-Perrin, C. L., Perrin, R. D., & Kocur, J. L. (2009). Parental physical later behavior problems: A prospective study of infants and young
and psychological aggression: Psychological symptoms in young adults. toddlers. Pediatrics, 113, 1321–1330.
Child Abuse & Neglect, 33, 1–11. http://dx.doi.org/10.1016/j.chiabu Shadish, W. R., Cook, T. D., & Campbell, D. T. (2001). Experimental and
.2008.12.001 quasi-experimental designs for generalized causal inference. New York,
ⴱ
Minton, C., Kagan, J., & Levine, J. A. (1971). Maternal control and NY: Houghton Mifflin.
obedience in the two-year-old. Child Development, 42, 1873–1894. Sheehan, M. J., & Watson, M. W. (2008). Reciprocal influences between
ⴱ
Mulvaney, M. K., & Mebert, C. J. (2007). Parental corporal punishment maternal discipline techniques and aggression in children and adoles-
predicts behavior problems in early childhood. Journal of Family Psy- cents. Aggressive Behavior, 34, 245–255. http://dx.doi.org/10.1002/ab
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Appendix
Number and Percent of Studies Excluded from the Meta-Analyses by Exclusion Code
Number of studies
Reason for exclusion from meta-analyses excluded Percent
Spanking not linked with child outcomes (e.g., prevalence only). 238 16
Not an empirical article (e.g., a literature review). 221 15
Definition of physical punishment included harsh methods of physical punishment beyond spanking,
slapping, or hitting. 194 13
Spanking was not measured in the study. 171 11
Study was an unpublished dissertation. 104 7
Article was not relevant. 85 6
Attitudes toward, and not use of, physical punishment was assessed. 82 5
This article is intended solely for the personal use of the individual user and is not to be disseminated broadly.
Study did not include a bivariate association between spanking and the child outcome. 61 4
Study was of an intervention to reduce physical punishment. 47 3
Available statistics were unclear, insufficient, or inappropriate for the meta-analyses. 46 3
Spanking was combined with yelling or some form of psychological aggression. 44 3
Study was not available in English. 32 2
Spanking was combined with other types of discipline. 30 2
Study was published as a book chapter or conference presentation. 23 2
Study used same dataset as another study in the meta-analysis. 23 2
Dependent variable did not fit into other outcome categories. 11 1
Spanking was of animals, not children. 5 ⬍1
Article was unavailable through interlibrary loan. 3 ⬍1
Spanking measure included threats of spanking. 3 ⬍1
Physical punishment measure was nontraditional (i.e., aversive noise; washing mouth out with soap). 2 ⬍1
Study involved a special population of children (chromosomal abnormality). 1 ⬍1
Total number of excluded studies 1,499 100%