Professional Documents
Culture Documents
Therapeutic Effectiveness
Author(s): Scott O. Lilienfeld, Lorie A. Ritschel, Steven Jay Lynn, Robin L. Cautin and
Robert D. Latzman
Source: Perspectives on Psychological Science , July 2014, Vol. 9, No. 4 (July 2014), pp.
355-387
Published by: Sage Publications, Inc. on behalf of Association for Psychological Science
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide
range of content in a trusted digital archive. We use information technology and tools to increase productivity and
facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org.
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at
https://about.jstor.org/terms
Sage Publications, Inc. and Association for Psychological Science are collaborating with JSTOR
to digitize, preserve and extend access to Perspectives on Psychological Science
Abstract
The past 40 years have generated numerous insights regarding errors in human reasoning. Arguably, clinical practice
is the domain of applied psychology in which acknowledging and mitigating these errors is most crucial. We address
one such set of errors here, namely, the tendency of some psychologists and other mental health professionals to
assume that they can rely on informal clinical observations to infer whether treatments are effective. We delineate
four broad, underlying cognitive impediments to accurately evaluating improvement in psychotherapy - naive realism,
confirmation bias, illusory causation, and the illusion of control. We then describe 26 causes of spurious therapeutic
effectiveness (CSTEs), organized into a taxonomy of three overarching categories: (a) the perception of client change
in its actual absence, (b) misinterpretations of actual client change stemming from extratherapeutic factors, and
(c) misinterpretations of actual client change stemming from nonspecific treatment factors. These inferential errors
can lead clinicians, clients, and researchers to misperceive useless or even harmful psychotherapies as effective. We
(a) examine how methodological safeguards help to control for different CSTEs, (b) delineate fruitful directions for
research on CSTEs, and (c) consider the implications of CSTEs for everyday clinical practice. An enhanced appreciation
of the inferential problems posed by CSTEs may narrow the science-practice gap and foster a heightened appreciation
of the need for the methodological safeguards afforded by evidence-based practice.
Keywords
psychotherapy, effectiveness, science-practice gap, confirmation bias, illusory correlation, spontaneous remission,
placebo effect, regression to the mean, effectiveness, efficacy
A clinically depressed client obtains psychotherapy; 2 the error of concluding that a treatment worked when
months later, she is free of serious symptoms. Was her the evidence for this inference is insufficient. They can
improvement due to the treatment? commit this mistake when evaluating the effectiveness of
The correct answer is "We don't know." On the one treatment for a given client, the effectiveness of a specific
hand, ample data demonstrate that scientifically sup- school or modality of psychotherapy, or both.
ported psychotherapies can alleviate many mental health This error in reasoning can be found in published
difficulties (Barlow, 2004), so the client's improvementresearch as well. In numerous articles, authors have inter-
may well stem at least partly from the intervention. On preted client improvement following an intervention -
the other hand, as most mental health professionals
even in the absence of differences from a no-treatment
know, we cannot draw valid conclusions regarding control
a group - as evidence for treatment efficacy (e.g.,
treatment's effectiveness in the absence of methodologi-
cal safeguards against errors in inference, such as well-
Corresponding Author:
validated outcome measures, randomized control groups, Scott O. Lilienfeld, Department of Psychology, Room 473, Emory
and blinded observations (Gambrill, 2012). Yet even sea-University, 36 Eagle Row, Atlanta, GA 30322
soned clinicians and researchers can easily fall prey toE-mail: slilien@emory.edu
Illusory causation 1975). For example, when money is at stake, most people
prefer to select a lottery ticket or roll a die themselves
Scottish philosopher David Hume (1748) maintained that
rather than leave these actions to others, even though the
humans are prone to perceiving causal relations in their outcomes in all scenarios do not exceed chance. This
absence. Two centuries later, Michotte (1945) argued that
illusion may predispose therapists to believe that they
our propensity to perceive causal relations between possess more causal power over client outcomes than
events, even those that are causally unrelated, comes to
they do. The illusion of control is especially likely when
us as naturally as does our propensity to perceive color.
the individual in question (a) is personally involved in
Research on illusory causation , or the propensity to per- the behaviors, (b) is familiar with the situation at hand,
ceive a spurious causal relation between two associated (c) is aware of the desired outcome, and (d) has a history
variables, bears out these contentions.
of previous success at the task (Thompson, 1999). Most
Laboratory evidence for illusory causation dates at
or all of these criteria presumably apply to the modal
least to the work of Koffka (1935), who showed observ-
psychotherapist. Indeed, when interventions are consis-
ers two points of light in a dark room. When the points
tently followed by improvement, treatment providers
moved apart, perceivers tended to attribute causality to may conclude that they are the active causal agents when
the dot on which they happened to be focusing, even if they are not (Matute, Yarritu, & Vadillo, 2011).
it was stationary. Koffka's findings suggest that we are
more likely to attach causal significance to the object of
our attention while ignoring competing evidence. Later Implications of cognitive impediments
research demonstrated that illusory causation extends tofor clinicians 9 self-perceptions and
social interactions. When observers are positioned physi-
predictions
cally so as to attend primarily to one partner in a two-
person conversation, they regard him or her as more These four broad cognitive impediments may help to
interpersonal^ influential than the other partner (Taylor explain why some therapists overestimate their positive
& Fiske, 1975; see also McArthur & Solomon, 1978). client outcomes. In this respect, they appear to be no dif-
There are two potential, nonmutually exclusive expla- ferent from professionals in many other fields, including
nations for illusory causation (McArthur, 1980). The first college professors (Cross, 1977), physicians (Hodges,
is perceptual: Individuals tend to attribute causality to Regehr, & Martin, 2001), and political pundits (Tetlock,
whatever stimulus is most vivid and prominent in their 2005), all of whom tend to hold an overly charitable view
visual fields and to accord less causal import to what lies of their effectiveness (Dunning, Heath, & Suis, 2004). In
in the visual background (Lassiter, Geers, Munhall, a sample of 129 therapists in private practice (26.4% psy-
Ploutz-Snyder, & Breitenbecher, 2002). The second is chologists), the average clinician rated him- or herself at
cognitive: Individuals recall more information about stim- the 80th percentile of all therapists in terms of effective-
uli that are prominent in their visual foregrounds than in ness and skills; 25% of respondents placed themselves at
their visual backgrounds (Taylor & Fiske, 1978). With the the 90th percentile. None rated themselves as below
aid of an availability heuristic, by which we gauge the average. Moreover, the typical therapist in the sample
probability of an event by using its accessibility in mem- estimated the rate of client deterioration in his or her
ory (Tversky & Kahneman, 1974), we come to view the caseload to be 3 7% (Walfish, McAlister, O'Donnell, &
former stimuli as more influential. Lambert, 2012). In fact, numerous studies have indicated
Because of illusory causation, therapists, researchers, that about 10% of clients become worse following psy-
clients, and external observers may leap to the conclu- chotherapy (Boisvert & Faust, 2002; Lilienfeld, 2007).
sion that a treatment exerted a causal effect on the client Other evidence dovetails with these results. In a sam-
when it did not (Sloman, 2005). The client's improvement ple of 49 psychotherapists in college counseling centers,
clinicians markedly overestimated their rates of positive
within therapy sessions is plainly visible to the clinician,
whereas rival explanations for this improvement (e.g.,client outcomes (91%) relative to their actual positive out-
events occurring to the client outside of sessions, placebocomes (40%), as ascertained by a standardized symptom
effects, changes in cognitive biases over the course ofmeasure. Furthermore, although therapists predicted that
treatment) rarely are. As a consequence, these explana- only 3 out of a total of 550 clients (0.5%) in their collec-
tions may be assigned less weight. tive caseloads would deteriorate, outcome data revealed
that 40 (7.3%) did so (Hannan et al., 2005). Taken
together, these findings suggest that many or most psy-
Illusion of control chotherapists perceive improvements in clients in their
A related error is the illusion of control , or the propensityabsence and fail to perceive deterioration in their
to overestimate our ability to influence events (Langer,presence.
1. Illusory placebo effects Perceived improvement occurring in the absence of (none additional)
genuine improvement
2. Palliative benefits Feeling better about one's signs and symptoms (none additional)
without tangible improvements in them
3. Confusing insight with Mistaking apparent understanding of one's problem (none additional)
improvement with improvement in that problem
4. Retrospective rewriting of Belief that one has improved arising from a tendency Measures of pretreatment functioning
pre treatment functioning to remember one's pretreatment functioning as
worse than it was
5. Response-shift bias Change in one's evaluation standard with respect Measures of pretreatment functioning
to an outcome dimension as a consequence of
treatment
6. Reduction in cognitive biases Declines in cognitive biases tied to pretreatment Measures of cognitive biases throug
reporting of symptoms treatment
7. Demand characteristics Tendency of clients to report improvement in accord Outcom
with what they believe to be the therapist's or
researcher's hypotheses
8. The therapist's office error Confusion of client's in-session behavioral Out-of-
presentation with out-of-session improvement improvement
9. Test-retest artifacts Tendency of scores on psychopathology measures to Outco
decline spuriously on their second administration skip-out structu
10. Unknowable outcomes in Lack of information regarding what would have Comparis
the control condition occurred had the treatment not been administered condition
11. Selective attrition Tendency of clients who drop out of therapy to Intent-to-treat analyses. Measur
improve less than other clients of pretreatment differences between
treatment completers versus dropouts
12. Compliance bias Tendency for client adherence to treatment Measures of treatment compliance (e.g.,
recommendations to be confounded with variables completion of homework assignments),
that predict improvement Examination of compliance in the
control condition
13. Selective attention to client Tendency of individuals to unwittingly "cherry- Blinding of observers. Explicit a priori
outcomes pick" the outcome variables on which clients are predictions concerning client outcomes
improving
14. Selective memory for client Tendency of individuals to preferentially recall Blinding of observers
outcomes indications of improvement as opposed to those of
no improvement or worsening
15. Selective interpretation of Tendency of individuals to interpret ambiguous Blinding of observers
client outcomes changes in signs or symptoms as indications of
improvement
Category 2 CSTEs: All Category 2 CSTEs: Randomization to
Misinterpretations of actual treatment conditions
client change stemming from
extratherapeutic factors
16. Spontaneous remission Tendency of some psychological conditions to
improve on their own
17. History Widely shared events transpiring outside of Repeated measurements throughout
treatment treatment. Tracking of life events
outside of treatment
18. Cyclical nature of some Tendency of some psychological conditions to go up Long-term follow-ups
disorders and down
19. Self-limiting nature of Tendency of individuals w
disorder episodes conditions to improve once episodes have run
their natural course
(continued)
Table 1. (continued)
Note: CSTEs in each category have one safeguard in common and then, usually, additional specific safeguards.
objectively unchanged
adequately functioning clients use psychotherapy ses- (M
Widlocher, 1991).
sions as opportunities to express their pent-up negative
emotions (see Nichols & Efran, 1985).
7. Demand characteristics. Demand characteristics In other cases, however, the therapist's office error
may contribute to overestimates of treatment effective-
occur when clients or research participants adjust their
behavior, including self-reported behavior, in accordness. For example, clients with social anxiety disorder
(social phobia) involving apprehension of interpersonal
with what they believe to be the therapists' or investiga-
tors' hypotheses (Orne, 1962). The treatment rationale
rejection who are initially anxious in treatment may grow
provided by clinicians can convey potent demand char-more comfortable with the therapist over time, leaving
the therapist (and perhaps clients themselves) with the
acteristics to patients regarding treatment and thereby
shape their attributions, expectations, emotions,misleading
and impression that they are experiencing
improvement in social anxiety symptoms. Yet these cli-
actions (Addis & Carpenter, 2000; McReynolds & Tori,
ents may merely be exhibiting stimulus discrimination,
1972). In one study, participants informed that thoughts
precede affect in response to images (i.e., a cognitive
learning to respond less anxiously to the psychotherapist
or others who provide them with unconditional accep-
therapy rationale) were more likely to report thoughts
first compared with participants informed that affecttance
pre-but not to the very people they find most interper-
cedes thoughts. Differences between the two rationales sonal^ threatening. Indeed, studies of behavior therapy
were especially apparent in response to highly arousingfor anxiety disorders sometimes point to a stimulus gen-
images (Kanter, Kohlenberg, & Loftus, 2004) and were eralization gradient from the therapist's office to the out-
side world, reflecting marked improvements in the former
maintained at a 1-week follow-up (Busch, Kanter, Sedivy,
& Leonard, 2007). setting followed by decrements upon treatment termina-
tion (Gruber, 1971; see Lynch, Chapman, Rosenthal, Kuo,
Moreover, clients are often motivated to tell their ther-
apists what they believe their therapists want to hear;& Linehan, 2006, for a discussion of real-world general-
they may also be motivated to persuade themselvesization that strategies in dialectical behavior therapy). These
they have improved. Hathaway (1948) referred to findings
the underscore the need to ensure that the client's
anxiety-provoking behaviors are addressed in real-world
"hello-goodbye" effect as clients' propensity to present
settings during treatment.
themselves as worse than they actually were at the outset
The therapist's office error may pose a particular chal-
of treatment and better than they actually are at the con-
clusion of treatment. As a consequence of this phenom-lenge for psychoanalytic therapies, which rely heavily on
enon, therapists and other observers may conclude the therapist-client transference as the engine of change.
that
client improvement occurred in its absence. In many respects, one can conceptualize transferences as
reflecting interpersonal expectancies (Westen, 1998).
Similarly, hypnosis researchers have identified a "hold-
Accordingly,
back effect" when participants are tested sequentially in if clients do not generalize their positive
transference reactions toward the therapist to others, their
nonhypnosis and hypnosis conditions. One of the implicit
demands of hypnosis is to behave as a "good" hypnoticlong-term improvements may be limited (Holmes, 1971).
subject should, or at least as this role is understood by
9. Retest artifacts. The retest artifact (Loranger,
the participant (Orne, 1962). The holdback effect can
Lenzenweger, Gartner, & Susman, 1991) is the tendency
arise when participants are not hypnotized during an ini-
tial baseline trial but know they will be hypnotized in of
thescores on psychopathology indices to decline spuri-
ously upon their second administration. This artifact may
following trial. In such cases, they may deliberately "hold
back" from fully responding when they are not hypno- be especially likely with measures characterized by a
tized to demonstrate gains on the later hypnosis trial, skip-out structure, such as many structured and semis-
thereby presenting themselves as good hypnotic subjects tructured interviews. Clients may realize that if they say
(Braffman & Kirsch, 1999; Zamansky, Scharf, & Brightbill, "no" to many questions, they will have a much briefer
1964). and less emotionally distressing experience than if they
say "yes" to them, generating the false appearance of
8. The therapist's office error. What we term the ther- improvement. In other cases, clients may deny more
apist's office error is the mistake of confusing clients' symptoms during the second assessment if they learn
in-session behavioral presentation with out-of-session that the questions concern sensitive behaviors, like drug
improvement. Clients' behavior within the cloistered use or antisocial activities. Indeed, evidence suggests that
confines of the therapist's office may not reflect their this artifact may be especially pronounced for measures
behavior or functioning outside of treatment (Holmes, of socially undesirable characteristics (Jorm, Duncan-
1971; Magaret, 1950). This error may sometimes lead cli- Jones, & Scott, 1989). Although the test-retest artifact has
nicians to underestimate treatment effectiveness, as when not received the research attention it merits, data suggest
22. Multiple treatment interference. When clients seek In the case of medication, some research suggests that
out a treatment, they often obtain other interventions up to 80% of the effects of antidepressants on clinical
simultaneously (Kendall, Butcher, & Holmbeck, 1999), depression,
a especially when it mild or moderate, may be
confound known as multiple treatment interferenceattributable
or to placebo effects (Kirsch, 2005; Kirsch &
co-intervention bias. Some of these adjunctive interven-Sapirstein, 1998; but see Coyne, 2012; Klein, 1998, for dif-
tions may be formal treatments, such as antidepressants ferent views). Placebos generally exert their most potent
or marital therapy. Others may be informal "treatments,"effects on subjective reports, such as depression, pain,
such as exercise, which has generally been found in con-and nausea, rather than on largely objective indices, such
as assays of cancer, heart disease, or other organic ill-
trolled studies to be effective for alleviating depression
(Fremont & Craighead, 1987; Penedo & Dahn, 2005), or nesses (Hróbjartsson & Gotzsche, 2001).
confiding in trusted friends or religious figures. Multiple Placebo effects appear to play an important role in the
treatment interference renders it difficult or impossibleefficacy
to of psychotherapy, too. Estimates of placebo
effects in psychotherapy, typically obtained by compar-
attribute client change conclusively to the active ingredi-
ents of the intervention of choice. ing treatment outcomes from attention-placebo control
groups with those of wait-list control groups, are on the
order of d = 0.40, or about half of the typical effect size
23 ■ Initial misdiagnosis. Even the best trained diagnos-
yielded by active therapies (Grissom, 1996; Lambert,
ticians are fallible (Beyerstein, 1997; Garb, 1998; Groop-
man, 2007). For example, relatively normal individuals 2005; Lambert & Ogles, 2004). Moreover, meta-analyses
Protecting
Research Methods
WelUvalidated outcome indicators . Well-validated
Against Causes of
and largely objective outcome measures help to rule out
Therapeutic Effect
all Category 1 CSTEs, because these CSTEs can engender
A key pointthe false appearance of change
that is in its not
absence. For exam-
em
tion in ple, well-validated indicators
clinical psycholoof depression or anxiety
systematic help research
to exclude - although not eliminate - illusory
des pla-
subject and cebo effects and palliative effects in controlled trials of
single-subje
mize CSTEs major as
depression rival
and anxiety disorders. To be effective
hypo
(Lilienfeld et
safeguards al,
against Category 2008;
1 CSTEs, well-validated out-
respects, come indicators
the should be sensitive not only to client
existence o
raison symptoms for
d'être but also to client impairment.
evidenc Such indicators
knowledge are also useful as protections
this crucialagainst Category 1 CSTEs inp
itly (but controlled single-subject
see Lilienfeld designs. In contrast, demand e
2011, for characteristics can be especially difficult to rule
discussions of out as
dence-based sources of erroneous clinical inference. Nevertheless,
practice).
outcome without
Specifically, measures that are low in reactivityrand (Weiss &
controlled Weisz, 1990), such as extrasession behavioral data or
quasi-experim
subject unobtrusive behavioral observations,
designs, and are at least partial
oth
safeguardsantidotes against this CSTE. Collateral reports
against CSTE from out-
whether side informants (e.g., friends,
client change significant others), which w
opposed tocan supply
a "social
wealth validation" (Kazdin, 1977), can
of be use- e
controlled ful in ruling out the confusion
trials are of insight
notwith improve- s
not ment, retrospective
remove all rewriting of pretreatment functioning,
potential
response shift
Nevertheless, bias, the therapist's office error, and
analyses of similar t
treatment CSTEs. Specifically, these reports can assist
designs based clinicians and
yield more investigators
replicable with excluding the hypothesisresu that client-
perceived change
experimental orin symptoms is (a) limited to behaviors
natural
probably atwithin therapy sessions, (b)
least in illusory, or (c) both.
part b
nate more CSTEs as rival e
As a Pretreatment measures . Collecting
consequence of measures of client
thei
randomized psychological status at pretreatment is especially helpfult
controlled
designs for ruling
justifiably out one specific Category 1 CSTE, namely, ret-
occup
tiary rospective
certainty rewriting of pretreatment
in functioning.
the Specifi-ev
(Ghaemi, cally, such measures can Neverthe
2009). assist in excluding the hypothesis
archy, suchthat clients
as are merely misremembering their initial
quasi-expe
ods, can adjustmentplay
also as worse than it actually was, thereby leading
valuabl
they help to
to spurious inferences
protect of improvement. If these measures
inv
(Wachtel, do not rely exclusively on Moreove
2010). self-report ratings, they can
pensable in also help
the to eliminate response-shift
early biases as explana-
ph
as they tions
allow for apparent improvement.
researchers
can shape the developm
turn, Blinding
these of observers. Blinded observations in con-
intervention
empiricallytrolled clinical trials control partially for several additional c
promising,
ously Category
controlled 1 CSTEs, especially
trials.those stemming from con-
In the firmationsection,
next bias and illusory correlation (i.e., selective
we
attention, memory, and interpretation
methodological procedu of client outcomes).
research When external evaluators
help to are fully blinded,
elimi they cannot
controlled
subtly and selectively perceive, recall, trials do not eliminate
or interpret ambig- Category 2 CSTEs,
uous symptom changes as a function whichof
still arise in these assign-
treatment investigations and can deceive
observers in the
ment. For example, blinded observers inabsence of randomized controlled
a randomized
controlled trial of cognitive-behavior therapy
groups. Nevertheless, theversus a
randomization process helps to
wait-list control for generalizedexclude
anxiety disorder
Category 2 CSTEs asare
rivalless
explanations for thera-
peutic effectiveness,
likely to differentially elicit or cherry-pick because of
indicators these CSTEs are equally
improvement (e.g., reports of less frequent
likely worrying)
in sizeable experimental and in
control groups. Given
the treatment condition. the law of large numbers, these CSTEs should no longer
Nevertheless, these Category 1 CSTEs may be difficult account for between-group differences in randomized
to eliminate entirely. Because therapy outcome studies controlled trials provided that clinical trials are adequately
cannot be strictly double-blinded (i.e., clients and clini- powered (Hsu, 1989). For example, in a randomized con-
cians know who is receiving treatment), confirmation trolled trial, spontaneous remission, history, regression to
bias can still affect ratings of improvement by clients and the mean, maturation, and multiple treatment interfer-
clinicians. Moreover, even the blinding of external observ- ence occur frequently among individuals assigned ran-
ers in psychotherapy trials is rarely infallible, as these domly to both treatment and no-treatment (or alternative
evaluators can often surmise treatment assignment at treatment) conditions. Nevertheless, proper randomiza-
above-chance levels (Carroll, Rounsaville, & Nich, 1994). tion ensures that these CSTEs tend to be equalized across
Assessing potential violations of blinding by asking eval-the active treatment and comparison arms.
uators to guess treatment conditions and using this vari-
able as a covariate in analyses can be a helpful safeguard Repeated measurements. In both between-subject
against selective perception, memory, and interpretation and controlled single-subject experiments, repeated mea-
of client change. Nevertheless, such covariate analyses surements across the course of treatment can help to rule
may underestimate treatment differences (especiallyout history and other extratherapeutic influences as
when based on guesses made at the conclusion of treat- sources of improvement in therapy (Laurenceau, Hayes,
ment), because above-chance guessing could stem from & Feldman, 2007). If one observes changes in treatment
evaluators' accurate observations of differential improve-at multiple time points rather than at only one time point
ment across conditions (Carroll et al., 1994; Rickeis, following an extratherapeutic event (e.g., initiation of a
Lipman, Fisher, Park, & Uhlenhuth, 1970). romantic relationship), ¿he likelihood that such events -
rather than the therapeutic intervention - are contribut-
Intent-to-treat analyses. Intent-to-treat (ITT) analyses ing to improvement is minimized (such observations are
(Hollis & Campbell, 1999) help to rule out one key Cat- also useflil for ruling out novelty effects, a Category 3
egory 1 CSTE, namely, selective attrition. By examiningCSTE). In the context of single-subject designs, multiple
outcomes of all participants enrolled in clinical trials, baseline designs - especially those in which the interven-
including dropouts, ITT analyses minimize erroneoustion is applied to different behaviors in a temporal
inferences of improvement stemming from the fact that sequence - can help to rule out history and other extra-
clients who leave treatment prematurely are often unrep- therapeutic factors as rival explanations for change dur-
resentative of those who initially enrolled (Tehrani et al., ing treatment (Engel & Schutt, 2012; Nock, Michel, &
1996). In contrast to clients who remain in treatment, Photos, 2007). If one consistently observes change in dif-
those who drop out of treatment tend to be lower func-ferent behaviors at different time points, the likelihood
tioning and more psychologically disturbed (Swift &that extratherapeutic factors account for the improve-
Greenberg, 2012), although in a minority of cases they ment is minimized. Finally, long-term follow-up measure-
comprise clients who have improved and no longer per- ments can be helpful in excluding CSTEs arising from the
ceive themselves as requiring treatment (Baekeland & cyclical and self-limiting nature of certain disorders, as
Lundwall, 1975; Tehrani et al., 1996). As a consequence such assessments can ensure that improvements in signs
of the selection biases introduced by client dropout, ITTand symptoms are not transient.
analyses help to avoid misestimating - and typically over-
estimating - treatment effects. Minimizing and estimating measurement error.
The use of pre- and posttreatment indicators with high
reliability will minimize regression to the mean, as this
Protecting against Category 2 CSTEs
statistical phenomenon is most probable when measures
Randomization to treatment conditions. Random- contain substantial amounts of nonsystematic (random)
ization to treatment conditions helps to address the measurement
infer- error. Particularly in quasi-experimental
ential errors generated by Categoiy 2 CSTEs, treatmentwhich studies, investigators should be circumspect in
produce changes stemming from extraneous factors out-
their use of extreme-groups designs (in which partici-
side of treatment. To be clear, well-executed randomized
pants are selected on the basis of very high pretreatment
demoralization"
to minimize illusory placebo effects; (Cook can
they & Campbell,
solicit 1979) as a conse-
information from informants quence
regarding clients'
of not receiving out-of-
treatment afforded to other indi-
viduals,
session behaviors to minimize the or both. Nevertheless,
therapist's office the evidence for this
error;
they can attend diligently to all relevant
assertion client
is mixed (e.g., outcomes
S. A. Elliott & Brown, 2002). Still
other authors
to avoid inadvertent cherry-picking of argue that "treatment
signs and symp- as usual" conditions,
toms; they can be alert to the which
fact that
often serve client
as control improve-
groups in psychotherapy
ments over time may reflect outcome
regression
designs, are to the mean,
best conceived of as "intent to fail"
conditions,
history, and other artifacts; they candefined
attend (perhaps
totendentiously)
potential as "pseudo-
treatments designed
client characteristics that may moderate the specifically
likelihood as control
of groups to
CSTEs; and so on. In this respect,
prove they canofadopt
the superiority a scien-
the investigator's preferred treat-
ment and that
tific mind-set while bearing in mind that have various
no theoretical sources
rationale or are deliv-
of inferential error cannot be ered by graduate students
completely who know they are
eliminated.
Hence, the local clinical scientist model,
administering although
treatment not
that is not an to work"
supposed
adequate substitute for scientist-practitioner or If
(Westen & Bradley, 2005, p. 267). clinical
these critics are cor-
science models of training (see Baker
rect, et al.,
some standard 2008),
psychotherapy is adesigns may
outcome
helpful reminder that clinicians should
overestimate continually
the efficacy oper- or gen-
of beneficial treatments
ate as "detectives" who strive toerate identify
the mistaken conclusion that inefficacious
potential rival treat-
sources of improvement and who
ments (a) minimize these
are efficacious.
sources when they can and (b) bear
Third, them
some readers mightin mind
contend that ouras
core argu-
ments
inferential constraints when they are rendered effectively moot by the Dodo Bird
cannot.
verdict, named after the Dodo Bird in Lewis Carroll's
Adventures of Alice in Wonderland , who proclaimed fol-
Limitations of our analysis
lowing a race that "Everybody has won, and all must
Our analysis is limited in at least three respects. First, our
have prizes." This verdict posits that all psychotherapies
review leaves unresolved the question of how often each are (a) effective and (b) equivalent in their effectiveness,
of the 26 CSTEs we have identified contributes to errone- both overall and for all disorders (Luborsky, Singer, &
ous inferences in actual clinical practice. As in many Luborsky, 1975; Shedler, 2010; Wampold et al., 1997). If
domains of psychology, one must distinguish "can" from the Dodo Bird verdict is correct, the reasoning continues,
"does" in discussions of causality (McCall, 1977). The fact CSTEs are of little or no concern because all treatments
that a CSTE can lead to incorrect inferences of therapeu- work, and work equally well (see Stewart et al., 2011).
tic effectiveness does not tell us how often it does so. Nevertheless, the Dodo Bird verdict has historically
Research examining therapists' knowledge of and under-referred to a rough equivalence in the effectiveness of
standing of CSTEs, both in the abstract and in real-world different schools of therapy (e.g., psychodynamic, cogni-
practice, would be a useful starting point in addressingtive-behavioral) rather than to a precise equivalence in
this question. the efficacy of all specific treatments (e.g., Smith, Glass,
Second, we have focused only on inferential errors & Miller, 1980). Moreover, the assertion that all therapies
that apply to everyday clinical practice. We have not are of equal efficacy, either overall (a main effects hypoth-
examined the many methodological decisions that can esis) or for all conditions (an interactional hypothesis), is
generate spurious inferences of treatment effectiveness in difficult to sustain (Lilienfeld, 2014; but see Wampold,
research studies of all kinds. For example, the file-drawer 2001, for a more sanguine perspective on the Dodo Bird
effect (Rosenthal, 1979), which is the bias against submit- verdict). For example, well-replicated data indicate that
ting negative results for publication, and outcome report- exposure-based therapies are more efficacious than other
ing bias (Chan & Altman, 2005), which is the propensitytreatments for at least some anxiety-related disorders
to cherry-pick data on dependent measures that yield (e.g., obsessive-compulsive disorder) and that behavioral
positive results, can lead to overestimates of treatmenttherapies are more efficacious than nonbehavioral thera-
efficacy. Recent data also raise the possibility of a discon- pies for child and adolescent behavioral problems
certingly high prevalence of "p-hacking," that is, analyz- (Chambless & Ollendick, 2001; Hunsley & Di Giulio,
ing data - or peeking repeatedly at already collected2002). A meta-analysis by Tolin (2010) similarly revealed
data - until alpha levels fall just below .05 (Masicampo &that behavioral and cognitive-behavior therapies are
Lalande, 2012). More directly relevant to clinical practice, more efficacious than other therapies for anxiety and
some researchers also contend that the use of wait-list mood disorders. Further calling into question the Dodo
control groups contributes to overestimates of psycho- Bird verdict are findings that at least some interventions,
therapy efficacy, because clients in these groups maysuch as CISD, are at best ineffective and perhaps harmful
deteriorate as they await treatment, experience "resentful(Lilienfeld, 2007; McNally et al., 2003). Even Bruce
Wampold, Notes
a prominent p
dict, acknowledges
1. Note that in a case in which the client'sthatdistress concerning
"bona-fide" therapies,
her self-centeredness and hostility was itself a treatmentna focus,
chological principles,
the response-shift bias would not be considered a CSTE. del
therapists,2. A and
related cognitive error
laidis the subset fallacy
out (Dawes, 2001).
publicationsA treatment may not
(Wampold differ in its effects from those of a control
&
treatment;
Furthermore, yet on a post
our hoc basis, investigators
discu may identify
a subset of clients within the treatment group who displayed
only to "schools" of psy
positive outcomes, leading them to conclude that the treatment
therapeutic techniques,
was efficacious for that subset. Yet if there was no mean differ-
m
treatment modalities. Th
ence between the treatment and control groups, mathematically
to false
inferences regardi
the individuals in the treatment group outside of the subset
techniques must
delivered with
have become worse following the intervention.
interpretation of
3. Nevertheless, especially a with
among clients clien
marked anxiety
imparted to a
and depression, client,
memory or
biases may run in the opposite direc-
ent and clinician. Hence,
tion, predisposing them to selectively recall threatening or sad
Dodo Bird information
verdict, (Coles & Heimberg, 2002; R. Elliott,
the Rubinsztein,
i
CSTEs in Sahakian, &psychother
the Dolan, 2002).
References
Closing thoughts
Addis, M. E., & Carpenter, K. M. (2000). The treatment rationale
The challenges posed
in cognitive-behavioral by
therapy: Psychological mechanisms
simism, letand clinical
alone guidelines. Cognitivenihilis
Therapy and Research ,
practice or 24, 313-326.
research sett
associated with them are to some extent surmountable.
Alexander, F. F., & French, T. M. (1946). Psychoanalytic ther-
Nevertheless, CSTEs underscore the pressing need to apy: Principles and application. Oxford, England: Ronald
Press.
inculcate humility in clinicians, researchers, and students
(McFall, 1991). We are all prone to neglecting CSTEs, notAlpert, J. (2012, April 22). In therapy forever? Enough already.
The New York Times , p. SR5.
because of a lack of intelligence but because of inherent
American Psychiatric Association. (2013). Diagnostic and sta-
limitations in human information processing (Kahneman,
tistical manual of mental disorders (5th ed.). Washington,
2011). As a consequence, all mental health professionals DC: American Psychiatric Association.
and consumers should be skeptical of confident procla-American Psychological Association Task Force on Evidence-
mations of treatment breakthroughs in the absence of rig- Based Practice. (2006). Report of the 2005 Presidential
orous outcome data (Dawes, 1994; Lilienfeld et al., 2003). Task Force on Evidence-Based Practice. Washington, DC:
CSTEs are potent reminders that although our intuitions American Psychological Association.
are at times accurate, they can be misleading. When eval-Axsom, D., & Cooper, J. (1985). Cognitive dissonance and
uating treatment effectiveness, our intuitions may fail to psychotherapy: The role of effort justification in inducing
account for numerous rival hypotheses for change that weight loss. Journal of Experimental Social Psychology , 21 ,
149-160. doi: 10. 1016/0022-1031(85)90012-5
are difficult or impossible to detect without the aid of
Bachrach, H. M., Galatzer-Levy, R., Skolnikoff, A., & Waldron,
finely honed research safeguards. As a consequence,
S. (1991). On the efficacy of psychoanalysis. Journal of the
CSTEs highlight the inherent limits of our knowledge as American Psychoanalytic Association, 39 , 871-916.
applied to the individual client and should impel us to beBaekeland, F., & Lundwall, L. (1975). Dropping out of
mindful of our propensities toward overconfidence. treatment: A critical review. Psychological Bulletin , 82,
Science, which is a systematic approach to reducing 738-783.
uncertainty in our inferences (McFall & Treat, 1999;Baker, T. B., McFall, R. M., & Shoham, V. (2008). The current
O'Donohue & Lilienfeld, 2007), is ultimately our best pre- status and future of clinical psychology: Towards a scien-
scription against being deceived by inadequate evidence. tifically principled approach. Psychological Science in the
Public Interest, 9, 67-103.
Acknowledgments Baldwin, S. A., Wampold, B. E., & Imel, Z. E. (2007). Untangling
the alliance-outcome correlation: Exploring the relative
The authors thank Sean Carey and Ben Johnson for their valu-
importance of therapist and patient variability in the alli-
able assistance with compiling references.
ance. Journal of Consulting and Clinical Psychology , 75,
842-852 . doi : 10. 1037/0022-006X.75 .6.842
Declaration of Conflicting Interests Barkham, M., Hardy, G. E., & Mellor-Clark, J. (2010). Developing
The authors declared that they had no conflicts of interest with and delivering practice-based evidence. Chichester,
respect to their authorship or the publication of this article. England: Wiley.
Chapman, L.American
J., Psychologist,& 55, 264-268. doi: 10. 1037/0003-
Chapma
but erroneous066X.55.2.264 psychodiag
Abnormal Dehue, T. (2005). History of the control group. In B. Everitt
Psychology, 72 & ,
Chwalisz, K. D. Howell (Eds.), Encyclopedia of statistics Evide
(2003). in the behav-
for ioral science (do. 829-836). Chichester, England: Wilev.
twenty-first-century
Counseling DeRubeis, R. J., Brotman, M. A., & Gibbons, C. J. (2005). A
Psychologist, 3
Coles, M. E., &
conceptual Heimberg
and methodological analysis of the nonspecifics
the anxiety argument. Clinical Psychology: Science and Practice, 12,
disorders: Cur
Review, 22 ,
174-183. 587-627.
doi: 10. 1093/clipsy/bpi022 doi:
Conway, M.,DeRubeis,& R. J., & Feeley,
Ross, M. (1990). DeterminantsM.of change (1
revising what in cognitive therapy
you for depression. Cognitive
had. Therapy and Jo
Psychology , Research,
47, 14, 469-482. doi: 10. 1007/BF01 172968
738-748. d
Cook, T. D.,Diefenbach,
& G. J., Diefenbach,
Campbell, D., Baumeister, A., & West, M. D
Design and (1999). Portrayal of lobotomy in the popular press:
analysis 1935-
for
McNally. 1960. Journal of the History of the Neurosciences, 8, 60-69.
Cooper, J. (1980). Reducing fears and increasing assertiveness: doi: 10. 1076/jhin.8. 1 .60. 1766
The role of dissonance reduction. Journal of Experimental Dimidjian, S., & Hollon, S. D. (2010). How would we know if
Social Psychology, 16, 199-213. doi: 10. 1016/0022-1031 psychotherapy were harmful? American Psychologist, 65,
(80)90064-5 21-33.
Cooper, J., & Axsom, D. (1982). Effort justification in psycho- D'Silva, K., Duggan, C., & McCarthy, L. (2004). Does treatment
therapy. In G. Weary & H. Mireis (Eds.), Integrations of really make psychopaths worse? A review of the evidence.
clinical and social psychology (pp. 98-121). New York, NY: Journal of Personality Disorders, 18, 163-177. doi: 10. 1521/
Oxford University Press. pedi. 18.2. 163.32775
Coover, J. E., & Angell, F. (1907). General practice effect of spe- Dunning, D., Heath, C., & Suis, J. M. (2004). Flawed self-
cial exercise. American Journal of Psychology, 18, 328-340. assessment: Implications for health, education, and the
doi: 10.2307/1412596 workplace. Psychological Science in the Public Interest, 5,
Coren, S. (2003). Sensation and perception. In D. K. Freedheim 69-106. doi:10.1111/j.l529-1006.2004.00018.x
(Ed.), Handbook of psychology: History of psychology (Vol. Edens, J. F., Skeem, J. L., Cruise, K. R., & Cauffman, E. (2001).
1, pp. 85-108). Hoboken, NT: Wiley. Assessment of "juvenile psychopathy" and its association
Coronary Drug Project Research Group. (1975). Clofibrate and with violence: A critical review. Behavioral Sciences & the
niacin in coronary heart disease. Journal of the American Law, 19, 53-80. doi:10.1002/bsl.425
Medical Association, 231, 360-381. Elliott, R., Rubinsztein, J. S., Sahakian, B. J., & Dolan, R. J.
Corsini, R. J., & Wedding, D. (2010). Current psychotherapies. (2002). The neural basis of mood-congruent processing
Pacific Grove, CA: Wadsworth. biases in depression. Archives of General Psychiatry, 59,
Costello, E. J., & Janiszewski, S. (1990). Who gets treated? 597-604. doi: 10. 1001/archpsyc.59.7.597
Factors associated with referral in children with psy- Elliott, S. A., & Brown, J. L. (2002). What are we doing to wait-
chiatric disorders. Acta Psychiatrica Scandinavica , 81, ing list controls? Behaviour Research and Therapy, 40,
523-529. 1047-1052. doi:10.10l6/S0005-7967(01)00082-l
Coyne, J. (2012). The antidepressant wars, a sequel: How theEllis, A. (2003). Helping people get better rather than merely
media distort findings, and do harm to patients. PLOSBlogs. feel better. Journal of Rational-Emotive & Cognitive-
Retrieved from http://blogs.plos.org/mindthebrain/tag/ Behavior Therapy, 21, 169-182.
placebo-washout/ Engel, R. J., & Schutt, R. K. (2012). The practice of research in
Cromer, A. (1993). Uncommon sense: The heretical nature of social work. New York, NY: Sage.
science. New York, NY: Oxford University Press. Ernst, E., & Resch, K. L. (1995). Concept of true and perceived
Cross, K. P. (1977). Not can, but will college teaching be placebo effects. British Medical Journal, 311 , 551-553.
improved? New Directions for Higher Education, 19 77(17),Eysenck, H. J. (1952). The effects of psychotherapy: An eval-
1-15. uation. Journal of Consulting Psychology, 16, 319-324.
Crumlish, N., & Kelly, B. D. (2009). How psychiatrists think. doi:10.1037/h0063633
Advances in Psychiatric Treatment, 15, 72-79. Finocchiaro, M. (1981). Fallacies and the evaluation of reason-
Dawes, R. M. (1986). Representative thinking in clini- ing. American Philosophical Quarterly, 18, 13-22.
cal judgment. Clinical Psychology Review , 6, 425-441. Fraenkel, J. R., & Wallen, N. E. (1993). How to design and
doi: 10. 1016/0272-7358(86)90030-9 evaluate research in education (2nd ed.). New York, NY:
Dawes, R. M. (1994). House of cards: Psychology and psycho- McGraw-Hill.
therapy built on myth. New York, NY: Free Press. Frank, J. D., & Frank, J. B. (1961). Persuasion and healing.
Dawes, R. M. (2001). Everyday irrationality: How pseudo-scien- Baltimore, MD: Johns Hopkins University Press.
tists, lunatics , and the rest of us systematically fail to thinkFrazier, P., Tennen, H., Gavian, M., Park, C., Tomich, P., &
rationally. Boulder, CO: Westview Press. Tashiro, T. (2009). Does self-reported posttraumatic growth
Dehue, T. (2000). From deception trials to control reagents: reflect genuine positive change? Psychological Science , 20,
The introduction of the control group about a century ago. 912-919.
clinical-statisticalL.
Fremont, J., & Craighead, controversy.
W. Psychology,
(1987).Public Policy , Aerobi
cognitive therapy in and
the Law, 2, 293-323.
treatmentdoi: 10. 1037/1076-8971. 2.2.293
of dyspho
Gruber, R.Research,
Cognitive Therapy and P. (1971). Behavior therapy: Problems
11 in,general- 241-251. d
BF01 183268 ization. Behavior Therapy, 2, 361-368. doi:10.10l6/S0005-
Gambrill, E. (2012). Critical thinking in clinical prac- 7894(71)80070-9
tice: Improving the quality of judgments and decisions. Hall, H. (2011). Evidence-based medicine, tooth-fairy science,
Hoboken, NJ: Wiley. and Cinderella medicine. Skeptic , i Xl), 4-5.
Garb, H. N. (1998). Studying the clinician: Judgment research Hamilton, D. L., & Gifford, R. K. (1976). Illusory correlation in
and psychological assessment. Washington, DC: American interpersonal perception: A cognitive basis of stereotypic
Psychological Association, doi: 10. 1037/10299-002 judgments. Journal of Experimental Social Psychology, 12,
Garb, H. N. (2005). Clinical judgment and decision making. 392-407. doi:10.10l6/S0022-1031(76)80006-6
Annual Review of Clinical Psychology , 1, 67-89. Hannan, C., Lambert, M. J., Harmon, C, Nielsen, S., Smart, D.
Garb, H. N., Lilienfeld, S. O., & Fowler, K. A. (2008). Psychological W., Shimokawa, K., & Sutton, S. W. (2005). A lab test and
assessment and clinical judgment. In J. E. Maddux & B. A. algorithms for identifying clients at risk for treatment failure.
Winstead (Eds.), Psychopathology: Foundations for a con- Journal of Clinical Psychology, 61, 155-163. doi: 10. 1002/
temporary understanding (2nd ed., pp. 103-124). New jclp.20108
York, NY: Routledge. Harrington, R., Whittaker, J., Shoebridge, P., & Campbell, F.
Garfield, S. (1994). Research on client variables in psychother- (1998). Systematic review of efficacy of cognitive behaviour
apy. In A. E. Bergin & S. Garfield (Eds.), Handbook of psy- therapies in childhood and adolescent depressive disorder.
chotherapy and behavior change (4th ed., pp. 190-228). British Medical Journal, 316, 1559-1563.
Oxford, England: Wiley. Harrison, S. (2011, August 15). "Beyond scared straight" producer
Geers, A. L., Weiland, P. E., Kosbab, K., Landry, S. J., & Heifer, shares secret s of A&E's hit series. Channel Guide Magazine.
S. G. (2005). Goal activation, expectations, and the pla- Available from http://www.channelguidemagblog.com
cebo effect. Journal of Personality and Social Psychology , Hartman, S. E. (2009). Why do ineffective therapies seem help-
89, 143-159. ful? A brief review. Chiropractic and Osteopathy, 17, 1-7.
Gehan, E. A., & Lemak, N. A. (1994). Statistics in medical doi: 10. 1 186/1746-1340-17-10
research: Developments in clinical trials. New York, NY: Hathaway, S. R. (1948). Some considerations relative to nondi-
Plenum. rective counseling as therapy. Journal of Clinical Psychology,
Ghaemi, S. N. (2009). A clinician's guide to statistics and epi- 4, 226-231. doi:10.1002/1097-4679(194807)4:3<226::AID-
demiology in mental health: Measuring truth and uncer- JCLP2270040303>3.0.CO;2-V
tainty. Cambridge, England: Cambridge University Press. Healy, D. (2002). Randomized controlled trials: Evidence biased
Gigerenzer, G., & Gaissmaier, W. (2011). Heuristic decision psychiatry. Retrieved from http://www.ahrp.org/COI/
making. Annual Review of Psychology, 62, 451-482. healy0802.php
Gilovich, T. (1991). How we know what isn't so: The fallibility Herbert,
of J. D. (2003). The science and practice of empirically
human reason in everyday life. New York, NY: Free Press. supported treatments. Behavior Modification, 27, 412-430.
Goldfried, M. R. (2010). The future of psychotherapy inte- Hodges, B., Regehr, G., & Martin, D. (2001). Difficulties in
gration: Closing the gap between research and practice. recognizing one's own incompetence: Novice physicians
Journal of Psychotherapy Integration, 20, 386-396. who are unskilled and unaware of it. Academic Medicine,
Green, D., & Latchford, G. (2012). Practice-based evidence. In 76C Suppl. 10), S87-S89.
D. Green & G. Latchford (Eds.), Maximising the benefits Hofmann, S. G. (2008). Cognitive processes during fear acquisi-
of psychotherapy: A practice-based evidence approach (pp. tion and extinction in animals and humans: Implications for
87-108). Chichester, United Kingdom: Wiley-Blackwell. exposure therapy of anxiety disorders. Clinical Psychology
Greenwald, A. G., Spangenberg, E. R., Pratkanis, A. R., & Review, 28, 199-210.
Eskenazi, J. (1991). Double-blind tests of subliminal Hollis, S., & Campbell, F. (1999). What is meant by intention to
self-help audiotapes. Psychological Science, 2, 119-122. treat analysis? Survey of published randomised controlled
doi: 10. 1 1 1 1/j. 1467-9280. 1991 .tbOOl 12 .x trials. British Medical Journal, 319, 610-61 A. doi: 10. 1136/
Grissom, R. J. (1996). The magical number .7±.2: Meta-meta- bmj. 3197211.670
analysis of the probability of superior outcome in compari- Holmes, D. S. (1971). Round Robin therapy: A technique
sons involving therapy, placebo, and control. Journal of for implementing the effects of psychotherapy. Journal
Consulting and Clinical Psychology, 64, 973-982. of Consulting and Clinical Psychology, 37, 324-331.
Grodstein, F., Clarkson, T. B., & Manson, J. E. (2003). doi : 1 0 . 1 037/h003 1 968
Understanding the divergent data on postmenopausal hor- Horney, K. (1945). Our inner conflicts. Oxford, England: W.W.
mone therapy. New England Journal of Medicine, 348, Norton.
645-650. Horowitz, M. J., Wilner, N., & Alvarez, W. (1979). Impact of
Groopman, J. (2007). How doctors think. Boston, MA: Houghton Event Scale: A measure of subjective stress. Psychosomatic
Mifflin. Medicine , 41, 209-218.
Grove, W. M., & Meehl, P. E. (1996). Comparative efficiency Horowitz, S. (2012). New perspectives on the placebo effect:
of informal (subjective, impressionistic) and formal Implications for research and clinical practice. Alternative
(mechanical, algorithmic) prediction procedures: The and Complementary Therapies , 18, 130-135.
Horvath, A.symptoms
O., Del
and personality. Re,
Psychological Medicine, 19, A
(2011). 487-493. doi: 10.
Alliance in 1017/S003329170001 25 14
individuai
48 , 9-16. Kahneman, D. (1965). Control of spurious association and the
Hoshmand, L. T., & Polkinghorne, D. E. (1992). Redefining reliability of the controlled variable. Psychological Bulletin,
the science-practice relationship and professional training. 64, 326-329. doi:10.1037/h0022529
American Psychologist, 47 , 53-66. Kahneman, D. (2011). Thinking, fast and slow. New York, NY:
Howard, G. S. (1980). Response-shift bias: A problem in evalu- Farrar, Straus and Giroux.
ating interventions with pre/post self-reports. Evaluation Kahneman, D., & Klein, G. (2009). Conditions for intuitive
Review, 4 , 93-106. doi: 10. 1177/0193841X8000400105 expertise: A failure to disagree. American Psychologist, 64,
Howard, G. S., & Dailey, P. R. (1979). Response-shift bias: A 515-526. doi: 10. 1037/a00 16755
source of contamination of self-report measures. Journal Kanter, J. W., Kohlenberg, R. J., & Loftus, E. F. (2004).
of Applied Psychology , 54, 144-150. doi: 10. 1037/002 1- Experimental and psychotherapeutic demand characteris-
9010.64.2.144 tics and the cognitive therapy rationale: An analogue study.
Howard, K. I., Kopta, S., Krause, M. S., & Orlinsky, D. E. (1986). Cognitive Therapy and Research, 28, 229-239.
The dose-effect relationship in psychotherapy. American Kazdin, A. E. (1977). Assessing the clinical or applied importance
Psychologist , 159-164. doi:10.1037/0003-066X.4l.2.159 of behavior change through social validation. Behavior
Hróbjartsson, A., & Gotzsche, P. C. (2001). Is the placebo pow- Modification , 1, 427-452. doi: 10. 1177/014544557714001
erless? An analysis of clinical trials comparing placebo Kazdin, A. E. (2007). Systematic evaluation to improve the qual-
with no treatment. New England Journal of Medicine, 344 , ity of patient care: From hope to hopeful. Pragmatic Case
1594-1602. doi: 10. 1056/NEJM200105243442 106 Studies in Psychotherapy, 3(4), 37-49.
Hsu, L. M. (1989). Random sampling, randomization, and equiv- Kazdin, A. E., & Nock, M. K. (2003). Delineating mechanisms
alence of contrasted groups in psychotherapy outcome of change in child and adolescent therapy: Methodological
research. Journal of Consulting and Clinical Psychology , issues and research recommendations. Journal of Child
57, 131-137. doi: 10. 1 037/002 2-006X. 57. 1.131 Psychology and Psychiatry, 44, 1116-1129.
Huck, S. W., & Sandler, H. M. (1979). Rival hypotheses: Kelly, J. F., & Moos, R. (2003). Dropout from 12-step self-help
Alternative interpretations of data based conclusions. New groups: Prevalence, predictors, and counteracting treat-
York, NY: Harper & Row. ment influences. Journal of Substance Abuse Treatment, 24,
Hume, D. (1748). Enquiry concerning human understanding. 241-250. doi: 10. 10l6/S0740-5472(03)0002 1-7
London, England: P. F. Collier & Son. Kendall, P. C., Butcher, J. N., & Holmbeck, G. N. (1999).
Hunsley, J., & Di Giulio, G. (2002). Dodo bird, phoenix, or Handbook of research methods in clinical psychology (2nd
urban legend? The question of psychotherapy equiva- ed.). Hoboken, NJ: Wiley.
lence. Scientific Review of Mental Health Practice: Objective Kienle, G. S., & Kiene, H. (1997). The powerful placebo effect:
Investigations of Controversial and Unorthodox Claims in Fact or fiction? Journal of Clinical Epidemiology , 50, 1311-
Clinical Psychology, Psychiatry, and Social Work , 1, 11-22. 1318.
Ilardi, S. S., & Craighead, W. (1999). Rapid early response, Kirsch, I. (2005). Placebo psychotherapy: Synonym or oxymo-
cognitive modification, and nonspecific factors in cogni- ron? Journal of Clinical Psychology, 61, 791-803.
tive behavior therapy for depression: A reply to Tang and Kirsch, I. (2013). The placebo effect revisited: Lessons learned to
DeRubeis. Clinical Psychology: Science and Practice, 6, date. Complementary Therapies in Medicine, 21, 102-104.
295-299. Kirsch, I., & Sapirstein, G. (1998). Listening to Prozac but
Ioannidis, J. A. (2005). Contradicted and initially stronger hearing placebo: A meta-analysis of antidepressant medi-
effects in highly cited clinical research. Journal of the cation. Prevention & Treatment, 7(2), Article 0002a.
American Medical Association, 294, 218-228. doi:10.1001/ doi: 10. 1037/1 522-3736. 1 . 1 . 12a
jama.294.2.218 Klein, D. F. (1998). Listening to meta-analysis but hearing bias.
Jacobson, N. S., & Christensen, A. (1996). Studying the effec- Prevention & Treatment, 1(2), Article 6c. doi: 10. 1037/1522-
tiveness of psychotherapy: How well can clinical tri- 3736.1.1.16c
als do the job? American Psychologist, 51, 1031-1039. Koffka, K. K. (1935). Principles of Gestalt psychology. Oxford,
doi: 10. 1037/0003-066X.5 1 . 10. 103 1 England: Harcourt, Brace.
James, F. E. (1992). Insulin treatment in psychiatry. History ofKoocher, G. P., Norcross, J. C., & Hill, S. S. (Eds.). (2005).
Psychiatry, 3, 221-235. Psychologists' desk reference (2nd ed.). New York, NY:
Jones, K. (2000). Insulin coma therapy in schizophrenia. Journal Oxford University Press.
of the Royal Society of Medicine, 93, 147-149. Kraemer, H. C., Wilson, G. T., Fairburn, C. G., & Agras, W. S.
Jopling, D. A. (2001). Placebo insight: The rationality of insight- (2002). Mediators and moderators of treatment effects in
oriented psychotherapy. Journal of Clinical Psychology , randomized clinical trials. Archives of General Psychiatry,
57, 19-36. doi: 10. 1002/1097-4679(200101)57 : 1<19:: AID- 59, 877-883.
JCLP4>3.0.CO;2-Z Kruger, J., Savitsky, K., & Gilovich, T. (1999). Superstition and
Jopling, D. A. (2008). Talking cures and placebo effects. Oxford, the regression effect. Skeptical Inquirer, 23, 24-29.
England: Oxford University Press. Kunda, Z. (1990). The case for motivated reasoning.
Jorm, A. F., Duncan-Jones, P. P., & Scott, R. R. (1989). An analy- Psychological Bulletin, 108, 480-498. doi: 10. 1037/0033-
sis of the re-test artefact in longitudinal studies of psychiatric 2909.108.3.480
McArthur, L. Z. (1980). Illusory causation and illu- Mintz, J., Drake, R. E., & Crits-Christoph, P. (1996). Efficacy
sory correlation: Two epistemological accounts. and effectiveness of psychotherapy: Two paradigms,
Personality and Social Psychology Bulletin, 6 , 507-519- one science. American Psychologist, 51, 1084-1085.
doi: 10.1 177/0l46l6728064003 doi: 10. 1037/0003-066X.51 . 10. 1084
McArthur, L. Z., & Solomon, L. K. (1978). Perceptions of anMorgado, A., Smith, M., Lecrubier, Y., & Widlocher, D. (1991).
aggressive encounter as a function of the victim's salience Depressed subjects unwittingly overreport poor social
and the perceiver's arousal. Journal of Personality and adjustment which they reappraise when recovered .Journal
Social Psychology , 36, 1278-1290. doi: 10. 1037/0022- of Nervous and Mental Disease , 1 79, 614-619.
3514.36.11.1278 Morgan, C. D., & Murray, H. A. (1935). A method for investigat-
McCall, R. B. (1977). Challenges to a science of develop- ing fantasies: The Thematic Apperception Test. Archives of
mental psychology. Child Development , 48, 333-344. Neurology and Psychiatry, 34, 289-306.
doi: 10.2307/1 128626 Morrison, J. R. (1997). When psychological problems mask medi-
McCauley, R. (2011). Why religion is natural and science is not. cal disorders: A guide for psychotherapists. New York, NY:
Oxford. England: Oxford University Press. Guilford Press.
McCullough, J. P. (2000). Treatment for chronic depression: Nathan, P. E., Stuart, S. P., & Dolan, S. L. (2003). Research on
Cognitive Behavioral Analysis System of Psychotherapy psychotherapy efficacy and effectiveness: Between Scylla
(CBASP). New York, NY: Guilford Press. and Charybdis? In A. E. Kazdin (Ed.), Methodological issues
McDonald, C. J., Mazzuca, S. A., & McCabe, G. P. (1983). How & strategies in clinical research (3rd ed., pp. 505-546).
much of the placebo "effect" is really statistical regression? Washington, DC: American Psychological Association.
Statistics in Medicine, 2, 417-427. Nichols, M. P., & Efran, J. S. (1985). Catharsis in psychother-
McFall, R. M. (1991). Manifesto for a science of clinical psychol- apy: A new perspective. Psychotherapy: Theory , Research,
ogy. The Clinical Psychologist , 44(6), 75-88. Practice, Training, 22, 46-58.
McFall, R. M., & Treat, T. A. (1999). Quantifying the information Nickerson, R. S. (1998). Confirmation bias: A ubiquitous phe-
value of clinical assessments with signal detection theory. nomenon in many guises. Review of General Psychology, 2,
Annual Review of Psychology, 50, 215-241. doi:10.1l46/ 175-220. doi: 10. 1037/1089-2680.2.2.175
annurev.psych.50.1.215 Nisbett, R. E., & Ross, L. (1980). Human inference: Strategies
McFarland, C., & Alvaro, C. (2000). The impact of motivation and shortcomings of social judgment. Englewood Cliffs, NJ:
on temporal comparisons: Coping with traumatic events Prentice-Hall.
by perceiving personal growth. Journal of Personality and Nisbett, R. E., & Wilson, T. D. (1977). Telling more than we can
Social Psychology, 79, 327-343. know: Verbal reports on mental processes. Psychological
McHugh, P. M. (1994). Psychotherapy awry. Amerìcan Scholar , Review, 84, 231-259.
63, 17-30. Nock, M. K., Michel, B. D., & Photos, V. I. (2007). Single-case
McHugh, R. K., & Barlow, D. H. (2010). The dissemination and research designs. In D. McKay (Ed.), Handbook of research
implementation of evidence-based psychological treatments: methods in abnormal and clinical psychology (pp. 337-
A review of current efforts. American Psychologist, 65, 73-84. 350). New York, NY: Sage.
McLeod, J. (2001). An administratively created reality: Some Norcross, J. C., Beutler, L. E., & Levant, R. F. (2007). Evidence-
problems with the use of self-report questionnaire mea- based practices in mental health. Washington, DC:
sures of adjustment in counselling/psychotherapy outcome American Psychological Association.
research. Counselling & Psychotherapy Research, 1, 215-226. Norcross, J. C., & Lambert, M. J. (2006). The therapy relation-
McNally, R. J., Bryant, R. A., & Ehlers, A. (2003). Does early psy- ship. In J. C. Norcross, L. E. Beutler, & R. F. Levant (Eds.),
chological intervention promote recovery from posttrau- Evidence-based practices in mental health: Debate and
matic stress? Psychological Science in the Public Interest , 4, dialogue on the fundamental questions (pp. 208-218).
45-79. doi: 10. 1 1 1 1/1529-1006.0142 1 Washington, DC: American Psychological Association.
Posternak, M. A., & Zimmerman, M. (2000). Short-term spon- Samuel, D. B., Hopwood, C. J., Ansell, E. B., Morey, L. C.,
taneous improvement in depressed outpatients. Journal of Sanislow, C. A., Markowitz, J. C., . . . Grilo, C. M. (2011).
Nervous and Mental Disease, 188, 799-804. Comparing the temporal stability of self-report and inter-
Poulsen, S., Lunn, S., Daniel, S. I., Folke, S., Mathiesen, B. B., view assessed personality disorder. Journal of Abnormal
Katznelson, H., & Fairburn, C. G. (2014). A randomized Psychology, 120, 670-680. doi:10.1037/a0022647
controlled trial of psychoanalytic psychotherapy or cog- Segali, M. H., Campbell, D. T., & Herskovits, M. J. (1966). The
nitive-behavioral therapy for bulimia nervosa. American influence of culture on visual perception. Oxford, England:
Journal of Psychiatry , 1 71, 109-1 16. Bobbs-Merrill.
Prasad, V., Cifu, A., & Ioannidis, J. P. (2012). Reversals of estab- Seligman, M. P. (1995). The effectiveness of psychotherapy:
lished medical practices. Journal of the American Medical The Consumer Reports study. American Psychologist, 50,
Association , 30 7, 37-38. 965-974. doi: 10. 1037/0003-066X.50. 12.965
(pp. 1252-1254). New York, NY: Wiley. Thompson, S. C. (1999). Illusions of control: How we overestimate
Stewart, R. E., & Chambless, D. L. (2007). Does psychotherapy our personal influence. Current Directions in Psychological
research inform treatment decisions in private practice? Science , 8 , 187-190. doi:10.1 11 1/1467-872 1.00044
fournal of Clinical Psychology , 63, 267-281. doi: 10. 1002/ Tolin, D. F. (2010). Is cognitive-behavioral therapy more
jclp. 20347 effective than other therapies?: A meta-analytic review.
Stewart, R. E., Chambless, D. L., & Baron, J. (2011). Theoretical Clinical Psychology Review, 30, 710-720. doi: 10. 1016/
and practical barriers to practitioners' willingness to seek j.cpr.2010.05.003