Professional Documents
Culture Documents
http://rsw.sagepub.com/
Applying Randomized Controlled Trials and Systematic Reviews in Social Work Research
Haluk Soydan
Research on Social Work Practice 2008 18: 311 originally published online 17 October 2007
DOI: 10.1177/1049731507307788
Published by:
http://www.sagepublications.com
Additional services and information for Research on Social Work Practice can be found at:
Subscriptions: http://rsw.sagepub.com/subscriptions
Reprints: http://www.sagepub.com/journalsReprints.nav
Permissions: http://www.sagepub.com/journalsPermissions.nav
Citations: http://rsw.sagepub.com/content/18/4/311.refs.html
Haluk Soydan
University of Southern California
This article elaborates on the centrality of interventions for social work practice and the importance of under-
standing the effects of interventions for a more efficient, harmless, transparent, and ethical social work practice.
Low-bias research designs and meta-analyses are important means of generating the best possible evidence on
what works in social work practice. An evidence-based practice model is promising in terms of translating and
implementing scientific evidence that is uncertain, volatile, and incomplete and might be difficult to access.
Originally this article was written for an international con- Also, limitations of randomized controlled studies are dis-
ference, titled What Works? Modernizing the Knowledge cussed and related to the issue of uncertainty of knowing
Base of Social Work, that took place in Bielefeld, in general. Finally, evidence-based practice is described
Germany, in November 2005. The conference was struc- and presented as a model of practice to implement gener-
tured as a platform to debate the merits and deficits of pro- alized scientific estimates of effectiveness in contextual
duction of scientific evidence and related research designs social work practice situations and to handle the lack and
for the advancement of evidence-based social work prac- scarcity of pertinent knowledge as well as the uncertainty
tice and the controversies in relation to other approaches of knowing as related to social work practice.
to scientific knowledge growth in social work.
The article defines social work and argues that the
purpose of social work practice is to infuse change in SOCIAL WORK DEFINED
the lives of individuals and in the community to reduce
or eradicate social problems and enhance betterment. As we well know, the profession’s understanding and
Thus, interventions are at the core of social work prac- definition of social work as professional and practical
tice. To understand whether social work interventions activity has, over time, changed frequently and substan-
work or might be harmful is imperative to the profession tially. On one extreme, a Swedish researcher has, for
for a variety of reasons, including ethical aspects. Then, instance, suggested that the history of social work is as old
the utility of high-quality randomized controlled stud- as human history, and the profession is defined as taking
ies, whenever they are possible to conduct and accessi- care of fellow human beings based on a sense of sociality
ble, is discussed, and real-life examples are given to and mutuality (Swedner, 1985). For example, he has noted,
illustrate the practical and policy importance of know-
ing what works and what is harmful in social work. As early as in the oldest well documented societies—the
Sumerian Empire in the Iraq of today, the Egypt of the Pharaohs,
ancient China, ancient Greece and the Roman Empire—there is a
division of labor that points towards the development of profes-
sions specializing in care and social welfare, principally doctors
Author’s Note: Portions of this article were previously presented at the con-
and midwives, but also people responsible for the material well-
ference What Works? Modernizing the Knowledge Base of Social Work,
sponsored by the Center for Social Service Studies at the University of being of the population. (Swedner, 1985, pp. 10-11)
Bielefeld, Germany, November 10-12, 2005. Correspondence concerning this
article may be addressed to Haluk Soydan, PhD, School of Social Work, Similarly, but in a less exaggerated fashion, a more
Hamovitch Center for Science in the Human Services, University of Southern
recent account of the history of the definition of social
California, Los Angeles, CA 90089-0411; e-mail: Soydan@usc.edu. This
article was accepted by the editor. work suggests that the profession of social work has
existed for some 400 years (Holosko, 2003, p. 271).
Research on Social Work Practice, Vol. 18 No. 4, July 2008 311-318
DOI: 10.1177/1049731507307788 Elsewhere, I have argued that a more reasonable and
© 2008 Sage Publications productive definition of social work practice must be
311
Downloaded from rsw.sagepub.com at UNIV OF TEXAS on February 23, 2011
312 RESEARCH ON SOCIAL WORK PRACTICE
understood in the context of the genesis and develop- relationships, legislation, clinical experience, and orga-
ment of modern and empirical social scientific research nizations. However, there is an emphasis on social work
methods followed by the development of the idea that practice as intervention. In the core of social work prac-
empirical research results could and should be used to tice are interventions for the betterment of clients,
infuse social change and betterment (Soydan, 1999). whether they be individuals, groups, or communities. In
Several articles in a special issue of Research on other words, it is sensible to suggest that the essence of
Social Work Practice (“Evaluating,” 2003) give a com- social work practice is intervention.
prehensive account of how definitions have shifted over In general terms, an intervention is any interference
the years. The definition developed and adopted by the that would modify a process or situation. In social work,
International Federation of Social Workers (IFSW) is the purpose of an intervention is to induce change to
very useful for the purposes of this article. It reads, slow down or eradicate risk factors, stimulate and acti-
vate protective factors, reduce or eliminate harm, or if
The social work profession promotes social change, problem possible, introduce betterment beyond harm control. So
solving in human relationships and the empowerment and lib-
understanding the effects of social work interventions is
eration of people to enhance well-being. Utilizing theories of
human behavior and social systems, social work intervenes at of essential importance to the social work profession.
the points where people interact with their environments. However, the profession’s history of conducting and
Principles of human rights and social justice are fundamental using results from high-quality effectiveness studies is
to social work. (IFSW, 2000, p. 1) shaky. One major problem of the social work profession
that has been and remains is that interventions are made
Furthermore, without regard or access to rigorous evidence on
whether an intervention harms, works, or does not have
Social work bases its methodology on a systematic body of
evidence-based knowledge derived from research and practice
any effect at all. There are cases in which services are
evaluation, including local and indigenous knowledge specific provided in spite of existing evidence on the ineffective-
to its context. (IFSW, 2000, p. 1) ness or harmfulness of the intervention. There are other
cases where evidence exists for the effectiveness of an
Similarly, the code of ethics of the National intervention but the services are either not provided or
Association of Social Workers (NASW) in the United insufficiently provided. This problem leads to little or
States prescribes the following as regards to social no progress and potential harm in social work. Let us
workers’ professional skills: see a few real-life examples.
The Drug Abuse Resistance Education program
Social workers should provide services in substantive areas or (DARE) is a well-known example of an intervention
use intervention techniques or approaches that are new to
program that does not work but is widely used by school
them only after engaging in appropriate study, training, con-
sultation, and supervision from people who are competent in districts in the United States. The program was devel-
those techniques. (NASW, 2007, p. 6) oped by the Los Angeles Police Department and the Los
Angeles Unified School District in 1983 and was imple-
So, what do we learn from these definitions? mented by more than 80% of school districts in the
United States by 2001 (www.dare.org). The intervention
1. We learn that the social work profession promotes social program aims to prevent the use of drugs, alcohol, and
change, problem solving in human relationships, and the
empowerment and liberation of people from hardship to tobacco among youth and is implemented in the class-
enhance well-being. room. However, repeated evaluations and meta-analyses
2. We learn that the principles of human rights and social of effectiveness studies all indicated its ineffectiveness.
justice are fundamental to the social work profession.
3. We learn that it is a universal imperative that social work-
In 1994, a meta-analysis of eight effectiveness studies
ers must use high-quality knowledge and skills to learn showed that the program had small effects on self-
and understand whether social work interventions work, reported drug use when compared with control groups
if they may cause harm to the client, and most desirably, (Ennett, Tobler, Ringwalt, & Flewelling, 1994). The U.S.
are effective in the betterment of the client’s situation.
General Accounting Office (GAO) reviewed the long-
terms effects of the program and concluded that the
SOCIAL WORK AND INTERVENTIONS program was ineffective in preventing long-term drug use
when the youth becomes an adolescent (U.S. GAO,
The scope of research on social work practice is often 2003). A recent meta-analysis 10 years later showed even
understood in very broad terms because social work smaller effects of DARE (West & O’Neal, 2004). Yet the
practice involves several core factors, such as values, program remains widely used and prevalent.
Poverty is a major problem. In the United States, social work and social policy and thus enhance well-
poverty rates are high: In 2004, an average of 12.7% of the being and client safety. Despite the fact that intervention
population was living in poverty. The rates for children and studies (or effectiveness studies) are not abundant in
for single female-headed families were even higher: 17.8% social work and related human service areas, their abil-
and 30.5% in 2004, respectively (McKernan & Ratcliffe, ity to detect what works and what is harmful creates the
2006). There are a few U.S. programs for low-income potential for great opportunity.
families that are shown to be effective, but they are not
widely implemented, and the nation’s poverty rates remain
high (Blank, 2002; McKernan & Ratcliffe, 2006). RESEARCH IMPLICATIONS
A promising intervention program is the Nurse–
Family Partnership Home Visiting Program (NFP; www Obviously, research on social work practice has been
.nursefamilypartnership.org). Effectiveness studies show impacted by the methodological controversies of the
that this program improves maternal and child health for social sciences. The history of social sciences is very
mothers and children visited by NFP-trained nurses. The much a history of paradigm wars. Elsewhere, I have
improvements include reduction in prenatal cigarette argued that it is time to abandon the dichotomous lan-
smoking and in prenatal hypertensive disorders, reduction guage of qualitative and quantitative methods and recog-
in children’s health care visits for injuries, fewer unin- nize that all research designs are good for the types of
tended subsequent pregnancies, increase in father involve- scientific questions for which they are tailored (Soydan,
ment and women’s employment, reduction in families’ use 2007). It might be correct that phenomenological research
of welfare and food stamps, and increase in children’s designs are the best fit for understanding events such as the
school readiness (Olds et al., 1998, 2004). social or cultural meaning of a conversation between a
Violence is an important public health problem among social worker and a client. Or ethnographic and partici-
adolescents in the United States. One particular type of pant observations might be the best methods of depicting
intervention that is widely used in the United States and behavior patterns of populations of social institutions
elsewhere is Scared Straight and similar awareness such as prisons and hospitals, networks of street gangs, or
programs. However, a Campbell Collaboration systematic victims of commercial sexual exploitation.
review on Scared Straight and other juvenile awareness However, when it comes to measure the effects of
programs for preventing juvenile delinquency (Petrosino, social work interventions, experimental studies, espe-
Petrosino-Turpin, & Buehler, 2002) showed that these cially when randomized, conducted very carefully, and
intervention programs do not work. These intervention large enough to generate statistical power, are the
programs involve organized visits to prisons by juvenile designs that are best fit for the purpose. From a scien-
delinquents or children at risk for criminal behavior and are tific point of view, the study of social work interventions
designed to deter juveniles from future offending. These is the measurement of the efficacy and effectiveness of
studies remain in worldwide use despite studies that show the intervention. Efficacy measurement involves effects of
that they do not work and are likely to have a harmful a social work intervention in well-controlled environments
effect. These programs should not be used! On the more where the intervention is delivered with high fidelity and
positive side, Life Skills Training programs seem to outcomes are measured to compare results of the experi-
have promising results (Botvin, Griffin, & Nichols, mental group with that of the control group(s). Results of
2006; Fraguela, Martin, & Trinanes, 2003; Griffin, efficacy studies are assumed to be strongly related to the
Botvin, & Nichols, 2006). Life Skills Training programs circumstances of the specific testing environment. If effi-
are school based and instruct students in the necessary cacy studies generate positive results, the intervention is
skills to resist social (peer) pressure to smoke, drink, considered promising and, it is assumed, sensible to test
and use drugs. They help students to develop greater for effectiveness. Effectiveness studies take place in sites
self-esteem and self-confidence, enable them to effec- where the clients are under less controlled but realistic
tively cope with anxiety, increase their knowledge of the conditions. Repeated effectiveness testing generates infor-
immediate consequences of substance abuse, and mation to understand whether the intervention works
enhance cognitive and behavioral competency to reduce under real-life conditions, to what extent, and under what
and prevent a variety of health risk behaviors. These diverse conditions.
programs should be used! In experimental studies of social work interventions,
These examples illustrate the utility of effectiveness eligible clients or entities such as mental health clinics,
studies in understanding effects of social work interven- dormitories, neighborhoods, and villages are randomly
tions. Rigorous evidence can bring sustained progress to allocated to each of the two or more treatment conditions.
The treatment conditions may be a social work inter- empirical evidence from this literature can be used in
vention program and a nontreatment control group. One the context of training and welfare programs to improve
or more social work intervention programs may be non-experimental research designs, it cannot on its own,
tested in one and the same study. The groups that are justify the use of such designs” (Glazerman et al., 2003,
constructed by random allocation do not differ system- p. 63). For a thoughtful account of this problem, see
atically. However, they may differ by the chance factor. Boruch (2007).
Alternative (nonrandomized) designs to randomized Certainly, randomized controlled effect studies, when
controlled studies are used for a number of reasons, conducted properly, generate best possible or less biased
including practical problems of implementation, ethical estimates of effects of social work interventions. However,
concerns, budget and time restrictions, and unwilling- it is not always possible to conduct controlled experi-
ness to use randomized controlled studies. Alternative ments. At times, it can be difficult to randomize individu-
designs often include before-and-after comparisons and als but possible to randomize social entities, such as
time-series analysis but also designs highly dependent school classes or schools, hospitals clinics or hospitals,
on mathematical (re)constructions. neighborhoods, and villages. This type of randomization
The best way of empirically understanding the rela- is called “place-based” or “cluster” randomization and cir-
tively biased estimates produced by alternative designs in cumvents some of the ethical and practical problems asso-
comparison to the estimates of randomized controlled ciated with random allocation of individuals.
studies is to compare results of different designs. To Other times, all types of randomization might be
understand the differences of bias produced by random- impossible because of ethical concerns, budget con-
ized controlled experiments with those of nonexperimen- straints, research practicalities, and several other reasons.
tal or “quasiexperimental” designs, researchers usually Then, we will have to use nonrandomized controlled
make “between-study” and “within-study” comparisons. research designs. In a later section, I will come back to
In between-study comparisons of experimental and non- practice and policy implications of depending on less rig-
experimental studies, researchers include multiple studies orous effect studies.
conducted with different research designs. The bias in esti-
mations is calculated by looking at the relationship
between the design and the estimates of effect. Reynolds BEYOND SINGLE EXPERIMENTAL
and Temple (1995) compared three studies, Shadish and STUDIES: SYSTEMATIC REVIEWS
Ragsdale (1996) compared dozens of studies, and Lipsey
and Wilson (1993) compared 74 randomized and nonran- From the early 1980s, social scientists began to develop
domized studies. All of these studies show mixed results. research synthesis and meta-analytic methods to synthesize
One major problem with these types of studies is that they results (effect sizes) of multiple effect studies. The primary
are not capable of distinguishing whether the difference driving force of this development was perhaps the acceler-
between estimates is because of design or some other ating number of scientific publications. This made it diffi-
factor (Glazerman, Levy, & Myers, 2003). cult and complicated to access most or all publications in
Within-study comparisons estimate an intervention one specific specialty, to control the scientific quality of the
program’s effect by using a randomized control group and accessed publication, and to have a reasonable overview of
one or several nonrandomized comparison groups. These the literature (state of the art) in a specific specialty.
studies use design replication as a method, which is a rees- The development of systematic reviews took off very
timation of the effect by using one or several comparison strongly from the mid-1990s, fueled by an increasing
groups. This type of study is capable of explaining that the awareness among professionals and decision makers, and
estimated differences between randomized and nonexperi- subsequently among the general public, of the importance
mental study design are because of the differences in of high-quality evidence in professional practice and policy
design and not other factors such as investigator bias, dif- making. Later, inception and development of the Cochrane
ferences in treatment environments, or implementation Collaboration (www.cochrane.org) of the health-related
itself. Glazerman et al. (2003) conducted a within-study sciences and practices by the mid-1990s and the Campbell
comparison of 12 labor market–related studies. They found Collaboration (www.campbellcollaboration.org) of the
that nonexperimentally measured estimates sometimes behavioral and social sciences from early 2000 established
came close to replicating experimentally generated results the science and technology of systematic research reviews
but often produced estimates that differed with margins of and meta-analysis.
importance for policy making. This is considered an esti- The aim of systematic reviews is to generate scien-
mate of bias. The researchers concluded that “although the tific generalizations by integrating empirical research.
The systematic research review is a broader concept uniform standards of evidence, specifies rigorous pro-
than the meta-analysis. Systematic research reviews cedures to avoid bias in the screening of studies and
may focus on outcomes of effect studies as well as the- producing of synthesis, employs advanced statistical
ories, methods, or applications. Meta-analysis, on the methods, continuously updates reviews, collaborates with
other hand, is used to integrate quantitative estimates of end-user networks, and is multidisciplinary. Currently,
effects of interventions. Thus, a systematic research respected knowledge clearinghouses and knowledge data-
review may or may not include a meta-analysis. bases benchmark themselves with Campbell Collabo-
Already by the mid-1970s, Glass (1976) launched the ration standards, even if they are not able to limit
term meta-analysis and defined it as “the statistical themselves to the Campbell standards at the time being.
analysis of a large collection of analysis results from As David Sackett, who is one of the foremost pio-
individual studies for the purpose of integrating the find- neers of evidence-based practice puts it, the value of
ings” (p. 3). Later, important publications on meta- systematic high quality reviews is undisputable:
analysis and systematic research reviews include Glass,
McGraw, and Smith (1981); Rosenthal (1984); Hunter Because the randomized trial, and especially the systematic
review of several randomized trials, is so much more likely to
and Schmidt (1990); Cooper and Hedges (1994); and
inform us and so much less likely to mislead us, it has become
Lipsey and Wilson (2001). the “gold standard” for judging whether a treatment does
With the progress of the Cochrane Collaboration, more good than harm. (Sackett, Richardson, Rosenberg, &
which develops, maintains, and distributes systematic Haynes, 1997, pp. 4-5)
research reviews of health-related interventions, the
term systematic was coined to designate the systematic
nature of the research reviews. The Cochrane Handbook INSECURITY OF KNOWING
for Systematic Reviews of Interventions (www.cochrane
.org/resources/handbook/) describes and prescribes Nevertheless, it is good science to recognize the lim-
procedures that make a research review systematic. This its of any gold standard, because a major problem in
includes planning and formatting of a review, transparency science is that it is impossible to know with 100% cer-
and ethical requirements, peer control, problem formula- tainty what the truth is in any given research question.
tion, locating and selecting of studies for reviews (e.g., use This assumption has two consequences of importance
of electronic databases, hand-searching, and unpublished for the purpose of this article. The first one is a scientific
studies), study quality assessment, study inclusion and perspective urging more empirical evidence to under-
exclusion procedures, data coding, analysis and meta- stand merits of various research designs, a question that
analytic standards, result interpretation, and updating of I will look at in this section. The second consequence is
reviews. Because the Cochrane systematic reviews the implications of this uncertainty for the professional
include primarily and exclusively randomized con- practice, a question that I will look at in the next section.
trolled studies (if not indicated otherwise) and use Simply, scientific research designs are devices that
advanced methods to control all known biases, these the human mind uses to filter and organize experience,
reviews are considered the highest standards of research an idea that was powerfully argued for and established
reviews. by the Austrian British scientist Karl Popper. The merit
With the inception of the Campbell Collaboration (aka (or deficit) of any scientific research design is thus
C2) in social and behavioral sciences in 2000, social work related to the ability of this design to test the falsifica-
acquired an abode for itself. The international Campbell tion of a hypothesis that we have about a specific phe-
Collaboration produces, maintains, and disseminates sys- nomenon. Popper first published his The Logic of
tematic reviews of the studies of effects of behavioral and Scientific Discovery (Logik der Forschung) in 1934 and
social interventions (including social work interventions). basically revolutionized the entire idea of the nature of
These reviews are primarily based on randomized con- growth in science. Popper famously emphasized that
trolled studies and on nonrandomized studies if there are “no matter how many instances of white swans we may
no randomized controlled studies of a specific interven- have observed, this does not justify the conclusion that
tion of interest. The potential value of the Campbell all swans are white” (Popper, 1934/1972, p. 27).
Collaboration—and that of its older sibling in health care, Popper (1934/1972) concluded that scientific theories
the Cochrane Collaboration—is partly in its dedication to are only hypotheses and may be falsified and replaced
“gold-standard” research reviews. The Campbell Collabo- any day. Consequently, what is important for the growth
ration is also unique in social and behavioral sciences at of science (evidence) is not the confirmation but the
the international level because it adopts transparent and attempted falsification of theories. For the practitioner
scientist, this means that she or he should always con- know with 100% certainty whether a social work inter-
tinue to test complex ramifications of theories and test vention works or is harmful, should we then stop inter-
them in as many different types of situations as possible. vening at all? Or how can this uncertainty of knowing be
In Karl Popper’s own words, handled within the framework of social work practice?
This will be the issue I address in the next section.
I shall certainly admit a system as empirical or scientific only
if it is capable of being tested by experience. These consider-
ations suggest that not the verifiability but the falsifiability of EVIDENCE-BASED SOCIAL
a system is to be taken as a criterion of demarcation. In other
WORK PRACTICE
words: I shall not require of a scientific system that it shall be
capable of being singled out, once and for all, in a positive
sense; but I shall require that its logical form shall be such that Seen in a historical perspective, contemporary mod-
it can be singled out, by means of empirical tests, in a nega- els of evidence-based practice and policy making were
tive sense: it must be possible for an empirical scientific developed largely to manage concerns related to the
system to be refuted by experience. (pp. 40-41)
uncertainty of knowing how an intervention would work
in a case that takes place in real-life situations. Yes,
In August 2005, John Ioannidis published a remark-
well-conducted and robust randomized controlled stud-
able study with the provocative title “Why Most
ies and well-conducted meta-analyses of a set of high-
Published Research Findings Are False.” He summa-
quality effectiveness studies say a lot as to whether an
rized his finding as follows:
intervention works or is harmful on a generalized level
There is increasing concern that most current published by providing us with impact estimates. But will an inter-
research findings are false. The probability that a research vention work in individual cases? Or will an interven-
claim is true may depend on study power and bias, the number tion work in a social context for which it was not tested?
of other studies on the same question, and, importantly, the Here comes the uncertainty! Originally, evidence-based
ratio of true to no relationships among the relationships practice was conceived as a model for medical practice.
probed in each scientific field. In this framework, a research
finding is less likely to be true when the studies conducted in
It was defined, not incidentally but purposefully, as “the
a field are smaller; when effect sizes are smaller; when there conscientious, explicit and judicious use of current best evi-
is greater number and lesser pre-selection of tested relation- dence in making decisions about the care of individual
ships; where there is greater flexibility in designs, definitions, patients” (Sackett et al., 1997, p. 2). The term current best
outcomes, and analytic modes; when there is greater financial evidence is the most explicit expression of the recognition
and other interests and prejudice; and when more teams are
that we cannot know with 100% certainty and thus the gold
involved in a scientific field in chase of statistical signifi-
cance. Simulations show that for most study designs and set- standard is unattainable in this sense. Therefore, the original
tings, it is more likely for a research claim to be false than developers of evidence-based medicine built their model of
true. Moreover, for many scientific fields, claimed research intervention on a platform where three fundamental factors
findings may often be simply accurate measures of the pre- intersect: current best evidence; physicians’ professional
vailing bias. (Ioannidis, 2005, p. 696) expertise; and patients’ predicaments, rights, and prefer-
ences. This basic model was later transported to other fields
Given that the scientific community is confronted of human services, including social work practice.
with this major problem, what can be done to improve It is didactic to read some more on how the idea of
the situation? Ioannidis suggests that scientists should evidence-based medicine was formulated by the originators:
try to obtain better powered evidence from large studies
and low-bias meta-analyses; be aware of the fact that it The practice of evidence-based medicine means integrating indi-
is misleading to emphasize the statistically significant vidual clinical expertise with the best available external clinical
findings of any single research team because what evidence from systematic research. By individual clinical exper-
matters is the totality of evidence; and instead of chas- tise, we mean the proficiency and judgment that individual clini-
cians acquire through clinical experience and clinical practice.
ing statistical significance, scientists should improve our
Increased expertise is reflected in many ways, but especially in
understanding of the range of R values, that is, the ratio more effective and efficient diagnosis and in the more thoughtful
of the number of “true relationships” to “no relation- identification and compassionate use of individual patients’
ships” among those tested in the field. predicaments, rights and preferences in making clinical decisions
If this is the state of the art of scientific evidence at about their care. By best available external clinical evidence,
present, it is most likely that we cannot attain the gold we mean clinically relevant research, often from the basic
sciences of medicine, but especially from patient-centered clini-
standard of evidence and will have to live with the cal research into the accuracy and precision of diagnostic tests
awareness of insecurity of knowing. So what does this (including the clinical examination), the power of prognostic
mean for a profession such as social work? If we cannot markers and the efficacy and safety of therapeutic, rehabilitative