Professional Documents
Culture Documents
2 (Fall)
seem to offer no rules at all. What ner, 1956/1972, p. 113). Skinner was
guidance, then, is to be offered the stu- enthusiastic about the approach; he re-
dent embarking on a research career? ported that "the possibility of using
The answer may be found in three large groups of animals greatly im-
critical notions in the passages quoted proves upon (our) method ... since
from Sidman's (1960) book: experi- tests of significance are provided for
ence, experimental control, and judg- and properties of behavior not apparent
ment. These are recurring themes in in single cases may be more easily de-
Sidman's treatment of research tactics tected" (Skinner, 1956/1972, p. 113).
and, indeed, throughout the historical But Skinner's enthusiasm soon faded:
development of behavior analysis. Un- In actual practice that is not what happened....
derstanding the role they can play in You cannot easily make a change in the condi-
scientific research is the key to appre- tions of an experiment when twenty-four appa-
ciating why statistical inference has not ratuses have to be altered. Any gain in rigor is
been and need not become a major fac- more than matched by a loss in flexibility. We
tor in the experimental analysis of be- were forced to confine ourselves to processes
which could be studied with the baselines al-
havior. ready developed in earlier work. We could not
move on to the discovery of other processes or
Experience even to a more refined analysis of those we were
working with. No matter how significant might
Behavior analysts' interest in single- be the relations we actually demonstrated, our
statistical Leviathan had swum aground. (Skin-
subject as opposed to group-statistical ner, 1956/1972, pp. 113-114)
research may be regarded as the result
of an inductive process arising from in- Skinner, the consummate tinkerer,
tense interactions with data and various was quite willing to scout about for
practical considerations, rather than de- new ways to conduct experiments. He
ductions from a well-developed philos- rejected group-statistical methods not
ophy of science. The sophisticated because they collided with his radical
philosophical justification for single- behaviorist epistemology, but rather
subject research came later. because his experience revealed that
Behavior-analytic methods, of they insulated the investigator from the
course, derive from the work of Skin- behavior of the subject. The ongoing
ner, whose graduate training antedated interaction between experimenter and
the widespread adoption of the group- data that had characterized his earlier
statistical approach made possible by work-and led to his innovations in
Fisher (1925). Skinner's early research apparatus, measurement, and theory-
involved single-subject designs; most could not be sustained in group-statis-
of the experiments reported in his sem- tical research. Skinner returned to the
inal work, The Behavior of Organisms experimental analysis of individual be-
(Skinner, 1938), used only 4 rats. But havior, and directed his energies to de-
as large-group methods gained favor veloping stronger methods of experi-
within psychology in the late 1930s, mental control that would obviate the
Skinner, then an assistant professor at need for statistical inference.
the University of Minnesota, gave
them a try. He and Heron built a set of Experimental. Control
24 operant chambers and cumulative The tension between group-statisti-
recorders, interconnected so that the re- cal and single-subject methods is cre-
corders displayed mean performances ated by the relative roles played by ex-
for the entire group of 24 rats, as well perimental control in the two ap-
as subgroups of 12 and 6. Skinner said proaches. For Skinner and other advo-
that he and Heron "thus provided for cates of single-subject research,
the design of experiments according to group-statistical methods are ill suited
the principles of R. A. Fisher, which to the development of strong forms of
then were coming into vogue" (Skin- experimental control over behavior, in
112 MICHAEL PERONE
part because the group methods are un- When a monetary reinforcer was pre-
wieldy and in part because the nature sented (an occasional event given the
of statistical analysis reduces the in- intermittent nature of the schedule), the
vestigator's motivation to establish subject repositioned himself and
such control. The sensitivity of a sta- pressed a button on the console re-
tistical test is a direct function of the quired to collect the reinforcer, then re-
number of subjects, and weak control sumed the rocking motion. This topog-
can be tolerated if the number is large raphy was wholly compatible with the
enough. Averaging data across many monetary schedule, which involved
subjects can hide a multitude of sins: only the plunger response, but the in-
The experimental treatment may fail to vestigators worried that it would inter-
affect the behavior of some subjects, fere with the acquisition of the observ-
and may even lead to contrary effects ing response, because the observing
in others. As a consequence, statisti- keys would usually be out of the sub-
cally significant results based on large ject's reach. To block the chair-rocking
sample sizes are not persuasive. Given topography, the investigators replaced
a sufficiently large sample, statistical the chair with a wheeled stool. The
significance is assured. Meehl (1967) subject reacted by sitting on the floor,
pointed out that the only question is tying his bootlace to the plunger and
whether the direction of the statistical pulling the other end, and occasionally
difference will support the investiga- standing up to collect reinforcers. The
tor's hypothesis. Under these circum- new topography was no better than the
stances, the probability of support is a old one. Finally, the investigators
lofty .5-hardly a rigorous experimen- placed a limited hold on the collection
tal challenge. button: Once a monetary reinforcer
In single-subject research, by com- was earned, the subject had just 1 s to
parison, treatment effects are clarified get up and collect it before it was can-
not by increasing statistical sensitivity celed. This contingency was effective
but rather by improvements in experi- in moving the subject ornto the stool in
mental control. Individual differences front the console, with the collection
are not averaged into obscurity as sta- button and the observing keys within
tistical error, but instead are regarded easy reach. When the critical phase of
as revealing the limits of the control the experiment finally commenced, the
being exercised. subject acquired the observing re-
As a case in point, consider a situ- sponse and his data fell in line with
ation encountered in the course of an those of the other subjects.
experiment on "observing behavior" The close interaction between inves-
in adult humans (Perone & Baron, tigator and subject fostered by the sin-
1980). The main response was pulling gle-subject approach allowed a poten-
a plunger mounted underneath a table. tial disaster to be identified and avert-
On the table was a console with col- ed. The troublesome individual differ-
ored stimulus lamps and several re- ence was not relegated to a statistical
sponse keys. In the critical conditions, error term, but was eliminated by suit-
pressing the "observing" keys on the able adjustment in the experimental
console would turn on colored lights procedure. What would have happened
correlated with the schedules of mon- in a group experiment? Perhaps the ab-
etary reinforcement associated with the sence of a conditioned reinforcement
plunger response. During preliminary effect in the problem subject would
training, 1 subject adopted an unusual have been overlooked, if it did not ap-
response topography: He tied one end preciably affect the group mean. Or, if
of his bootlace to the plunger and the detected, the negative result might
other end to the leg of his chair, put his have been attributed to the regrettable
feet on the table, and executed the re- but inevitable appearance in the sample
sponse by rocking back and forth. of a recalcitrant subject whose person-
EXPERIMENTAL CONTROL 113
have their place in psychology, and son, single-subject methods put inves-
perhaps even in behavior analysis. But tigator and subject into repeated con-
there is no room for the unthinking tact, and force the investigator to iden-
methodological orthodoxy that often tify and control variables relevant to
accompanies statistical inference. Per- the object of study. Thus, the methods
haps the trouble started when Camp- are ideal for shaping-and maintain-
bell and Stanley (1963) proclaimed ing-the kind of experimental practic-
that the only "true experiment" is one es that will ensure the continued suc-
with random assignment of subjects to cess of behavior analysis.
treatment groups. Campbell and Stan-
ley directed their monograph to field REFERENCES
researchers in education, and it seems
unlikely that they intended to dismiss Bakan, D. (1966). The test of significance in
single-subject experiments (or, for that psychological research. Psychological Bulle-
matter, virtually all natural science be- tin, 66, 423-437.
fore 1925) as invalid. But by parroting Baron, A., & Perone, M. (1998). Experimental
design and analysis in the laboratory study of
Campbell and Stanley's monograph human operant behavior. In K. A. Lattal & M.
with insufficient thought or circum- Perone (Eds.), Handbook of research methods
spection, several generations of text- in human operant behavior (pp. 45-91). New
books on psychological research meth- York: Plenum.
ods have surely had that unfortunate Campbell, D. T, & Stanley, J. C. (1963). Ex-
perimental and quasi-experimental designs
effect. for research. Chicago: Rand McNally.
Whatever methods are adopted by Cohen, J. (1994). The earth is round (p < .05).
behavior analysts, let us ask that they American Psychologist, 49, 997-1003.
be adopted thoughtfully. The cookbook DeProspero, A., & Cohen, S. (1979). Inconsis-
recipes sometimes associated with sta- tent visual analysis of intrasubject data. Jour-
nal of Applied Behavior Analysis, 12, 573-
tistical inference are easy to criticize, 579.
but more thoughtful statistical appli- Dixon, P. (1998). Why scientists value p values.
cations may be welcome. In the same Psychonomic Bulletin & Review, 5, 390-396.
vein, it must be recognized that the de- Fisher, R. A. (1925). Statistical methods for re-
mand for cookbooks is not altogether search workers. Edinburgh, UK: Oliver &
Boyd.
absent from the behavior-analytic com- Hagen, R. L. (1997). In praise of the null hy-
munity. Sidman (1960), as he wrote his pothesis statistical test. American Psycholo-
Tactics, was perhaps the first to feel the gist, 52, 15-24.
demand. His response was to stead- Hagen, R. L. (1998). A further look at wrong
fastly refuse to offer any recipes. In- reasons to abandon statistical testing. Ameri-
can Psychologist, 53, 801-803.
stead, he asked his readers to think an- Kazdin, A. E. (1982). Single-case research de-
alytically about their research ques- signs: Methods for clinical and applied set-
tions, to explore new procedures, and tings. New York: Oxford University Press.
to learn from experience-in short, to Knapp, T. (1983). Behavior analysts' visual ap-
develop good experimental judgment. praisal of behavior change in graphic display.
Behavioral Assessment, 5, 155-164.
The present view, derived from the Lykken, D. T (1968). Statistical significance in
insights and advice offered by Sidman psychological research. Psychological Bulle-
and Skinner, is that in a science of be- tin, 70, 151-159.
havior good judgment is shaped by in- Matyas, T. A., & Greenwood, K. M. (1990). Vi-
tensive interplay between investigator sual analysis of single-case time series: Ef-
fects of variability, serial dependence, and
and subject in the course of experi- magnitude of intervention effects. Journal of
mental analysis. Group-statistical Applied Behavior Analysis, 23, 341-35 1.
methods seem ill suited to the task, McGrath, R. E. (1998). Significance testing: Is
tending to insulate the investigator there something better? American Psycholo-
from the immediate results of experi- gist, 53, 796-797.
Meehl, P. E. (1967). Theory testing in psychol-
mental operations and reducing the ogy and physics: A methodological paradox.
motivation for seeking and exercising Philosophy of Science, 34, 103-115. (Reprint-
strong forms of control. By compari- ed in D. E. Morrison & R. E. Henkel, Eds.,
116 MICHAEL PERONE
The significance test controversy, pp. 252- correlated with extinction or increased effort.
266. Chicago: Aldine, 1970) Journal of the Experimental Analysis of Be-
Morrison, D. E., & Henkel, R. E. (Eds.). (1970). havior, 34, 239-261.
The significance test controversy. Chicago: Sidman, M. (1960). Tactics of scientific re-
Aldine. search. New York: Basic Books.
Parsonson, B. S., & Baer, D. M. (1992). The Skinner, B. F (1938). The behavior of organ-
visual analysis of data, and current research isms. New York: Appleton-Century.
into the stimuli controlling it. In T. R. Skinner, B. F (1972). A case history in scien-
Kratochwill & J. R. Levin (Eds.), Single-case tific method. In B. F Skinner (Ed.), Cumula-
research design and analysis: New directions tive record (3rd ed., pp. 101-124). New York:
for psychology and education (pp. 15-40). Appleton-Century-Crofts. (Original work
Hillsdale, NJ: Erlbaum. published 1956)
Perone, M. (1991). Experimental design in the Thompson, B. (1998). In praise of brilliance:
Where that praise really belongs. American
analysis of free-operant behavior. In I. H. Iver- Psychologist, 53, 799-800.
sen & K. A. Lattal (Eds.), Techniques in the Tryon, W. W. (1998). The inscrutable null hy-
behavioral and neural sciences: Vol. 6. Ex- pothesis. American Psychologist, 53, 796.
perimental analysis of behavior, Part I (pp. Wilcox, R. R. (1998). How many discoveries
135-171). Amsterdam: Elsevier. have been lost by ignoring modem statistical
Perone, M., & Baron, A. (1980). Reinforcement methods? American Psychologist, 53, 300-
of human observing behavior by a stimulus 314.