Professional Documents
Culture Documents
Baron, 2005. Positive and Negative Reinforcement PDF
Baron, 2005. Positive and Negative Reinforcement PDF
Baron, 2005. Positive and Negative Reinforcement PDF
2 (Fall)
ing. Presentation and removal also lier label for positive reinforcement),
have come to define different areas of the response not only produces a stim-
research. On the one hand is a set of ulus but produces a stimulus that
phenomena and issues whose study evokes pleasure or satisfaction. By
most often involves positive reinforce- comparison, escape-avoidance training
ment. Within the animal laboratory, in- (negative reinforcement) involves ar-
vestigations of schedules, choice, and rangements in which the response re-
stimulus control usually present food duces pain, anxiety, or some other
as the reinforcer. By comparison, in- forms of discomfort or distress.
vestigations of the aversive control of From the start, behavior-analytic dis-
behavior most often involve negative cussions of the reinforcement process
reinforcers. The subject is able to shied away from such interpretations
terminate intense stimuli (usually of reinforcement on the grounds that
shocks), or, in the case of avoidance, they assign causal status to events that
escape from situations in which such are ill defined and not easily observ-
events can occur. able (Skinner, 1938). The more desir-
Occasionally, writers have com- able alternative is to couch the distinc-
mented on similarities rather than dif- tion strictly in terms of the stimulus
ferences between the two forms of re- change that follows the response (Kel-
inforcement (e.g., Baron, 1991; Hine- ler & Schoenfeld, 1950). Thus, in his
line, 1984), for example, the parallel authoritative glossary of behavior-ana-
effects of postponing the stimulus lytic terms, Catania (1998) provides
change. Also, a few researchers have the reader with this current definition:
sought paradigms that might bridge the "A stimulus is a positive reinforcer if
gap, such as procedures in which re- its presentation increases the likelihood
sponding produces a period of time-out of responses that produce it, or a neg-
from avoidance (Perone & Galizio, ative reinforcer if its removal increases
1987; Verhave, 1962) or avoids a pe- the likelihood of responses that termi-
riod of time-out from positive rein- nate or postpone it" (p. 405). As we
forcement (Baron & Kaufman, 1966; will see, a definition in these terms, al-
Stone, 1961). Nevertheless, treatments though avoiding the pitfalls of moti-
of operant conditioning continue to vational interpretations, has some
place behavior maintained by positive problems of its own.
and negative reinforcers under separate
headings (e.g., Catania, 1998; Iversen Michael's Objection
& Lattal, 1991; Mazur, 2002; Pierce &
Cheney, 2004). What might be regarded as a water-
The purpose of this article is to re- shed in behavior-analytic discussions
view the current status of these two of positive and negative reinforcement
forms of reinforcement. In particular, was Michael's (1975) article, to which
we reconsider Michael's (1975) call to he gave the provocative title, "Positive
abandon the distinction. Although Mi- and Negative Reinforcement, a Dis-
chael's views have not been refuted to tinction That Is No Longer Necessary;
our knowledge, there are few signs that or a Better Way to Talk about Bad
his recommendations are being heeded. Things." His discussion forcefully
brought the ambiguity of the distinc-
Traditional Bases for the Distinction tion to the attention of behavior ana-
lysts. The matter was not, however,
Traditional treatments sharpened the completely new; definitional problems
distinction between positive and nega- had previously been considered by
tive reinforcement by introducing mo- both behavior-analytic and more moti-
tivational variables (e.g., Hilgard & vationally oriented writers (e.g., Cata-
Marquis, 1940; Mowrer, 1960; Thorn- nia, 1973; D'Amato, 1969; Mowrer,
dike, 1911). In reward training (an ear- 1960).
POSITIVE AND NEGATIVE REINFORCEMENT 87
Michael (1975) identified two is- periment cited by Catania (1998).
sues, the first of which pertained to a Weiss and Laties (1961) observed that
longstanding confusion about the dif- rats kept in a cold chamber would
ference between negative reinforce- press a lever that turned on a heat
ment and punishment. He observed lamp. This outcome can be regarded as
that the term negative reinforcement the product of positive reinforcement
had been used by a number of writers because the lamp's onset adds heat to
(including Skinner himself in The Be- the environment. But the behavior also
havior of Organisms, 1938) not only to reduces the extent to which the envi-
refer to stimulus termination but also ronment is cold and thus can be re-
to punishment, that is, consequences garded as an instance of negative re-
that suppress responding. Happily, this inforcement. Similar questions can be
ambiguity has been laid to rest in that raised about any reinforcement proce-
current usage reserves the term punish- dure. Although food is usually regard-
ment for suppressive operations. How- ed as a positive reinforcer, its presen-
ever, a link between reinforcement and tation also serves to reduce a state of
punishment remains, in that it has be- deprivation (negative reinforcement).
come customary to use the presenta- Similarly, the negatively reinforcing
tion-removal difference developed properties of escape from shock can be
originally for reinforcement to also dis- attributed to the onset of the stimuli
tinguish between two types of punish- correlated with safety (positive rein-
ment (e.g., Catania, 1998; Mazur, forcement). Moreover, the problem is
2002). Thus, responding can be sup- by no means confined to unconditioned
pressed not only by delivery of a shock forms of reinforcement. Delivery of
(positive punishment) but also by with- money contingent on some behavior
drawal of food (negative punishment). has the consequence of ending a mo-
Michael's (1975) second issue is the neyless period, and escape from con-
one that concerns us here. His point ditioned aversive stimuli produces sit-
was that the reinforcing functions of an uations in which the stimuli are absent.
event, either its presentation or its re- Faced with these ambiguities, Mi-
moval, hinge on the context in which chael (1975) concluded that there is no
the event occurs. The response-contin- good basis for continuing to describe
gent presentation of a stimulus re- reinforcers as positive or negative. In
quires, of necessity, that the response his view, communication is not imped-
terminate a previous period when the ed if the focus is on the stimulus
stimulus was absent. By the same to- changes that strengthen behavior rather
ken, the response-contingent termina- than on the onset or the offset of stim-
tion of a stimulus cannot be accom- uli. If a distinction is to be made, it
plished unless the response has been should be between processes of rein-
preceded by a period when the stimu- forcement and processes of punish-
lus was present. The argument, then, is ment, that is, between environmental
that positive and negative reinforce- changes that strengthen and environ-
ment are changes from one stimulus mental changes that suppress.
condition to another, not the simple
presentation or removal of a stimulus. Michael's Analysis
Without this essential clarification, the
claim that a reinforcer is exclusively By most standards, Michael's (1975)
positive or negative always can be analysis is quite cogent. Nevertheless,
challenged by the assertion that the al- even a cursory survey of textbooks in-
ternative form is the true basis for the dicates that the positive-negative dis-
reinforcing effect. tinction continues to be taught to psy-
The conundrum for those who want chology students, and this might sug-
to stick to the presentation-removal gest that Michael's views have been
distinction is well illustrated by an ex- discredited. Perhaps new research find-
88 ALAN BARON & MARK GALIZIO
ings or new theories have bolstered the fects. This feature of reinforcement
distinction he believed should be aban- was discussed by Weiss and Laties
doned. Or, perhaps, the original rea- (1961) in their study of heat reinforce-
sons for abandoning the distinction ment (or, if you like, cold termination).
were faulty. Instead, Michael's classic They commented that heat reinforce-
article is not always cited in textbooks, ment appeared to produce more reli-
even those that address operant condi- able effects than reinforcers that re-
tioning in detail. When his views have quire eating or drinking, and they at-
been given some attention (e.g., Cata- tributed the difference to the "long
nia, 1998; Pear, 2001; Pierce & Che- chain of processes that intervene be-
ney, 2004), the message is somewhat tween behavior and the ultimate ef-
mixed. Although the validity of his ar- fect" of eating and drinking. By com-
gument may be acknowledged, the dis- parison, the effect of heat "is practi-
tinction that he argued against contin- cally instantaneous" (Weiss & Laties,
ues to be used as a way of classifying 1961, p. 1344).
both different operant procedures and Perhaps the case can be made that
different research areas. stimulus change usually is more abrupt
Given the persistence of this pecu- for negative than for positive reinforce-
liar state of affairs, it seems worthwhile ment (cf. termination and food con-
to reconsider Michael's discussion in sumption). But even assuming that
the light of developments since his ar- negative contingencies produce more
ticle was published. He considered and reliable conditioning (we do not know
rejected three possible justifications for of experiments that have demonstrated
the distinction. this), this does not mean that the dif-
1. The strengthening effects of posi- ference is fundamental. A feature of
tive and negative reinforcers may dif- stimulus change is that rates of onset
fer in such regards as their temporal and offset can vary, thus introducing
properties, their relation to other in- differing degrees of delay before the
dependent variables, or their role in event is fully present or fully absent.
the development of discriminations. Although such time-dependent differ-
Michael (1975) could not find a ences play an important role in operant
good basis for such a conclusion. Al- conditioning, they are better viewed as
though the environmental changes that a parameter of reinforcement (delay of
function as reinforcers have unique reinforcement) than as a difference in
properties, "these properties seem just the reinforcement process itself. For
as relevant to the distinctions among example, if Weiss and Laties's (1961)
the various kinds of positive reinforce- procedure had entailed a slow onset of
ments as between positive and negative the heat lamp, we would expect that
reinforcement" (p. 41). Consistent conditioning would be retarded by
with this interpretation, our review of comparison with food- and water-de-
the literature on aversive control (Bar- livery procedures. Or consider the re-
on, 1991) led us to conclude (as had inforcing functions of drugs: The re-
Hineline, 1984) that similarities be- inforcing effect of a given dose of co-
tween positive and negative reinforce- caine varies as a function of the mode
ment effects are more apparent than of delivery (oral, intravenous, or intra-
differences. Most notably, the well- nasal) because of differences in speed
known parameters of positive rein- of onset of drug effects.
forcement (the magnitude, delay, and 2. There are differences in the phys-
schedule of stimulus presentations) iological structures or processes that
have similar influences on responses underlie positive and negative rein-
maintained by negative reinforcement. forcement.
A possible difference, one not men- Michael (1975) concluded that phys-
tioned by Michael (1975), pertains to iological information did not help to
the rapidity of the strengthening ef- clarify the distinction. However, the
POSITIVE AND NEGATIVE REINFORCEMENT 89
many advances in neuroscience since system of the brain (and in particular
his article warrant a reconsideration of the nucleus accumbens) has been
the literature. The literature on the linked to the actions of rewarding stim-
physiological substrates of the rein- ulation (Kiyatkin, 1995; Vaccarino,
forcement process is much too broad Schiff, & Glickman, 1989; Wise &
to survey in this paper. However, we Bozarth, 1987), and an amygdala-hy-
will briefly consider research in the pothalamic-central gray system to pro-
three areas that seem most relevant: cesses of pain and fear (Davis, Cam-
pharmacological investigations, re- peau, Kim, & Falls, 1995; Panksepp,
search on the neurobiology of rein- Sacks, Crepeau, & Abbot, 1991). How-
forcement, and investigations of psy- ever, these neurobiological distinctions
chophysiological changes that may ac- do not correspond in straightforward
company reinforcement. ways to the distinction between posi-
Research from the behavioral phar- tive and negative reinforcement. Re-
macology laboratory has been guided search on dopaminergic mesolimbic
by a search for drugs.to counteract syn- involvement has largely involved pro-
dromes that are triggered by aversive cedures that would be classified as pos-
events (tension, anxiety), and negative- itive reinforcers. For example, the ev-
reinforcement baselines have been idence for dopamine release in the nu-
used to model these syndromes. Early cleus accumbens comes from experi-
experiments within this framework ments in which animals responded for
suggested unique links between classes such events as food, opportunity for
of drugs and type of reinforcement. For sexual interaction, or stimulant drugs
example, antipsychotic and anxiolytic (Kiyatkin, 1995). However, these re-
drugs appeared to have different ef- sults cannot be viewed as showing a
fects on shock-avoidance and food-re- unique link between dopamine and
inforced baselines. However, a key ex- positive reinforcement. Parallel exper-
periment by Kelleher and Morse iments with avoidance schedules have
(1964) indicated otherwise. They ad- indicated similar patterns of dopamine
ministered drugs with opposing phar- release in the nucleus accumbens
macological properties (either amphet- during shock avoidance (e.g., Mc-
amine or chlorpromazine) to monkeys Cullough, Sokolowski, & Salamone,
responding on identical schedules of 1993). To complicate matters, other re-
positive and negative reinforcement search has raised doubts about the
(stimulus-shock termination vs. food functional role of dopamine in the nu-
presentation). Their major finding was cleus accumbens. Insofar as mesolim-
that drug effects depended consider- bic dopamine is implicated in the neu-
ably more on the response rates con- robiology of reinforcement, the best
trolled by the schedules than on wheth- evidence appears to suggest that it en-
er the reinforcer was positive or nega- compasses both forms of reinforce-
tive. Although subsequent researchers ment-negative as well as positive (see
continued to search for relations be- Salamone, Correa, Mingote, & Weber,
tween drugs and baseline reinforcers, 2003; Salamone, Cousins, & Snyder,
the evidence does not support a phar- 1997, for reviews).
macological basis for distinguishing Finally, on a more behavioral level,
between performances under schedules evidence has not been forthcoming to
of positive and negative reinforcement support Mowrer's (1960) original con-
(for reviews, see Barrett & Katz, 1981; tention that characteristic emotional
Dworkin, Pitts, & Galizio, 1993). processes are evoked by aversive and
Research on the neurobiology of re- appetitive stimuli. Behavior analysts
ward and punishment also has sought have not formed a consensus about the
physiological distinctions between pos- proper way to view the process of
itive and negative reinforcement. For emotion (for a recent exchange of
example, the dopaminergic mesolimbic views, see Friman, Hayes, & Wilson,
90 ALAN BARON & MARK GALIZIO
tania (1998) acknowledged this ambi- inforced by them have more effectively learned
guity, and his last word was that "re- and remembered where and how to get them and
therefore have been more likely to survive and
inforcement always involves changes transmit this susceptibility to the species. It has
in the organism's situation and inevi- often been pointed out that competition for a
tably leads to differences in responding mate tends to select the more skillful and pow-
before and after the change" (p. 101). erful members of a species, but it also selects
those more susceptible to sexual reinforcement.
As a result, the human species, like other spe-
Feelings of Reinforcement cies, is powerfully reinforced by sugar, salt, and
sexual contact. This is very different from say-
A different approach to reinforce- ing that these things reinforce me because they
ment focuses on what Skinner (1976) taste or feel good. (Skinner, 1976, pp. 52-53)
referred to as "the feelings of rein-
forcement." Skinner observed that Skinner's discussion of rewards and
"feelings have dominated the discus- punishment provides a clue as to why
sion of rewards and punishment for the conception of two forms of rein-
centuries" (p. 53), and he did not ques- forcement has persisted despite the
tion that different forms of reinforce- lack of a clear operational basis for de-
ment may evoke distinctively different termining which is which. Perhaps the
feelings. By "feelings," Skinner was feelings of positive and negative rein-
referring to private events that are re- forcement are different. Consider the
vealed by "asking the subject how he labels given these states. Beginning
'feels' about certain events" (1953, p. with Thorndike (1911), positive rein-
82). Indeed, surveys have been devel- forcers have been described in such
oped with the objective of identifying terms as satisfying, pleasing, and plea-
the relative strength of different rein- surable. Descriptive terms for the feel-
forcing events and activities. Cautela's ing of negative reinforcement are more
(1972) reinforcement survey schedule difficult to find, and Thorndike used
asks respondents to rate events in terms the term satisfier to refer to both pos-
of how much "joy or pleasurable feel- itive and negative reinforcement.
ings" each provides (the survey in- Mowrer (1960), however, coined the
cludes such diverse items as "eating term relief to refer to states that accom-
ice cream," "playing basketball," and pany the withdrawal of aversive stim-
"dogs"). The resulting information has uli, and the same sense is conveyed by
been used in programs of behavior the dictionary definition: "Relief-an
therapy and research (e.g., Baron, easing as of pain, discomfort, or anxi-
DeWaard, & Galizio, 1981). ety" (Webster's New World Dictio-
Skinner's treatment of the feelings of nary, 1994). These commonsense
reinforcement followed his philosoph- views of the difference between feel-
ical views about the role of private ings of pleasure and feelings of relief
events in general. He did not take issue are paralleled by textbook treatments
with descriptions that include referenc- of positive and negative reinforcement
es to feelings-that a person's respons- in separate sections-one in which the
es are accompanied by the presentation reinforcers involve presentation of
or removal of events said to be liked such events as food, praise, or money,
or disliked. "But this does not mean and one in which the reinforcers are
that his feelings are causally effective; termination of electric shock, loud
his answer reports a collateral effect" sounds, or expressions of disapproval.
(1976, p. 53). In these writings and Paralleling Skinner's (1976) consid-
elsewhere (Skinner, 1986), he turned to eration of the place of feelings in the
other mechanisms for a causal account reinforcement process is his discussion
of the origins of reinforcement, in par- of the feelings of accompanying states
ticular, evolutionary processes: of motivation (feelings of "conditions
Salt and sugar are critical requirements, and in- of the body"). For both positive and
dividuals who were especially likely to be re- negative reinforcement, these states are
94 ALAN BARON & MARK GALIZIO