Professional Documents
Culture Documents
2 - Schwartz (1982) - Failure To Produce Response Variability With Reinforcement
2 - Schwartz (1982) - Failure To Produce Response Variability With Reinforcement
Two experiments attempted to train pigeons to produce variable response sequences. In the
first, naive pigeons were exposed to a procedure requiring four pecks on each of two keys
in any order, with a reinforcer delivered only if a given sequence was different from the
preceding one. In the second experiment, the same pigeons were exposed to this procedure
after having been trained successfully to alternate between two specific response sequences.
In neither case did any pigeon produce more than a few different sequences or obtain more
than 50% of the possible reinforcers. Stereotyped sequences developed even though stereo-
typy was not reinforced. It is suggested that reinlorcers have both hedonic and informative
properties and that the hedonic properties are responsible for sterotyped repetition of re-
inforced responses, even when stereotypy is negatively related to reinforcer delivery.
Key words: complex operants, differential reinforcement, stereotypy, variability, pigeons
In the course of being trained to emit oper- each key four times, so that the bottom right
ant responses for reinforcers, animals typically matrix light was illuminated, a reinforcer was
develop response topographies that are both delivered. Thus, to obtain reinforcers pigeons
stereotyped and efficient. Although the move- had to peck each of the keys four times, in any
ments that compose the rat's lever press or the order. If they pecked either key a fifth time, the
pigeon's key peck may be variable and uneco- trial ended without reinforcement. In all,
nomical early in training, as training proceeds, there were 70 different sequences of responses
extraneous movement tends to drop out, and that would result in reinforcement.
variability tends to diminish. By the time The consistent finding in these studies was
training is complete, the operant is usually that despite the wide range of sequence vari-
highly stereotyped with regard to properties ability permitted by the reinforcement con-
such as force, duration, and location (e.g., tingency, stereotyped sequences developed. For
Herrnstein, 1961; Notterman & Mintz, 1965; each pigeon, one sequence came to dominate
Schwartz, 1977). all others, occurring on 50 to 90% of trials.
Recently, the study of the development of Though the dominant sequence varied from
stereotyped response topographies has been ex- pigeon to pigeon, for each pigeon it was highly
tended to situations in which the required re- stereotyped from trial to trial and session to
sponse is more complex than a single key peck session.
or lever press (Schwartz, 1980, 1981a, 1981b; This reliable finding led Schwartz (1980, Ex-
Vogel & Annau, 1973). In these experiments, periment 4) to ask whether sequence stereo-
the required operant was a sequence of re- typy could be prevented if the reinforcement
sponses. Pigeons were exposed to trials that be- contingency required variability. Thus, pi-
gan with two response keys and the top left geons were exposed to the sequence task just
light of a 5 by 5 matrix of lights illuminated. described, with the reinforcement contingency
When they pecked one of the keys, the illumi- modified so that a given sequence was only re-
nated matrix light moved down one row, and inforced if it differed from the sequence that
when they pecked the other key, it moved had occurred in the preceding trial. Pigeons
across one column. When they had pecked were exposed to this procedure after a lengthy
period of training with the regular sequence
This research was supported by NSF grant BNS 78- task, which had resulted in the development of
15461 and by Swarthmore College Faculty Research
Grants to the author. Reprint requests should be sent stereotyped sequences. Schwartz found that in
to Barry Schwartz, Department of Psychology, Swarth- 40 sessions of exposure to this contingency re-
more College, Swarthmore, Pennsylvania 19081. quiring sequence variability, only one of eight
171
172 BARRY SCHWARTZ
pigeons showed an appreciable increase in se-
quence variability. In the initial sessions of the EXPERIMENT 1
procedure, the pigeons obtained reinforcers in METHOD
about 32% of trials; at the end of the proce-
dure, they obtained reinforcers in about 36% Subjects
of trials. In contrast, when the sequence task Six experimentally naive White Carneaux
contained no requirement of sequence vari- pigeons were maintained at 80% of their free-
ability, pigeons obtained reinforcers on 70 to feeding weights.
95% of trials.
This finding led Schwartz to speculate that Apparatus
perhaps reinforcement was not an effective Four Gerbrands pigeon chambers (G7313)
procedure for producing variations in response contained three-key pigeon intelligence panels.
topography. Perhaps reinforcement led inex- The keys were Gerbrands normally closed
orably to stereotyped responses. Schwartz keys, requiring a force of .1 N to operate. They
showed in another experiment (Schwartz, 1980, were spaced 7.5 cm apart, center-to-center, and
Experiment 5) that response sequences were were located 21 cm above the grid floor. A
not completely unmodifiable. If sequences grain hopper was directly below the center
were required to begin with two left key pecks, key, 5.5 cm above the grid floor, and a pair of
or with a left-right alternation, pigeons modi- houselights was located in the ceiling of the
fied their sequences accordingly. However, chamber. The houselights were illuminated
when they were simply required to vary se- throughout experimental sessions, except dur-
quences, behavior was not modified effectively. ing 4-sec feeder operations, when a light in the
However, it would be premature to suggest feeder was illuminated.
that reinforcement cannot produce sequence On the left wide wall of each chamber was
variability. There were two features of mounted a 5 by 5 matrix of red lights, spaced
Schwartz's procedure that might have worked 2 cm apart. The lights were .84 cm in diameter
against the variability requirement. First, the and .04 amp (Dialco No. 507-3917-1471-60D).
pigeons were highly trained and were emitting The top row of lights was 20 cm from the grid
stereotyped sequences when the procedure be- floor, and the right column (closest to the in-
gan. Perhaps with naive pigeons that had not telligence panel) was 4 cm from the panel.
yet developed stereotyped sequences, the vari- Scheduling of experimental events, data col-
ability contingency could effectively control lection, and data analysis were accomplished
behavior. Second, in the Schwartz experiment, with a Digital Equipment Corporation PDP
the interval between trials was 10 sec. Thus, to 8/E digital computer using interfacing and
master the contingency, pigeons had to bridge software provided by State Systems Incorpo-
a 10-sec delay between what they had just done rated, Kalamazoo, Michigan.
and what they would do next. Although one
can imagine strategies that would effectively Procedure
bridge the delay (e.g., the pigeon could spend Pretraining. The pigeons were trained to eat
the entire intertrial period next to the key on from the food magazine, after which they were
which the next sequence should begin), such exposed to a modified autoshaping procedure
a long intertrial period certainly does not opti- (Brown & Jenkins, 1968). Each session con-
mize the chances for successful performance. sisted of 50, 6-sec trials, separated by a variable
The present experiments were designed to intertrial interval (X = 40 sec). Each of three
explore further whether pigeons could learn to trial types was equiprobable: either the left
vary response sequences. Experiment 1 re- key was illuminated with white light, or the
moved both of the obstacles that were present right key was illuminated with white light, or
in the initial experiment: the pigeons were both keys were. These three types of trials oc-
naive at the start of training, and the intertrial curred in random order. After 6 sec, the key-
interval was reduced to .5 sec. Experiment 2 light (s) was extinguished and the feeder oper-
attempted to produce sequence variability by ated. Key pecks were recorded but had no
first explicitly establishing a repertoire that programmed consequence. Each pigeon was
contained two distinct sequences that occurred exposed to the autoshaping procedure for five
with appreciable frequency. full sessions after the one in which pecking be-
RESPONSE VARIABILITY AND REINFORCEMENT 173
gan. At the end of pretraining, all pigeons sions. It was then reintroduced for 50 sessions
were reliably pecking both keys (when illumi- and removed again for 30 sessions. Thus, the
nated). present experiment differed from the one re-
Sequence Training Procedure. Daily sessions ported in Schwartz (1980) in two respects: the
consisted of 50 trials, separated by an intertrial ITI was .5 rather than 10 sec, and the variabil-
interval (ITI) of .5 sec. At the beginning of ity requirement was imposed before stereo-
each trial, the two side keys were illuminated typed sequences could develop.
with white light and the top left matrix light
was lit. Each peck on the left key extinguished RESULTS AND DISCUSSION
the currently illuminated matrix light and lit In previous experiments of this type, there
the one to its right; each peck on the right key have been three dependent variables of pri-
also extinguished the currently illuminated mary interest: the number of reinforcers ob-
matrix light and lit the one beneath it. Four tained, the number of different sequences
left key pecks were required to move the ma- emitted, and the frequency of the sequence
trix light from extreme left to extreme right, that became dominant. Figure 1 shows the
and four right key pecks were required to number of reinforcers obtained per session by
move the matrix light from extreme top to ex- each pigeon, averaged across the first and last
treme bottom. To obtain a reinforcer it was five sessions of each procedure. The procedure
necessary to move the matrix light from the requiring sequence variability is labeled "DIF"
top left to the bottom right, that is, to peck (for different sequences) in the figure, and the
each key four times. A fifth peck on either key standard sequence procedure is labeled "SEQ."
terminated a trial immediately, without rein- One hundred sessions of exposure to the vari-
forcement. In all, there were 70 different se- ability contingency did not result in mastery of
quences of left and right key pecks that could the contingency. Only Pigeon A9 was obtain-
satisfy the reinforcement contingency. ing more than 20 reinforcers per session by the
All pigeons were placed on the sequence end of training. That this represents ineffec-
procedure immediately after the autoshaping tive performance in contrast to what one ob-
pretraining described above. Generally, this serves on the standard sequence task is clear
pretraining was sufficient to ensure that pi- from the second pair of bars for each pigeon.
geons would peck both keys on most sequence Within five sessions of exposure to the stan-
trials. If they were pecking both keys, they dard procedure, five of the pigeons were ob-
tended to obtain enough reinforcers in early taining 40 or more reinforcers a session. By the
sessions to keep them pecking until they mas- end of 30 sessions, all pigeons were performing
tered the contingency. However, one pigeon effectively. When the variability contingency
(Al0) tended to peck exclusively on one key
and thus obtained no reinforcers at all. For so
Z 20 .L7 B
Variability Requirement
Aside from having to peck each key four
times, the pigeons were required to vary their ODf 335 DiV SuA DIir Su air
V Su VUSt DIV SEO
30
-.i
06
FFL mm m mm
87 Ba
R
RET0
BR
LR
R
RL
R
RE
97
L
LB
L
RL
GROUP
FI
RUFT R
GENERAL DISCUSSION
Mu T AFT LR RL
The results of these experiments confirm
UL J
OTT Ri
that the idea of informatiWe but nonreinforc- Lepper, M. R-., & Greene, D. (Eds.). The hidden costs of
ing consequences has much to do with the cen- reward. Hillsdale, N.J.: Erlbaum, 1978.
Lepper, M. R., Greene, D., & Nisbett, R. E. Under-
tral ideas of behavior analysis over its history. mining children's intrinsic interest with extrinsic
To the extent that this distinction is impor- reward: A test of the "overjustification" hypothesis.
tant, it may indicate significant incompleteness Journal of Personality and Social Psychology, 1973,
in the traditional concepts of behavior anal- 28, 129-137.
ysis. Notterman, J. M., & Mintz, D. E. Dynamics of re-
sponse. New York: Wiley, 1965.
Olton, D. S. Mazes, maps and memory. American Psy-
chologist, 1979, 34, 583-596.
REFERENCE NOTES Olton, D., & Samuelson, R. J. Remembrance of places
1. Schwartz, B. Interval and ratio reinforcement of a passed: Spatial memory in rats. Journal of Experi-
complex, sequential operant in pigeons. Manuscript mental Psychology: Animal Behavior Processes, 1976,
submitted for publication. 2, 97-116.
2. Schwartz, B. Stereotypy without automaticity in pi- Pryor, K. W., Haag, R., & O'Reilly, J. The creative
geons. Manuscript in preparation. porpoise: Training for novel behavior. Journal of
the Experimental Analysis of Behavior, 1969, 12,
653-661.
REFERENCES Schick, K. Operants. Journal of the Experimental
Analysis of Behavior, 1971, 15, 413-423.
Blough, D. S. The reinforcement of least-frequent in- Schwartz, B. Studies of operant and reflexive key pecks
terresponse times. Journal of the Experimental Anal- in the pigeon. Journal of the Experimental Analysis
ysis of Behavior, 1966, 9, 581-591. of Behavior, 1977, 27, 301-313.
Brown, P. L., & Jenkins, H. M. Auto-shaping of the Schwartz, B. Development of complex, stereotyped be-
pigeon's key-peck. Journal of the Experimental havior in pigeons. Journal of the Experimental
Analysis of Behavior, 1968, 11, 1-8. Analysis of Behavior, 1980, 33, 153-166.
Deci, E. L. Intrinsic motivation. New York: Plenum, Schwartz, B. Reinforcement creates behavioral units.
1975. Behaviour Analysis Letters, 1981, 1, 33-41. (a)
Feingold, B. D., & Mahoney, M. J. Reinforcement ef- Schwartz, B. Control of complex, sequential operants
fects on intrinsic interest: Undermining the overjus- by systemnatic visual information in pigeons. Journal
tification hypothesis. Behavior Therapy, 1975, 6, 367- of Experimental Psychology: Animal Behavior Pro-
377. cesses, 1981, 7, 31-44. (b)
Goetz, E. M., & Baer, D. M. Social control of form di- Schwartz, B. Reinforcement induced behavioral stereo-
versity and the emergence of new forms in children's typy: How not to teach people to discover rules.
blockbuilding. Journal of Applied Behavior Analy- Journal of Experimental Psychology: General, 1982,
sis, 1973, 6, 209-217. 111, in press.
Gollub, L. Conditioned reinforcement: Schedule ef- Skinner, B. F. The generic nature of the concepts of
fects. In W. K. Honig & J. E. R. Staddon (Eds.), stimulus and response. Journal of General Psychol-
Handbook of operant behavior. Englewood Cliffs, ogy, 1935, 12, 40-65.
N.J.: Prentice-Hall, 1977. Vogel, R-., & Annau, Z. An operant discrimination task
Herrnstein, R. J. Stereotypy and intermittent rein- allowing variability of reinforced response pattern-
forcement. Science, 1961, 133, 2067-2069. ing. Journal of the Experimental Analysis of Behav-
Horton, R. African traditional thought and Western ior, 1973, 20, 1-6.
science. Africa, 1967, 37, 50-71 & 155-187.
LaBerge, D., & Samuels, S. J. Toward a theory of auto-
matic information processing in reading. Cognitive Received June 19,1981
Psychology, 1974, 6, 293-323. Final acceptance October 15, 1981