Professional Documents
Culture Documents
Reinforcement
Psyc 220
Schedules of
Reinforcement
• Acquisition of new behavior and changes
in the rate and pattern of performance are
partly due to changes in reinforcement
contingencies.
• The rule describing the delivery of
reinforcement is called a schedule of
reinforcement.
Acquisition of new behavior.
Measuring Learning
• Learning to ride a bike
A rat may be trained to press a lever for food. After shaping the behavior, the experimenter may switch to
a schedule in which every third lever press is reinforced. This would be a FR3 schedule.
This kind of schedule results in high, steady rates of responding. Organisms are persistent in responding
because of the hope that the next response might be one needed to receive reinforcement.
Free Drink cards or Lottery Games (a Three strike rule would be fixed ration punishment schedules)
Noncontingent
Reinforcement
Schedules Fixed time or Variable
time schedules are
contingent on time rather
than Bx.
Stretching the Ratio
• It is like shaping, but in this case the trainer shapes persistence.
• The trainer might start with a continuous reinforcement schedule
and, when the animal is working at a steady rate, increase the ratio to
FR 3; when this schedule has been in force a while, the experimenter
may go to FR 5, then FR 8, FR 12, FR 20, FR 30, and so on.
• Rats will press levers and pigeons will peck disks hundreds of times for
a single reinforcer, even if that reinforcer is a small amount of food.
People have also been known to work steadily on very thin schedules
—schedules that require many responses for each reinforcement.
The tendency of behavior
that has been maintained
on an intermittent
schedule to be more Partial
resistant to extinction Reinforcement
than behavior that has
been on continuous Effect (PRE)
reinforcement.
Hypotheses About the PRE
• Discrimination Hypothesis
• Extinction takes longer after intermittent reinforcement because it is harder to distinguish (or
discriminate) between extinction and an intermittent schedule than between extinction and
continuous reinforcement. The frustration and sequential hypotheses are both variations of the
• Frustration Hypothesis discrimination hypothesis.
• Nonreinforcement of previously reinforced behavior is frustrating. Frustration is an aversive
emotional state, so anything that reduces frustration will be reinforcing.
• Sequential Hypothesis
• Attributes the PRE to differences in the sequence of cues during training.
• Response Unit Hypothesis
• This approach says that to understand the PRE we must think differently about the behavior
on intermittent reinforcement.
Complex Schedules
• Two or more simple schedules can be combined to form various kinds of complex schedules.
• When the schedules alternate and each is identified by a particular stimulus, this is a multiple
schedule.
• When the schedules alternate but there is no signal, this is a mixed schedule.
• In a chain schedule, reinforcement occurs only on completion of the last in a series of
reinforcement schedules, with each schedule change signaled by a change in stimulus.
• Tandem schedules resemble chain schedules except that there is no signal.
• A cooperative schedule makes reinforcement contingent on the behavior of two or more
individuals.
• In concurrent schedules, two or more schedules are available simultaneously, so that the
individual must choose among them.