Professional Documents
Culture Documents
➔ Negative Reinforcement
- It consists of removing an aversive stimulus following a response, which
leads to an increase in the future strength of that response.
- For example, if we wave our hands at a bothersome wasp and the wasp
flies away, we will likely repeat that action with the next wasp that annoys
us.
- Negative reinforcement is associated with two types of behaviour:
1. Escape behaviour
- in which the performance of the behaviour terminates the
aversive stimulus
- For example, we escape from the rain when we run indoors
after it has started.
2. Avoidance Behaviour
- occurs when the performance of the behaviour prevents the
aversive stimulus from occurring.
- For example, we avoid the rain when we head indoors
before it has started.
- Typically, one first learns to escape from an aversive
stimulus and then to avoid it.
2. Operant Conditioning
- in which moving away from the CS is negatively reinforced
by a reduction in fear.
Light (SD): Climb over the barrier (R) -> Reduction in fear
(SR)
➔ One-Process Theory
- Alternative theory to explain avoidance behaviour
- Suggests the act of avoidance is negatively reinforced by a lower rate of
aversive stimulation.
Does not acknowledge internal states (covert behaviours)
The attractive aspect of this theory is that it does away with any
reference to an internal state of fear, the existence of which has to
be inferred. The overall reduction in aversive stimulation that
accompanies avoidance is instead regarded as a sufficient
explanation for the behaviour.
➔ Avoidance conditioning
- Phobia
- Obsessive- Compulsive Disorder
➔ Stampfl, 1987:
- Experimental Avoidance Procedure
Stop conveyor belt for 3 minutes
➢ After a two-hour session:
➢ FR1 to FR10
➢ Rat learned to run past photocells when the conveyor belt
started
➢ Avoided black sidewalls in more than 1000 consecutive trials
➢ Avoided shock after only one exposure, just like humans
- Compulsions
Repetitive actions in response to obsessions
- (E.g., compulsively checking grades online)
- Associated with a decrease in anxiety (Anxiety down)
➔ OCD
- Exposure and Response Prevention (ERP)
Method of treating OCD that involves prolonged exposure to the
anxiety-arousing event while not engaging in the compulsive
behaviour pattern that reduces anxiety.
- Combination of systemic desensitization and flooding
procedures (^90 minute sessions)
- Advised to disengage from compulsive behaviour pattern
although Mowrer’s two-process theory has proven quite useful in enhancing our
understanding and treatment of OCD, it does not provide an entirely adequate
explanation for it.
➔ Approach-Avoidant Conflict (Or I can’t believe i volunteered for this)
- Explanation for events that are sometimes rewarding (appetitive) but
sometimes punishing (aversive).
- Distance affects the value of consequences
- The immediacy of a reinforcer or punisher can affect motivation.
➔ Punishment
- Involves the weakening of a behavior through the application of an
aversive stimulus or the removal of an appetitive stimulus.
- Positive Punishment
The presentation of a stimulus following the occurrence of a target
behaviour which decreases the likelihood the target behaviour will
occur again in the future.
Adding a stimulus
- Negative Punishment
The removal of a stimulus following the occurrence of a target
behaviour which decreases the likelihood the target behaviour will
occur in the future.
Removing a stimulus
Time out
- Loss of access to positive reinforcers for a brief period of
time following an instance of target behaviour.
➢ Need to also reinforce appropriate behaviour.
➢ Can be effective after a short duration (1 minute); is
not recommended for a long period of time- less than
10 minutes.
Response Cost
- Removal of a specific reinforcer following the occurrence of
a problem behaviour
- Item must be a reinforcer to have an impact on the
behaviour.
➔ Types of Punishment
1. Intrinsic Punishment
- Target behaviour is inherently punishing and less likely to be
repeated.
- Behaviour itself is punishing.
-
2. Extrinsic Punishment
- Target behaviour is not inherently punishing, and the punisher is
externally delivered following a target behaviour.
3. Primary (Unconditioned) punisher
- Innately punishing events
- Events that we are born to dislike
4. Secondary (Conditioned) Punisher
- Events that become punishing due to their association with another
punisher
- Tone (NS): Shock (US) -> Fear (UR)
- Tone (CS) -> Fear (CR)
- Running in a wheel (R) -> Tone (SP)
5. Generalized (Secondary) Punisher
- Event that has become punishing because it has, in the past, been
associated with many other punishers.
➔ Benefits of Punishment
- Can lead to an increase in social behaviour
- Sometimes results in mood improvement
- Can Increase attention to the environment
➔ Theories of Punishment
1. Conditioned Suppression Theory
- Punishment does not weaken behaviour, it produces an emotional
response that interferes with behaviour occurrence.
Does not produce a lasting change in the target behaviour
2. Avoidance Theory of Punishment
- Involves type of avoidance conditioning where the avoidance
response is any behaviour other than the behaviour being
punished.
Agrees with the suppression theory that punishment does
not directly weaken behaviour; it replaces the behaviour with
other avoidance responses
3. The Premack Approach to Punishment
- A low probability behaviour (low-p) can be used to punish a
high-probability behaviour (high- p).
Implies that punishment (weakens behaviour) is the opposite
of reinforcement (strengthens behaviour)
- Noncontingent Punishment
- What if the aversive event was essentially uncontrollable (even
unpredictable), such that whatever you do, you are unable to
influence your exposure to that event?
➔ Learned Helplessness
- Decrement in learning ability resulting from repeated exposure to
uncontrollable aversive events (e.g., depression after series of
uncontrollable aversive events).
- Cure? People will eventually recover the ability to escape if repeatedly
forced to escape the aversive stimulus prior to exposure.
- Prevention? The history of successfully overcoming minor adversities
protects against depression when later confronted by more serious
difficulties.
VR 20 ↓ ↓VR 50
Food Food
➔ For example,
Key pec
↓ VI60-s
Food
Delay (2-sec)
Key peck
VI 30-sec ↓
Food
2. Overmatching
➢ In overmatching, the proportion of responses on the richer
schedule versus the poorer schedule is more different than would
be predicted by matching (to remember this, think of overmatching
as more different).
➢ For example, the matching law predicts that the proportion of
responses should be .75 on the richer VI 20-sec schedule and .25
on the poorer VI 60-sec schedule. If we instead find proportions of
.85 and .15, respectively, then overmatching has occurred. There is
more of a difference in responding between the richer and poorer
schedules than would be predicted by matching.
➢ Overmatching can occur when the cost of moving from one
alternative to another is very high.
➢ For example, Baum (1974) found that overmatching occurred
when a pigeon had to walk around a partition and climb across a
wooden hurdle to switch from one response key to another. The
pigeon switched less often and spent more time on the richer
alternative than the matching law would predict. Similarly, a
predator that has to cross a deep, fast-moving river to move from
one food patch to another might make the trip only infrequently and
spend considerably more time in the richer food patch than
predicted by matching.
Self-Control
➔ We know what to do...but have trouble with the execution.
➔ Lack of Self Control? Homework!!!! Studying but not
➔ self-control seems to require a certain mental faculty called willpower.
➔ What is “Willpower”?
- Self-control and willpower seem to have circular reasoning…
- If A then B <-> If B then A
- ability to proceed to your plan.
➔ Skinner on Self-Control
➢ Skinner (1953) viewed self-control, or “self-management,” not as an issue
of willpower but as an issue involving conflicting outcomes.
➢ For example, drinking alcohol can lead to both positive outcomes (e.g.,
increased confidence and feelings of relaxation) and negative outcomes
(e.g., a hangover along with that horrible tattoo you found on your arm the
next morning).
➢ Skinner also proposed that managing this conflict involves two types
of responses:
1. A Controlling response
- Alters frequency of controlled response (e.g. reducing the
amount of money you take out)
2. Controlled Response
- Result of controlled response (e.g. Spending less money)
➔ self-control
- consists of choosing a larger later reward over a smaller sooner reward;
the opposite of self-control, known as impulsiveness, consists of
choosing a smaller sooner reward over a larger later reward.
- The task of choosing between such alternatives is known as a delay of
gratification task because the individual must forgo the smaller sooner
reward (i.e., the subject has to “delay gratification”) to obtain the larger
later reward.
➔ Delay of Reinforcement:
➢ FI10-sec compared to FI15-sec = slightly stronger preference for the
FI-10sec schedule
- 5-second difference
➢ - FI10-sec compared to FI5-sec= moderately stronger preference for the
FI0-sec compared to FI5-sec
- 5-second difference
➔ Relative values
➢ Preference Reversals:
- At an early time, the larger later reward (LLR) is preferred
- As time passes the smaller sooner reward (SSR) becomes
imminent
- The value of SSR increases higher than the value of the LLR
- The SSr outweighs the value of the LLR.
➔ Behaviour Contract
➢ Outlines the target behaviour for changes and the consequences obtained
for the omission or engagement of the target behaviour.
➔ Small-But-Cumulative Effect Model
➢ Each individual choice of self-control task has a small but cumulative
effect on our likelihood of obtaining the desired long-term outcome.
- Helps explain why self-control is difficult
● Example, =Hey, it’s been a tough day, so why not indulge
just this once?
- The small effect, in the long run, accounts for the maintenance and
increase of the SSR value as the time delay for reward delivery
decreases.
● No immediate punishment contingency
➢ To improve Self-control
- Make salient that individuals' choices are not isolated events, but
rather parts of a whole.
- Have a relapse prevention plan.
- Establish rules that clearly distinguish between acceptable and
unacceptable behaviours.