9/27/2010
Psyc100, Chapter 4, Lecture 1
Classical Conditioning
Operant Conditioning
Different Perspectives of learning ◦ Behavioural ◦ Cognitive ◦ Ethological perspective
Learning - any process through which experience at one time can alter an individual’s behavior at a future time ◦ E.g. Habituation ◦ A reduction in response to a stimulus after repeated presentations – e.g. startle response
The attempt to understand observable activity in terms of observable stimuli and observable responses
John B. Watson (1913)
◦ “Give me a dozen healthy infants and my own specified world to bring them up in and I’ll
guarantee to take any one at random and produce
teachers, lawyers …”
B.F. Skinner (1938)
Training procedure
Neutral stimulus elicits reflexive response through being paired with another stimulus that already elicits this reflexive response
Reflex = Stimulus-response sequence mediated by CNS
Stimulus Response = action that automatically follows an event
NO REACTION |
REFLEX ACTION |
REFLEX ACTION |
CONDITIONED |
RESPONSE |
1
9/27/2010
UCS |
![]() |
|||
(drug) |
||||
UCR |
||||
(nausea) |
||||
CS |
![]() |
|||
(waiting |
||||
room) |
UCS |
![]() |
||
(drug) |
||||
UCR |
||||
(nausea) |
||||
CS |
![]() |
|||
(waiting |
||||
room) |
CR |
|||
(nausea) |
2
9/27/2010
The conditioned stimulus is not truly lost during extinction, but is Inhibited.
Eye-blink reflex studies (i.e. Tone + Puff of Air) in rabbits has shown that conditioning and extinction involve different sets of neurons
Neurons involved in conditioning excite neurons that control eye-blinks.
Neurons involved in extinction inhibit neurons that control eye-blinks
After conditioning, stimuli that resemble the CS will elicit the response even when they have never been paired before.
Depends on the degree of similarity between new stimuli & conditioned stimuli.
Further a new tone is away from the original tone, the less the dog salivated.
Generalisation can be abolished if the response to one is reinforced while the response to the other is extinguished
E.g. Conditioning to black square is generalised to grey square.
Grey square extinguished (no pairings)
Eventually conditioned the dog to discriminate a black square from a grey that was (almost) imperceptibly different from the black.
Allows investigations of sensory capacities
Generalisation occurs in both physically and semantically similar stimuli (Razran, 1939)
Paired words with lemon juice squirts
Style, urn, freeze, surf = Salivate
Generalised to fashion, vase, chill and wave
Did not generalise to homophones or orthographically similar words (e.g. Serf, stile, etc)
These associations must be encoded deeply (i.e. Semantic rather than surface forms)
John B. Watson and Little Albert
◦ first psychologist to explain human behavior in terms of Pavlovian conditioning
◦ Fear not seen as feeling but observable behavior, catching breath, stiffening body turning away
◦ This emotion can be thought of as a reflex and is therefore amenable to scientific investigation through CC
3
9/27/2010
We don’t just react to stimuli, we often behave in ways that produce certain changes
Actions that result in a particular goal are known as operant responses
On first trial cat engaged in many different behaviors until accidentally opening box
After 20 – 30 trails cat could open box almost immediately after entering it.
Learning = Trial and Error
4
9/27/2010
Responses that produce satisfying effects in particular situations become more likely to occur again in that situation, and responses that produce discomforting effects become less likely to occur again
Pavlovian condition: Animal = passive agent
Thorndike: Animal = active agent that emits behaviour from its own accord
Did not like the term “satisfying” ◦ Used term “reinforcer” for any event that follows a
behavior AND
strengthens the behavior
Invented better apparatus: the Skinner box ◦ Animals could be kept in the box for the whole duration of the experimental session whilst multiple conditioning trials could take place
Consequences ◦ positive and negative reinforcement ◦ positive and negative punishment
Punished behavior is not forgotten, it's suppressed--behavior returns when punishment is no longer present
Causes increased aggression- shows that aggression is a way to cope with problems
Creates fear that can generalize to undesirable behaviors, e.g., fear of school
Does not necessarily guide toward desired behavior--reinforcement tells you what to do, punishment only tells you what not to do
Punishment teaches how to avoid punishment
5
9/27/2010
Allows complex behaviours to be conditioned
Process reinforces gradually more desired responses until final response is achieved
◦ E.g. The grading system in musical instrument learning
◦ Reinforcement stays consistent (certificate etc.) required behaviour becomes more complex
Continuous Reinforcement ◦ reinforcing the desired response each time it occurs ◦ learning occurs rapidly ◦ extinction occurs rapidly
Partial Reinforcement ◦ reinforcing a response only part of the time ◦ learning occurs slowly ◦ resistance to extinction
Continuous: 1 to 1 ratio, a prize every time
|
|
◦ |
fixed: 1 to ?, a prize every ? time |
◦ variable: ? to ?, maybe a prize, maybe not! |
|
|
◦ fixed: announced examination ◦ variable: pop quiz
Ratio Schedules = higher response rates - the more you respond the faster you reach the set threshold and receive the UCS
Variable Schedules = greater resistance to extinction as you are never sure when a reward will be presented so you don’t know when is a good time to give up - it could always be “just about to pay out” (fruit machines)
Indicate when a reinforcer is potentially available
E.g. a lever press will only result in a food pellet when a red light is illuminated ◦ Red light = discriminative stimulus
E.g. waiting for the “ready” green light to flash on a camera before taking a picture
E.g. waiting until people are in a good mood before asking them a favour
6
9/27/2010
Primary Reinforcer ◦ An innate reinforcer ◦ Satisfies a biological need
Secondary Reinforcer ◦ A conditioned reinforcer ◦ An event that gains its reinforcing power through its association with a primary reinforcer
Tokens are secondary reinforces
◦ Reinforcing value related to previous learning experiences ◦ E.g. Work > Money > Buy Food
Chimpanzees will work for tokens and save them for use later when grape vending machine is removed from enclosure
Clicker training
The most famous example was Azrin's token economy in psychiatric hospital
Tokens could be exchanged for cosmetics, candy, cigarettes, clothing, bedside tables, use of the TV, stereo, & sleeping late
Tokens were obtained by attending work and therapy as well as for good grooming, appropriate meal time behaviors, and minor housekeeping chores
Reduction in bizarre behaviors and increased normal behaviors and social skills
Advantages of Tokens
◦ Potent reinforcers big changes in behavior ◦ Bridge the delays between target responses and back up reinforcers
◦ Backed up by a variety of items and hence are less subject to satiating.
◦ Administration does little to disrupt on going behavior
◦ Can be used with many individuals all with different back up reinforcer preferences
◦ Can be accumulated towards valuable goals
Disadvantages of Tokens ◦ Not usually seen in classrooms nor are food snacks or other unusual back up reinforcers ◦ Tokens, except for money and grades, are unavailable in the normal environment ◦ In some environments, people will use unauthorized means such as force/theft to obtain
7
In groups
Think of some real world (human or non- human animal) examples
9/27/2010
8
Much more than documents.
Discover everything Scribd has to offer, including books and audiobooks from major publishers.
Cancel anytime.