Chap 5 Learning New
Chap 5 Learning New
Chapter : Learning
1
Contents
Definition and Nature of Learning
Factors and Types of Learning
1. Classical Conditioning
2. Operant Conditioning
i) Reinforcement
ii) Punishment
iii) Schedule of Reinforcement
3. Latent Learning
4. Insightful Learning
2
Definition of Learning
Learning is a relatively permanent change in
immediate or potential behavior that results from
experience.
This definition has 3 critical aspects:
©Habituation:
©Desensitization:
4
Types of Learning
5
Types of Learning
Associative Learning: Occurs when an organism
makes connections between two stimuli or events that
occur together in the environment.
Classical Conditioning: Organisms learn to associate
events (or stimuli) that repeatedly happens together.
Operant Conditioning: Organisms learn to associate
events – a behavior and consequence (reinforcement &
punishment).
Observational Learning: The process of watching
others and then imitating what they do. 6
Classical Conditioning
Classical conditioning is a type of learning in which a
neutral stimulus comes to bring about a response
after it is paired with a stimulus that naturally brings
about that response.
Originated from the experiment of Ivan Pavlov (1872,
Russian physiologist), who won the Nobel Prize in
1904 for his work on digestion.
It is sometimes called Pavlovian conditioning.
Pavlov (1972) attached a tube to the salivary gland of a
dog, allowing him to measure precisely the dog’s
salvation.
7
8
Classical Conditioning (Cont’d)
a. Before Conditioning
Sound of Bell No Response or
(Neutral Stimulus-NS) Irrelevant Response
Meat Salivation
(Unconditioned Stimulus-UCS) (Unconditioned Response-
UCR)
b. During Conditioning
Sound of Bell + Meat Salivation
(NS+ UCS) (UCR)
c. After Conditioning
Sound of Bell Salivation
(Conditioned Stimulus-CS) (Conditioned Response-
CR)
9
Figure: Basic Processes of Classical Conditioning
Basic Processes of Classical Conditioning
a. Before Conditioning
The ringing of a bell does not bring about salivation-making
the bell a neutral stimulus (NS).
In contrast, meat naturally brings about salivation, making the
meat an unconditioned stimulus (UCS) and salivation an
unconditioned response (UCR).
b. During Conditioning
The bell (NS) is rung just before the presentation of the meat
(UCS).
The goal is for the dog to associate the bell (NS) with meat
(UCS) and therefore to bring about the salivation (UCR).
c. After Conditioning
After a number of pairings of the bell (NS) and meat (UCS),
the bell (conditioned stimulus-CS) alone causes the dog to
salivate (conditioned response-CR).
The previously neutral stimulus of the bell is now considered a
conditioned stimulus that brings about the conditioned 10
response of salivation.
Stimulus and Response in Classical Conditioning
Stimulus/Response Description
Neutral Stimulus (NS) A stimulus (e.g. sound of bell) that, before
conditioning, does not naturally bring about
the response of interest.
Unconditioned Stimulus A stimulus (e.g. meat) that naturally brings
(UCS) about a particular response without having
been learned.
Unconditioned Response A response that is natural and needs no
(UCR) training (e.g., salivation at the smell of meat).
Conditioned Stimulus A once-neutral stimulus (e.g. sound of bell)
(CS) that has been paired with an unconditioned
stimulus to bring about a response formerly
caused only by the unconditioned stimulus.
Conditioned Response The learned or acquired response to a
(CR) stimulus that did not evoke the response
originally (e.g. salivation at the ringing of a
bell). 11
12
Some Phenomena in Classical Conditioning
Phenomenon Description
Extinction occurs when a previously conditioned
response decreases in frequency and
eventually disappears.
16
Operant Conditioning
• A type of learning in which behavior is
strengthened if followed by reinforcement or
diminished if followed by punishment.
Operant Conditioning
Operant conditioning is a type of learning in which a
voluntary response is strengthened or weakened,
depending on its favorable or unfavorable consequences.
Thorndike (1898) was exploring how animals (cat) learn to
solve problems and he built a special cage, called puzzle
box, that could be opened from the inside by pulling a string
or stepping on a lever.
Thorndike (1911) called this learning as instrumental
learning because an organism’s behavior is instrumental in
bringing about certain outcomes.
Thorndike proposed the law of effect, which states that, in a
given situation, a response followed by a “satisfying”
consequence will become more likely to occur and a response
followed by an “unsatisfying” consequence will become less
likely to occur.
18
Operant Conditioning (Cont’d)
Skinner coined the term “operant conditioning” because the
organism produces a consequence by operating on its
environment.
Operant behaviors are behaviors that are emitted rather
than elicited by the environment.
Skinner conducted study on rat in a chamber called skinner
box, a special chamber used to study operant conditioning
experimentally.
The basic idea behind operant conditioning is that behavior
is controlled by its consequences.
There are two types of environmental consequences that
produce operant conditioning:
1. Reinforcement, which strengthen the behavior
2. Punishment, which weaken the behavior
19
Operant Conditioning
Skinner box.
Positive Reinforcement
• Strengthens a response by presenting a stimulus after a
response.
Negative Reinforcement
• Strengthens a response by reducing or removing an
aversive stimulus.
Operant Conditioning (Cont’d)
Reinforcer
23
Reinforcer
• Any event that STRENGTHENS the behavior it
follows.
1. Positive Reinforcer
A positive reinforcer is any stimulus whose presentation
increases the probability that a behavior will occur.
Positive reinforcers are pleasant stimuli that strengthen a
response or behavior by their presentation.
For example, if food, water, money, or praise is provided
after a response or behavior, it is more likely that that
response will occur again in the future.
25
Operant Conditioning (Cont’d)
1.1 Primary Reinforcer
A primary reinforcer is a stimulus that an organism naturally finds
reinforcing because it satisfy biological needs.
For example, food for hungry person, water for a thirst person,
warmth for a cold person, etc.
1.2 Secondary Reinforcer
A secondary reinforcer is a stimulus that acquire reinforcing
properties through its association with a primary reinforcer.
For example, money, praise, performance feedback, and grades
are crucial in everyday life.
2. Negative Reinforcer
A negative reinforcer is any stimulus whose removal increases
the probability that a behavior will occur.
Negative reinforcers are aversive or unpleasant stimuli that
strengthen a response or behavior by their removal.
For example, if you have an itchy rash (unpleasant stimulus) that
is relieved when you apply a certain brand of ointment, you are
more likely to use that ointment the next time you have the itchy
26
rash.
Primary Reinforcer
• An innately reinforcing stimulus
Conditioned (Secondary) Reinforcer
• A stimulus that gains it reinforcing power
through its association with a primary reinforcer.
• Money is also a GENERALIZED
REINFORCER!
Operant Conditioning
Punishment
• An event that
DECREASES
the behavior
that it follows.
Two Types of Punishment
• Positive Punishment • Negative Punishment or
• Adding something Omission Training
unpleasant • Taking away something
• Ex.--Spanking pleasant
• Ex.—Taking car away for
bad grades
Operant Conditioning (Cont’d)
Punishment
Punishment is the process by which a stimulus decreases
the probability that a previous behavior will occur again.
Punishment occurs when a response or behavior is
weakened by an outcome that follows it.
Punisher
Punisher is any stimulus that decreases the probability that
a preceding behavior will occur.
Punisher is any stimulus whose presentation or removal
decreases the probability that a preceding behavior will
occur.
Punisher is the stimulus that weaken a response or
behavior.
For example: if a rat presses a bar and is shocked
immediately afterward, the behavior of bar pressing will
occur less often.
Punisher has two types: positive and negative punisher.
Operant Conditioning (Cont’d)
1. Positive Punisher
A positive punisher is any stimulus whose presentation
decreases the probability that a behavior will occur.
Positive punishers are aversive or unpleasant stimuli that
weaken a response or behavior by their presentation.
For example: spanking a child for misbehaving or spending
ten years in jail for committing a crime.
2. Negative Punisher
A negative punisher is any stimulus whose removal
decreases the probability that a behavior will occur.
Negative punishers are pleasant stimuli that weaken a
response or behavior by their removal.
For example: demotion with a cut in pay of an employee
due to a poor job evaluation or no TV for 1 week for two
siblings due to fight over a toy.
33
34
Operant Conditioning (Cont’d)
Continuous RS Partial/Intermittent RS
Quick Acquisition
Quick Extinction
Partial Reinforcement
• Reinforcing a
response only part
of the time.
• The acquisition
process is slower.
• Greater resistance
to extinction.
• There are two main types of intermittent schedules: ratio and
interval.
• A schedule of
reinforcement that
reinforces a
response after an
unpredictable
number of
responses.
Pop Quizzes
Operant Conditioning
• Fixed-ratio (FR)
schedules require that a
set number of responses
be made before a
reinforcer is delivered.
• Variable-ratio (VR)
schedules require that the
participant perform
differing numbers of
responses to obtain a
reinforcer.
Operant Conditioning
• With a fixed-interval (FI)
schedule, the time
interval is constant.
• The time interval changes
after each reinforcer is
delivered when a
variable-interval (VI)
schedule is used.
• Ratio schedules generally
produce higher rates of
responding than interval
schedules.
Operant Conditioning (Cont’d)
Type of When Reinforces are Effect on Rate of
Schedule Delivered Behavior
Ratio
Fixed After a fixed number of High rate of behavior with a
behavior small pause after each
reinforcer
Variable After a variable number of High and steady rates of
behavior behavior
Interval
Fixed After a fixed amount of time Low rates of behavior at the
beginning of the interval and
high rates toward the end
Variable After a variable amount of time Slow and steady rates of
behavior
46
Classical versus Operant Conditioning
Concept Classical Conditioning Operant Conditioning
Basic Building associations between a Reinforcement increases the
principle conditioned stimulus and frequency of the behavior
conditioned response. preceding it; punishment
decreases the frequency of the
behavior preceding it.
Nature of Based on involuntary, natural, innate Organism voluntarily operates
behavior behavior. Behavior is elicited by the on its environment to produce
unconditioned or conditioned a desirable result. After
stimulus. behavior occurs, the likelihood
of the behavior occurring again
is increased or decreased by
the behavior’s consequences.
Order of Before conditioning, an Reinforcement leads to an
events unconditioned stimulus leads to an increase in behavior;
unconditioned response. After punishment leads to a
conditioning, a conditioned stimulus decrease in behavior.
leads to a conditioned response.
47
48
Insightful Learning: happens all-of-a-sudden
through understanding the relationships of
various parts of a problem rather than through
trial and error
49