Week 3: Learning
Week 3: Learning
1
Learning Objectives:
What is learning?
Learning results from our experience and interaction with the objects and the
events of our environment. It is defined as ‘as a process through which experience
produces a lasting change in behavior or mental processes’.
Without the ability to form new knowledge our lives would rely entirely on
reflexes and on the innate (inborn) behaviors (instincts). More complex learning
behaviors are evolutionary advances over instincts, allowing prompt adjustments to
the environment (Zimbardo, Weber, & Johnson, 2003, pp. 206).
2
Mere exposure effect is the tendency to prefer familiar stimuli (this is a
technique highly used in advertising).
The experiment:
Pavlov and his team used an untrained dog. In intervals, a tone was sounded,
and the dogs were given food. The first reaction to the tone was an orienting
response (stretching of the ears and turning the head towards the source of the
sound). Repeated pairings between the tone and the food, the orienting
response stopped, and the dogs began to salivate in response to the tone alone.
3
Main components of Classical Conditioning:
Unconditioned stimulus (UCS): stimulus that activates a reflexive
response (e.g. food provokes salivation reflex)
Unconditioned response (UCR): the response elicited by an
unconditioned stimulus without prior learning (e.g. salivation at the sight of
food). No learning is involved between UCS and UCR
Acquisition: is the first stage of classical conditioning. A neutral stimulus
(e.g. a tone) is repeatedly paired with a UCS. After several trials, the
neutral stimulus elicits the same response as does the UCS
Conditioned Stimulus (CS): a (previously) neutral stimulus which elicits a
conditioned response (e.g. the tone produces salivation).
Conditioned Response (CR): a response elicited by a previously neutral
stimulus that has become associated with the unconditioned stimulus.
In Classical Conditioning, proper timing of the pairing between the two stimuli is
essential. In most cases, the CS and UCS must be presented contiguously (closely
together) so that the connection can be made (Zimbardo, Weber, & Johnson, 2003,
pp. 209).
Classical Conditioning
CS (Bell) CR (Salivating)
4
Figure 1 Illustrates Pavlov’s Classical Conditioning experiment. Step one food was
presented to the dog (causing salivation). Food was presented, while the tuning
fork was struck. This act was paired and repeated several times. The tuning fork
was struck without the food presented and salivation occurred. Conditioning was
retrieved from https://psychpics.com/2015/12/11/behaviorism-i-historical-progress/
5
Generalization
Generalization is a process where the conditioned response occurs to stimuli
similar with the CS (e.g. dogs salivate to sounds similar to the conditioned
stimulus).
Discrimination
Discrimination is the opposite of generalization. An organism learns to respond to
one stimulus but not to stimuli that are similar (e.g. the dogs distinguish between
different sounds). Practically, this was done by presenting one tone where food
followed, whereas another tone was not.
(Zimbardo, Weber, & Johnson, 2003, pp. 208-212)
2.1.2 Applying Classical Conditioning
Pavlov’s experiments referred to appetitive conditioning (pleasant stimulus e.g.
food). Classical conditioning also involves unpleasant or aversive stimuli such as
painful electric shocks.
6
the rat alone. After the establishment of the learned fear, the researchers showed
that his fear was generalized to other furry objects (e.g. Santa Claus mask).
(Zimbardo, Weber, & Johnson, 2003, pp. 213)
7
cats’ behavior was shaped. The researcher believed that his observations were
equally applied to human learning.
Trial and Error: The learner gradually discovers the correct response by
attempting many behaviours and noting which ones produce the desired
consequences
Figure 2 shows Thorndike’s Puzzle Box. Initially, a fish was put outside the
box. The cat pushed the bar, unlocking the gate, then pulled the string managed
to unlock the gate. Finally, the cat pushes the gate down
https://psychpics.com/2015/12/11/behaviorism-i-historical-progress/
8
‘operant conditioning’ because it focuses on operant responses (operant is an
observable, voluntary behavior that an organism emits to ‘operate’ or have an
effect on the environment).
(Zimbardo, Weber, & Johnson, 2003, pp. 218)
3.2.1 Reinforcement
Reinforcer is a condition (either by presenting or removing a stimulus), that occurs
after a response and affects the appearance of the behavior.
9
Reinforcer
s
10
3.2.4 Scheduling Reinforcement
Reinforcement has two main forms. The ratio schedule rewards a subject after a
certain number of responses, whereas the interval schedule provides a reward after
a certain time interval.
Ratio Schedules: Rewards are distributed based on the number of response (e.g.
employees get paid based on the amount of work they perform).
This is distinguished into two types: fixed ratio and variable ratio schedules
Fixed ratio Schedules: Programs by which reinforcement is provided
after a certain number of responses (e.g. an employer who pays the
workers a dollar for every ten cartons of products they manufacture).
Variable Ration Schedules: Reinforcement programs by which the
number of responses required for reinforcement varies from trial to trial
(e.g. the slot machine players).
11
Summary
Schedule of Reinforcement Advantages and Disadvantages
12
Negative Punishment: The removal of an attractive stimulus after a
response (e.g. taking the toy away from a misbehaving child).
Negative Reinforcement
Positive Reinforcement
Example: Your headache disappears after
Example: An employee gets a bonus for
taking an aspirin. You take an aspirin again
good work and is motivated to keep
the next time you have a headache
working hard
Negative Punishment
Positive Punishment
Example: A child stayed out late and
Example: A speeder gets a traffic ticket,
misses dinner. Next time the child comes
then drives away more slowly
home on time
13
2.5.6 Consequences of punishment
Societies rely on punishment and the threat of punishment to keep people in order
(e.g. putting people in jail, giving people a fine). Punishment often has an
immediate result and a change in behavior.
Then why do punished children continue to misbehave? Why do employees
continue to arrive late at work? Punishment does not always work the way it is
intended to.
14
Effective Punishment:
Zimbardo,
3.1 The Cognitive 2003, pp. 225)
Revolution
Cognitive psychologists disagreed with the idea of learning being a simple
connection between a stimulus and a response as supported through Classical and
Operant conditioning. A new perspective of learning emerged through a more
cognitive point of view. On a neural level, learning involves a physical change that
strengthens the synapses in groups of nerve cells (a process called long-term
potentiation).
(Zimbardo, Weber, & Johnson, 2003, pp. 236)
15
3.1.2 Gestalt Psychology (Kohler):
The Gestalt theory supports that perception (the way a stimulus is being received
and interpreted), affects the responses. In line with this basic idea, the Gestalt
theorists argue that people use their insight in order to solve a problem or
determine their response to stimuli (in contrast with behaviorists). Wolfgang
Kohler observed chimpanzees and identified that some apes learned more quickly
than other, how to reach their food. Despite the fact that the chimpanzees used trial
and error strategies, these attempts were not merely enhanced by the reinforcement
of their prior attempt. The researcher believed that the animals used their prior
attempts to determine their next attempt. This means that the apes used insight to
solve the problem (Blosser, 1973 in http://hci.liacs.nl/files/gestalt-excerpt.pdf).
16
In brief, in Bandura’s classic experiments, children watched videos with adults
punching, hitting and kicking an inflated plastic clown (BoBo Doll Experiment).
Children showed similar aggressive behavior towards the doll. Significantly, these
children were more aggressive than those in a control condition who had not
observed the aggressive models. According to Bandura’s study, imitative learning
happens in situations where individuals cannot rely on their personal experiences
(Zimbardo, Weber, & Johnson, 2003, pp. 237).
17
References
Bandura, A. (1971). Social Learning Theory. General Learning Corporation.
Additional Reading
Gormezano, I., Prokasy, W. F., & Thompson, R. F. (1987). Classical conditioning.
Psychology Press. Available at https://books.google.com
18
Munoz, L. Behaviorist Learning Theory. Available at:
https://lynnmunoz.wordpress.com/learning-theories/behaviorist-learning-theory/
19