SlideShare a Scribd company logo
PSY101
WEEK 3 LEARNING CONT.
DR. RUSSELL RODRIGO
Outline
Operant Conditioning:
Reinforcement of Learning
Biological and cognitive
components of learning
Observational learning
How often should we
reinforce?
§ Do we need to give a reward every single time? Or
is that even best?
§ B.F. Skinner experimented with the effects of giving
reinforcements in different patterns or “schedules”
to determine what worked best to establish and
maintain a target behavior.
§ In continuous reinforcement (giving a reward after
the target every single time), the subject acquires
the desired behavior quickly.
§ In partial/intermittent reinforcement (giving
rewards part of the time), the target behavior takes
longer to be acquired/established but persists
longer without reward.
§ Fixed interval schedule: reward
every hour
§ Variable interval schedule:
reward after a changing/random
amount of time passes
We may schedule
our reinforcements
based on an
interval of time that
has gone by.
§ Fixed ratio schedule: reward
every five targeted behaviors
§ Variable ratio schedule: reward
after a randomly chosen instance
of the target behavior
We may plan for a
certain ratio of
rewards per
number of
instances of the
desired behavior.
Different Schedules of
Partial/Intermittent Reinforcement
Which Schedule of Reinforcement is This?
Ratio or Interval? Fixed or Variable?
1. Rat gets food every third time it presses the lever
2. Getting paid weekly no matter how much work is done
3. Getting paid for every ten boxes you make
4. Hitting a jackpot sometimes on the slot machine
5. Winning sometimes on the lottery you play once a day
6. Checking cell phone all day; sometimes getting a text
7. Buy eight pizzas, get the next one free
8. Fundraiser averages one donation for every eight houses visited
9. Kid has tantrum, parents sometimes give in
10. Repeatedly checking mail until paycheck arrives
Rapid responding
near time for
reinforcement
Fixed interval
Rapid
responding
near time for
reinforcement
Fixed interval
Results of the different schedules of reinforcement
Which reinforcements produce more “responding” (more target behavior)?
§ Fixed interval: slow,
unsustained responding
If I’m only paid for my
Saturday work, I’m not going
to work as hard on the other
days.
§ Variable interval: slow,
consistent responding
If I never know which day my
lucky lottery number will pay
off, I better play it every day.
Steady
responding
Variable
interval
Reinforcers
Effectiveness of the ratio schedules of Reinforcement
§ Fixed ratio: high rate of
responding
Buy two drinks, get one free? I’ll buy a lot
of them!
§ Variable ratio: high, consistent
responding, even if
reinforcement stops (resists
extinction)
If the slot machine sometimes pays, I’ll
pull the lever as many times as possible
because it may pay this time!
Variable ratio
Fixed
ratio
Operant Effect: Punishment
Punishments have the opposite effects of reinforcement.
These consequences make the target behavior less likely
to occur in the future.
+ Positive
Punishment
You ADD
something
unpleasant/aversiv
e (ex: spank the
child)
- Negative
Punishment
You TAKE AWAY
something
pleasant/ desired
(ex: no TV time, no
attention)--MINUS is
the “negative” here
àPositive does not mean “good” or “desirable” and
negative does not mean “bad” or “undesirable.”
When is punishment
effective?
u Punishment works best in natural settings when we
encounter punishing consequences from actions
such as reaching into a fire; in that case, operant
conditioning helps us to avoid dangers.
u Punishment is effective when we try to artificially
create punishing consequences for other’s
choices; these work best when consequences
happen as they do in nature.
uàSeverity of punishments is not as helpful as making
the punishments immediate and certain.
Applying operant conditioning to
parenting
Problems with Physical Punishment
§ Punished behaviors may restart when the
punishment is over; learning is not lasting.
§ Instead of learning behaviors, the child may learn to
discriminate among situations, and avoid those in
which punishment might occur.
§ Instead of behaviors, the child might learn an
attitude of fear or hatred, which can interfere with
learning. This can generalize to a fear/hatred of all
adults or many settings.
§ Physical punishment models aggression and control
as a method of dealing with problems.
Don’t think about the beach
Don’t think about the waves, the sand,
the towels and sunscreen, the
sailboats and surfboards. Don’t think
about the beach.
Are you obeying the instruction?
Would you obey this instruction more
if you were punished for thinking
about the beach?
Problem:
Punishing focuses on what NOT to do, which
does not guide people to a desired behavior.
§ Even if undesirable behaviors do stop,
another problem behavior may emerge
that serves the same purpose, especially if
no replacement behaviors are taught and
reinforced.
Lesson:
In order to teach desired
behavior, reinforce what’s
right more often than
punishing what’s wrong.
More effective forms of operant conditioning The Power of
Rephrasing
§ Positive punishment: “You’re playing video games
instead of practicing the piano, so I am justified in
YELLING at you.”
§ Negative punishment: “You’re avoiding practicing, so I’m
turning off your game.”
§ Negative reinforcement: “I will stop staring at you and
bugging you as soon as I see that you are practicing.”
§ Positive reinforcement: “After you practice, we’ll play a
game!”
Summary: Types of
Consequences
Adding stimuli Subtract stimuli Outcome
Positive +
Reinforcement
(You get candy)
Negative –
Reinforcement
(I stop yelling)
Strengthens
target behavior
(You do chores)
Positive +
Punishment
(You get spanked)
Negative –
Punishment
(No cell phone)
Reduces target
behavior
(cursing)
= uses desirable
stimuli
= uses unpleasant
stimuli
B.F. Skinner’s
Legacy
§ The way to modify behavior is through
consequences.
§ Behavior is influenced only by external
feedback, not by thoughts and feelings.
§ We should intentionally create
consequences to shape the behavior of
others.
§ Humanity improves through conscious
reinforcement of positive behavior and
the punishment of bad behavior.
§ This leaves out the value of
instruction and modeling.
§ Adult humans have the ability to
use thinking to make choices and
plans
§ Natural consequences are more
justifiable than manipulation of
others.
§ Humanity improves through free
choice guided by wisdom,
conscience, and responsibility.
B.F. Skinner’s View Critique
School: long before
tablet computers, B.F.
Skinner proposed
machines that would
reinforce students for
correct responses,
allowing students to
improve at different
rates and work on
different learning goals.
Sports: athletes
improve most in the
shaping approach in
which they are
reinforced for
performance that
comes closer and
closer to the target
skill (e.g., hitting
pitches that are
progressively faster).
Work: some
companies make
pay a function of
performance or
company profit
rather than
seniority; they
target more specific
behaviors to
reinforce.
Applications of Operant Conditioning
More Operant Conditioning Applications
Parenting
1. Rewarding small improvements toward desired behaviors works
better than expecting complete success, and also works better than
punishing problem behaviors.
2. Giving in to temper tantrums stops them in the short run but
increases them in the long run.
Self-Improvement
Reward yourself for steps you
take toward your goals. As you
establish good habits, then make
your rewards more infrequent
(intermittent).
Contrasting Types of Conditioning
Organism associates events.
Classical Conditioning Operant Conditioning
Basic Idea
Associating events/stimuli with
each other
Associating chosen behaviors
with resulting events
Response
Involuntary, automatic
reactions such as salivating
Voluntary actions “operating” on
our environment
Acquisition
NS linked to US by repeatedly
presenting NS before US
Behavior is associated with
punishment or reinforcement
Extinction
CR decreases when CS is
repeatedly presented alone
Target behavior decreases when
reinforcement stops
Spontaneous
Recovery
Extinguished CR starts again
after a rest period (no CS)
Extinguished response starts
again after a rest (no reward)
Generalization
When CR is triggered by stimuli
similar to the CS
Response behavior similar to the
reinforced behavior.
Discrimination
Distinguishing between a CS
and NS not linked to U.S.
Distinguishing what will get
reinforced and what will not
If the organism is learning
associations between
events that it does not
control, it is...
If the organism is learning
associations between its
behavior and the resulting
events, it is...
operant conditioning
classical conditioning
Operant vs. Classical
Conditioning
Role of Biology in Conditioning
Classical Conditioning
§ John Garcia and others found it was easier to learn
associations that make sense for survival.
§ Food aversions can be acquired even if the UR
(nausea) does NOT immediately follow the NS.
When acquiring food aversions during pregnancy
or illness, the body associates nausea with
whatever food was eaten.
§ Males in one study were more likely to see a
pictured woman as attractive if the picture had a
red border.
§ Quail can have a sexual response linked to a fake
quail more readily and strongly than to a red light.
Role of Biology in Conditioning
u Operant Conditioning
u Can a monkey be trained to peck with its nose? No, but a pigeon can.
u Can a pigeon be trained to dive underwater? No, but a dolphin can.
u Operant conditioning encounters biological tendencies and limits that
are difficult to override.
u What can we most easily train a dog to do based on natural
tendencies?
u detecting scents?
u climbing and balancing?
u putting on clothes?
In classical conditioning In operant conditioning
Cognitive Processes
§ When the dog salivates at the
bell, it may be due to cognition
(learning to predict, even
expect, the food).
§ Conditioned responses can alter
attitudes, even when we know
the change is caused by
conditioning.
§ However, knowing that our
reactions are caused by
conditioning gives us the option
of mentally breaking the
association, e.g. deciding that
nausea associated with a food
aversion was actually caused by
an illness.
§ Higher-order conditioning
involves some cognition; the
name of a food may trigger
salivation.
§ In fixed-interval reinforcement,
animals do more target
behaviors/responses around
the time that the reward is more
likely, as if expecting the reward.
§ Expectation as a cognitive skill is
even more evident in the ability
of humans to respond to
delayed reinforcers such as a
paycheck.
§ Higher-order conditioning can be
enabled with cognition; e.g.,
seeing something such as
money as a reward because of
its indirect value.
§ Humans can set behavioral
goals for self and others, and
plan their own reinforcers.
Latent Learning
§ Rats appear to form cognitive maps. They can learn a
maze just by wandering, with no cheese to reinforce
their learning.
§ Evidence of these maps is revealed once the cheese is
placed somewhere in the maze. After only a few trials,
these rats quickly catch up in maze-solving to rats who
were rewarded with cheese all along.
§ Latent learning refers to skills or knowledge gained from
experience, but not apparent in behavior until rewards are
given.
Learning, Rewards, and Motivation
§ Intrinsic motivation refers to
the desire to perform a
behavior well for its own sake.
The reward is internalized as
a feeling of satisfaction.
§ Extrinsic motivation refers
to doing a behavior to receive
rewards from others.
§ Intrinsic motivation can
sometimes be reduced by
external rewards, and can be
prevented by using
continuous reinforcement.
§ One principle for maintaining
behavior is to use as few
rewards as possible, and
fade the rewards over time.
What might happen
if we begin to
reward a behavior
someone was
already doing and
enjoying?
Summary
of factors
affecting
learning
Learning by Observation
§ Can we, like the rats exploring the maze with no reward, learn new
behaviors and skills without a direct experience of conditioning?
§ Yes, and one of the ways we do so is by observational learning:
watching what happens when other people do a behavior and learning
from their experience.
§ Skills required: mirroring, being able to picture ourselves doing the
same action, and cognition, noticing consequences and associations.
Modeling
The behavior of others serves as a model, an example of how to
respond to a situation; we may try this model regardless of
reinforcement.
Vicarious
Conditioning
§ Vicarious: experienced indirectly, through others
§ Vicarious reinforcement and punishment means our choices
are affected as we see others get consequences for their
behaviors.
Observational Learning Processes
Albert Bandura’s Bobo Doll Experiment (1961)
§ Kids saw adults punching an inflated doll while narrating their
aggressive behaviors such as “kick him.”
§ These kids were then put in a toy-deprived situation… and acted out
the same behaviors they had seen.
Mirroring in the Brain
§ When we watch others doing or feeling something,
neurons fire in patterns that would fire if we were doing
the action or having the feeling ourselves.
§ These neurons are referred to as mirror neurons, and
they fire only to reflect the actions or feelings of others.
From Mirroring to Imitation
§ Humans are prone to spontaneous imitation of both behaviors and emotions
(“emotional contagion”).
§ This includes even overimitating, that is, copying adult behaviors that have no
function and no reward.
§ Children with autism are less likely to cognitively “mirror,” and less likely to
follow someone else’s gaze as a neurotypical toddler (left) is doing below.
Mirroring Plus Vicarious Reinforcement
§ Mirroring enables observational learning; we cognitively practice a
behavior just by watching it.
§ If you combine this with vicarious reinforcement, we are even more
likely to get imitation.
§ Monkey A saw Monkey B getting a banana after pressing four
symbols. Monkey A then pressed the same four symbols (even
though the symbols were in different locations).
Prosocial Effects of Observational Learning
§ Prosocial behavior refers to
actions which benefit others,
contribute value to groups, and
follow moral codes and social
norms.
§ Parents try to teach this behavior
through lectures, but it may be
taught best through modeling…
especially if kids can see the
benefits of the behavior to oneself
or others.
Antisocial Effects of Observational Learning
§ What happens when we learn from
models who demonstrate
antisocial behavior, actions that
are harmful to individuals and
society?
§ Children who witness violence in
their homes, but are not physically
harmed themselves, may hate
violence but still may become
violent more often than the
average child.
§ Perhaps this is a result of “the Bobo
doll effect”? Under stress, we do
what has been modeled for us.
Media Models of Violence
Do we learn antisocial
behavior such as violence
from indirect observations of
others in the media?
Research shows that viewing media violence leads to
increased aggression (fights) and reduced prosocial behavior
(such as helping an injured person).
This violence-viewing effect might be explained by imitation,
and also by desensitization toward pain in others.
Summary
§ Classical conditioning: Ivan Pavlov’s salivating dogs
§ New triggers for automatic responses
§ Operant conditioning: B.F. Skinner’s boxes and his pecking pigeons
§ Consequences influencing chosen behaviors
§ Biological components: constraints, neurons
§ Observational learning: Albert Bandura’s Bobo dolls, mirroring, prosocial
and antisocial modeling
Sources
u Myers, D., G. (2013). Psychology, 10th Edition. New York:
Worth Publishers.
u https://open.lib.umn.edu/intropsyc/part/chapter-7-
learning/

More Related Content

PDF
PSY101 Week 2 Learning
PDF
PSY101 Week 7 motivation
PDF
PSY101 Week 1 Critical Thinking
PDF
PSY101 Week 11 Social Psychology
PDF
PSY101 Week 9 learned helplessness
PPTX
Unit 2
PPTX
Behaviourist model of abnormality AS
PPT
Learned helplessness &_control
PSY101 Week 2 Learning
PSY101 Week 7 motivation
PSY101 Week 1 Critical Thinking
PSY101 Week 11 Social Psychology
PSY101 Week 9 learned helplessness
Unit 2
Behaviourist model of abnormality AS
Learned helplessness &_control

What's hot (20)

PDF
Operant Conditioning Part 1
PPTX
Attitude Week 1
PPT
Attitude revision
PPT
PEShare.co.uk Shared Resource
PPTX
Attitude Week 2
PPTX
Psychology Learning
PPT
Behavior Support Planning
PPT
Self directed behaviour change
PPTX
PPT
Psychology unit 2 (learning approach
PPTX
Q3 L02 Attitude Formation and Measurement
PPTX
Attitude: Component and Why Study Attiudes
PPTX
B.f. skinner
PPTX
Learning Theories of Operant Conditioning
PDF
L&B wk 4
PPT
Chapter 1 powerpoint
PPTX
B.f.skinner
PPTX
Q3L01 - Attitude: definition and components
PPTX
Introduction to learning
PPTX
Theories of Attitude Change
Operant Conditioning Part 1
Attitude Week 1
Attitude revision
PEShare.co.uk Shared Resource
Attitude Week 2
Psychology Learning
Behavior Support Planning
Self directed behaviour change
Psychology unit 2 (learning approach
Q3 L02 Attitude Formation and Measurement
Attitude: Component and Why Study Attiudes
B.f. skinner
Learning Theories of Operant Conditioning
L&B wk 4
Chapter 1 powerpoint
B.f.skinner
Q3L01 - Attitude: definition and components
Introduction to learning
Theories of Attitude Change
Ad

Similar to Week 3 learning cont. (20)

PPT
Behavior Einsteins, Positive Behavior Support
PPT
Pbs 101
PPTX
Learning
PPT
AP Chapter 6 Learning.ppt:Classical Conditioning, Operant Condition, Social C...
PPTX
Behaviorism - behavioral theory of learning Suhair ababneh & Amani Alomari
PPT
Learning.ppt
PPT
Learning_yourself_Module2_week2_Perdev_12
PDF
OB Chap 2.pdf
PDF
Industrial psychology Unit 2
PPT
Learning
PPTX
Supporting self regulation skills [recovered]
PPTX
Behaviour modification techniques
PPT
Bbl2 handouts 2 - copy
PPTX
Cognitive and Affective Processes
PPTX
Learning and organizational reward system
PPTX
Motivation and Behaviorism
PDF
Decoding Behavior_ MaxLearn’s Guide to Skinner’s Operant Conditioning _ MaxLe...
PPT
Professional Development Final Presentation2
PPTX
Behavioral Perspectives
PPTX
ch 4 learning (final term).pptxPsychology
Behavior Einsteins, Positive Behavior Support
Pbs 101
Learning
AP Chapter 6 Learning.ppt:Classical Conditioning, Operant Condition, Social C...
Behaviorism - behavioral theory of learning Suhair ababneh & Amani Alomari
Learning.ppt
Learning_yourself_Module2_week2_Perdev_12
OB Chap 2.pdf
Industrial psychology Unit 2
Learning
Supporting self regulation skills [recovered]
Behaviour modification techniques
Bbl2 handouts 2 - copy
Cognitive and Affective Processes
Learning and organizational reward system
Motivation and Behaviorism
Decoding Behavior_ MaxLearn’s Guide to Skinner’s Operant Conditioning _ MaxLe...
Professional Development Final Presentation2
Behavioral Perspectives
ch 4 learning (final term).pptxPsychology
Ad

More from Dr. Russell Rodrigo (20)

PDF
Week 4 Presenting Grammar.pdf
PDF
Week 3 Presenting Vocabulary and Giving Instructions
PDF
Intercultural Communication_DrRodrigo.pdf
PDF
Week 3.2 Ethical Decision Making Process & Ethical Dilemma.pdf
PDF
Week 9 Marketing.pdf
PDF
Week 8 Workplace Discrimination.pdf
PDF
Week 8 Work Ethics.pdf
PDF
Week 8 Recruiting, Motivating & Keeping Quality Employees.pdf
PDF
Week 6 Managing Businesses for Success.pdf
PDF
Week 5 Forms of Business and Writing a Business Plan.pdf
PDF
Week 4 Business in a Global Environment.pdf
PDF
Week 3 Business Ethics and Social Responsibility.pdf
PDF
Week 2 Kantian & Virtue Ethics (1).pdf
PDF
Week 1 The Business Foundation.pdf
PDF
Week 1 Ethical Traditions.pdf
PDF
Week 7 The Adverb Clauses.pdf
PDF
Week 5 Object Relative Clauses.pdf
PDF
Week 4 Subject Relative Clauses.pdf
PPTX
ENG366 Week 2 Narratives_Updated.pptx
PDF
Week 1.2 ENG366 Introduction to the course.pdf
Week 4 Presenting Grammar.pdf
Week 3 Presenting Vocabulary and Giving Instructions
Intercultural Communication_DrRodrigo.pdf
Week 3.2 Ethical Decision Making Process & Ethical Dilemma.pdf
Week 9 Marketing.pdf
Week 8 Workplace Discrimination.pdf
Week 8 Work Ethics.pdf
Week 8 Recruiting, Motivating & Keeping Quality Employees.pdf
Week 6 Managing Businesses for Success.pdf
Week 5 Forms of Business and Writing a Business Plan.pdf
Week 4 Business in a Global Environment.pdf
Week 3 Business Ethics and Social Responsibility.pdf
Week 2 Kantian & Virtue Ethics (1).pdf
Week 1 The Business Foundation.pdf
Week 1 Ethical Traditions.pdf
Week 7 The Adverb Clauses.pdf
Week 5 Object Relative Clauses.pdf
Week 4 Subject Relative Clauses.pdf
ENG366 Week 2 Narratives_Updated.pptx
Week 1.2 ENG366 Introduction to the course.pdf

Recently uploaded (20)

PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Pharma ospi slides which help in ospi learning
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
Presentation on HIE in infants and its manifestations
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
RMMM.pdf make it easy to upload and study
PDF
Computing-Curriculum for Schools in Ghana
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Anesthesia in Laparoscopic Surgery in India
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Chinmaya Tiranga quiz Grand Finale.pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
O7-L3 Supply Chain Operations - ICLT Program
Final Presentation General Medicine 03-08-2024.pptx
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Supply Chain Operations Speaking Notes -ICLT Program
Pharmacology of Heart Failure /Pharmacotherapy of CHF
O5-L3 Freight Transport Ops (International) V1.pdf
Pharma ospi slides which help in ospi learning
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Module 4: Burden of Disease Tutorial Slides S2 2025
GDM (1) (1).pptx small presentation for students
Presentation on HIE in infants and its manifestations
Final Presentation General Medicine 03-08-2024.pptx
2.FourierTransform-ShortQuestionswithAnswers.pdf
RMMM.pdf make it easy to upload and study
Computing-Curriculum for Schools in Ghana
VCE English Exam - Section C Student Revision Booklet
Anesthesia in Laparoscopic Surgery in India

Week 3 learning cont.

  • 1. PSY101 WEEK 3 LEARNING CONT. DR. RUSSELL RODRIGO
  • 2. Outline Operant Conditioning: Reinforcement of Learning Biological and cognitive components of learning Observational learning
  • 3. How often should we reinforce? § Do we need to give a reward every single time? Or is that even best? § B.F. Skinner experimented with the effects of giving reinforcements in different patterns or “schedules” to determine what worked best to establish and maintain a target behavior. § In continuous reinforcement (giving a reward after the target every single time), the subject acquires the desired behavior quickly. § In partial/intermittent reinforcement (giving rewards part of the time), the target behavior takes longer to be acquired/established but persists longer without reward.
  • 4. § Fixed interval schedule: reward every hour § Variable interval schedule: reward after a changing/random amount of time passes We may schedule our reinforcements based on an interval of time that has gone by. § Fixed ratio schedule: reward every five targeted behaviors § Variable ratio schedule: reward after a randomly chosen instance of the target behavior We may plan for a certain ratio of rewards per number of instances of the desired behavior. Different Schedules of Partial/Intermittent Reinforcement
  • 5. Which Schedule of Reinforcement is This? Ratio or Interval? Fixed or Variable? 1. Rat gets food every third time it presses the lever 2. Getting paid weekly no matter how much work is done 3. Getting paid for every ten boxes you make 4. Hitting a jackpot sometimes on the slot machine 5. Winning sometimes on the lottery you play once a day 6. Checking cell phone all day; sometimes getting a text 7. Buy eight pizzas, get the next one free 8. Fundraiser averages one donation for every eight houses visited 9. Kid has tantrum, parents sometimes give in 10. Repeatedly checking mail until paycheck arrives
  • 6. Rapid responding near time for reinforcement Fixed interval Rapid responding near time for reinforcement Fixed interval Results of the different schedules of reinforcement Which reinforcements produce more “responding” (more target behavior)? § Fixed interval: slow, unsustained responding If I’m only paid for my Saturday work, I’m not going to work as hard on the other days. § Variable interval: slow, consistent responding If I never know which day my lucky lottery number will pay off, I better play it every day. Steady responding Variable interval
  • 7. Reinforcers Effectiveness of the ratio schedules of Reinforcement § Fixed ratio: high rate of responding Buy two drinks, get one free? I’ll buy a lot of them! § Variable ratio: high, consistent responding, even if reinforcement stops (resists extinction) If the slot machine sometimes pays, I’ll pull the lever as many times as possible because it may pay this time! Variable ratio Fixed ratio
  • 8. Operant Effect: Punishment Punishments have the opposite effects of reinforcement. These consequences make the target behavior less likely to occur in the future. + Positive Punishment You ADD something unpleasant/aversiv e (ex: spank the child) - Negative Punishment You TAKE AWAY something pleasant/ desired (ex: no TV time, no attention)--MINUS is the “negative” here àPositive does not mean “good” or “desirable” and negative does not mean “bad” or “undesirable.”
  • 9. When is punishment effective? u Punishment works best in natural settings when we encounter punishing consequences from actions such as reaching into a fire; in that case, operant conditioning helps us to avoid dangers. u Punishment is effective when we try to artificially create punishing consequences for other’s choices; these work best when consequences happen as they do in nature. uàSeverity of punishments is not as helpful as making the punishments immediate and certain.
  • 10. Applying operant conditioning to parenting Problems with Physical Punishment § Punished behaviors may restart when the punishment is over; learning is not lasting. § Instead of learning behaviors, the child may learn to discriminate among situations, and avoid those in which punishment might occur. § Instead of behaviors, the child might learn an attitude of fear or hatred, which can interfere with learning. This can generalize to a fear/hatred of all adults or many settings. § Physical punishment models aggression and control as a method of dealing with problems.
  • 11. Don’t think about the beach Don’t think about the waves, the sand, the towels and sunscreen, the sailboats and surfboards. Don’t think about the beach. Are you obeying the instruction? Would you obey this instruction more if you were punished for thinking about the beach?
  • 12. Problem: Punishing focuses on what NOT to do, which does not guide people to a desired behavior. § Even if undesirable behaviors do stop, another problem behavior may emerge that serves the same purpose, especially if no replacement behaviors are taught and reinforced. Lesson: In order to teach desired behavior, reinforce what’s right more often than punishing what’s wrong.
  • 13. More effective forms of operant conditioning The Power of Rephrasing § Positive punishment: “You’re playing video games instead of practicing the piano, so I am justified in YELLING at you.” § Negative punishment: “You’re avoiding practicing, so I’m turning off your game.” § Negative reinforcement: “I will stop staring at you and bugging you as soon as I see that you are practicing.” § Positive reinforcement: “After you practice, we’ll play a game!”
  • 14. Summary: Types of Consequences Adding stimuli Subtract stimuli Outcome Positive + Reinforcement (You get candy) Negative – Reinforcement (I stop yelling) Strengthens target behavior (You do chores) Positive + Punishment (You get spanked) Negative – Punishment (No cell phone) Reduces target behavior (cursing) = uses desirable stimuli = uses unpleasant stimuli
  • 15. B.F. Skinner’s Legacy § The way to modify behavior is through consequences. § Behavior is influenced only by external feedback, not by thoughts and feelings. § We should intentionally create consequences to shape the behavior of others. § Humanity improves through conscious reinforcement of positive behavior and the punishment of bad behavior. § This leaves out the value of instruction and modeling. § Adult humans have the ability to use thinking to make choices and plans § Natural consequences are more justifiable than manipulation of others. § Humanity improves through free choice guided by wisdom, conscience, and responsibility. B.F. Skinner’s View Critique
  • 16. School: long before tablet computers, B.F. Skinner proposed machines that would reinforce students for correct responses, allowing students to improve at different rates and work on different learning goals. Sports: athletes improve most in the shaping approach in which they are reinforced for performance that comes closer and closer to the target skill (e.g., hitting pitches that are progressively faster). Work: some companies make pay a function of performance or company profit rather than seniority; they target more specific behaviors to reinforce. Applications of Operant Conditioning
  • 17. More Operant Conditioning Applications Parenting 1. Rewarding small improvements toward desired behaviors works better than expecting complete success, and also works better than punishing problem behaviors. 2. Giving in to temper tantrums stops them in the short run but increases them in the long run. Self-Improvement Reward yourself for steps you take toward your goals. As you establish good habits, then make your rewards more infrequent (intermittent).
  • 18. Contrasting Types of Conditioning Organism associates events. Classical Conditioning Operant Conditioning Basic Idea Associating events/stimuli with each other Associating chosen behaviors with resulting events Response Involuntary, automatic reactions such as salivating Voluntary actions “operating” on our environment Acquisition NS linked to US by repeatedly presenting NS before US Behavior is associated with punishment or reinforcement Extinction CR decreases when CS is repeatedly presented alone Target behavior decreases when reinforcement stops Spontaneous Recovery Extinguished CR starts again after a rest period (no CS) Extinguished response starts again after a rest (no reward) Generalization When CR is triggered by stimuli similar to the CS Response behavior similar to the reinforced behavior. Discrimination Distinguishing between a CS and NS not linked to U.S. Distinguishing what will get reinforced and what will not
  • 19. If the organism is learning associations between events that it does not control, it is... If the organism is learning associations between its behavior and the resulting events, it is... operant conditioning classical conditioning Operant vs. Classical Conditioning
  • 20. Role of Biology in Conditioning Classical Conditioning § John Garcia and others found it was easier to learn associations that make sense for survival. § Food aversions can be acquired even if the UR (nausea) does NOT immediately follow the NS. When acquiring food aversions during pregnancy or illness, the body associates nausea with whatever food was eaten. § Males in one study were more likely to see a pictured woman as attractive if the picture had a red border. § Quail can have a sexual response linked to a fake quail more readily and strongly than to a red light.
  • 21. Role of Biology in Conditioning u Operant Conditioning u Can a monkey be trained to peck with its nose? No, but a pigeon can. u Can a pigeon be trained to dive underwater? No, but a dolphin can. u Operant conditioning encounters biological tendencies and limits that are difficult to override. u What can we most easily train a dog to do based on natural tendencies? u detecting scents? u climbing and balancing? u putting on clothes?
  • 22. In classical conditioning In operant conditioning Cognitive Processes § When the dog salivates at the bell, it may be due to cognition (learning to predict, even expect, the food). § Conditioned responses can alter attitudes, even when we know the change is caused by conditioning. § However, knowing that our reactions are caused by conditioning gives us the option of mentally breaking the association, e.g. deciding that nausea associated with a food aversion was actually caused by an illness. § Higher-order conditioning involves some cognition; the name of a food may trigger salivation. § In fixed-interval reinforcement, animals do more target behaviors/responses around the time that the reward is more likely, as if expecting the reward. § Expectation as a cognitive skill is even more evident in the ability of humans to respond to delayed reinforcers such as a paycheck. § Higher-order conditioning can be enabled with cognition; e.g., seeing something such as money as a reward because of its indirect value. § Humans can set behavioral goals for self and others, and plan their own reinforcers.
  • 23. Latent Learning § Rats appear to form cognitive maps. They can learn a maze just by wandering, with no cheese to reinforce their learning. § Evidence of these maps is revealed once the cheese is placed somewhere in the maze. After only a few trials, these rats quickly catch up in maze-solving to rats who were rewarded with cheese all along. § Latent learning refers to skills or knowledge gained from experience, but not apparent in behavior until rewards are given.
  • 24. Learning, Rewards, and Motivation § Intrinsic motivation refers to the desire to perform a behavior well for its own sake. The reward is internalized as a feeling of satisfaction. § Extrinsic motivation refers to doing a behavior to receive rewards from others. § Intrinsic motivation can sometimes be reduced by external rewards, and can be prevented by using continuous reinforcement. § One principle for maintaining behavior is to use as few rewards as possible, and fade the rewards over time. What might happen if we begin to reward a behavior someone was already doing and enjoying?
  • 26. Learning by Observation § Can we, like the rats exploring the maze with no reward, learn new behaviors and skills without a direct experience of conditioning? § Yes, and one of the ways we do so is by observational learning: watching what happens when other people do a behavior and learning from their experience. § Skills required: mirroring, being able to picture ourselves doing the same action, and cognition, noticing consequences and associations. Modeling The behavior of others serves as a model, an example of how to respond to a situation; we may try this model regardless of reinforcement. Vicarious Conditioning § Vicarious: experienced indirectly, through others § Vicarious reinforcement and punishment means our choices are affected as we see others get consequences for their behaviors. Observational Learning Processes
  • 27. Albert Bandura’s Bobo Doll Experiment (1961) § Kids saw adults punching an inflated doll while narrating their aggressive behaviors such as “kick him.” § These kids were then put in a toy-deprived situation… and acted out the same behaviors they had seen.
  • 28. Mirroring in the Brain § When we watch others doing or feeling something, neurons fire in patterns that would fire if we were doing the action or having the feeling ourselves. § These neurons are referred to as mirror neurons, and they fire only to reflect the actions or feelings of others.
  • 29. From Mirroring to Imitation § Humans are prone to spontaneous imitation of both behaviors and emotions (“emotional contagion”). § This includes even overimitating, that is, copying adult behaviors that have no function and no reward. § Children with autism are less likely to cognitively “mirror,” and less likely to follow someone else’s gaze as a neurotypical toddler (left) is doing below.
  • 30. Mirroring Plus Vicarious Reinforcement § Mirroring enables observational learning; we cognitively practice a behavior just by watching it. § If you combine this with vicarious reinforcement, we are even more likely to get imitation. § Monkey A saw Monkey B getting a banana after pressing four symbols. Monkey A then pressed the same four symbols (even though the symbols were in different locations).
  • 31. Prosocial Effects of Observational Learning § Prosocial behavior refers to actions which benefit others, contribute value to groups, and follow moral codes and social norms. § Parents try to teach this behavior through lectures, but it may be taught best through modeling… especially if kids can see the benefits of the behavior to oneself or others.
  • 32. Antisocial Effects of Observational Learning § What happens when we learn from models who demonstrate antisocial behavior, actions that are harmful to individuals and society? § Children who witness violence in their homes, but are not physically harmed themselves, may hate violence but still may become violent more often than the average child. § Perhaps this is a result of “the Bobo doll effect”? Under stress, we do what has been modeled for us.
  • 33. Media Models of Violence Do we learn antisocial behavior such as violence from indirect observations of others in the media? Research shows that viewing media violence leads to increased aggression (fights) and reduced prosocial behavior (such as helping an injured person). This violence-viewing effect might be explained by imitation, and also by desensitization toward pain in others.
  • 34. Summary § Classical conditioning: Ivan Pavlov’s salivating dogs § New triggers for automatic responses § Operant conditioning: B.F. Skinner’s boxes and his pecking pigeons § Consequences influencing chosen behaviors § Biological components: constraints, neurons § Observational learning: Albert Bandura’s Bobo dolls, mirroring, prosocial and antisocial modeling
  • 35. Sources u Myers, D., G. (2013). Psychology, 10th Edition. New York: Worth Publishers. u https://open.lib.umn.edu/intropsyc/part/chapter-7- learning/