SlideShare a Scribd company logo
Lecture Overview
Solving the prisoner’s dilemma
Instrumental rationality
Morality & norms
Repeated games
Three ways to solve the prisoner’s dilemma
Sequential games
Backward induction
Subgame perfect Nash equilibrium
Common knowledge of rationality
Mixed strategies
Game theory: underlying assumptions
Remember:
Homo economicus: instrumental rationality and preferences
Common knowledge of rationality and consistent alignment of
believes: given the same information individuals arrive at the
same decisions
Individuals know the rules of the game which are exogenously
given and independent of individuals’ choices
We will look at these one by one, analysing alternative
assumptions.
We will use the prisoner’s dilemma as example.
Why?
Coordination game with conflict
Arguably it describes many social situations, e.g. the free rider
problem:
Voting
Trade union affiliation
Wage cuts to increase profit
Domestic work
Prisoner’s dilemma
The homo economicus maximises his/her utility.
In a prisoner’s dilemma the dominant strategy is to confess
(defect).
Fallacy of compositions: what is individually rational is neither
Pareto optimal not socially rational.
But do people really defect?
Kant’s categorical imperative: not the outcome but the act is
crucial (morality)
Altruism: blood donation
Social norms: forest people hunting in the Congo (Turnbull
1963)
Instrumental rationality
Gauthier: it is instrumentally rational to cooperate rather than to
defect
Assume there are two sorts of maximisers in the economy:
straight maximisers (SM) and constrained maximisers (CM);
SMs defect, CMs cooperate with other CMs:
E(return from CM) = p*(-1)+(1-p)*(-3)
E(return from SM) = -3
For any p>0 the CM
strategy is better than
the SM one.
Instrumental rationality
Tit-for-tat
Unsurprisingly (maybe), in a repeated Prisoner’s dilemma the
best strategy is not to defect but to adopt a tit-for-tat strategy.
In the 1980s, Robert Axelrod invited professional game
theorists to enter strategies into a tournament of a repeated
game (200 times).
The winning strategy was tit-for-tat entered by Anatol
Rapaport:
Start off with cooperation
If opponent defects punish him/her by defecting
If opponent comes back to cooperation ‘forgive’ them and go
back to cooperation
Overall, forgiving and cooperative strategies did better.
Repeated games & reputation
A tit-for-tat strategy can only be played in repeated games.
The folk theorem states that in an infinitely repeated game (or
given uncertainty to the end of the game) any strategy with a
feasible payoff can be an equilibrium.
This is important for social interaction: the prisoner’s dilemma
can be overcome without (!) external authority.
Players enforce compliance (cooperate rather than defect)
through punishment.
The loss of future returns deters players from defecting.
The surprising thing about Axelrod’s tournament was that the
tit-for-tat strategy won in a finite (and defined) repeated game.
Solution
s to the prisoner’s dilemma
Authority (state, mafia etc.) imposes cooperation by changing
the payoff matrix (e.g. binding contracting).
Individuals are motivated by morality/norms (but: if some
individuals in society are motivated by morality it becomes
rational – according to Gauthier – to cooperate)
In repeated games with uncertainty about the end of the game
cooperation can be policed by the players through punishment
(e.g. defecting in future round). This insight is backed by the
Folk Theorem.
Remember this game?
This is the matrix form of a simultaneous game.
Are there any equilibria?
Yes, player A has a dominant strategy: ‘Top’.
The equilibrium in dominant strategy is (Top, Right).
This is simultaneously a Nash equilibrium.
Dynamic games & backward induction
LeftRightTop10, 41, 5Bottom9, 90, 3
Now imagine the game is played sequentially: A moves first.
To solve this game, we can use backward induction.
A: What will B do once I’ve played Top/Bottom?
Dynamic games & backward induction
10, 4
0, 3
9, 9
1, 5
Dynamic games & backward induction
Let’s play a game to demonstrate backward induction:
The race to 20
Your aims is to get to 20 first!
You can either pick up one pen or two pens.
Whoever gets the last pen wins!
Who will win?
Whoever gets to 2 first (the first player if she/he is aware of
backward induction).
From 2 you can get to 5, 8, 11, 14, 17, 20.
Dynamic games & backward induction
Backward induction helps us to identify ‘subgame perfect Nash
equilibria’.
A subgame is a segment of a dynamic game, if:
The subgame starts from a node
It branches out to the successor node
It must end at the pay-offs of the last node
The initial node must be a singleton (i.e. it is clear what the
previous move was).
While (Top, Right) was an equilibrium in dominant strategies, it
is not a subgame perfect Nash equilibrium (SPNE).
For backward induction we ask ‘what will player B do it A
plays strategy x, y, z.
But if it is not rational for player A to play bottom, why should
B’s reaction to a non-rational strategy be rational?
This is a game of imperfect information.
Common knowledge of rationality is not assumed. Therefore,
you can build a reputation for being rational/irrational, play
cooperative/defect etc.
Common knowledge of rationality
R
R
R
C
C
C
110, 101
101
102
4
1
3
100
102
99
1
0
0
2
A reputation as ‘irrational’ individual might increase your pay-
off.
But your opponent might know that. He/she might suspect that
you’re just playing irrational.
If a player does not have perfect information about her
opponents strategy choices she will play a game of mixed
strategies (rather than pure strategy).
Assume C plays left with probability c and right with
probability (1-c). R plays top with probability r and
bottom with probability (1-r).
Common knowledge of rationality
For R pay-offs are:
For C:
Row will be happy with his assigned probabilities if a small
change in r does not change his pay-off:
Same is true for Column.
Mixed strategies
Mixed strategies
These are best response curves.
The intersection shows a Nash equilibrium.
There are three ways how the prisoner’s dilemma can be
resolved:
enforcement by authority,
non-instrumental rationality and
the folk theorem.
In dynamic games we use backward induction to find subgame
perfect Nash equilibria.
If we drop common knowledge of rationality pay-off
change/might increase (reputation).
Given incomplete information we can play mixed strategies.
Summary
Varian (2006): chapters 24, 28 and 29.
Hargreaves Heap & Varoufakis (1995): Game theory: A critical
introduction, chapters 3, 5 and 6.
Required readings
Lecture OverviewSolving the prisoner’s dilemmaInstrumental r.docx

More Related Content

PDF
Ssrn a brief inrtoduction to the basic of game theory
PDF
Introduction to Game Theory
PPT
Dr Stahl Game Theory Jan 27
PPTX
Econ
PDF
A brief introduction to the basics of game theory
PDF
A brief introduction to the basics of game theory
PDF
Game theory intro_and_questions_2009[1]
PPT
Introduction to Game Theory for engineering.ppt
Ssrn a brief inrtoduction to the basic of game theory
Introduction to Game Theory
Dr Stahl Game Theory Jan 27
Econ
A brief introduction to the basics of game theory
A brief introduction to the basics of game theory
Game theory intro_and_questions_2009[1]
Introduction to Game Theory for engineering.ppt

Similar to Lecture OverviewSolving the prisoner’s dilemmaInstrumental r.docx (20)

DOCX
Game Theory
PPT
gt_2007
PPT
Game theory
PDF
SN- Lecture 7
PPT
Ch 5 Game theory.ppt
DOCX
A brief introduction to game theory prisoners dilemma and nash equilibrum
PPTX
ch15%20-%20GAME%20THEORY%20MODELS.pptxll
PPT
ch15.ppt
PPT
Ch15
PPTX
Topic 3- Cooperation and Collective Action
PPTX
Game Theory powerpoint for Micro economics
PPT
Materi Kuliah Ekonomi Mikro dengan judul GAME THEORY MODELS
PPT
Oligopoly game theory
PPT
p1701-3-1_John_Willett_rev1xxxxxxxxx.ppt
PPTX
Mid term
PPTX
Study of strategic decision making in game
PPTX
Strategies Without Frontiers
PDF
A Brief Introduction to the Basics of Game Theory
Game Theory
gt_2007
Game theory
SN- Lecture 7
Ch 5 Game theory.ppt
A brief introduction to game theory prisoners dilemma and nash equilibrum
ch15%20-%20GAME%20THEORY%20MODELS.pptxll
ch15.ppt
Ch15
Topic 3- Cooperation and Collective Action
Game Theory powerpoint for Micro economics
Materi Kuliah Ekonomi Mikro dengan judul GAME THEORY MODELS
Oligopoly game theory
p1701-3-1_John_Willett_rev1xxxxxxxxx.ppt
Mid term
Study of strategic decision making in game
Strategies Without Frontiers
A Brief Introduction to the Basics of Game Theory

More from SHIVA101531 (20)

DOCX
Answer the following questions in a minimum of 1-2 paragraphs ea.docx
DOCX
Answer the following questions using scholarly sources as references.docx
DOCX
Answer the following questions about this case studyClient .docx
DOCX
Answer the following questions using art vocabulary and ideas from L.docx
DOCX
Answer the following questions in a total of 3 pages (900 words). My.docx
DOCX
Answer the following questions No single word responses (at lea.docx
DOCX
Answer the following questions based on the ethnography Dancing Skel.docx
DOCX
Answer the following questions to the best of your ability1) De.docx
DOCX
Answer the following questionDo you think it is necessary to .docx
DOCX
Answer the following question. Use facts and examples to support.docx
DOCX
Answer the bottom questions  in apa format and decent answer no shor.docx
DOCX
Answer the following below using the EXCEL attachment. chapter 5.docx
DOCX
Answer the following prompts about A Germanic People Create a Code .docx
DOCX
Answer the following discussion board question below minumun 25.docx
DOCX
Answer the following questions about IT Project Management. What.docx
DOCX
Answer the following in at least 100 words minimum each1.Of.docx
DOCX
Answer the following questions(at least 200 words) and responses 2 p.docx
DOCX
Answer the following questions in a Word document and upload it by M.docx
DOCX
Answer the following questions in complete sentences. Each answer sh.docx
DOCX
ANSWER THE DISCUSSION QUESTION 250 WORDS MINDiscussion Q.docx
Answer the following questions in a minimum of 1-2 paragraphs ea.docx
Answer the following questions using scholarly sources as references.docx
Answer the following questions about this case studyClient .docx
Answer the following questions using art vocabulary and ideas from L.docx
Answer the following questions in a total of 3 pages (900 words). My.docx
Answer the following questions No single word responses (at lea.docx
Answer the following questions based on the ethnography Dancing Skel.docx
Answer the following questions to the best of your ability1) De.docx
Answer the following questionDo you think it is necessary to .docx
Answer the following question. Use facts and examples to support.docx
Answer the bottom questions  in apa format and decent answer no shor.docx
Answer the following below using the EXCEL attachment. chapter 5.docx
Answer the following prompts about A Germanic People Create a Code .docx
Answer the following discussion board question below minumun 25.docx
Answer the following questions about IT Project Management. What.docx
Answer the following in at least 100 words minimum each1.Of.docx
Answer the following questions(at least 200 words) and responses 2 p.docx
Answer the following questions in a Word document and upload it by M.docx
Answer the following questions in complete sentences. Each answer sh.docx
ANSWER THE DISCUSSION QUESTION 250 WORDS MINDiscussion Q.docx

Recently uploaded (20)

PDF
advance database management system book.pdf
PDF
What if we spent less time fighting change, and more time building what’s rig...
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
PDF
HVAC Specification 2024 according to central public works department
PDF
Empowerment Technology for Senior High School Guide
PDF
My India Quiz Book_20210205121199924.pdf
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PDF
Complications of Minimal Access-Surgery.pdf
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PDF
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
PDF
Hazard Identification & Risk Assessment .pdf
PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
IGGE1 Understanding the Self1234567891011
PPTX
20th Century Theater, Methods, History.pptx
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
advance database management system book.pdf
What if we spent less time fighting change, and more time building what’s rig...
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 1)
HVAC Specification 2024 according to central public works department
Empowerment Technology for Senior High School Guide
My India Quiz Book_20210205121199924.pdf
A powerpoint presentation on the Revised K-10 Science Shaping Paper
Complications of Minimal Access-Surgery.pdf
B.Sc. DS Unit 2 Software Engineering.pptx
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
Chinmaya Tiranga quiz Grand Finale.pdf
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
MBA _Common_ 2nd year Syllabus _2021-22_.pdf
Hazard Identification & Risk Assessment .pdf
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
AI-driven educational solutions for real-life interventions in the Philippine...
LDMMIA Reiki Yoga Finals Review Spring Summer
IGGE1 Understanding the Self1234567891011
20th Century Theater, Methods, History.pptx
Paper A Mock Exam 9_ Attempt review.pdf.

Lecture OverviewSolving the prisoner’s dilemmaInstrumental r.docx

  • 1. Lecture Overview Solving the prisoner’s dilemma Instrumental rationality Morality & norms Repeated games Three ways to solve the prisoner’s dilemma Sequential games Backward induction Subgame perfect Nash equilibrium Common knowledge of rationality Mixed strategies Game theory: underlying assumptions Remember: Homo economicus: instrumental rationality and preferences Common knowledge of rationality and consistent alignment of believes: given the same information individuals arrive at the same decisions Individuals know the rules of the game which are exogenously given and independent of individuals’ choices We will look at these one by one, analysing alternative assumptions.
  • 2. We will use the prisoner’s dilemma as example. Why? Coordination game with conflict Arguably it describes many social situations, e.g. the free rider problem: Voting Trade union affiliation Wage cuts to increase profit Domestic work Prisoner’s dilemma The homo economicus maximises his/her utility. In a prisoner’s dilemma the dominant strategy is to confess (defect). Fallacy of compositions: what is individually rational is neither Pareto optimal not socially rational. But do people really defect? Kant’s categorical imperative: not the outcome but the act is crucial (morality) Altruism: blood donation Social norms: forest people hunting in the Congo (Turnbull 1963) Instrumental rationality
  • 3. Gauthier: it is instrumentally rational to cooperate rather than to defect Assume there are two sorts of maximisers in the economy: straight maximisers (SM) and constrained maximisers (CM); SMs defect, CMs cooperate with other CMs: E(return from CM) = p*(-1)+(1-p)*(-3) E(return from SM) = -3 For any p>0 the CM strategy is better than the SM one. Instrumental rationality Tit-for-tat Unsurprisingly (maybe), in a repeated Prisoner’s dilemma the best strategy is not to defect but to adopt a tit-for-tat strategy. In the 1980s, Robert Axelrod invited professional game theorists to enter strategies into a tournament of a repeated
  • 4. game (200 times). The winning strategy was tit-for-tat entered by Anatol Rapaport: Start off with cooperation If opponent defects punish him/her by defecting If opponent comes back to cooperation ‘forgive’ them and go back to cooperation Overall, forgiving and cooperative strategies did better. Repeated games & reputation A tit-for-tat strategy can only be played in repeated games. The folk theorem states that in an infinitely repeated game (or given uncertainty to the end of the game) any strategy with a feasible payoff can be an equilibrium. This is important for social interaction: the prisoner’s dilemma can be overcome without (!) external authority. Players enforce compliance (cooperate rather than defect) through punishment. The loss of future returns deters players from defecting. The surprising thing about Axelrod’s tournament was that the tit-for-tat strategy won in a finite (and defined) repeated game.
  • 5. Solution s to the prisoner’s dilemma Authority (state, mafia etc.) imposes cooperation by changing the payoff matrix (e.g. binding contracting). Individuals are motivated by morality/norms (but: if some individuals in society are motivated by morality it becomes rational – according to Gauthier – to cooperate) In repeated games with uncertainty about the end of the game cooperation can be policed by the players through punishment (e.g. defecting in future round). This insight is backed by the Folk Theorem. Remember this game? This is the matrix form of a simultaneous game. Are there any equilibria? Yes, player A has a dominant strategy: ‘Top’.
  • 6. The equilibrium in dominant strategy is (Top, Right). This is simultaneously a Nash equilibrium. Dynamic games & backward induction LeftRightTop10, 41, 5Bottom9, 90, 3 Now imagine the game is played sequentially: A moves first. To solve this game, we can use backward induction. A: What will B do once I’ve played Top/Bottom? Dynamic games & backward induction
  • 7. 10, 4 0, 3 9, 9 1, 5 Dynamic games & backward induction Let’s play a game to demonstrate backward induction: The race to 20 Your aims is to get to 20 first! You can either pick up one pen or two pens. Whoever gets the last pen wins! Who will win? Whoever gets to 2 first (the first player if she/he is aware of backward induction).
  • 8. From 2 you can get to 5, 8, 11, 14, 17, 20. Dynamic games & backward induction Backward induction helps us to identify ‘subgame perfect Nash equilibria’. A subgame is a segment of a dynamic game, if: The subgame starts from a node It branches out to the successor node It must end at the pay-offs of the last node The initial node must be a singleton (i.e. it is clear what the previous move was). While (Top, Right) was an equilibrium in dominant strategies, it is not a subgame perfect Nash equilibrium (SPNE). For backward induction we ask ‘what will player B do it A plays strategy x, y, z. But if it is not rational for player A to play bottom, why should B’s reaction to a non-rational strategy be rational?
  • 9. This is a game of imperfect information. Common knowledge of rationality is not assumed. Therefore, you can build a reputation for being rational/irrational, play cooperative/defect etc. Common knowledge of rationality R R R C C C 110, 101 101 102
  • 10. 4 1 3 100 102 99 1 0 0 2 A reputation as ‘irrational’ individual might increase your pay- off. But your opponent might know that. He/she might suspect that you’re just playing irrational. If a player does not have perfect information about her opponents strategy choices she will play a game of mixed strategies (rather than pure strategy). Assume C plays left with probability c and right with probability (1-c). R plays top with probability r and bottom with probability (1-r). Common knowledge of rationality
  • 11. For R pay-offs are: For C: Row will be happy with his assigned probabilities if a small change in r does not change his pay-off: Same is true for Column. Mixed strategies
  • 12. Mixed strategies These are best response curves. The intersection shows a Nash equilibrium. There are three ways how the prisoner’s dilemma can be resolved: enforcement by authority, non-instrumental rationality and the folk theorem. In dynamic games we use backward induction to find subgame perfect Nash equilibria.
  • 13. If we drop common knowledge of rationality pay-off change/might increase (reputation). Given incomplete information we can play mixed strategies. Summary Varian (2006): chapters 24, 28 and 29. Hargreaves Heap & Varoufakis (1995): Game theory: A critical introduction, chapters 3, 5 and 6. Required readings