SlideShare a Scribd company logo
Resolution of the Stochastic Strategy
Spatial Prisoner’s Dilemma by Means of
Particle Swarm Optimization
Zhang J, Zhang C, Chu T, Perc M
PLoS ONE 6(7): e21787. doi:10.1371/journal.pone.0021787 (2011)
Presented by: Stephen Daedalus E. Separa
Outline
● abstract
● prisoner’s dilemma
● particle swarm optimization
● methods
● results
● summary
Abstract
We study the evolution of cooperation among selfish individuals in the stochastic strategy spatial
prisoner’s dilemma game. We equip players with the particle swarm optimization technique, and find that it
may lead to highly cooperative states even if the temptations to defect are strong. The concept of
particle swarm optimization was originally introduced within a simple model of social dynamics that can
describe the formation of a swarm, i.e., analogous to a swarm of bees searching for a food source.
Essentially, particle swarm optimization foresees changes in the velocity profile of each player, such that the
best locations are targeted and eventually occupied. In our case, each player keeps track of the highest payoff
attained within a local topological neighborhood and its individual highest payoff. Thus, players make use of
their own memory that keeps score of the most profitable strategy in previous actions, as well as use
of the knowledge gained by the swarm as a whole, to find the best available strategy for themselves
and the society. Following extensive simulations of this setup, we find a significant increase in the level of
cooperation for a wide range of parameters, and also a full resolution of the prisoner’s dilemma. We
also demonstrate extreme efficiency of the optimization algorithm when dealing with environments that
strongly favor the proliferation of defection, which in turn suggests that swarming could be an important
phenomenon by means of which cooperation can be sustained even under highly unfavorable
conditions. We thus present an alternative way of understanding the evolution of cooperative behavior
and its ubiquitous presence in nature, and we hope that this study will be inspirational for future efforts
aimed in this direction.
Prisoner’s dilemma
Cooperate Defect
Cooperate R, R S, T
Defect T, S P, P
● Cooperate (stay silent), Defect (betray other prisoner)
● Payoff: R - reward, T - temptation, S - “sucker’s”, P - punishment
● T > R > P > S
● R > P mutual cooperation is superior to mutual defection
● T > R and P > S defection is the dominant strategy for both
● mutual defection is the only strong Nash equilibrium
● Developed by Merrill Flood and Melvin Dresher at RAND Corporation, later
formalized by Albert W. Tucker (mathematician)
Prisoner's dilemma, https://en.wikipedia.org/wiki/Prisoner’s_dilemma (last visited Aug. 23, 2015)
Particle swarm optimization algorithm
velocity update rule
global best current best
● Nature-inspired optimization algorithm based on swarming behavior
● schools of fishes, herding in quadrupeds, flocking in birds, ant and bee colonies
● forming swarms for: migration, foraging, evading preys
● algorithm has deterministic and stochastic components
● each particle fully aware of the other particle’s positions and swarm’s global best
Kennedy J, Eberhart R (1995) Particle swarm optimization. IEEE International Conference on Neural Networks, Piscataway, volume 4. pp 1942–1948
Xin-She Yang, Introduction to Mathematical Optimization: From Linear Programming to Metaheuristics, Cambridge International Science Press: Cambridge, UK (2008)
Particle swarm optimization, https://en.wikipedia.org/wiki/Particle_swarm_optimization (last visited Aug. 23, 2015)
Methods
payoff matrix for
pairwise interaction
(cooperation C or
defection D), between
it’s 4-nearest
neighbors
velocity
update
strategy
update
average level
of cooperation
Average level of cooperation in dependence on b for
different values of ω
● imitating best performing player
in the swarm (ω → 0) beneficial
in low b environments
● imitating personal success (ω →
1) better for evolution of
cooperation in high b
environments
● each data point is an average of
the final outcome of 100
simulations
Distribution of strategies in the whole population, as
obtained for different combinations of b and ω
● ω = 0.01, game is fully
dominated by two strategies, W
= 0 (full defection) and W = 1
(full cooperation)
● ω = 0.99, full spectrum of
strategies are available
● vertical axis: probability that
strategy W is present in the
population
Characteristic spatial distributions of strategies, as
obtained for different combinations of b and ω
● for low values of ω, clustering of
cooperators are observed, game
is dominated by two strategies W
= 0 and W = 1.
● for high values of ω, clustering of
cooperators less pronounced,
more strategies W are observed
● survival of cooperators at high b
Characteristic spatial distributions of velocities, as
obtained for different combinations of b and ω
● at ω = 0.99, population becomes
a “swarm”, with velocities ~ 0,
regardless of b. stationary state
has been reached by means of
adaptive locally inspired slow
strategy changes
● at ω = 0.01, only a few clusters
can be considered as “swarms”.
many pursue highest payoff but
are unable to attain it. swarming
is an important factor promoting
cooperation at high b
Summary
● impact of particle swarm optimization on the evolution of cooperation in stochastic strategy spatial prisoner’s
dilemma game
● strategy update guided by individual memory and swarm knowledge
● cooperative behavior can prevail
● resolution of prisoner’s dilemma game to prosocial behavior
● most profitable swarm strategy at moderate b lead to full dominance of cooperation
● best individual strategy at high b lead to survival of cooperative behavior
● spatial distributions of strategies and velocities

More Related Content

PDF
Examining Coexistence between Splachnaceae Mosses with Individual-Based Model...
PDF
Particle Swarm Optimization Approach for Estimation of Energy Demand of Turkey
PDF
Foundations of Multi-Agent Systems
PPTX
Particle swarm optimization
PDF
The Impact of Allee Effect on a Predator-Prey Model with Holling Type II Func...
PDF
Insights into the local and non-local interaction of two species on the impac...
PDF
Insights into the local and non-local interaction of two species on the impac...
PPTX
An Updated Survey on Niching Methods and Their Applications
Examining Coexistence between Splachnaceae Mosses with Individual-Based Model...
Particle Swarm Optimization Approach for Estimation of Energy Demand of Turkey
Foundations of Multi-Agent Systems
Particle swarm optimization
The Impact of Allee Effect on a Predator-Prey Model with Holling Type II Func...
Insights into the local and non-local interaction of two species on the impac...
Insights into the local and non-local interaction of two species on the impac...
An Updated Survey on Niching Methods and Their Applications

Similar to Resolution of the stochastic strategy spatial prisoner’s dilemma by means of particle swarm optimization (20)

PPTX
Particle swarm optimization
PDF
Qualitative Analysis of Prey Predator System With Immigrant Prey
PDF
Comparison Between PSO and HPSO In Image Steganography
PPTX
soft computing BTU MCA 3rd SEM unit 1 .pptx
PPTX
Neutral theory 2019
PDF
Two-Stage Eagle Strategy with Differential Evolution
PPTX
Multi-Domain Diversity Preservation to Mitigate Particle Stagnation and Enab...
PDF
Pso kota baru parahyangan 2017
PDF
As pi re2015_abstracts
PDF
Evolution of Coordination and Communication in Groups of Embodied Agents
PPT
Swarm intelligence pso and aco
PDF
Signal & Image Processing: An International Journal (SIPIJ)
PDF
Efficient Method to find Nearest Neighbours in Flocking Behaviours
PDF
Phd Defence talk
PDF
Methods for High Dimensional Interactions
PDF
Evolution-based Reaction Path Following
PDF
Evolutionary-driven Optimization in Computational Chemistry
PDF
Master of Science Thesis Defense - Souma (FIU)
PDF
PDF
BC3 Seminars: "Species interaction networks, global change and restoration" D...
Particle swarm optimization
Qualitative Analysis of Prey Predator System With Immigrant Prey
Comparison Between PSO and HPSO In Image Steganography
soft computing BTU MCA 3rd SEM unit 1 .pptx
Neutral theory 2019
Two-Stage Eagle Strategy with Differential Evolution
Multi-Domain Diversity Preservation to Mitigate Particle Stagnation and Enab...
Pso kota baru parahyangan 2017
As pi re2015_abstracts
Evolution of Coordination and Communication in Groups of Embodied Agents
Swarm intelligence pso and aco
Signal & Image Processing: An International Journal (SIPIJ)
Efficient Method to find Nearest Neighbours in Flocking Behaviours
Phd Defence talk
Methods for High Dimensional Interactions
Evolution-based Reaction Path Following
Evolutionary-driven Optimization in Computational Chemistry
Master of Science Thesis Defense - Souma (FIU)
BC3 Seminars: "Species interaction networks, global change and restoration" D...
Ad

Recently uploaded (20)

PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
An interstellar mission to test astrophysical black holes
PPTX
famous lake in india and its disturibution and importance
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPT
protein biochemistry.ppt for university classes
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PDF
Sciences of Europe No 170 (2025)
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
An interstellar mission to test astrophysical black holes
famous lake in india and its disturibution and importance
HPLC-PPT.docx high performance liquid chromatography
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
Phytochemical Investigation of Miliusa longipes.pdf
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
AlphaEarth Foundations and the Satellite Embedding dataset
The KM-GBF monitoring framework – status & key messages.pptx
Classification Systems_TAXONOMY_SCIENCE8.pptx
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
protein biochemistry.ppt for university classes
Taita Taveta Laboratory Technician Workshop Presentation.pptx
INTRODUCTION TO EVS | Concept of sustainability
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
Sciences of Europe No 170 (2025)
ECG_Course_Presentation د.محمد صقران ppt
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
Ad

Resolution of the stochastic strategy spatial prisoner’s dilemma by means of particle swarm optimization

  • 1. Resolution of the Stochastic Strategy Spatial Prisoner’s Dilemma by Means of Particle Swarm Optimization Zhang J, Zhang C, Chu T, Perc M PLoS ONE 6(7): e21787. doi:10.1371/journal.pone.0021787 (2011) Presented by: Stephen Daedalus E. Separa
  • 2. Outline ● abstract ● prisoner’s dilemma ● particle swarm optimization ● methods ● results ● summary
  • 3. Abstract We study the evolution of cooperation among selfish individuals in the stochastic strategy spatial prisoner’s dilemma game. We equip players with the particle swarm optimization technique, and find that it may lead to highly cooperative states even if the temptations to defect are strong. The concept of particle swarm optimization was originally introduced within a simple model of social dynamics that can describe the formation of a swarm, i.e., analogous to a swarm of bees searching for a food source. Essentially, particle swarm optimization foresees changes in the velocity profile of each player, such that the best locations are targeted and eventually occupied. In our case, each player keeps track of the highest payoff attained within a local topological neighborhood and its individual highest payoff. Thus, players make use of their own memory that keeps score of the most profitable strategy in previous actions, as well as use of the knowledge gained by the swarm as a whole, to find the best available strategy for themselves and the society. Following extensive simulations of this setup, we find a significant increase in the level of cooperation for a wide range of parameters, and also a full resolution of the prisoner’s dilemma. We also demonstrate extreme efficiency of the optimization algorithm when dealing with environments that strongly favor the proliferation of defection, which in turn suggests that swarming could be an important phenomenon by means of which cooperation can be sustained even under highly unfavorable conditions. We thus present an alternative way of understanding the evolution of cooperative behavior and its ubiquitous presence in nature, and we hope that this study will be inspirational for future efforts aimed in this direction.
  • 4. Prisoner’s dilemma Cooperate Defect Cooperate R, R S, T Defect T, S P, P ● Cooperate (stay silent), Defect (betray other prisoner) ● Payoff: R - reward, T - temptation, S - “sucker’s”, P - punishment ● T > R > P > S ● R > P mutual cooperation is superior to mutual defection ● T > R and P > S defection is the dominant strategy for both ● mutual defection is the only strong Nash equilibrium ● Developed by Merrill Flood and Melvin Dresher at RAND Corporation, later formalized by Albert W. Tucker (mathematician) Prisoner's dilemma, https://en.wikipedia.org/wiki/Prisoner’s_dilemma (last visited Aug. 23, 2015)
  • 5. Particle swarm optimization algorithm velocity update rule global best current best ● Nature-inspired optimization algorithm based on swarming behavior ● schools of fishes, herding in quadrupeds, flocking in birds, ant and bee colonies ● forming swarms for: migration, foraging, evading preys ● algorithm has deterministic and stochastic components ● each particle fully aware of the other particle’s positions and swarm’s global best Kennedy J, Eberhart R (1995) Particle swarm optimization. IEEE International Conference on Neural Networks, Piscataway, volume 4. pp 1942–1948 Xin-She Yang, Introduction to Mathematical Optimization: From Linear Programming to Metaheuristics, Cambridge International Science Press: Cambridge, UK (2008) Particle swarm optimization, https://en.wikipedia.org/wiki/Particle_swarm_optimization (last visited Aug. 23, 2015)
  • 6. Methods payoff matrix for pairwise interaction (cooperation C or defection D), between it’s 4-nearest neighbors velocity update strategy update average level of cooperation
  • 7. Average level of cooperation in dependence on b for different values of ω ● imitating best performing player in the swarm (ω → 0) beneficial in low b environments ● imitating personal success (ω → 1) better for evolution of cooperation in high b environments ● each data point is an average of the final outcome of 100 simulations
  • 8. Distribution of strategies in the whole population, as obtained for different combinations of b and ω ● ω = 0.01, game is fully dominated by two strategies, W = 0 (full defection) and W = 1 (full cooperation) ● ω = 0.99, full spectrum of strategies are available ● vertical axis: probability that strategy W is present in the population
  • 9. Characteristic spatial distributions of strategies, as obtained for different combinations of b and ω ● for low values of ω, clustering of cooperators are observed, game is dominated by two strategies W = 0 and W = 1. ● for high values of ω, clustering of cooperators less pronounced, more strategies W are observed ● survival of cooperators at high b
  • 10. Characteristic spatial distributions of velocities, as obtained for different combinations of b and ω ● at ω = 0.99, population becomes a “swarm”, with velocities ~ 0, regardless of b. stationary state has been reached by means of adaptive locally inspired slow strategy changes ● at ω = 0.01, only a few clusters can be considered as “swarms”. many pursue highest payoff but are unable to attain it. swarming is an important factor promoting cooperation at high b
  • 11. Summary ● impact of particle swarm optimization on the evolution of cooperation in stochastic strategy spatial prisoner’s dilemma game ● strategy update guided by individual memory and swarm knowledge ● cooperative behavior can prevail ● resolution of prisoner’s dilemma game to prosocial behavior ● most profitable swarm strategy at moderate b lead to full dominance of cooperation ● best individual strategy at high b lead to survival of cooperative behavior ● spatial distributions of strategies and velocities