SlideShare a Scribd company logo
Sparse Binary Zero-Sum Games 
[ACML 2014] 
David Auger1 Jialin Liu2 Sylvie Ruette3 David L. St-Pierre4 
Olivier Teytaud2 
1AlCAAP, Laboratoire PRiSM, Universite de Versailles Saint Quentin-en-Yvelines, France 
2TAO, INRIA-CNRS-LRI, Universite Paris-Sud, France 
3Laboratoire de Mathematiques, CNRS, Universite Paris-Sud, France 
4Universite du Quebec a Trois-Rivieres, Canada 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 1 / 26
Thanks to reviewers for very fruitful comments. 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 2 / 26
Introduction 
Two-person zero-sum game MKK 
Nash Equilibrium ! O(K2) with   3 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 3 / 26
Introduction 
Two-person zero-sum game MKK 
Nash Equilibrium ! O(K2) with   3 
If the Nash is sparse ! k  k submatrix 
! O(k3kK log K) with probability 1   (provable) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 3 / 26
Zero-sum matrix games 
Game de
ned by matrix M 
I choose (privately) i 
Simultaneously, you choose j 
I earn Mi ;j 
You earn Mi ;j 
So this is zero-sum. 
Or you earn 1  Mi ;j (so this is 1-sum, equivalent). 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 4 / 26
Ok, I earn Mi ;j , you earn Mi ;j 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 5 / 26
Ok, I earn Mi ;j , you earn Mi ;j 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 5 / 26
Nash Equilibrium 
Nash Equilibrium (NE) 
Zero-sum matrix game M 
My strategy = probability distrib. on rows = x 
Your strategy = probability distrib. on cols = y 
Expected reward = xTMy 
There exists x; y such that 8x; y, 
xTMy  xTMy  xTMy: 
(x; y) is a Nash Equilibrium (no unicity). 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 6 / 26
Ok, I earn Mi ;j , you earn Mi ;j 
Nash: Ok I play i with probability x 
i 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 7 / 26
Ok, I earn Mi ;j , you earn Mi ;j 
Nash: Ok I play i with probability x 
i 
How to 
compute x*? 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 7 / 26
Solving Nash 
Solution 1: Linear Programming (LP) 
1 M   M + C so that it is positive (without loss of generality) 
2 LP:
nd 0  u minimizing 
P 
i 
ui such that (MT )  u  1 
P 
3 x = u= 
i 
ui 
=) classical, provably exact, polynomial time 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 8 / 26
Solving Nash 
Solution 2: Approximate Nash Equilibrium 
Approximate -NE 
(x; y) such that 
xTMy    xTMy  xTMy + : 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 9 / 26
Solution 1: LP (comp. expensive) 
Solution 2: Approximate Nash Equilibrium 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 10 / 26
Solution 1: LP (comp. expensive) 
Solution 2: Approximate Nash Equilibrium 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 10 / 26
Computing approximate Nash Equilibrium 
Assuming the matrix is of size K  K ... 
LP (see reduction from Nash to linear programming in 
[Von Stengel (2002)]): O(K2) with 3    4 
[Grigoriadis and Khachiyan(1995)]: 
-Nash with expected time O(K log(K) 
2 ), i.e. less than the size of the 
matrix! 
Parallel : O( log2(K) 
2 ) if using K 
log(K) processors 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 11 / 26
Computing approximate Nash Equilibrium 
Assuming the matrix is of size K  K ... 
LP (see reduction from Nash to linear programming in 
[Von Stengel (2002)]): O(K2) with 3    4 
[Grigoriadis and Khachiyan(1995)]: 
-Nash with expected time O(K log(K) 
2 ), i.e. less than the size of the 
matrix! 
Parallel : O( log2(K) 
2 ) if using K 
log(K) processors 
Other algorithms: similar complexity, approximate solution +
xed 
time with probability 1   
EXP3 ([Auer et al.(1995)]) 
Inf ([Audibert and Bubeck(2009)]) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 11 / 26
Other tools 1: Hadamard determinant 
Hadamard determinant bound 
([Hadamard(1893)], [Brenner and Cummings(1972)]) 
Given matrix Mkk with coecients in f1; 0; 1g, then M has 
determinant at most k 
k 
2 , i.e. 
j detMj  k 
k 
2 : 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 12 / 26
Other tools 2: Linear programming 
Solve 
min ax 
Mx  c 
x 2 Rd 
If there is a
nite optimum, then there is a
nite optimum x such 
that, for some E with jEj = d, 
8i 2 E, Mi x = ci 
the Mi for i in E are linear independent 
(=) i.e. d lin. indep. constraints are active) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 13 / 26
Why is this relevant ? 
Nash = solution of linear programming problem 
x: Nash Equilibrium of MKK 
Let us assume that x is unique and has at most k non-zero 
components (sparsity) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 14 / 26
Why is this relevant ? 
Nash = solution of linear programming problem 
x: Nash Equilibrium of MKK 
Let us assume that x is unique and has at most k non-zero 
components (sparsity) 
) x = also NE of a k  k submatrix: Mk 
) x = solution of LP in dimension k 
) x = solution of k lin. eq. with coecients in f1; 0; 1g 
) x = inv-matrix  vector 
) x = obtained by cofactors / det matrix 
x k 
) has denominator at most k 
2 
0k 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 14 / 26
How to realise ? 
Under assumption that the Nash is sparse 
x is rational with small denominator 
So let us compute an -Nash (sublinear time!) 
And let us compute its closest approximation with small 
denominator (Hadamard) 
variants for -Nash =) exact Nash 
Rounding: switch to closest approximation 
Truncation: remove small components and work on the remaining 
submatrix (exact solving) 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 15 / 26
Evil in the details 
jjy  yjj1   does not imply V(y)  V(y) + ; 
indeed V(y)  V(y) + jjyyjj1 
k 
k 
2 
Results : (if Grigoriadis) 
For a K  K matrix with Nash k-sparse 
Exact solution in time O(poly (k) + (K log K)k3k ) with 
truncation-algorithm 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 16 / 26
Experimental results: two card games 
Previous results: ingaming of Urban Rivals 
New results: metagaming of Pokemon 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 17 / 26
Ingaming results (Urban Rivals) 
Previous work: [Flory and Teytaud(2011)], implementation of 
Truncated-EXP3, without proof 
Urban Rivals AI 
= Monte Carlo Tree Search 
([Coulom (2006)]), 
using zero-sum matrix games 
as a key component 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 18 / 26
Ingaming results (Urban Rivals) 
Previous work: [Flory and Teytaud(2011)], implementation of 
Truncated-EXP3, without proof 
Results don't look impressive ( 56%), but the game is highly 
randomized =) Reaching 55% is far from being negligible 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 19 / 26
New experiments 
Test on Pokemon Deck choice (metagaming) 
Based on EXP3+truncation 
Various versions of EXP3 (6= parameters) 
Code available https://www.lri.fr/~teytaud/games.html 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 20 / 26
New experiments 
With a poorly tuned EXP3 : truncation brings a huge improvement 
0 
0.9 
0.85 
0.8 
0.75 
0.7 
0.65 
0.6 
0.55 
0.5 
10 
1 
10 
2 
10 
3 
10 
4 
10 
0.45 
TEXP3 vs EXP3 
0 
0.9 
0.85 
0.8 
0.75 
0.7 
0.65 
0.6 
0.55 
0.5 
10 
1 
10 
2 
10 
3 
10 
4 
10 
0.45 
TEXP3 vs Uniform 
EXP3 vs Uniform 
Figure: Performance in terms of budget T with a poorly tuned EXP3 for the 
game of Pokeman using 2 cards. 
Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 21 / 26

More Related Content

PPTX
8 queens problem using back tracking
DOC
Unit 4 jwfiles
PDF
DS-MLR: Scaling Multinomial Logistic Regression via Hybrid Parallelism
PPTX
Topology Matters in Communication
PPTX
Backtracking
PDF
Tensor Train decomposition in machine learning
PDF
QMC Error SAMSI Tutorial Aug 2017
PDF
Tensorizing Neural Network
8 queens problem using back tracking
Unit 4 jwfiles
DS-MLR: Scaling Multinomial Logistic Regression via Hybrid Parallelism
Topology Matters in Communication
Backtracking
Tensor Train decomposition in machine learning
QMC Error SAMSI Tutorial Aug 2017
Tensorizing Neural Network

What's hot (18)

PDF
2012 mdsp pr13 support vector machine
PDF
Convolutional neural network backpropagation derivation
PDF
Cnn backpropagation derivation
PDF
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
PDF
2012 mdsp pr12 k means mixture of gaussian
PDF
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
PDF
Statistical inference of generative network models - Tiago P. Peixoto
PDF
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
PDF
Chapter 06 boolean algebra 2o-p
PDF
Gwt sdm public
PDF
Chapter 06 boolean algebra
PDF
Chapter 06 boolean algebra 3o-p
PDF
2012 mdsp pr09 pca lda
PDF
2012 mdsp pr10 ica
PDF
2012 mdsp pr11 ica part 2 face recognition
PDF
Big data matrix factorizations and Overlapping community detection in graphs
PDF
Hierarchical matrix techniques for maximum likelihood covariance estimation
PDF
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
2012 mdsp pr13 support vector machine
Convolutional neural network backpropagation derivation
Cnn backpropagation derivation
Quasistatic Fracture using Nonliner-Nonlocal Elastostatics with an Analytic T...
2012 mdsp pr12 k means mixture of gaussian
Anti-differentiating approximation algorithms: A case study with min-cuts, sp...
Statistical inference of generative network models - Tiago P. Peixoto
Animashree Anandkumar, Electrical Engineering and CS Dept, UC Irvine at MLcon...
Chapter 06 boolean algebra 2o-p
Gwt sdm public
Chapter 06 boolean algebra
Chapter 06 boolean algebra 3o-p
2012 mdsp pr09 pca lda
2012 mdsp pr10 ica
2012 mdsp pr11 ica part 2 face recognition
Big data matrix factorizations and Overlapping community detection in graphs
Hierarchical matrix techniques for maximum likelihood covariance estimation
Slides: On the Chi Square and Higher-Order Chi Distances for Approximating f-...
Ad

Viewers also liked (6)

PPT
Game theory
PDF
Lesson 35: Game Theory and Linear Programming
ZIP
PPTX
Game theory
PPT
Linear Programming 1
Game theory
Lesson 35: Game Theory and Linear Programming
Game theory
Linear Programming 1
Ad

Similar to Sparse Binary Zero Sum Games (ACML2014) (20)

PPT
lect1207
PPT
Juegos minimax AlfaBeta
PDF
Design and Implementation of Parallel and Randomized Approximation Algorithms
PPTX
Unit 3 ap gt
PDF
present_merged
PDF
Matrices & Determinants
PPTX
Adversarial search
PPTX
Adversarial search
PPTX
9SearchAdversarial (1).pptx
PPTX
foundations of AI:module 3,csp,minimax algorithm
PDF
Adversarial search
PPTX
Adversarial search with Game Playing
PPT
Unit_I_Introduction(Part_III).ppt
PPTX
Game playing
PPT
05 adversarial
PPTX
PPTX
Artificial intelligence dic_SLIDE_3.pptx
PPTX
AI_unit3.pptx
PDF
Numerical Solution of Linear algebraic Equation
PPTX
MINI-MAX ALGORITHM.pptx
lect1207
Juegos minimax AlfaBeta
Design and Implementation of Parallel and Randomized Approximation Algorithms
Unit 3 ap gt
present_merged
Matrices & Determinants
Adversarial search
Adversarial search
9SearchAdversarial (1).pptx
foundations of AI:module 3,csp,minimax algorithm
Adversarial search
Adversarial search with Game Playing
Unit_I_Introduction(Part_III).ppt
Game playing
05 adversarial
Artificial intelligence dic_SLIDE_3.pptx
AI_unit3.pptx
Numerical Solution of Linear algebraic Equation
MINI-MAX ALGORITHM.pptx

More from Jialin LIU (7)

PDF
Slides banditrmhc
PDF
GVGAI Single-Player Learning Competition at IEEE CIG17
PDF
My PhD defence
PDF
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
PDF
A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...
PDF
Meta online learning: experiments on a unit commitment problem (ESANN2014)
PDF
Noisy Optimization Convergence Rates (GECCO2013)
Slides banditrmhc
GVGAI Single-Player Learning Competition at IEEE CIG17
My PhD defence
Algorithm Portfolios for Noisy Optimization: Compare Solvers Early (LION8)
A Mathematically Derived Number of Resamplings for Noisy Optimization (GECCO2...
Meta online learning: experiments on a unit commitment problem (ESANN2014)
Noisy Optimization Convergence Rates (GECCO2013)

Recently uploaded (20)

PPT
First Aid Training Presentation Slides.ppt
PDF
Swiggy’s Playbook: UX, Logistics & Monetization
PPTX
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
PPTX
BIOLOGY TISSUE PPT CLASS 9 PROJECT PUBLIC
PPTX
water for all cao bang - a charity project
PPTX
Tablets And Capsule Preformulation Of Paracetamol
PPTX
Effective_Handling_Information_Presentation.pptx
PPTX
chapter8-180915055454bycuufucdghrwtrt.pptx
PPTX
Self management and self evaluation presentation
PPTX
Hydrogel Based delivery Cancer Treatment
PPTX
Non-Verbal-Communication .mh.pdf_110245_compressed.pptx
DOCX
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
PPTX
Introduction to Effective Communication.pptx
PPTX
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
PPTX
Human Mind & its character Characteristics
PDF
Nykaa-Strategy-Case-Fixing-Retention-UX-and-D2C-Engagement (1).pdf
PPTX
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
PPTX
nose tajweed for the arabic alphabets for the responsive
DOCX
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
PDF
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...
First Aid Training Presentation Slides.ppt
Swiggy’s Playbook: UX, Logistics & Monetization
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
BIOLOGY TISSUE PPT CLASS 9 PROJECT PUBLIC
water for all cao bang - a charity project
Tablets And Capsule Preformulation Of Paracetamol
Effective_Handling_Information_Presentation.pptx
chapter8-180915055454bycuufucdghrwtrt.pptx
Self management and self evaluation presentation
Hydrogel Based delivery Cancer Treatment
Non-Verbal-Communication .mh.pdf_110245_compressed.pptx
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
Introduction to Effective Communication.pptx
ART-APP-REPORT-FINctrwxsg f fuy L-na.pptx
Human Mind & its character Characteristics
Nykaa-Strategy-Case-Fixing-Retention-UX-and-D2C-Engagement (1).pdf
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
nose tajweed for the arabic alphabets for the responsive
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
Parts of Speech Prepositions Presentation in Colorful Cute Style_20250724_230...

Sparse Binary Zero Sum Games (ACML2014)

  • 1. Sparse Binary Zero-Sum Games [ACML 2014] David Auger1 Jialin Liu2 Sylvie Ruette3 David L. St-Pierre4 Olivier Teytaud2 1AlCAAP, Laboratoire PRiSM, Universite de Versailles Saint Quentin-en-Yvelines, France 2TAO, INRIA-CNRS-LRI, Universite Paris-Sud, France 3Laboratoire de Mathematiques, CNRS, Universite Paris-Sud, France 4Universite du Quebec a Trois-Rivieres, Canada Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 1 / 26
  • 2. Thanks to reviewers for very fruitful comments. Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 2 / 26
  • 3. Introduction Two-person zero-sum game MKK Nash Equilibrium ! O(K2) with 3 Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 3 / 26
  • 4. Introduction Two-person zero-sum game MKK Nash Equilibrium ! O(K2) with 3 If the Nash is sparse ! k k submatrix ! O(k3kK log K) with probability 1 (provable) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 3 / 26
  • 6. ned by matrix M I choose (privately) i Simultaneously, you choose j I earn Mi ;j You earn Mi ;j So this is zero-sum. Or you earn 1 Mi ;j (so this is 1-sum, equivalent). Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 4 / 26
  • 7. Ok, I earn Mi ;j , you earn Mi ;j Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 5 / 26
  • 8. Ok, I earn Mi ;j , you earn Mi ;j Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 5 / 26
  • 9. Nash Equilibrium Nash Equilibrium (NE) Zero-sum matrix game M My strategy = probability distrib. on rows = x Your strategy = probability distrib. on cols = y Expected reward = xTMy There exists x; y such that 8x; y, xTMy xTMy xTMy: (x; y) is a Nash Equilibrium (no unicity). Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 6 / 26
  • 10. Ok, I earn Mi ;j , you earn Mi ;j Nash: Ok I play i with probability x i Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 7 / 26
  • 11. Ok, I earn Mi ;j , you earn Mi ;j Nash: Ok I play i with probability x i How to compute x*? Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 7 / 26
  • 12. Solving Nash Solution 1: Linear Programming (LP) 1 M M + C so that it is positive (without loss of generality) 2 LP:
  • 13. nd 0 u minimizing P i ui such that (MT ) u 1 P 3 x = u= i ui =) classical, provably exact, polynomial time Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 8 / 26
  • 14. Solving Nash Solution 2: Approximate Nash Equilibrium Approximate -NE (x; y) such that xTMy xTMy xTMy + : Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 9 / 26
  • 15. Solution 1: LP (comp. expensive) Solution 2: Approximate Nash Equilibrium Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 10 / 26
  • 16. Solution 1: LP (comp. expensive) Solution 2: Approximate Nash Equilibrium Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 10 / 26
  • 17. Computing approximate Nash Equilibrium Assuming the matrix is of size K K ... LP (see reduction from Nash to linear programming in [Von Stengel (2002)]): O(K2) with 3 4 [Grigoriadis and Khachiyan(1995)]: -Nash with expected time O(K log(K) 2 ), i.e. less than the size of the matrix! Parallel : O( log2(K) 2 ) if using K log(K) processors Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 11 / 26
  • 18. Computing approximate Nash Equilibrium Assuming the matrix is of size K K ... LP (see reduction from Nash to linear programming in [Von Stengel (2002)]): O(K2) with 3 4 [Grigoriadis and Khachiyan(1995)]: -Nash with expected time O(K log(K) 2 ), i.e. less than the size of the matrix! Parallel : O( log2(K) 2 ) if using K log(K) processors Other algorithms: similar complexity, approximate solution +
  • 19. xed time with probability 1 EXP3 ([Auer et al.(1995)]) Inf ([Audibert and Bubeck(2009)]) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 11 / 26
  • 20. Other tools 1: Hadamard determinant Hadamard determinant bound ([Hadamard(1893)], [Brenner and Cummings(1972)]) Given matrix Mkk with coecients in f1; 0; 1g, then M has determinant at most k k 2 , i.e. j detMj k k 2 : Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 12 / 26
  • 21. Other tools 2: Linear programming Solve min ax Mx c x 2 Rd If there is a
  • 22. nite optimum, then there is a
  • 23. nite optimum x such that, for some E with jEj = d, 8i 2 E, Mi x = ci the Mi for i in E are linear independent (=) i.e. d lin. indep. constraints are active) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 13 / 26
  • 24. Why is this relevant ? Nash = solution of linear programming problem x: Nash Equilibrium of MKK Let us assume that x is unique and has at most k non-zero components (sparsity) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 14 / 26
  • 25. Why is this relevant ? Nash = solution of linear programming problem x: Nash Equilibrium of MKK Let us assume that x is unique and has at most k non-zero components (sparsity) ) x = also NE of a k k submatrix: Mk ) x = solution of LP in dimension k ) x = solution of k lin. eq. with coecients in f1; 0; 1g ) x = inv-matrix vector ) x = obtained by cofactors / det matrix x k ) has denominator at most k 2 0k Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 14 / 26
  • 26. How to realise ? Under assumption that the Nash is sparse x is rational with small denominator So let us compute an -Nash (sublinear time!) And let us compute its closest approximation with small denominator (Hadamard) variants for -Nash =) exact Nash Rounding: switch to closest approximation Truncation: remove small components and work on the remaining submatrix (exact solving) Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 15 / 26
  • 27. Evil in the details jjy yjj1 does not imply V(y) V(y) + ; indeed V(y) V(y) + jjyyjj1 k k 2 Results : (if Grigoriadis) For a K K matrix with Nash k-sparse Exact solution in time O(poly (k) + (K log K)k3k ) with truncation-algorithm Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 16 / 26
  • 28. Experimental results: two card games Previous results: ingaming of Urban Rivals New results: metagaming of Pokemon Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 17 / 26
  • 29. Ingaming results (Urban Rivals) Previous work: [Flory and Teytaud(2011)], implementation of Truncated-EXP3, without proof Urban Rivals AI = Monte Carlo Tree Search ([Coulom (2006)]), using zero-sum matrix games as a key component Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 18 / 26
  • 30. Ingaming results (Urban Rivals) Previous work: [Flory and Teytaud(2011)], implementation of Truncated-EXP3, without proof Results don't look impressive ( 56%), but the game is highly randomized =) Reaching 55% is far from being negligible Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 19 / 26
  • 31. New experiments Test on Pokemon Deck choice (metagaming) Based on EXP3+truncation Various versions of EXP3 (6= parameters) Code available https://www.lri.fr/~teytaud/games.html Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 20 / 26
  • 32. New experiments With a poorly tuned EXP3 : truncation brings a huge improvement 0 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5 10 1 10 2 10 3 10 4 10 0.45 TEXP3 vs EXP3 0 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5 10 1 10 2 10 3 10 4 10 0.45 TEXP3 vs Uniform EXP3 vs Uniform Figure: Performance in terms of budget T with a poorly tuned EXP3 for the game of Pokeman using 2 cards. Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 21 / 26
  • 33. New experiments With a well-tuned EXP3, truncation brings a signi
  • 34. cant improvement 0 0.58 0.57 0.56 0.55 0.54 0.53 0.52 0.51 10 1 10 2 10 3 10 4 10 0.5 TEXP3 vs EXP3 0 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5 10 1 10 2 10 3 10 4 10 0.45 TEXP3 vs Uniform EXP3 vs Uniform Figure: Performance in terms of budget T with a well-tuned EXP3 for the game of Pokeman using 2 cards. Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 22 / 26
  • 35. Conclusions further work Proved small improvement, experimentally big improvement. Improving the bound ? We don't know k (sparsity level). Adaptive algorithms ? Proved only with unique Nash (x; y). Necessary ? Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 23 / 26
  • 36. Jean-Yeves Audibert and Sebastien Bubeck. Minimax policies for adversarial and stochastic bandits. In 22th annual conference on learning theory, 2009. Peter Auer, Nicolo Cesa-Bianchi, Yoav Freund, and Robert E. Schapire. Gambling in a rigged casino: the adversarial multi-armed bandit problem. In Proceedings of the 36th Annual Symposium on Foundations of Computer Science. IEEE Computer Society Press, 1995. Remi Coulom (2006). Ecient selectivity and backup operators in Monte-Carlo tree search. In Computers and games, 2006. Joel Brenner and Larry Cummings. The Hadamard maximum determinant problem. In Amer. Math. Monthly, 1972. Sebastien Flory and Olivier Teytaud. Upper con
  • 37. dence trees with short term partial information. In Procedings of EvoGames, 2011. Michael D. Grigoriadis and Leonid G. Khachiyan. A sublinear-time randomized approximation algorithm for matrix games. In Operations Research Letters, 1995. Jacques Hadamard. Resolution d'une question relative aux determinants. In Bull. Sci. Math., 1893. Bernhard Von Stengel. Computing equilibria for two-person games. In Handbook of game theory with economic applications, 2002. Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 24 / 26
  • 38. Thank you for your attention ! David Auger David L. St-Pierre Sylvie Ruette Olivier Teytaud Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 25 / 26
  • 39. [ACML 2014] Sparse Binary Zero-Sum Games D. Auger J. Liu S. Ruette D. L. St-Pierre O. Teytaud Jialin LIU (INRIA-TAO) Sparse Binary Zero-Sum Games 26 / 26