SlideShare a Scribd company logo
Anticipation Mappings for  Learning Classifier Systems Larry Bull, Pier Luca Lanzi, Toby O’hara University of the West of England, Bristol, UK Politecnico di Milano, Italy Illinois Genetic Algorithms Laboratory,  University of Illinois at Urbana Champaign, USA CEC 2007, September 27th, 2007, Singapore TexPoint fonts used in EMF.  Read the TexPoint manual before you delete this box.:  A A A A A A
A Brief Look at Classifier Systems If  condition C  holds in  state S , then  action A  will produce  a  payoff p , this prediction has an  accuracy F
Anticipatory Classifier Systems (ACS) Modify the structure of classifiers and the systems to learn the effect of actions Short history Rick L. Riolo. 1990  “Lookahead Planning and Latent Learning in a Classifier System.” Wolfgang Stolzmann. 1998 “Anticipatory Classier Systems”, GP98 Martin V. Butz (2002),  “Anticipatory Classifier Systems”, Springer-Verlag If  condition C  holds in  state S t , then  action A  will  produce an effect resulting in  state S t+1
Need Anticipations? Compute them! Basic Idea Not a system designed for anticipations Enrich a known system to learn anticipations Anticipatory prediction comes for free Maybe less powerful than ACS,  but can be applied to any LCS If  condition C  holds in  state S , then  action A  will produce  a  payoff p , with an  accuracy F If  condition C  holds in  state S , then  action A  will produce  a  payoff p , with an  accuracy F ,   and effect is an f (s t , w) First tried by Larry Bull and Toby O’hara (2000) using neural networks to compute an f (s t ,w)
Learning Anticipatory Functions s t s t+1 a t s t  a t   ->  s t+1
(our way to) Anticipations Classifiers has five parameters: The condition and the action The prediction, the error and the fitness The parameter vector  w  for anticipation If  condition C  holds in  state S , then  action A  will produce  a  payoff p , with an  accuracy F ,   and effect is an f (s t , w)
Learning to Anticipate the Effect  The next state s t+1  is used to update w of classifiers Classifier error is updated according to a is the action computed by the classifier  ε f   (x t ,y t ,a)   is the  error function  Several error functions, we used the simplest one:  0 if action is correct (a = y t ),  1000 otherwise Classifier fitness is updated as in XCS
Anticipatory Prediction Predicting the next state Classifiers matching s t  that advocate a t For each action a in [M], the  classification accuracy  C(x t ,a) is computed as, Learning to predict the next state Use current state s t , a t , and s t+1  to train the vector w
What Function? Sigmoid Action is simply a constant: Update is performed with a gradient descent: a f xw
What Function? Neural Networks When the problem involves many action we can either use an array of simple action function use a powerful action function  Neural Network n inputs h hidden nodes As many outputs as the action encoding needs Update is performed using online backpropagation
Woods
Simple Sequential Problems (1) (2) (3)
Anticipation Accuracy (1)  accuracy MSE
Anticipation Accuracy (2)  accuracy MSE
Anticipation Accuracy (3)  accuracy MSE
Alias
Sequential Problems with Aliasing (1) (2) Extension of anticipatory Classifier Systems for problem with noise was developed by Martin Butz, Dave G. Goldberg, and Wolfgang Stolzmann (2000)
Anticipation Accuracy (1)  accuracy MSE
Anticipation Accuracy (2)  accuracy MSE
We presented a very simple approach  to anticipatory behavior  Compute the anticipation of the next state based on the previous state and the action performed Very simple, but provide accurate predictions,  while requiring smaller populations Simpler than ACSs, probably less powerful  Even simple perceptron can be powerful enough Generalizes to real-valued and/or noisy domains
Any Question? Thank you!
Woods1 The Woods1 model is  very sparse : there are only 128X17 state-action pairs Woods1 is, in practice, more simple than binary sum Both perceptron and NN does not exploit the problem sparseness as SVM
Maze 5 Maze5 is still a sparse problem but slightly more complex: it has 288X37 state-action pairs Maze5 is, in practice, slight more difficult than binary sum Also in this case SVM exploits very effectively the problem sparseness

More Related Content

PDF
Support Vector Machines for Computing Action Mappings in Learning Classifier ...
PDF
PR12-225 Discovering Physical Concepts With Neural Networks
PPTX
Deep Learning A-Z™: Artificial Neural Networks (ANN) - Backpropagation
PPTX
The matrix
PPTX
Neural net and back propagation
PPTX
Back propagation network
PDF
Climaware anticipation2015 slides
PPTX
Implemetation du system des classeurs de type xcsfca
Support Vector Machines for Computing Action Mappings in Learning Classifier ...
PR12-225 Discovering Physical Concepts With Neural Networks
Deep Learning A-Z™: Artificial Neural Networks (ANN) - Backpropagation
The matrix
Neural net and back propagation
Back propagation network
Climaware anticipation2015 slides
Implemetation du system des classeurs de type xcsfca

Viewers also liked (19)

PPTX
PDF
Final slide show tammy nichols2
PPTX
Algorithme DPLL
PPTX
Intelligence artificielle et décision médicale
PDF
Toward a Natural Genetic / Evolutionary Algorithm for Multiobjective Optimiza...
PPTX
Research Project - E-learning website for Computer Algorithms
PPTX
E.B.E.C 2014 - Team Greece - Day 2 Case Study
PPT
PPT
Mahindra & Mahindra Ltd
PPTX
E.B.E.C 2014 - Team Greece - Day 3 Case Study
PDF
Terminaux individuels (BYOD)
PPT
Inferential statistics
PPTX
Ruled based expert system
PPT
Introduction to Rule-based Applications
PPTX
Anticipation and Mitigation of Geohazards for Subsea Power Cables (and Pipeli...
PDF
Rule Based Architecture System
PPT
Sri Lanka Case Study Presentation
PPT
Rule Based System
PPTX
Indian culture
Final slide show tammy nichols2
Algorithme DPLL
Intelligence artificielle et décision médicale
Toward a Natural Genetic / Evolutionary Algorithm for Multiobjective Optimiza...
Research Project - E-learning website for Computer Algorithms
E.B.E.C 2014 - Team Greece - Day 2 Case Study
Mahindra & Mahindra Ltd
E.B.E.C 2014 - Team Greece - Day 3 Case Study
Terminaux individuels (BYOD)
Inferential statistics
Ruled based expert system
Introduction to Rule-based Applications
Anticipation and Mitigation of Geohazards for Subsea Power Cables (and Pipeli...
Rule Based Architecture System
Sri Lanka Case Study Presentation
Rule Based System
Indian culture
Ad

Similar to Anticipation Mappings for Learning Classifier Systems (20)

PDF
Computed Prediction: So far, so good. What now?
PPTX
GECCO 2014 - Learning Classifier System Tutorial
PDF
How AlphaGo Works
PPTX
When Models Meet Data: From ancient science to todays Artificial Intelligence...
PDF
Ai & ml
PDF
13ClassifierPerformance.pdf
PDF
Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAI
PDF
Uncertainty Awareness in Integrating Machine Learning and Game Theory
PPT
Machine learning
PPT
Free Ebooks Download ! Edhole.com
DOCX
Machine Learning Printable For Studying Exam
PDF
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...
PPT
AML_030607.ppt
PPTX
MachineLearning-v0.1
PPTX
Reinforcement learning
PPT
Introduction
PDF
Deep learning MindMap
PPTX
GECCO-2014 Learning Classifier Systems: A Gentle Introduction
PPT
Introduction to Machine Learning Aristotelis Tsirigos
PDF
Anomaly Detection through Reinforcement Learning
Computed Prediction: So far, so good. What now?
GECCO 2014 - Learning Classifier System Tutorial
How AlphaGo Works
When Models Meet Data: From ancient science to todays Artificial Intelligence...
Ai & ml
13ClassifierPerformance.pdf
Deep Reinforcement Learning Through Policy Optimization, John Schulman, OpenAI
Uncertainty Awareness in Integrating Machine Learning and Game Theory
Machine learning
Free Ebooks Download ! Edhole.com
Machine Learning Printable For Studying Exam
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...
AML_030607.ppt
MachineLearning-v0.1
Reinforcement learning
Introduction
Deep learning MindMap
GECCO-2014 Learning Classifier Systems: A Gentle Introduction
Introduction to Machine Learning Aristotelis Tsirigos
Anomaly Detection through Reinforcement Learning
Ad

More from Pier Luca Lanzi (20)

PDF
11 Settembre 2021 - Giocare con i Videogiochi
PDF
Breve Viaggio al Centro dei Videogiochi
PDF
Global Game Jam 19 @ POLIMI - Morning Welcome
PPTX
Data Driven Game Design @ Campus Party 2018
PDF
GGJ18 al Politecnico di Milano - Presentazione che precede la presentazione d...
PDF
GGJ18 al Politecnico di Milano - Presentazione di apertura
PDF
Presentation for UNITECH event - January 8, 2018
PDF
DMTM Lecture 20 Data preparation
PDF
DMTM Lecture 19 Data exploration
PDF
DMTM Lecture 18 Graph mining
PDF
DMTM Lecture 17 Text mining
PDF
DMTM Lecture 16 Association rules
PDF
DMTM Lecture 15 Clustering evaluation
PDF
DMTM Lecture 14 Density based clustering
PDF
DMTM Lecture 13 Representative based clustering
PDF
DMTM Lecture 12 Hierarchical clustering
PDF
DMTM Lecture 11 Clustering
PDF
DMTM Lecture 10 Classification ensembles
PDF
DMTM Lecture 09 Other classificationmethods
PDF
DMTM Lecture 08 Classification rules
11 Settembre 2021 - Giocare con i Videogiochi
Breve Viaggio al Centro dei Videogiochi
Global Game Jam 19 @ POLIMI - Morning Welcome
Data Driven Game Design @ Campus Party 2018
GGJ18 al Politecnico di Milano - Presentazione che precede la presentazione d...
GGJ18 al Politecnico di Milano - Presentazione di apertura
Presentation for UNITECH event - January 8, 2018
DMTM Lecture 20 Data preparation
DMTM Lecture 19 Data exploration
DMTM Lecture 18 Graph mining
DMTM Lecture 17 Text mining
DMTM Lecture 16 Association rules
DMTM Lecture 15 Clustering evaluation
DMTM Lecture 14 Density based clustering
DMTM Lecture 13 Representative based clustering
DMTM Lecture 12 Hierarchical clustering
DMTM Lecture 11 Clustering
DMTM Lecture 10 Classification ensembles
DMTM Lecture 09 Other classificationmethods
DMTM Lecture 08 Classification rules

Recently uploaded (20)

PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
cuic standard and advanced reporting.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Approach and Philosophy of On baking technology
PDF
Empathic Computing: Creating Shared Understanding
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
KodekX | Application Modernization Development
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
Programs and apps: productivity, graphics, security and other tools
cuic standard and advanced reporting.pdf
Network Security Unit 5.pdf for BCA BBA.
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Chapter 3 Spatial Domain Image Processing.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Per capita expenditure prediction using model stacking based on satellite ima...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Approach and Philosophy of On baking technology
Empathic Computing: Creating Shared Understanding
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
KodekX | Application Modernization Development
MIND Revenue Release Quarter 2 2025 Press Release
sap open course for s4hana steps from ECC to s4
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Understanding_Digital_Forensics_Presentation.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation

Anticipation Mappings for Learning Classifier Systems

  • 1. Anticipation Mappings for Learning Classifier Systems Larry Bull, Pier Luca Lanzi, Toby O’hara University of the West of England, Bristol, UK Politecnico di Milano, Italy Illinois Genetic Algorithms Laboratory, University of Illinois at Urbana Champaign, USA CEC 2007, September 27th, 2007, Singapore TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.: A A A A A A
  • 2. A Brief Look at Classifier Systems If condition C holds in state S , then action A will produce a payoff p , this prediction has an accuracy F
  • 3. Anticipatory Classifier Systems (ACS) Modify the structure of classifiers and the systems to learn the effect of actions Short history Rick L. Riolo. 1990 “Lookahead Planning and Latent Learning in a Classifier System.” Wolfgang Stolzmann. 1998 “Anticipatory Classier Systems”, GP98 Martin V. Butz (2002), “Anticipatory Classifier Systems”, Springer-Verlag If condition C holds in state S t , then action A will produce an effect resulting in state S t+1
  • 4. Need Anticipations? Compute them! Basic Idea Not a system designed for anticipations Enrich a known system to learn anticipations Anticipatory prediction comes for free Maybe less powerful than ACS, but can be applied to any LCS If condition C holds in state S , then action A will produce a payoff p , with an accuracy F If condition C holds in state S , then action A will produce a payoff p , with an accuracy F , and effect is an f (s t , w) First tried by Larry Bull and Toby O’hara (2000) using neural networks to compute an f (s t ,w)
  • 5. Learning Anticipatory Functions s t s t+1 a t s t a t -> s t+1
  • 6. (our way to) Anticipations Classifiers has five parameters: The condition and the action The prediction, the error and the fitness The parameter vector w for anticipation If condition C holds in state S , then action A will produce a payoff p , with an accuracy F , and effect is an f (s t , w)
  • 7. Learning to Anticipate the Effect The next state s t+1 is used to update w of classifiers Classifier error is updated according to a is the action computed by the classifier ε f (x t ,y t ,a) is the error function Several error functions, we used the simplest one: 0 if action is correct (a = y t ), 1000 otherwise Classifier fitness is updated as in XCS
  • 8. Anticipatory Prediction Predicting the next state Classifiers matching s t that advocate a t For each action a in [M], the classification accuracy C(x t ,a) is computed as, Learning to predict the next state Use current state s t , a t , and s t+1 to train the vector w
  • 9. What Function? Sigmoid Action is simply a constant: Update is performed with a gradient descent: a f xw
  • 10. What Function? Neural Networks When the problem involves many action we can either use an array of simple action function use a powerful action function Neural Network n inputs h hidden nodes As many outputs as the action encoding needs Update is performed using online backpropagation
  • 11. Woods
  • 16. Alias
  • 17. Sequential Problems with Aliasing (1) (2) Extension of anticipatory Classifier Systems for problem with noise was developed by Martin Butz, Dave G. Goldberg, and Wolfgang Stolzmann (2000)
  • 20. We presented a very simple approach to anticipatory behavior Compute the anticipation of the next state based on the previous state and the action performed Very simple, but provide accurate predictions, while requiring smaller populations Simpler than ACSs, probably less powerful Even simple perceptron can be powerful enough Generalizes to real-valued and/or noisy domains
  • 22. Woods1 The Woods1 model is very sparse : there are only 128X17 state-action pairs Woods1 is, in practice, more simple than binary sum Both perceptron and NN does not exploit the problem sparseness as SVM
  • 23. Maze 5 Maze5 is still a sparse problem but slightly more complex: it has 288X37 state-action pairs Maze5 is, in practice, slight more difficult than binary sum Also in this case SVM exploits very effectively the problem sparseness