SlideShare a Scribd company logo
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Learning Classifier Systems
A Gentle Introduction
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Outline
• Introduction
 Why? When?
 What areas? What Applications?
• Learning Classifier Systems
 What Learning?
 Classifiers?
 How do they work?
 What decisions?
 General principles
 Better classifiers
 Theory?
• Survey of applications
2
why?
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
4
a real system
with an unknown
underlying
dynamics
Why? What was the goal?
if C1 :buy 30
if C2 :sell -2
…
evolved rules provide
a plausible, human
readable, model of
the unknown system
apply a classifier system online
to generate a behavior
matched the real system
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
To state, in concrete technical form, a model of
a complete mind and its several aspects
5
• A cognitive system interacting
with an environment
• Binary detectors and effectors
• Knowledge = set of classifiers
• Condition-action rules that
recognize a situation and
propose an action
• Payoff reservoir for
the system’s needs
• Payoff distributed through
an epochal algorithm
• Internal memory as
message list
• Genetic search of classifiers
Holland’s Vision, Cognitive System One
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
6Holland’s Learning Classifier Systems
• Explicit representation of the incoming reward
• Good classifiers are the
ones that predict
high rewards
• Credit Assignment using
Bucket Brigade
• Rule Discovery through
a genetic algorithm
applied to the entire
rule base (on the whole
solution)
• Description was vast
It did not work right off!
Very limited success
• David E. Goldberg: Computer-aided gas pipeline operation using genetic
algorithms and rule learning, PhD thesis. University of Michigan. Ann Arbor,
MI.
Rule Discovery Component
Perceptions
Detectors
Reward Action
Effectors
Match Set
Classifiers
matching
the current
sensory inputs
Population
Classifiers
representing
the current
knowledge
Evaluation of
the actions in
the match set
Credit Assignment
Component
1 2
3
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
7Learning System LS-1 &
Pittsburgh Classifier Systems
Holland models learning as ongoing adaptation process
De Jong instead views learning as optimization
Genetic algorithms applied to a population of rule sets
1. t := 0
2. Initialize the population P(t)
3. Evaluate the rules sets in P(t)
4. While the termination condition is not satisfied
5. Begin
6. Select the rule sets in P(t) and generate Ps(t)
7. Recombine and mutate the rule sets in Ps(t)
8. P(t+1) := Ps(t)
9. t := t+1
10. Evaluate the rules sets in P(t)
11. End No apportionment of credit
Offline evaluation of rule sets
Pittsburgh
Classifier
System
when?
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
1970s
1980s
1990s
2000s
XCS is born, first results on classification
& robotics applications, but interest fades wa
Genetic algorithms and CS-1
Research flourishes, success is limited
Evolving rules as optimization
Research follows Holland’s vision
Success is still limited
Classifier systems finally work
Focus on classification (UCS)
Large development of models
Facetwise theory and applications
Genomic and epidemiological applications
Reinforcement
Learning
&
Machine
Learning
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
10Stewart W. Wilson &
The XCS Classifier System
1.Simplify the model
2.Go for accurate predictions
not high payoffs
3.Apply the genetic algorithm
to subproblems not to
the whole problem
4.Focus on classifier systems as
reinforcement learning
with rule-based generalization
5.Use reinforcement learning (Q-learning) to distribute reward
• Wilson, S.W.: Classifier Fitness Based on Accuracy. Evolutionary
Computation 3(2), 149-175 (1995).
Most developed and studied model so far
for what?
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Classification
(label prediction)
Regression
(numerical
prediction)
Sequential
Decision
Making
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
13
Computational
Models of Cognition
Complex
Adaptive
Systems
Classification
& Data mining
Autonomous
Robotics
Others
Traffic controllers
Target recognition
Fighter maneuvering
…
learning classifier systems
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
15
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
16
• The goal is to maximize the amount of reward received
• How much future reward when at is performed in st?
• What is the expected payoff for st and at?
• Need to compute a value function, Q(st,at) payoff
Learning Classifier Systems as
Reinforcement Learning Methods
Environment
Agent
st atrt+1
st+1
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
How does reinforcement
learning work?
Define the inputs, the actions,
and how the reward is determined
Define the expected payoff
Compute a value function Q(st,at) mapping
state-action pairs into expected payoffs
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
18
• At the beginning, is initialized with random values
• At time t,
• Parameters,
 Discount factor 
 The learning rate 
 The action selection strategy
How does reinforcement learning work?
Then, Q-learning is an option.
incoming rewardnew estimate
previous value
new estimate
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
How does reinforcement
learning work?
Reinforcement learning assumes
that Q(st,at) is represented as a table
But the real world is complex,
the number of possible inputs can be huge!
We cannot afford an exact Q(st,at)
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
20The Mountain Car Example
GOAL
Task: drive an underpowered
car up a steep mountain road
st = position,
velocity
rt = 0 when goal is
reached, -1 otherwise
Value Function
Q(st,at)
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
21What are the issues?
•Exact representation infeasible
•Approximation mandatory
•The function is unknown,
it is learnt online from experience
Learning an unknown payoff function
while also trying to approximate it
Approximator works on intermediate estimates
While also providing information for the learning
Convergence is not guaranteed
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Whats does this have to do with
Learning Classifier Systems?
They solve reinforcement learning problems
Represent the payoff function Q(st, at) as
a population of rules, the classifiers
Classifiers are evolved while
Q(st, at) is learned online
classifiers?
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
24
payoff
surface for A
What is a classifier?
IF condition C is true for input s
THEN the payoff of action A is p
s
payoff
l u
p
Condition
C(s)=l≤s≤u
General conditions covering
large portions of the problem
space
Accurate approximations
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
25What types of solutions?
how do they work?
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
• Two key components
• A genetic algorithm works on problem space
decomposition (condition-action)
• Supervised or reinforcement learning is used
for learning local prediction models
Problem Space
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
28How do learning classifier systems work?
The main performance cycle
statest
Environment
Agent
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
29How do learning classifier systems work?
The main performance cycle
statest
Environment
Agent
Population[P]
Rulesdescribing
thecurrent solution
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
30How do learning classifier systems work?
The main performance cycle
statest
Matching
Environment
Agent
Rulesdescribing
thecurrent solution
Population[P]
Ruleswhose
conditionmatchst
MatchSet [M]
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
31How do learning classifier systems work?
The main performance cycle
statest
Matching
Environment
Agent
Rulesdescribing
thecurrent solution
Population[P]
Ruleswhose
conditionmatchst
MatchSet [M]
Action
Evaluation
PredictionArray
Thevalueof each
actionin[M]
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
How do learning classifier systems work?
The main performance cycle
32
statest
Matching
Environment
Agent
Rulesdescribing
thecurrent solution
Population[P]
Ruleswhose
conditionmatchst
MatchSet [M]
Action
Evaluation
PredictionArray
Thevalueof each
actionin[M]
Action
Selection
ActionSet [A]
Rulesin[M] withthe
selectedaction.
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
33How do learning classifier systems work?
The main performance cycle
statest
Matching
Rulesdescribing
thecurrent solution
Population[P]
Ruleswhose
conditionmatchst
MatchSet [M]
Action
Evaluation
PredictionArray
Thevalueof each
actionin[M]
Action
Selection
ActionSet [A]
Rulesin[M] withthe
selectedaction.
actionat
Environment
Agent
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
34How do learning classifier systems work?
The main performance cycle
statest
Matching
Environment
Agent
Rulesdescribing
thecurrent solution
Population[P]
Ruleswhose
conditionmatchst
MatchSet [M]
Action
Evaluation
PredictionArray
Thevalueof each
actionin[M]
Action
Selection
ActionSet [A]
Rulesin[M] withthe
selectedaction.
actionat
The classifiers predict an expected payoff
The incoming reward is used to update
the rules which helped in getting the reward
Any reinforcement learning algorithm can
be used to estimate the classifier prediction
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
35How do learning classifier systems work?
The main performance cycle
statest
Matching
Rulesdescribing
thecurrent solution
Population[P]
Ruleswhose
conditionmatchst
MatchSet [M]
Action
Evaluation
PredictionArray
Thevalueof each
actionin[M]
Action
Selection
ActionSet [A]
Rulesin[M] withthe
selectedaction.
actionatrewardrt
ActionSet at t-1[A]-1
Rulesin[M] withthe
selectedaction.
Reinforcement
Learning
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
36How do learning classifier systems work?
The reinforcement component
• For each classifier C in [A]-1 the prediction p is updated as follows:
P  r +  maxaA PredictionArray(a)
p  p + (P- p)
• Compare this with Q-learning
A rule “corresponds” to Q-table
p “corresponds” to the value Q(s,a)
P “corresponds” to “r+maxaQ(st,a)”
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Where do classifiers come from?
In principle, any search method may be used
Evolutionary computation is nice because
it is representation “independent”
A genetic algorithm select, recombines,
mutate existing classifiers to search for better ones
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
What are the good classifiers?
What is the classifier fitness?
The goal is to approximate a target value function
with as few classifiers as possible
We wish to have an accurate approximation
One possible approach is to define fitness
as a function of the classifier prediction accuracy
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
What about generalization?
The genetic algorithm can take care of this
General classifiers apply more often,
thus they are reproduced more
But since fitness is based on classifiers accuracy
only accurate classifiers are likely to be reproduced
The genetic algorithm evolves
maximally general maximally accurate classifiers
what decisions?
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
41How to apply learning classifier systems
• Determine the inputs, the actions,
and how reward is distributed
• Determine what is the expected payoff
that must be maximized
• Decide an action selection strategy
• Set up the parameter 
Environment
Learning Classifier System
st rt at
• Select a representation for conditions,
the recombination and the mutation operators
• Select a reinforcement learning algorithm
• Setup the parameters, mainly the population size,
the parameters for the genetic algorithm, etc.
• Parameter 
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
42Things can be extremely simple!
For instance in supervised classification
Environment
Learning Classifier System
example class
1 if the class is correct
0 if the class is not correct
• Select a representation for conditions and
the recombination and mutation operators
• Setup the parameters, mainly the population size,
the parameters for the genetic algorithm, etc.
general principles
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
An Example… 44
A1 A2 A3 A4 A5 A6 CLASS
1 1 1 1 3 1 1
1 1 1 1 3 2 1
1 1 1 3 2 1 1
1 1 1 3 3 2 1
1 1 2 3 1 2 1
1 2 1 1 1 2 1
1 2 1 1 2 1 0
1 2 1 1 3 1 0
1 2 1 1 4 2 0
1 2 1 2 1 1 1
1 2 1 2 3 1 0
… … … … … … …
Six Attributes
Severalcases
A hidden
concept…
What is the concept?
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Traditional Approach?
• Classification  Trees  C4.5? ID3? CHAID? …
• Classification  Rules  CN2? C4.5rules? …
• Prediction  Trees  CART? …
45
Task
Representation
Algorithm
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
I Need to Classify, I Want Trees, What
Algorithm? ID3? C4.5? CHAID?
46
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
I Need to Classify, I Want Rules!
What Algorithm?
• OneRule
if A5 = 3 then 0 (11/19)
if A5 = 2 then 0 (11/20)
if A5 = 4 then 0 (11/23)
if A5 = 1 then 1 (29/0)
correct: 91 out of 124 training
examples.
• Rule Learner
if A5 = 4 and A1 = 1 then 0 (1/13)
if A5 = 1 then 1 (29/0)
if A4 = 2 and A5 = 2 then 0 (1/6)
if A1 = 1 and A2 = 2 then 0 (0/10)
else 0 (27/29)
correct: 87 out of 116 training examples.
47
 FOIL
is_0(A1,A2,A3,A4,A5,A6) :- a1  a2 and a5  1
Different task, different solution representation?
Completely different algorithm!
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Thou shalt have no other model
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Genetics-Based
Generalization
Accurate Estimates
About Classifiers
(Powerful RL)
Classifier
Representation
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
50Learning Classifier Systems:
One Principle Many Representations
Learning Classifier System
Genetic
Search
Estimates
RL & MLKnowledge
Representation
Conditions &
Prediction
Ternary Conditions
0, 1, #
Symbolic
Conditions
Attribute-Value
Conditions
Ternary rules
0, 1, #
if a5<2 or
a1=a2, class=1
if A1=1and A2=11
if A1=2 andA2=21
Ternary Conditions
0, 1, #
Attribute-Value
Conditions
Symbolic
Conditions
Same framework!
Just plug-in your favorite representation
better classifiers
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
52
payoff
landscape of A
What is computed prediction?
Replace the prediction p by
a parametrized function p(s,w)
s
payoff
l u
p(s,w)=w0+sw1
Condition
C(s)=l≤s≤u
Which Representation?
Which type of
approximation?
Stewart W. Wilson: Classifiers that approximate functions. Natural Computing
1(2-3): 211-234 (2002)
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
53Same example with computed prediction
No need to change the framework
Just plug-in your favorite estimator
Linear, Polynomial, NNs, SVMs, tile-coding
Lanzi, PL, Extending XCSF beyond linear approximation. GECCO, 2005
Loiacono D. and Lanzi P.L.. Evolving neural networks for classifier prediction with XSCF. ECAI, 2006
Lanzi and Loiacono XCSF with neural prediction. IEEE CEC, 2006.
Loiacono, D., Marelli A., and Lanzi PL. Support vector regression for classifier prediction. GECCO, 2007.
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
What do we want?
Fast learning
Learn something as soon as possible!
Accurate solutions
As the learning proceeds
the solution accuracy should improve
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Is there another approach?
payoff
landscape
s
payoff
l u
p(s,w)=w0
p(s,w)=w1s+w0p(s,w)=NN(s,w)
Initially, constant
prediction may be good
As learn proceeds,
the solution should improve…
… as much as possible.
55
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Ensemble Classifiers 56
None of prediction models is the “best”
Let evolution search for the “most suited”
NN
Almost as fast as using best model
Model is adapted effectively in each subspace
any theory?
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Learning Classifier Systems
Representation, Reinforcement Learning,
& Genetics-based Search
Unified theory is impractical
Develop facetwise models
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
59Facetwise Models for a Theory of Evolution
and Learning
• Prof. David E. Goldberg
University of Illinois at Urbana Champaign
• Facetwise approach for the analysis
and the design of genetic algorithms
• In learning classifier systems
 Separate learning from evolution
 Simplify the problem by focusing
only on relevant aspect
 Derive facetwise models
• Applied to model several aspects of evolution
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
S([A]) = S([P])/(2-S([P]))
Since the genetic algorithm is applied to the action set,
there is a generalization pressure regulated by this equation.
N>-log(1-θ)/p
Given the probability θ to maintain all the subsolutions
with occurrence probability p, then the population size N …
O(L 2o+a)
Time to converge for a problem of L bits order o
and with a problem classes
Martin V. Butz, Tim Kovacs, Pier Luca Lanzi, Stewart W. Wilson: Toward a theory of generalization and
learning in XCS. IEEE Trans. Evolutionary Computation 8(1): 28-46 (2004)
Martin V. Butz, Kumara Sastry, David E. Goldberg: Strong, Stable, and Reliable Fitness Pressure in XCS
due to Tournament Selection. Genetic Programming and Evolvable Machines 6(1): 53-77 (2005)
Martin V. Butz, David E. Goldberg, Pier Luca Lanzi: Bounding Learning Time in XCS. GECCO (2) 2004:
739-750
advanced topics…
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
What the Advanced Topics?
• Models Tailored for Applications
Anticipatory Behavior (ACS)
Data Mining, classification and prediction (e.g., UCS)
Epidemiology (e.g., EpiCS, EpiXCS)
ExSTraCS (Bioinformatics)
• Improved representations of conditions (GP, GEP, …)
• Improved representations of actions (GP, Code
Fragments)
• Improved genetic search (EDAs, ECGA, BOA, ….)
• Improved estimators
• Scalability
Matching
Distributed models
62
what applications?
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
64
Computational
Models of Cognition
Complex
Adaptive
Systems
Classification
& Data mining
Autonomous
Robotics
Others
Traffic controllers
Target recognition
Fighter maneuvering
…
modeling cognition
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
66What Applications?
Computational Models of Cognition
• Learning classifier system model
certain aspects of cognition
 Human language learning
 Perceptual category learning
 Affect theory
 Anticipatory and latent learning
• Learning classifier systems provide good
models for animals in experiments
in which the subjects must learn internal
models to perform as well as they do
• Martin V. Butz, University of Würzburg,
Department of Cognitive Psychology III
Cognitive Bodyspaces:
Learning and Behavior (COBOSLAB)
• Wolfgang Stolzmann, Daimler Chrysler
• Rick R. Riolo, University of Michigan,
Center for the Study of Complex Systems
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
67References
• Butz, M.V.: Anticipatory Learning Classifier Systems, Genetic
Algorithms and Evolutionary Computation, vol. 4.
Springer-Verlag (2000)
• Riolo, R.L.: Lookahead Planning and
Latent Learning in a Classifier System.
In: J.A. Meyer, S.W. Wilson (eds.)
From Animals to Animats 1.
Proceedings of the First International
Conferenceon Simulation of Adaptive
Behavior (SAB90), pp. 316{326.
A Bradford Book. MIT Press (1990)
• Stolzmann, W. and Butz, M.V. and Hoffman, J. and Goldberg, D.E.:
First Cognitive Capabilities in the Anticipatory Classifier System. In:
From Animals to Animats: Proceedings of the Sixth International
Conference on Simulation of Adaptive Behavior. MIT Press (2000)
computational economics
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
69What Applications?
Computational Economics
• To models one single agent acting in
the market (BW Arthur, JH Holland, B LeBaron)
• To model many interactive agents each one
controlled by its own classifier system.
• Modeling the behavior of agents trading
risk free bonds and risky assets
• Different trader types modeled by supplying
different input information sets
to a group of homogenous agents
• Later extended to a multi-LCS architecture
applied to portfolio optimization
• Technology startup company
founded in March 2005
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
70References
• Sor Ying (Byron) Wong, Sonia Schulenburg: Portfolio allocation using
XCS experts in technical analysis, market conditions and options
market. GECCO (Companion) 2007: 2965-2972
• Sonia Schulenburg, Peter Ross: An Adaptive Agent Based Economic
Model. Learning Classifier Systems 1999: 263-282
• BW Arthur, J.H. Holland, B. LeBaron, R. Palmer, and P. Tayler: "Asset
Pricing Under Endogenous Expectations in an Artificial Stock Market,“
in The Economy as an Evolving Complex System II. Edited (with S.
Durlauf and D. Lane), Addison-Wesley, 1997.
• BW Arthur, R. Palmer, J. Holland, B. LeBaron, and P. Taylor:
"Artificial Economic Life: a Simple Model of a Stockmarket,“ Physica
D, 75, 264-274, 1994
data analysis
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
72What Applications?
Classification and Data Mining
• Bull, L. (ed) Applications of Learning
Classifier Systems. Springer. (2004)
• Bull, L., Bernado Mansilla, E. & Holmes, J.
(eds) Learning Classifier Systems in
Data Mining. Springer. (2008)
• Nowadays, by far the most important
application domain for LCSs
• Many models GA-Miner, REGAL, GALE GAssist
• Performance comparable to state of the art machine learning
Human Competitive Results 2007
X Llorà, R Reddy, B Matesic, R Bhargava: Towards Better than
Human Capability in Diagnosing Prostate Cancer Using Infrared
Spectroscopic Imaging
hyper heuristics
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
74What Applications?
Hyper-Heuristics
• Ross P., Marin-Blazquez J., Schulenburg S.,
and Hart E., Learning a Procedure that can
Solve Hard Bin-packing Problems: A New
GA-Based Approach to Hyper-Heuristics.
In Proceedings of GECCO 2003
• Bin-packing and timetabling problems
• Pick a set of non-evolutionary heuristics
• Use classifier system to learn a solution process
not a solution
• The classifier system learns a sequence of heuristics which should be
applied to gradually transform the problem from its initial state to its
final solved state.
medical data
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
76What Applications?
Epidemiologic Surveillance
• John H. Holmes
Center for Clinical Epidemiology & Biostatistics
Department of Biostatistics & Epidemiology
University of Pennsylvania - School of Medicine
• Epidemiologic surveillance data
need adaptivity to abrupt changes
• Readable rules are attractive
• Performance similar to state
of the art machine learning
• But several important
feature-outcome relationships
missed by other methods were discovered
• Similar results were reported by
Stewart Wilson for breast cancer data
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
77References
• John H. Holmes, Jennifer A. Sager: Rule Discovery in Epidemiologic
Surveillance Data Using EpiXCS: An Evolutionary Computation
Approach. AIME 2005: 444-452
• John H. Holmes, Dennis R. Durbin, Flaura K. Winston: A New
Bootstrapping Method to Improve Classification Performance in
Learning Classifier Systems. PPSN 2000: 745-754
• John H. Holmes, Dennis R. Durbin, Flaura K. Winston: The learning
classifier system: an evolutionary computation approach to knowledge
discovery in epidemiologic surveillance. Artificial Intelligence in
Medicine 19(1): 53-74 (2000)
autonomous robotics
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
79What Applications?
Autonomous Robotics
• In the 1990's, a major testbed
for learning classifier systems.
• Marco Dorigo and Marco Colombetti:
Robot Shaping An Experiment in Behavior
Engineering, 1997
• They introduced the concept of
robot shaping defined as the
incremental training of an autonomous agent.
• Behavior engineering methodology named BAT:
Behavior Analysis and Training.
• University of West England
applied several learning classifier system
models to several robotics problems
artificial ecosystems
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
81What Applications?
Modeling Artificial Ecosystems
• Jon McCormack, Monash University
• Eden: an interactive, self-generating, artificial ecosystem.
• World populated by collections of evolving virtual creatures
• Creatures move about
the environment,
 Make and listen to sounds,
 Foraging for food,
 Encountering predators,
 Mating with each other.
• Creatures evolve to
fit their landscape
• Eden has four seasons per year (15mins)
• Simple physics for rocks, biomass and sonic animals.
Jon McCormack
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
82Eden: An Evolutionary Sonic Ecosystem
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
83References
• McCormack, J. Impossible Nature The Art of Jon McCormack
Published by the Australian Centre for the Moving Image
ISBN 1 920805 08 7, ISBN 1 920805 09 5 (DVD)
• J. McCormack: New Challenges
for Evolutionary Music and Art,
ACM SIGEVOlution Newsletter,
Vol. 1(1), April 2006, pp. 5-11.
• McCormack, J. 2005, 'On the
Evolution of Sonic Ecosystems'
in Adamatzky, et al. (eds),
Artificial Life Models in Software,
Springer, Berlin.
• McCormack, J. 2003, 'Evolving Sonic Ecosystems', Kybernetes,
32(1/2), pp. 184-202.
chemical & neuronal
networks
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
85What Applications?
Chemical and Neuronal Networks
• L. Bull, A. Budd, C. Stone, I. Uroukov,
B. De Lacy Costello and A. Adamatzky
University of the West of England
• Behaviour of non-linear media
controlled automatically through
evolutionary learning
• Unconventional computing
realised by such an approach.
• Learning classifier systems
 Control a light-sensitive sub-excitable
Belousov-Zhabotinski reaction
 Control the electrical stimulation of
cultured neuronal networks
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
86What Applications?
Chemical and Neuronal Networks
• To control a light-sensitive sub-excitable BZ reaction, pulses of wave
fragments are injected into the checkerboard grid resulting in rich spatio-
temporal behaviour
• Learning classifier system can direct the fragments to an arbitrary position
through control of the light intensity within each cell
• Learning Classifier Systems control the electrical stimulation of cultured
neuronal networks such that they display elementary learning, respond to a
given input signal in a pre-specified way
• Results indicate that the learned stimulation protocols identify seemingly
fundamental properties of in vitro neuronal networks
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
87References
• Larry Bull, Adam Budd, Christopher Stone, Ivan Uroukov, Ben De
Lacy Costello and Andrew Adamatzky: Towards Unconventional
Computing through Simulated Evolution: Learning Classifier System
Control of Non-Linear Media
Artificial Life (to appear)
• Budd, A., Stone, C., Masere, J., Adamatzky, A., DeLacyCostello, B.,
Bull, L.: Towards machine learning control of chemical computers. In:
A. Adamatzky, C. Teuscher (eds.) From Utopian to Genuine
Unconventional Computers, pp. 17-36. Luniver Press
• Bull, L., Uroukov, I.S.: Initial results from the use of learning classier
systems to control n vitro neuronal networks. In: Lipson [189], pp.
369-376
conclusions
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
89Conclusions
• Cognitive Modeling
• Complex Adaptive Systems
• Machine Learning
• Reinforcement Learning
• Metaheuristics
• …
 Many blocks to plug-in
 Several representations
 Several RL algorithms
 Several evolutionary methods
 …
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Additional Information
• WWW
 http://gbml.org
 http://www.illigal.org
 UWE Learning Classifier Systems Group
http://www.cems.uwe.ac.uk/lcsg/
 A Learning Classifier Systems Bibliography
http://www.cs.bris.ac.uk/~kovacs/lcs/search.html
• Mailing lists
 lcs-and-gbml group @ Yahoo
• Proceedings of the International Workshop on Learning Classifier
Systems (Lanzi, Stolzmann, & Wilson, 2000, 2001, 2002; Kovacs,
Llorà, & Takadama, 2003-2005; Bacardit, Bernadó-Mansilla, Butz,
Kovacs, Llorà, Takadama, IWLCS2007)
• IWLCS here! (too bad if you did not come)
90
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Books
• Bull, L. (Ed.). Applications of learning classifier systems. Berlin Heidelberg: Springer-
Verlag.
• Butz, M. V. (2002). Anticipatory learning classifier systems. Kluwer Academic
Publishers, Boston, MA.
• Butz, M. V. (2006). Rule-based evolutionary online learning systems: A principled
approach to LCS analysis and design. Studies in Fuzziness and Soft Computing Series,
Springer Verlag, Berlin Heidelberg, Germany.
• Bull, L. & Kovacs, T. (Eds.) (2005). Foundations of learning classifier systems. Berlin
Heidelberg: Springer-Verlag.
• Lanzi, P. L., Stolzmann, W., & Wilson, S. W. (Eds.) (2000). Learning classifier systems:
From foundations to applications (LNAI 1813). Berlin Heidelberg: Springer-Verlag.
• Drugowitsch, J., (2008) Design and Analysis of Learning Classifier Systems: A
Probabilistic Approach, Springer-Verlag
• Goldberg, D. E. (1989). Genetic algorithms in search, optimization & machine learning.
Addison-Wesley.
• Holland, J.H. (1975). Adaptation in natural and artificial systems. University of Michigan
Press.
• Kovacs, T. (2004). Strength of accuracy: Credit assignment in learning classifier
systems. Berlin Heidelberg: Springer-Verlag.
91
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Software
• http://www.illigal.org
Martin Butz’s ACS & XCS in C and Java
Pier Luca Lanzi’s C++ XCSLib (XCS and XCSF)
• http://medal.cs.umsl.edu/files/XCSFJava1.1.zip
Martin Butz’s XCSF in Java
• http://www.ryanurbanowicz.com
Ryan’s Urbanowicz’s ExSTraCS
• Educational LCS (eLCS)
http://sourceforge.net/projects/educationallcs/
 Includes a basic guide for each implementation, which
progressively adds major components of a Michigan-Style LCS
algorithm.
 Code intended to be paired with the first LCS introductory textbook
written by Will Browne.
92
Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC
Thank you!
Questions?

More Related Content

PPTX
Adversarial search
PPTX
Genetic programming
PDF
Syntax directed translation
PDF
Recurrent Neural Networks, LSTM and GRU
PPTX
Ensemble learning
PPT
Soft Computing-173101
PDF
02 Machine Learning - Introduction probability
Adversarial search
Genetic programming
Syntax directed translation
Recurrent Neural Networks, LSTM and GRU
Ensemble learning
Soft Computing-173101
02 Machine Learning - Introduction probability

What's hot (20)

PPT
GRAPH COLORING AND ITS APPLICATIONS
PDF
AI local search
PDF
Kevin Knight, Elaine Rich, B. Nair - Artificial Intelligence (2010, Tata McGr...
PPTX
Recognition-of-tokens
PPTX
04 Multi-layer Feedforward Networks
PPTX
Reasoning in AI
PPT
UNIT-1-PPTS-DAA.ppt
PDF
COMPILER DESIGN- Syntax Directed Translation
PPTX
Control Strategies in AI
PPTX
Uninformed search /Blind search in AI
PPTX
5.2 primitive recursive functions
PDF
Assignment#03
PPT
Chap4
PDF
Daa notes 3
PPTX
Distribution transparency and Distributed transaction
PPTX
Constraint Satisfaction Problem (CSP) : Cryptarithmetic, Graph Coloring, 4- Q...
PPT
Symbol Table, Error Handler & Code Generation
PPTX
Genetic algorithms in Data Mining
PPTX
Introduction to Transformer Model
PPTX
Syntax Analysis in Compiler Design
GRAPH COLORING AND ITS APPLICATIONS
AI local search
Kevin Knight, Elaine Rich, B. Nair - Artificial Intelligence (2010, Tata McGr...
Recognition-of-tokens
04 Multi-layer Feedforward Networks
Reasoning in AI
UNIT-1-PPTS-DAA.ppt
COMPILER DESIGN- Syntax Directed Translation
Control Strategies in AI
Uninformed search /Blind search in AI
5.2 primitive recursive functions
Assignment#03
Chap4
Daa notes 3
Distribution transparency and Distributed transaction
Constraint Satisfaction Problem (CSP) : Cryptarithmetic, Graph Coloring, 4- Q...
Symbol Table, Error Handler & Code Generation
Genetic algorithms in Data Mining
Introduction to Transformer Model
Syntax Analysis in Compiler Design
Ad

Viewers also liked (20)

PPTX
GECCO 2014 - Learning Classifier System Tutorial
PPT
Videogame Design and Programming: Conferenza d'Ateneo 18 Maggio 2011
PPT
Fitness Inheritance in Evolutionary and
PDF
Lecture 02 Machine Learning For Data Mining
PDF
Evolving Rules to Solve Problems: The Learning Classifier Systems Way
PDF
Lecture 04 Association Rules Basics
PDF
DMTM 2015 - 06 Introduction to Clustering
PDF
DMTM 2015 - 03 Data Representation
PDF
Machine Learning and Data Mining: 02 Machine Learning
PDF
DMTM 2015 - 01 Course Introduction
PDF
DMTM 2015 - 02 Data Mining
PDF
Course Organization
PDF
DMTM 2015 - 04 Data Exploration
PDF
Idea Generation and Conceptualization
PDF
DMTM 2015 - 15 Classification Ensembles
PDF
DMTM 2015 - 10 Introduction to Classification
PDF
DMTM 2015 - 08 Representative-Based Clustering
PDF
Video game Design for Dummies - Pisa 10 Dicembre 2014
PDF
DMTM 2015 - 16 Data Preparation
PDF
DMTM 2015 - 05 Association Rules
GECCO 2014 - Learning Classifier System Tutorial
Videogame Design and Programming: Conferenza d'Ateneo 18 Maggio 2011
Fitness Inheritance in Evolutionary and
Lecture 02 Machine Learning For Data Mining
Evolving Rules to Solve Problems: The Learning Classifier Systems Way
Lecture 04 Association Rules Basics
DMTM 2015 - 06 Introduction to Clustering
DMTM 2015 - 03 Data Representation
Machine Learning and Data Mining: 02 Machine Learning
DMTM 2015 - 01 Course Introduction
DMTM 2015 - 02 Data Mining
Course Organization
DMTM 2015 - 04 Data Exploration
Idea Generation and Conceptualization
DMTM 2015 - 15 Classification Ensembles
DMTM 2015 - 10 Introduction to Classification
DMTM 2015 - 08 Representative-Based Clustering
Video game Design for Dummies - Pisa 10 Dicembre 2014
DMTM 2015 - 16 Data Preparation
DMTM 2015 - 05 Association Rules
Ad

Similar to GECCO-2014 Learning Classifier Systems: A Gentle Introduction (20)

PDF
IRJET- Tracking and Predicting Student Performance using Machine Learning
PPT
Contextual information elicitation in travel recommender systems
PPT
Contextual Information Elicitation in Travel Recommender Systems
PDF
DMTM Lecture 04 Classification
PDF
[SAC 2015] Improve General Contextual SLIM Recommendation Algorithms By Facto...
PDF
Reinforcement Learning on Mine Sweeper
PDF
Research on Recommender Systems: Beyond Ratings and Lists
PDF
Assessment Model for Opportunistic Routing
PDF
Performance Evaluation for Classifiers tutorial
PPTX
PDF
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
PDF
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
PDF
Data Visualization and Learning Analytics with xAPI
PDF
Replicable Evaluation of Recommender Systems
PDF
Predicting Engagement in Video Lectures
PDF
Ascertaining Impacts of Open Educational Practices in Shifting Mindsets
PDF
De carlo rizk 2010 icelw
PPTX
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
PPTX
Efficiency Based Comparison of Project Delivery Methods
PDF
LAK21 Data Driven Redesign of Tutoring Systems (Yun Huang)
IRJET- Tracking and Predicting Student Performance using Machine Learning
Contextual information elicitation in travel recommender systems
Contextual Information Elicitation in Travel Recommender Systems
DMTM Lecture 04 Classification
[SAC 2015] Improve General Contextual SLIM Recommendation Algorithms By Facto...
Reinforcement Learning on Mine Sweeper
Research on Recommender Systems: Beyond Ratings and Lists
Assessment Model for Opportunistic Routing
Performance Evaluation for Classifiers tutorial
A Connectionist Approach to Dynamic Resource Management for Virtualised Netwo...
[WI 2017] Context Suggestion: Empirical Evaluations vs User Studies
Data Visualization and Learning Analytics with xAPI
Replicable Evaluation of Recommender Systems
Predicting Engagement in Video Lectures
Ascertaining Impacts of Open Educational Practices in Shifting Mindsets
De carlo rizk 2010 icelw
Unlocking Exploration: Self-Motivated Agents Thrive on Memory-Driven Curiosity
Efficiency Based Comparison of Project Delivery Methods
LAK21 Data Driven Redesign of Tutoring Systems (Yun Huang)

More from Pier Luca Lanzi (20)

PDF
11 Settembre 2021 - Giocare con i Videogiochi
PDF
Breve Viaggio al Centro dei Videogiochi
PDF
Global Game Jam 19 @ POLIMI - Morning Welcome
PPTX
Data Driven Game Design @ Campus Party 2018
PDF
GGJ18 al Politecnico di Milano - Presentazione che precede la presentazione d...
PDF
GGJ18 al Politecnico di Milano - Presentazione di apertura
PDF
Presentation for UNITECH event - January 8, 2018
PDF
DMTM Lecture 20 Data preparation
PDF
DMTM Lecture 19 Data exploration
PDF
DMTM Lecture 18 Graph mining
PDF
DMTM Lecture 17 Text mining
PDF
DMTM Lecture 16 Association rules
PDF
DMTM Lecture 15 Clustering evaluation
PDF
DMTM Lecture 14 Density based clustering
PDF
DMTM Lecture 13 Representative based clustering
PDF
DMTM Lecture 12 Hierarchical clustering
PDF
DMTM Lecture 11 Clustering
PDF
DMTM Lecture 10 Classification ensembles
PDF
DMTM Lecture 09 Other classificationmethods
PDF
DMTM Lecture 08 Classification rules
11 Settembre 2021 - Giocare con i Videogiochi
Breve Viaggio al Centro dei Videogiochi
Global Game Jam 19 @ POLIMI - Morning Welcome
Data Driven Game Design @ Campus Party 2018
GGJ18 al Politecnico di Milano - Presentazione che precede la presentazione d...
GGJ18 al Politecnico di Milano - Presentazione di apertura
Presentation for UNITECH event - January 8, 2018
DMTM Lecture 20 Data preparation
DMTM Lecture 19 Data exploration
DMTM Lecture 18 Graph mining
DMTM Lecture 17 Text mining
DMTM Lecture 16 Association rules
DMTM Lecture 15 Clustering evaluation
DMTM Lecture 14 Density based clustering
DMTM Lecture 13 Representative based clustering
DMTM Lecture 12 Hierarchical clustering
DMTM Lecture 11 Clustering
DMTM Lecture 10 Classification ensembles
DMTM Lecture 09 Other classificationmethods
DMTM Lecture 08 Classification rules

Recently uploaded (20)

PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PDF
Biophysics 2.pdffffffffffffffffffffffffff
DOCX
Viruses (History, structure and composition, classification, Bacteriophage Re...
PPTX
neck nodes and dissection types and lymph nodes levels
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PDF
The scientific heritage No 166 (166) (2025)
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PPTX
BIOMOLECULES PPT........................
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PDF
Placing the Near-Earth Object Impact Probability in Context
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PPTX
microscope-Lecturecjchchchchcuvuvhc.pptx
PPTX
famous lake in india and its disturibution and importance
2. Earth - The Living Planet Module 2ELS
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
Phytochemical Investigation of Miliusa longipes.pdf
INTRODUCTION TO EVS | Concept of sustainability
Biophysics 2.pdffffffffffffffffffffffffff
Viruses (History, structure and composition, classification, Bacteriophage Re...
neck nodes and dissection types and lymph nodes levels
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
The scientific heritage No 166 (166) (2025)
bbec55_b34400a7914c42429908233dbd381773.pdf
BIOMOLECULES PPT........................
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
Derivatives of integument scales, beaks, horns,.pptx
Placing the Near-Earth Object Impact Probability in Context
HPLC-PPT.docx high performance liquid chromatography
Cell Membrane: Structure, Composition & Functions
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
microscope-Lecturecjchchchchcuvuvhc.pptx
famous lake in india and its disturibution and importance

GECCO-2014 Learning Classifier Systems: A Gentle Introduction

  • 1. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Learning Classifier Systems A Gentle Introduction
  • 2. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Outline • Introduction  Why? When?  What areas? What Applications? • Learning Classifier Systems  What Learning?  Classifiers?  How do they work?  What decisions?  General principles  Better classifiers  Theory? • Survey of applications 2
  • 4. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 4 a real system with an unknown underlying dynamics Why? What was the goal? if C1 :buy 30 if C2 :sell -2 … evolved rules provide a plausible, human readable, model of the unknown system apply a classifier system online to generate a behavior matched the real system
  • 5. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC To state, in concrete technical form, a model of a complete mind and its several aspects 5 • A cognitive system interacting with an environment • Binary detectors and effectors • Knowledge = set of classifiers • Condition-action rules that recognize a situation and propose an action • Payoff reservoir for the system’s needs • Payoff distributed through an epochal algorithm • Internal memory as message list • Genetic search of classifiers Holland’s Vision, Cognitive System One
  • 6. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 6Holland’s Learning Classifier Systems • Explicit representation of the incoming reward • Good classifiers are the ones that predict high rewards • Credit Assignment using Bucket Brigade • Rule Discovery through a genetic algorithm applied to the entire rule base (on the whole solution) • Description was vast It did not work right off! Very limited success • David E. Goldberg: Computer-aided gas pipeline operation using genetic algorithms and rule learning, PhD thesis. University of Michigan. Ann Arbor, MI. Rule Discovery Component Perceptions Detectors Reward Action Effectors Match Set Classifiers matching the current sensory inputs Population Classifiers representing the current knowledge Evaluation of the actions in the match set Credit Assignment Component 1 2 3
  • 7. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 7Learning System LS-1 & Pittsburgh Classifier Systems Holland models learning as ongoing adaptation process De Jong instead views learning as optimization Genetic algorithms applied to a population of rule sets 1. t := 0 2. Initialize the population P(t) 3. Evaluate the rules sets in P(t) 4. While the termination condition is not satisfied 5. Begin 6. Select the rule sets in P(t) and generate Ps(t) 7. Recombine and mutate the rule sets in Ps(t) 8. P(t+1) := Ps(t) 9. t := t+1 10. Evaluate the rules sets in P(t) 11. End No apportionment of credit Offline evaluation of rule sets Pittsburgh Classifier System
  • 9. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 1970s 1980s 1990s 2000s XCS is born, first results on classification & robotics applications, but interest fades wa Genetic algorithms and CS-1 Research flourishes, success is limited Evolving rules as optimization Research follows Holland’s vision Success is still limited Classifier systems finally work Focus on classification (UCS) Large development of models Facetwise theory and applications Genomic and epidemiological applications Reinforcement Learning & Machine Learning
  • 10. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 10Stewart W. Wilson & The XCS Classifier System 1.Simplify the model 2.Go for accurate predictions not high payoffs 3.Apply the genetic algorithm to subproblems not to the whole problem 4.Focus on classifier systems as reinforcement learning with rule-based generalization 5.Use reinforcement learning (Q-learning) to distribute reward • Wilson, S.W.: Classifier Fitness Based on Accuracy. Evolutionary Computation 3(2), 149-175 (1995). Most developed and studied model so far
  • 12. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Classification (label prediction) Regression (numerical prediction) Sequential Decision Making
  • 13. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 13 Computational Models of Cognition Complex Adaptive Systems Classification & Data mining Autonomous Robotics Others Traffic controllers Target recognition Fighter maneuvering …
  • 15. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 15
  • 16. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 16 • The goal is to maximize the amount of reward received • How much future reward when at is performed in st? • What is the expected payoff for st and at? • Need to compute a value function, Q(st,at) payoff Learning Classifier Systems as Reinforcement Learning Methods Environment Agent st atrt+1 st+1
  • 17. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC How does reinforcement learning work? Define the inputs, the actions, and how the reward is determined Define the expected payoff Compute a value function Q(st,at) mapping state-action pairs into expected payoffs
  • 18. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 18 • At the beginning, is initialized with random values • At time t, • Parameters,  Discount factor   The learning rate   The action selection strategy How does reinforcement learning work? Then, Q-learning is an option. incoming rewardnew estimate previous value new estimate
  • 19. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC How does reinforcement learning work? Reinforcement learning assumes that Q(st,at) is represented as a table But the real world is complex, the number of possible inputs can be huge! We cannot afford an exact Q(st,at)
  • 20. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 20The Mountain Car Example GOAL Task: drive an underpowered car up a steep mountain road st = position, velocity rt = 0 when goal is reached, -1 otherwise Value Function Q(st,at)
  • 21. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 21What are the issues? •Exact representation infeasible •Approximation mandatory •The function is unknown, it is learnt online from experience Learning an unknown payoff function while also trying to approximate it Approximator works on intermediate estimates While also providing information for the learning Convergence is not guaranteed
  • 22. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Whats does this have to do with Learning Classifier Systems? They solve reinforcement learning problems Represent the payoff function Q(st, at) as a population of rules, the classifiers Classifiers are evolved while Q(st, at) is learned online
  • 24. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 24 payoff surface for A What is a classifier? IF condition C is true for input s THEN the payoff of action A is p s payoff l u p Condition C(s)=l≤s≤u General conditions covering large portions of the problem space Accurate approximations
  • 25. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 25What types of solutions?
  • 26. how do they work?
  • 27. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC • Two key components • A genetic algorithm works on problem space decomposition (condition-action) • Supervised or reinforcement learning is used for learning local prediction models Problem Space
  • 28. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 28How do learning classifier systems work? The main performance cycle statest Environment Agent
  • 29. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 29How do learning classifier systems work? The main performance cycle statest Environment Agent Population[P] Rulesdescribing thecurrent solution
  • 30. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 30How do learning classifier systems work? The main performance cycle statest Matching Environment Agent Rulesdescribing thecurrent solution Population[P] Ruleswhose conditionmatchst MatchSet [M]
  • 31. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 31How do learning classifier systems work? The main performance cycle statest Matching Environment Agent Rulesdescribing thecurrent solution Population[P] Ruleswhose conditionmatchst MatchSet [M] Action Evaluation PredictionArray Thevalueof each actionin[M]
  • 32. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC How do learning classifier systems work? The main performance cycle 32 statest Matching Environment Agent Rulesdescribing thecurrent solution Population[P] Ruleswhose conditionmatchst MatchSet [M] Action Evaluation PredictionArray Thevalueof each actionin[M] Action Selection ActionSet [A] Rulesin[M] withthe selectedaction.
  • 33. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 33How do learning classifier systems work? The main performance cycle statest Matching Rulesdescribing thecurrent solution Population[P] Ruleswhose conditionmatchst MatchSet [M] Action Evaluation PredictionArray Thevalueof each actionin[M] Action Selection ActionSet [A] Rulesin[M] withthe selectedaction. actionat Environment Agent
  • 34. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 34How do learning classifier systems work? The main performance cycle statest Matching Environment Agent Rulesdescribing thecurrent solution Population[P] Ruleswhose conditionmatchst MatchSet [M] Action Evaluation PredictionArray Thevalueof each actionin[M] Action Selection ActionSet [A] Rulesin[M] withthe selectedaction. actionat The classifiers predict an expected payoff The incoming reward is used to update the rules which helped in getting the reward Any reinforcement learning algorithm can be used to estimate the classifier prediction
  • 35. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 35How do learning classifier systems work? The main performance cycle statest Matching Rulesdescribing thecurrent solution Population[P] Ruleswhose conditionmatchst MatchSet [M] Action Evaluation PredictionArray Thevalueof each actionin[M] Action Selection ActionSet [A] Rulesin[M] withthe selectedaction. actionatrewardrt ActionSet at t-1[A]-1 Rulesin[M] withthe selectedaction. Reinforcement Learning
  • 36. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 36How do learning classifier systems work? The reinforcement component • For each classifier C in [A]-1 the prediction p is updated as follows: P  r +  maxaA PredictionArray(a) p  p + (P- p) • Compare this with Q-learning A rule “corresponds” to Q-table p “corresponds” to the value Q(s,a) P “corresponds” to “r+maxaQ(st,a)”
  • 37. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Where do classifiers come from? In principle, any search method may be used Evolutionary computation is nice because it is representation “independent” A genetic algorithm select, recombines, mutate existing classifiers to search for better ones
  • 38. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC What are the good classifiers? What is the classifier fitness? The goal is to approximate a target value function with as few classifiers as possible We wish to have an accurate approximation One possible approach is to define fitness as a function of the classifier prediction accuracy
  • 39. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC What about generalization? The genetic algorithm can take care of this General classifiers apply more often, thus they are reproduced more But since fitness is based on classifiers accuracy only accurate classifiers are likely to be reproduced The genetic algorithm evolves maximally general maximally accurate classifiers
  • 41. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 41How to apply learning classifier systems • Determine the inputs, the actions, and how reward is distributed • Determine what is the expected payoff that must be maximized • Decide an action selection strategy • Set up the parameter  Environment Learning Classifier System st rt at • Select a representation for conditions, the recombination and the mutation operators • Select a reinforcement learning algorithm • Setup the parameters, mainly the population size, the parameters for the genetic algorithm, etc. • Parameter 
  • 42. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 42Things can be extremely simple! For instance in supervised classification Environment Learning Classifier System example class 1 if the class is correct 0 if the class is not correct • Select a representation for conditions and the recombination and mutation operators • Setup the parameters, mainly the population size, the parameters for the genetic algorithm, etc.
  • 44. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC An Example… 44 A1 A2 A3 A4 A5 A6 CLASS 1 1 1 1 3 1 1 1 1 1 1 3 2 1 1 1 1 3 2 1 1 1 1 1 3 3 2 1 1 1 2 3 1 2 1 1 2 1 1 1 2 1 1 2 1 1 2 1 0 1 2 1 1 3 1 0 1 2 1 1 4 2 0 1 2 1 2 1 1 1 1 2 1 2 3 1 0 … … … … … … … Six Attributes Severalcases A hidden concept… What is the concept?
  • 45. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Traditional Approach? • Classification  Trees  C4.5? ID3? CHAID? … • Classification  Rules  CN2? C4.5rules? … • Prediction  Trees  CART? … 45 Task Representation Algorithm
  • 46. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC I Need to Classify, I Want Trees, What Algorithm? ID3? C4.5? CHAID? 46
  • 47. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC I Need to Classify, I Want Rules! What Algorithm? • OneRule if A5 = 3 then 0 (11/19) if A5 = 2 then 0 (11/20) if A5 = 4 then 0 (11/23) if A5 = 1 then 1 (29/0) correct: 91 out of 124 training examples. • Rule Learner if A5 = 4 and A1 = 1 then 0 (1/13) if A5 = 1 then 1 (29/0) if A4 = 2 and A5 = 2 then 0 (1/6) if A1 = 1 and A2 = 2 then 0 (0/10) else 0 (27/29) correct: 87 out of 116 training examples. 47  FOIL is_0(A1,A2,A3,A4,A5,A6) :- a1  a2 and a5  1 Different task, different solution representation? Completely different algorithm!
  • 48. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Thou shalt have no other model
  • 49. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Genetics-Based Generalization Accurate Estimates About Classifiers (Powerful RL) Classifier Representation
  • 50. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 50Learning Classifier Systems: One Principle Many Representations Learning Classifier System Genetic Search Estimates RL & MLKnowledge Representation Conditions & Prediction Ternary Conditions 0, 1, # Symbolic Conditions Attribute-Value Conditions Ternary rules 0, 1, # if a5<2 or a1=a2, class=1 if A1=1and A2=11 if A1=2 andA2=21 Ternary Conditions 0, 1, # Attribute-Value Conditions Symbolic Conditions Same framework! Just plug-in your favorite representation
  • 52. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 52 payoff landscape of A What is computed prediction? Replace the prediction p by a parametrized function p(s,w) s payoff l u p(s,w)=w0+sw1 Condition C(s)=l≤s≤u Which Representation? Which type of approximation? Stewart W. Wilson: Classifiers that approximate functions. Natural Computing 1(2-3): 211-234 (2002)
  • 53. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 53Same example with computed prediction No need to change the framework Just plug-in your favorite estimator Linear, Polynomial, NNs, SVMs, tile-coding Lanzi, PL, Extending XCSF beyond linear approximation. GECCO, 2005 Loiacono D. and Lanzi P.L.. Evolving neural networks for classifier prediction with XSCF. ECAI, 2006 Lanzi and Loiacono XCSF with neural prediction. IEEE CEC, 2006. Loiacono, D., Marelli A., and Lanzi PL. Support vector regression for classifier prediction. GECCO, 2007.
  • 54. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC What do we want? Fast learning Learn something as soon as possible! Accurate solutions As the learning proceeds the solution accuracy should improve
  • 55. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Is there another approach? payoff landscape s payoff l u p(s,w)=w0 p(s,w)=w1s+w0p(s,w)=NN(s,w) Initially, constant prediction may be good As learn proceeds, the solution should improve… … as much as possible. 55
  • 56. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Ensemble Classifiers 56 None of prediction models is the “best” Let evolution search for the “most suited” NN Almost as fast as using best model Model is adapted effectively in each subspace
  • 58. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Learning Classifier Systems Representation, Reinforcement Learning, & Genetics-based Search Unified theory is impractical Develop facetwise models
  • 59. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 59Facetwise Models for a Theory of Evolution and Learning • Prof. David E. Goldberg University of Illinois at Urbana Champaign • Facetwise approach for the analysis and the design of genetic algorithms • In learning classifier systems  Separate learning from evolution  Simplify the problem by focusing only on relevant aspect  Derive facetwise models • Applied to model several aspects of evolution
  • 60. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC S([A]) = S([P])/(2-S([P])) Since the genetic algorithm is applied to the action set, there is a generalization pressure regulated by this equation. N>-log(1-θ)/p Given the probability θ to maintain all the subsolutions with occurrence probability p, then the population size N … O(L 2o+a) Time to converge for a problem of L bits order o and with a problem classes Martin V. Butz, Tim Kovacs, Pier Luca Lanzi, Stewart W. Wilson: Toward a theory of generalization and learning in XCS. IEEE Trans. Evolutionary Computation 8(1): 28-46 (2004) Martin V. Butz, Kumara Sastry, David E. Goldberg: Strong, Stable, and Reliable Fitness Pressure in XCS due to Tournament Selection. Genetic Programming and Evolvable Machines 6(1): 53-77 (2005) Martin V. Butz, David E. Goldberg, Pier Luca Lanzi: Bounding Learning Time in XCS. GECCO (2) 2004: 739-750
  • 62. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC What the Advanced Topics? • Models Tailored for Applications Anticipatory Behavior (ACS) Data Mining, classification and prediction (e.g., UCS) Epidemiology (e.g., EpiCS, EpiXCS) ExSTraCS (Bioinformatics) • Improved representations of conditions (GP, GEP, …) • Improved representations of actions (GP, Code Fragments) • Improved genetic search (EDAs, ECGA, BOA, ….) • Improved estimators • Scalability Matching Distributed models 62
  • 64. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 64 Computational Models of Cognition Complex Adaptive Systems Classification & Data mining Autonomous Robotics Others Traffic controllers Target recognition Fighter maneuvering …
  • 66. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 66What Applications? Computational Models of Cognition • Learning classifier system model certain aspects of cognition  Human language learning  Perceptual category learning  Affect theory  Anticipatory and latent learning • Learning classifier systems provide good models for animals in experiments in which the subjects must learn internal models to perform as well as they do • Martin V. Butz, University of Würzburg, Department of Cognitive Psychology III Cognitive Bodyspaces: Learning and Behavior (COBOSLAB) • Wolfgang Stolzmann, Daimler Chrysler • Rick R. Riolo, University of Michigan, Center for the Study of Complex Systems
  • 67. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 67References • Butz, M.V.: Anticipatory Learning Classifier Systems, Genetic Algorithms and Evolutionary Computation, vol. 4. Springer-Verlag (2000) • Riolo, R.L.: Lookahead Planning and Latent Learning in a Classifier System. In: J.A. Meyer, S.W. Wilson (eds.) From Animals to Animats 1. Proceedings of the First International Conferenceon Simulation of Adaptive Behavior (SAB90), pp. 316{326. A Bradford Book. MIT Press (1990) • Stolzmann, W. and Butz, M.V. and Hoffman, J. and Goldberg, D.E.: First Cognitive Capabilities in the Anticipatory Classifier System. In: From Animals to Animats: Proceedings of the Sixth International Conference on Simulation of Adaptive Behavior. MIT Press (2000)
  • 69. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 69What Applications? Computational Economics • To models one single agent acting in the market (BW Arthur, JH Holland, B LeBaron) • To model many interactive agents each one controlled by its own classifier system. • Modeling the behavior of agents trading risk free bonds and risky assets • Different trader types modeled by supplying different input information sets to a group of homogenous agents • Later extended to a multi-LCS architecture applied to portfolio optimization • Technology startup company founded in March 2005
  • 70. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 70References • Sor Ying (Byron) Wong, Sonia Schulenburg: Portfolio allocation using XCS experts in technical analysis, market conditions and options market. GECCO (Companion) 2007: 2965-2972 • Sonia Schulenburg, Peter Ross: An Adaptive Agent Based Economic Model. Learning Classifier Systems 1999: 263-282 • BW Arthur, J.H. Holland, B. LeBaron, R. Palmer, and P. Tayler: "Asset Pricing Under Endogenous Expectations in an Artificial Stock Market,“ in The Economy as an Evolving Complex System II. Edited (with S. Durlauf and D. Lane), Addison-Wesley, 1997. • BW Arthur, R. Palmer, J. Holland, B. LeBaron, and P. Taylor: "Artificial Economic Life: a Simple Model of a Stockmarket,“ Physica D, 75, 264-274, 1994
  • 72. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 72What Applications? Classification and Data Mining • Bull, L. (ed) Applications of Learning Classifier Systems. Springer. (2004) • Bull, L., Bernado Mansilla, E. & Holmes, J. (eds) Learning Classifier Systems in Data Mining. Springer. (2008) • Nowadays, by far the most important application domain for LCSs • Many models GA-Miner, REGAL, GALE GAssist • Performance comparable to state of the art machine learning Human Competitive Results 2007 X Llorà, R Reddy, B Matesic, R Bhargava: Towards Better than Human Capability in Diagnosing Prostate Cancer Using Infrared Spectroscopic Imaging
  • 74. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 74What Applications? Hyper-Heuristics • Ross P., Marin-Blazquez J., Schulenburg S., and Hart E., Learning a Procedure that can Solve Hard Bin-packing Problems: A New GA-Based Approach to Hyper-Heuristics. In Proceedings of GECCO 2003 • Bin-packing and timetabling problems • Pick a set of non-evolutionary heuristics • Use classifier system to learn a solution process not a solution • The classifier system learns a sequence of heuristics which should be applied to gradually transform the problem from its initial state to its final solved state.
  • 76. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 76What Applications? Epidemiologic Surveillance • John H. Holmes Center for Clinical Epidemiology & Biostatistics Department of Biostatistics & Epidemiology University of Pennsylvania - School of Medicine • Epidemiologic surveillance data need adaptivity to abrupt changes • Readable rules are attractive • Performance similar to state of the art machine learning • But several important feature-outcome relationships missed by other methods were discovered • Similar results were reported by Stewart Wilson for breast cancer data
  • 77. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 77References • John H. Holmes, Jennifer A. Sager: Rule Discovery in Epidemiologic Surveillance Data Using EpiXCS: An Evolutionary Computation Approach. AIME 2005: 444-452 • John H. Holmes, Dennis R. Durbin, Flaura K. Winston: A New Bootstrapping Method to Improve Classification Performance in Learning Classifier Systems. PPSN 2000: 745-754 • John H. Holmes, Dennis R. Durbin, Flaura K. Winston: The learning classifier system: an evolutionary computation approach to knowledge discovery in epidemiologic surveillance. Artificial Intelligence in Medicine 19(1): 53-74 (2000)
  • 79. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 79What Applications? Autonomous Robotics • In the 1990's, a major testbed for learning classifier systems. • Marco Dorigo and Marco Colombetti: Robot Shaping An Experiment in Behavior Engineering, 1997 • They introduced the concept of robot shaping defined as the incremental training of an autonomous agent. • Behavior engineering methodology named BAT: Behavior Analysis and Training. • University of West England applied several learning classifier system models to several robotics problems
  • 81. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 81What Applications? Modeling Artificial Ecosystems • Jon McCormack, Monash University • Eden: an interactive, self-generating, artificial ecosystem. • World populated by collections of evolving virtual creatures • Creatures move about the environment,  Make and listen to sounds,  Foraging for food,  Encountering predators,  Mating with each other. • Creatures evolve to fit their landscape • Eden has four seasons per year (15mins) • Simple physics for rocks, biomass and sonic animals. Jon McCormack
  • 82. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 82Eden: An Evolutionary Sonic Ecosystem
  • 83. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 83References • McCormack, J. Impossible Nature The Art of Jon McCormack Published by the Australian Centre for the Moving Image ISBN 1 920805 08 7, ISBN 1 920805 09 5 (DVD) • J. McCormack: New Challenges for Evolutionary Music and Art, ACM SIGEVOlution Newsletter, Vol. 1(1), April 2006, pp. 5-11. • McCormack, J. 2005, 'On the Evolution of Sonic Ecosystems' in Adamatzky, et al. (eds), Artificial Life Models in Software, Springer, Berlin. • McCormack, J. 2003, 'Evolving Sonic Ecosystems', Kybernetes, 32(1/2), pp. 184-202.
  • 85. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 85What Applications? Chemical and Neuronal Networks • L. Bull, A. Budd, C. Stone, I. Uroukov, B. De Lacy Costello and A. Adamatzky University of the West of England • Behaviour of non-linear media controlled automatically through evolutionary learning • Unconventional computing realised by such an approach. • Learning classifier systems  Control a light-sensitive sub-excitable Belousov-Zhabotinski reaction  Control the electrical stimulation of cultured neuronal networks
  • 86. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 86What Applications? Chemical and Neuronal Networks • To control a light-sensitive sub-excitable BZ reaction, pulses of wave fragments are injected into the checkerboard grid resulting in rich spatio- temporal behaviour • Learning classifier system can direct the fragments to an arbitrary position through control of the light intensity within each cell • Learning Classifier Systems control the electrical stimulation of cultured neuronal networks such that they display elementary learning, respond to a given input signal in a pre-specified way • Results indicate that the learned stimulation protocols identify seemingly fundamental properties of in vitro neuronal networks
  • 87. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 87References • Larry Bull, Adam Budd, Christopher Stone, Ivan Uroukov, Ben De Lacy Costello and Andrew Adamatzky: Towards Unconventional Computing through Simulated Evolution: Learning Classifier System Control of Non-Linear Media Artificial Life (to appear) • Budd, A., Stone, C., Masere, J., Adamatzky, A., DeLacyCostello, B., Bull, L.: Towards machine learning control of chemical computers. In: A. Adamatzky, C. Teuscher (eds.) From Utopian to Genuine Unconventional Computers, pp. 17-36. Luniver Press • Bull, L., Uroukov, I.S.: Initial results from the use of learning classier systems to control n vitro neuronal networks. In: Lipson [189], pp. 369-376
  • 89. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC 89Conclusions • Cognitive Modeling • Complex Adaptive Systems • Machine Learning • Reinforcement Learning • Metaheuristics • …  Many blocks to plug-in  Several representations  Several RL algorithms  Several evolutionary methods  …
  • 90. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Additional Information • WWW  http://gbml.org  http://www.illigal.org  UWE Learning Classifier Systems Group http://www.cems.uwe.ac.uk/lcsg/  A Learning Classifier Systems Bibliography http://www.cs.bris.ac.uk/~kovacs/lcs/search.html • Mailing lists  lcs-and-gbml group @ Yahoo • Proceedings of the International Workshop on Learning Classifier Systems (Lanzi, Stolzmann, & Wilson, 2000, 2001, 2002; Kovacs, Llorà, & Takadama, 2003-2005; Bacardit, Bernadó-Mansilla, Butz, Kovacs, Llorà, Takadama, IWLCS2007) • IWLCS here! (too bad if you did not come) 90
  • 91. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Books • Bull, L. (Ed.). Applications of learning classifier systems. Berlin Heidelberg: Springer- Verlag. • Butz, M. V. (2002). Anticipatory learning classifier systems. Kluwer Academic Publishers, Boston, MA. • Butz, M. V. (2006). Rule-based evolutionary online learning systems: A principled approach to LCS analysis and design. Studies in Fuzziness and Soft Computing Series, Springer Verlag, Berlin Heidelberg, Germany. • Bull, L. & Kovacs, T. (Eds.) (2005). Foundations of learning classifier systems. Berlin Heidelberg: Springer-Verlag. • Lanzi, P. L., Stolzmann, W., & Wilson, S. W. (Eds.) (2000). Learning classifier systems: From foundations to applications (LNAI 1813). Berlin Heidelberg: Springer-Verlag. • Drugowitsch, J., (2008) Design and Analysis of Learning Classifier Systems: A Probabilistic Approach, Springer-Verlag • Goldberg, D. E. (1989). Genetic algorithms in search, optimization & machine learning. Addison-Wesley. • Holland, J.H. (1975). Adaptation in natural and artificial systems. University of Michigan Press. • Kovacs, T. (2004). Strength of accuracy: Credit assignment in learning classifier systems. Berlin Heidelberg: Springer-Verlag. 91
  • 92. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Software • http://www.illigal.org Martin Butz’s ACS & XCS in C and Java Pier Luca Lanzi’s C++ XCSLib (XCS and XCSF) • http://medal.cs.umsl.edu/files/XCSFJava1.1.zip Martin Butz’s XCSF in Java • http://www.ryanurbanowicz.com Ryan’s Urbanowicz’s ExSTraCS • Educational LCS (eLCS) http://sourceforge.net/projects/educationallcs/  Includes a basic guide for each implementation, which progressively adds major components of a Michigan-Style LCS algorithm.  Code intended to be paired with the first LCS introductory textbook written by Will Browne. 92
  • 93. Pier Luca Lanzi - GECCO-2014, July 12-16, 2014 Vancouver BC Thank you! Questions?