Random Artificial Incorporation of Noise
    in a Learning Classifier System
             Environment

   Ryan J. Urbanowicz, Nicholas A. Sinnott-
      Armstrong, and Jason H. Moore
             Dartmouth Medical School




             GECCO Dublin, Ireland - 2011
Genetic Epidemiology
             • Association Study (Case/Control)
             • Single Nucleotide Polymorphism (SNP)
             • Allele & Genotype



Subject #1             Subject #2           Subject #3
-- AGGTCA --            -- AGGTCA --        -- AGCTCA --
-- AGGTCA --            -- AGCTCA --        -- AGCTCA --

             Two alleles (G and C)
             Three genotypes (GG, GC, CC)
             Encode genotypes (0, 1, 2)
“One SNP at a time approach”




“Complex systems approach”
Main Effects
Disease   Disease   Disease
                                                AA(.25)   Aa(.5)   aa(.25)
 SNP1       SNP2     SNP3             SNP X       0         0        1




Epistasis                                                 SNP 1
                                                AA(.25)   Aa(.5)   aa(.25)
          Disease
                                      BB(.25)      0        1         0            0.5

                              SNP 2   Bb(.5)       1        0         1            0.5
                                      bb(.25       0        1         0            0.5
                                                  0.5      0.5       0.5

                                                                               Marginal
                                                                             Penetrance(s)

 SNP1       SNP2      SNP3
Genetic Heterogeneity

                             Sample population




                        G3     G4    G5                    G7      G8
G1   G2
                                                   G6

          • Evidence of GH in…
             –   Autism               –   Tuberous sclerosis
             –   Schizophrenia        –   Cystic Fibrosis
             –   Breast Cancer        –   Asthma
             –   Alzheimer disease    –   And many, many others…
Learning Classifier Systems
       4
                               2                  Population [P]
           Covering
                                   Classifiern = Condition : Action :: Parameter(s)                   Detectors

                                3
                                                                 Action 5                                1
     10                        Match Set [M]                     Selection                  9

           Genetic                  Classifierm                                                     Environment
           Algorithm                              Prediction Array


                                6
                                                                                 Action Performed
                               Action Set [A]                                                         Effectors
                                    Classifiera                                           7

                                    8                                        Reward
              [A]t-1                         Learning Strategy
           Classifiert-1                          Credit Assignment

                                                                                          • Autonomous Robotics
              Discovery Component                                                         • Complex Adaptive Systems
              Performance Component                                                       • Function Approximation
              Reinforcement Component                                                     • Classification
                                                                                          • Data Mining
Urbanowicz 2009 LCS: A Complete Introduction, Review, and Roadmap. Journal of Artificial Evolution and Applications
Random Artificial Incorporation of Noise in a Learning Classifier System Environment.
Effective Generalization
• Effective Generalization – Maximizing rule generality while
  preserving accuracy. (Testing Acc. = Training Acc.)

• Examples of LCS Generalization                         10###00### - 1
    –   Generalization Hypothesis (Wilson 1995)          02##1##### - 0
    –   Action Set GA & Subsumption (Wilson 1998)
    –   Hierarchical Selection operator (Bacardit/Garrell 2002)
    –   Windowing (Bacardit et. al. 2004)
    –   Minimum Description Length (Bacardit/Garrell 2007)
    –   Ensemble LCS (Gao et. al. 2005)

• Noisy Problem domains –
    – Over-fitting become a particularly important problem
    – Classification Noise - < 100% testing accuracy possible
    – Attribute Noise – attributes which contribute nothing to testing accuracy.
Hypothesis:
• Given: a noisy problem, LCS’s with accuracy-based fitness will tend
  to over-fit (learn structure idiosyncratic to the training dataset).

• Consider: datasets with a small sample size would be particularly
  susceptible to this (online learning repeatedly considers same
  samples).

• If: we probabilistically incorporate variable noise into the incoming
  training instances, than every epoch of learning, the Michigan LCS is
  exposed to a randomly permuted version of the original dataset.
  Leads to an artificially inflated sample size.

• Hypothesis: The incorporation of low levels of random
  classification noise will discourage over-fitting, and promote
  effective generalization
RAIN
           Random Artificial Incorporation of Noise




 Environment


0210110220 -- 1                 0210120220 -- 1


Population [P]




                        Match Set [M]        Correct Set [C]
Temporal Models
• Pm = Maximum Permutation Prob.
•Pc = Current Permutation Prob.
•Im = Maximum Iteration
•Ic = Current Iteration


•Uniform


•Linear


•Inverse Linear


•Gaussian
Power Estimation
                                                                       SNP 2
                                                 Model 1         2       1        0
1 234...                                                     2   0       1        0




                                                     SNP 1
                                                             1   1       0        1
1   0   #   #   #   0   0   #   #   #   -   1
                                                             0   0       1        0
0   2   #   #   1   #   #   #   #   #   -   0
#   #   #   1   0   2   1   #   #   #   -   0
#   #   1   0   #   #   #   #   #   #   -   0                          SNP 4
0   1   #   #   #   #   #   #   2   #   -   1    Model 2         2       1        0
#   #   0   0   #   #   #   0   0   1   -   1
                                                             2   0       0        0




                                                     SNP 3
1   2   0   0   1   #   #   #   #   2   -   1
1   1   1   #   #   #   #   #   #   #   -   0                1   0       0        0

2   #   2   #   #   1   #   0   #   #   -   0                0   0       0        1
#   1   #   #   #   #   #   #   #   #   -   1
#   #   1   1   #   #   #   2   #   #   -   0

5556889899
                                                   Power at the CV level:
                                                             > 50% of the 10 CV runs
Targeted RAIN
• Idea: Strategically and automatically avoid destructively adding noise to
  attributes more likely to be important to classification.

• Probabilistically targets attributes which are more frequently generalized
  (rather than specified)

• Pc = Pm

• Two Implementations (Weight lists generated differently)
    – Targeted Generality (TG)
    – Targeted Fitness Weighted Generality (TFWG)

• Noise Generation (same for both implementations)
    –   First epoch – no noise
    –   Weight list recalculated at the end of each epoch.
    –   Subtract minimum weight in list from all values in list.
    –   Determine number of attributes to be permuted (Random < Pm)
    –   Choose attribute - Roulette wheel selection
Experimental Evaluation
•   UCS –
     – Iterations = [50000,100000, 200000, 500000]
     – Micro Pop. Size = 1600
     – Other parameters are default
     – Track: Training Acc. Testing Acc. Generality, Macro Pop. Size, Run Time
       Power to find both or a single underlying model
     – Pm = 0.001, 0.01, 0.05, 0.1

•   Each Dataset
     –   Main effect free
     –   2X two-locus epistatic interation
     –   20 Attributes
     –   Balanced
     –   Minor allele frequencies = 0.2                    G1 G2                 G3 G4
     –   Heritability = 0.2
     –   Mix Ratio = 50:50
     –   Sample Sizes [200, 400, 800, 1600]
     –   20 Replicates

•   80 Simulated Datasets + (10 Fold CV)  800 runs of UCS
Random Artificial Incorporation of Noise in a Learning Classifier System Environment.
Random Artificial Incorporation of Noise in a Learning Classifier System Environment.
Conclusions & Future Work
•   Incorporation of RAIN with equal attribute probability is ineffective.

•   Targeted RAIN was able to reduce over-fitting (sig. decrease in training accuracy
    without reducing testing accuracy.

•   Improvements in power (not significant) suggest that RAIN may improve UCS’s
    ability to identify predictive attributes.




•   Try RAIN on datasets with much larger numbers of attributes

•   Consider the combination of targeted RAIN with temporal models

•   Explore a larger range of Pm values

•   Implement RAIN with an adaptive Pm
Acknowledgements


         Jason Moore &
  Nicholas A. Sinnott-Armstrong

            Funding Support
  NIH: AI59694, LM009012, LM010098
  William H. Neukom 1964 Institute for
Computational Science at Dartmouth College
Random Artificial Incorporation of Noise in a Learning Classifier System Environment.
Quaternary Rule Representation

         [ #, 0, 1, 2]
Random Artificial Incorporation of Noise in a Learning Classifier System Environment.
Random Artificial Incorporation of Noise in a Learning Classifier System Environment.
Random Artificial Incorporation of Noise in a Learning Classifier System Environment.
Random Artificial Incorporation of Noise in a Learning Classifier System Environment.

More Related Content

PDF
Defect, defect, defect: PROMISE 2012 Keynote
PDF
Scientific and Grid Workflow Management (SGS09)
PDF
Substructrual surrogates for learning decomposable classification problems: i...
PDF
Modeling XCS in class imbalances: Population sizing and parameter settings
PDF
Hinf6210 Project Classification Of Breast Cancer Dataset
PDF
Welsh_BioHDF_BOSC2009
PPTX
Decision making in management for large medical equipment
PPTX
Decision making in management for large medical equipment
Defect, defect, defect: PROMISE 2012 Keynote
Scientific and Grid Workflow Management (SGS09)
Substructrual surrogates for learning decomposable classification problems: i...
Modeling XCS in class imbalances: Population sizing and parameter settings
Hinf6210 Project Classification Of Breast Cancer Dataset
Welsh_BioHDF_BOSC2009
Decision making in management for large medical equipment
Decision making in management for large medical equipment

Similar to Random Artificial Incorporation of Noise in a Learning Classifier System Environment. (20)

PDF
Abraham q3 2008
PDF
Nanometer Testing: Challenges and Solutions
PDF
Anvita Audio Classification Presentation
PDF
Protein Structure Prediction using Coarse Grain Force Fields
PPT
Qualifications And Experience Presentation
PDF
monsanto 08-23-05a
PDF
Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...
PDF
Integrated Yield Management Solutions
PDF
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
PPTX
Ph d colloquium 2
PPT
Creating a SNP calling pipeline
PDF
2007 03-16 modeling and static analysis of complex biological systems dsr
PDF
Next-generation sequencing course, part 1: technologies
PDF
"Genome-Wide Annotation Prediction with SVD Truncation based on ROC Analysis"...
PDF
An Overview to Protein bioinformatics
PDF
GeneArt® services - Gene synthesis through protein production
PDF
General pipeline of transcriptomics analysis
PDF
Leveraging The Open Provenance Model as a Multi-Tier Model for Global Climate...
PPTX
Principal components analysis(pca)fulledited
PDF
ISMB2014読み会 イントロ + Deep learning of the tissue-regulated splicing code
Abraham q3 2008
Nanometer Testing: Challenges and Solutions
Anvita Audio Classification Presentation
Protein Structure Prediction using Coarse Grain Force Fields
Qualifications And Experience Presentation
monsanto 08-23-05a
Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...
Integrated Yield Management Solutions
Basic Aspects of Microarray Technology and Data Analysis (UEB-UAT Bioinformat...
Ph d colloquium 2
Creating a SNP calling pipeline
2007 03-16 modeling and static analysis of complex biological systems dsr
Next-generation sequencing course, part 1: technologies
"Genome-Wide Annotation Prediction with SVD Truncation based on ROC Analysis"...
An Overview to Protein bioinformatics
GeneArt® services - Gene synthesis through protein production
General pipeline of transcriptomics analysis
Leveraging The Open Provenance Model as a Multi-Tier Model for Global Climate...
Principal components analysis(pca)fulledited
ISMB2014読み会 イントロ + Deep learning of the tissue-regulated splicing code
Ad

More from Daniele Loiacono (20)

PDF
GPUs for GEC Competition @ GECCO-2013
PDF
EvoRobocode Competition @ GECCO-2013
PDF
2013 Simulated Car Racing @ GECCO-2013
PDF
2012 Simulated Car Racing Championship @ CIG-2012
PDF
2012 Simulated Car Racing Championship @ GECCO-2012
PDF
2012 Simulated Car Racing Championship @ Evo*-2012
PDF
Computational Intelligence in Games Tutorial @GECCO2012
PPTX
XCSF with Local Deletion: Preventing Detrimental Forgetting
PDF
Testing learning classifier systems
PPTX
One Step Fits All
PDF
Introducing LCS to Digital Design Verification
PPTX
A temporal classifier system using spiking neural networks
PDF
Confusion Matrices for Improving Performance of Feature Pattern Classifier Sy...
PDF
Automatically Defined Functions for Learning Classifier Systems
PDF
Voting Based Learning Classifier System for Multi-Label Classification
PDF
2011 Simulated Car Racing Championship @ GECCO-2011
PDF
2010 Simulated Car Racing Championship @ CIG-2010
PDF
2010 Simulated Car Racing Championship @ GECCO-2010
PDF
2010 Simulated Car Racing Championship @ WCCI-2010
PDF
Car Setup Optimization Competition @ EvoStar 2010
GPUs for GEC Competition @ GECCO-2013
EvoRobocode Competition @ GECCO-2013
2013 Simulated Car Racing @ GECCO-2013
2012 Simulated Car Racing Championship @ CIG-2012
2012 Simulated Car Racing Championship @ GECCO-2012
2012 Simulated Car Racing Championship @ Evo*-2012
Computational Intelligence in Games Tutorial @GECCO2012
XCSF with Local Deletion: Preventing Detrimental Forgetting
Testing learning classifier systems
One Step Fits All
Introducing LCS to Digital Design Verification
A temporal classifier system using spiking neural networks
Confusion Matrices for Improving Performance of Feature Pattern Classifier Sy...
Automatically Defined Functions for Learning Classifier Systems
Voting Based Learning Classifier System for Multi-Label Classification
2011 Simulated Car Racing Championship @ GECCO-2011
2010 Simulated Car Racing Championship @ CIG-2010
2010 Simulated Car Racing Championship @ GECCO-2010
2010 Simulated Car Racing Championship @ WCCI-2010
Car Setup Optimization Competition @ EvoStar 2010
Ad

Recently uploaded (20)

PPTX
Wheat allergies and Disease in gastroenterology
PPTX
HYPERSENSITIVITY REACTIONS - Pathophysiology Notes for Second Year Pharm D St...
PPTX
Post Op complications in general surgery
PPTX
Neonate anatomy and physiology presentation
PDF
Comparison of Swim-Up and Microfluidic Sperm Sorting.pdf
PPT
Dermatology for member of royalcollege.ppt
PDF
OSCE SERIES ( Questions & Answers ) - Set 5.pdf
PPTX
y4d nutrition and diet in pregnancy and postpartum
PPTX
Neoplasia III.pptxjhghgjhfj fjfhgfgdfdfsrbvhv
PPTX
IMAGING EQUIPMENiiiiìiiiiiTpptxeiuueueur
PDF
Transcultural that can help you someday.
PPTX
NRP and care of Newborn.pptx- APPT presentation about neonatal resuscitation ...
PDF
SEMEN PREPARATION TECHNIGUES FOR INTRAUTERINE INSEMINATION.pdf
PPTX
Electrolyte Disturbance in Paediatric - Nitthi.pptx
PPT
neurology Member of Royal College of Physicians (MRCP).ppt
PDF
AGE(Acute Gastroenteritis)pdf. Specific.
PPTX
Effects of lipid metabolism 22 asfelagi.pptx
PDF
The Digestive System Science Educational Presentation in Dark Orange, Blue, a...
PDF
OSCE SERIES - Set 7 ( Questions & Answers ).pdf
PPTX
CARDIOVASCULAR AND RENAL DRUGS.pptx for health study
Wheat allergies and Disease in gastroenterology
HYPERSENSITIVITY REACTIONS - Pathophysiology Notes for Second Year Pharm D St...
Post Op complications in general surgery
Neonate anatomy and physiology presentation
Comparison of Swim-Up and Microfluidic Sperm Sorting.pdf
Dermatology for member of royalcollege.ppt
OSCE SERIES ( Questions & Answers ) - Set 5.pdf
y4d nutrition and diet in pregnancy and postpartum
Neoplasia III.pptxjhghgjhfj fjfhgfgdfdfsrbvhv
IMAGING EQUIPMENiiiiìiiiiiTpptxeiuueueur
Transcultural that can help you someday.
NRP and care of Newborn.pptx- APPT presentation about neonatal resuscitation ...
SEMEN PREPARATION TECHNIGUES FOR INTRAUTERINE INSEMINATION.pdf
Electrolyte Disturbance in Paediatric - Nitthi.pptx
neurology Member of Royal College of Physicians (MRCP).ppt
AGE(Acute Gastroenteritis)pdf. Specific.
Effects of lipid metabolism 22 asfelagi.pptx
The Digestive System Science Educational Presentation in Dark Orange, Blue, a...
OSCE SERIES - Set 7 ( Questions & Answers ).pdf
CARDIOVASCULAR AND RENAL DRUGS.pptx for health study

Random Artificial Incorporation of Noise in a Learning Classifier System Environment.

  • 1. Random Artificial Incorporation of Noise in a Learning Classifier System Environment Ryan J. Urbanowicz, Nicholas A. Sinnott- Armstrong, and Jason H. Moore Dartmouth Medical School GECCO Dublin, Ireland - 2011
  • 2. Genetic Epidemiology • Association Study (Case/Control) • Single Nucleotide Polymorphism (SNP) • Allele & Genotype Subject #1 Subject #2 Subject #3 -- AGGTCA -- -- AGGTCA -- -- AGCTCA -- -- AGGTCA -- -- AGCTCA -- -- AGCTCA -- Two alleles (G and C) Three genotypes (GG, GC, CC) Encode genotypes (0, 1, 2)
  • 3. “One SNP at a time approach” “Complex systems approach”
  • 4. Main Effects Disease Disease Disease AA(.25) Aa(.5) aa(.25) SNP1 SNP2 SNP3 SNP X 0 0 1 Epistasis SNP 1 AA(.25) Aa(.5) aa(.25) Disease BB(.25) 0 1 0 0.5 SNP 2 Bb(.5) 1 0 1 0.5 bb(.25 0 1 0 0.5 0.5 0.5 0.5 Marginal Penetrance(s) SNP1 SNP2 SNP3
  • 5. Genetic Heterogeneity Sample population G3 G4 G5 G7 G8 G1 G2 G6 • Evidence of GH in… – Autism – Tuberous sclerosis – Schizophrenia – Cystic Fibrosis – Breast Cancer – Asthma – Alzheimer disease – And many, many others…
  • 6. Learning Classifier Systems 4 2 Population [P] Covering Classifiern = Condition : Action :: Parameter(s) Detectors 3 Action 5 1 10 Match Set [M] Selection 9 Genetic Classifierm Environment Algorithm Prediction Array 6 Action Performed Action Set [A] Effectors Classifiera 7 8 Reward [A]t-1 Learning Strategy Classifiert-1 Credit Assignment • Autonomous Robotics Discovery Component • Complex Adaptive Systems Performance Component • Function Approximation Reinforcement Component • Classification • Data Mining Urbanowicz 2009 LCS: A Complete Introduction, Review, and Roadmap. Journal of Artificial Evolution and Applications
  • 8. Effective Generalization • Effective Generalization – Maximizing rule generality while preserving accuracy. (Testing Acc. = Training Acc.) • Examples of LCS Generalization 10###00### - 1 – Generalization Hypothesis (Wilson 1995) 02##1##### - 0 – Action Set GA & Subsumption (Wilson 1998) – Hierarchical Selection operator (Bacardit/Garrell 2002) – Windowing (Bacardit et. al. 2004) – Minimum Description Length (Bacardit/Garrell 2007) – Ensemble LCS (Gao et. al. 2005) • Noisy Problem domains – – Over-fitting become a particularly important problem – Classification Noise - < 100% testing accuracy possible – Attribute Noise – attributes which contribute nothing to testing accuracy.
  • 9. Hypothesis: • Given: a noisy problem, LCS’s with accuracy-based fitness will tend to over-fit (learn structure idiosyncratic to the training dataset). • Consider: datasets with a small sample size would be particularly susceptible to this (online learning repeatedly considers same samples). • If: we probabilistically incorporate variable noise into the incoming training instances, than every epoch of learning, the Michigan LCS is exposed to a randomly permuted version of the original dataset. Leads to an artificially inflated sample size. • Hypothesis: The incorporation of low levels of random classification noise will discourage over-fitting, and promote effective generalization
  • 10. RAIN Random Artificial Incorporation of Noise Environment 0210110220 -- 1 0210120220 -- 1 Population [P] Match Set [M] Correct Set [C]
  • 11. Temporal Models • Pm = Maximum Permutation Prob. •Pc = Current Permutation Prob. •Im = Maximum Iteration •Ic = Current Iteration •Uniform •Linear •Inverse Linear •Gaussian
  • 12. Power Estimation SNP 2 Model 1 2 1 0 1 234... 2 0 1 0 SNP 1 1 1 0 1 1 0 # # # 0 0 # # # - 1 0 0 1 0 0 2 # # 1 # # # # # - 0 # # # 1 0 2 1 # # # - 0 # # 1 0 # # # # # # - 0 SNP 4 0 1 # # # # # # 2 # - 1 Model 2 2 1 0 # # 0 0 # # # 0 0 1 - 1 2 0 0 0 SNP 3 1 2 0 0 1 # # # # 2 - 1 1 1 1 # # # # # # # - 0 1 0 0 0 2 # 2 # # 1 # 0 # # - 0 0 0 0 1 # 1 # # # # # # # # - 1 # # 1 1 # # # 2 # # - 0 5556889899 Power at the CV level: > 50% of the 10 CV runs
  • 13. Targeted RAIN • Idea: Strategically and automatically avoid destructively adding noise to attributes more likely to be important to classification. • Probabilistically targets attributes which are more frequently generalized (rather than specified) • Pc = Pm • Two Implementations (Weight lists generated differently) – Targeted Generality (TG) – Targeted Fitness Weighted Generality (TFWG) • Noise Generation (same for both implementations) – First epoch – no noise – Weight list recalculated at the end of each epoch. – Subtract minimum weight in list from all values in list. – Determine number of attributes to be permuted (Random < Pm) – Choose attribute - Roulette wheel selection
  • 14. Experimental Evaluation • UCS – – Iterations = [50000,100000, 200000, 500000] – Micro Pop. Size = 1600 – Other parameters are default – Track: Training Acc. Testing Acc. Generality, Macro Pop. Size, Run Time Power to find both or a single underlying model – Pm = 0.001, 0.01, 0.05, 0.1 • Each Dataset – Main effect free – 2X two-locus epistatic interation – 20 Attributes – Balanced – Minor allele frequencies = 0.2 G1 G2 G3 G4 – Heritability = 0.2 – Mix Ratio = 50:50 – Sample Sizes [200, 400, 800, 1600] – 20 Replicates • 80 Simulated Datasets + (10 Fold CV)  800 runs of UCS
  • 17. Conclusions & Future Work • Incorporation of RAIN with equal attribute probability is ineffective. • Targeted RAIN was able to reduce over-fitting (sig. decrease in training accuracy without reducing testing accuracy. • Improvements in power (not significant) suggest that RAIN may improve UCS’s ability to identify predictive attributes. • Try RAIN on datasets with much larger numbers of attributes • Consider the combination of targeted RAIN with temporal models • Explore a larger range of Pm values • Implement RAIN with an adaptive Pm
  • 18. Acknowledgements Jason Moore & Nicholas A. Sinnott-Armstrong Funding Support NIH: AI59694, LM009012, LM010098 William H. Neukom 1964 Institute for Computational Science at Dartmouth College