SlideShare a Scribd company logo
Analysing a Complex Agent-Based Model 
Using Data-Mining Techniques 
Claire Little, Bruce Edmonds 
Centre for Policy Modelling 
Manchester Metropolitan University 
Ed Fieldhouse, Laurence Lessard-Phillips 
Institute for Social Change 
University of Manchester
An “Inconvenient Truth” 
• That the universe is not arranged for our benefit 
(as researchers studying it) 
• in other words, that assumptions such as the following 
are likely to be wrong: 
– Our planet is the centre of the universe 
– Risky events follow a normal distribution 
– Humans act as if they followed a simple algorithm 
– Society can be understood using simple, universal prinicples 
– etc. etc. 
• In particular, the assumption that there will always (if 
you look hard enough) be models that are: 
– (a) simple enough for us to understand and 
– (b) adequate to what we want to model 
• …is wrong 
Complexity and Context-Dependency, Bruce Edmonds, ECCS, Lisbon, Sept 2010. slide-2
The Alternative 
• Thus consider the alternative, more realistic, 
situation where one is facing some phenomena 
where any model that is adequate (w.r.t. our goals) 
will be too complex for us to completely understand 
• Instead of indulging in wishful thinking this paper 
looks at ways forward under complexity 
• In other words, if we have a simulation model that is 
too complex to completely understand, how can we 
obtain some useful understanding of its 
properties… 
• …and hence use it to leverage some 
understanding/control over the target phenomena 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 3
The Broad Idea 
1. Make relatively complex simulations based 
on available evidence (a “KIDS” approach) 
2. Analyse this simulation in a number of 
ways, including… 
3. Data mining output data in a more holistic 
manner over a broad “space” of settings 
4. Look for patterns in the data that suggest 
(maybe context-dependent) hypotheses 
5. These are candidates for simpler (but 
maybe partial) models of the simulation 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 4
The Model 
• To explore the complex 
mix of factors, 
structures and 
processes that affect 
whether people vote 
• An agent-based model, 
with demographics and 
dynamic social 
networks 
• Was formulated using a 
mixture of qualitative, 
survey data and others’ 
expert opinion/results 
Class 
Age 
Activities 
Ethnicity 
Etc. 
Level-of-Political-Interest 
A Household 
Discuss-politics-with person-23 blue expert=false 
neighbour-network year=10 month=3 
Lots-family-discussions year=10 month=2 
Etc. 
Memory 
An Agent’s Memory of Events 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 5
Overall Structure of Model 
Underlying data about 
population composition 
Demographics of people in 
households 
Social network formation and 
maintenance (homophily) 
Influence via social networks 
• Political discussions 
Voting Behaviour 
Input 
Output
Technique 
• Instead of initiating ‘thin’ analyses of the 
simulation behaviour (e.g. 1 or 2D parameter 
sweeps/correlation models against a few key 
output measures) 
• To sample a multi-dimensional space of 
settings and cluster on a multi-dimensional 
space of output indicators (in this case 9 
parameter x 13 output measures) 
• Look at the patterns between clusters for 
indications as to hypotheses of behaviour 
• Then test these with targeted simulation 
experiments 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 7
More Holistic but Less Detailed 
1. Many Runs, Randomly 
Sampling Parameter Values 
Multi-Dimensional 
Space of Parameter 
Settings 
2. Analyse data set of result 
measures using data mining 
3. Look for patterns that you 
might then check in a more 
systematic manner 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 8
Parameter ranges 
3862 independent runs with parameters sampled 
from the following, uniform distributions: 
• density: [0.65, 0.95] 
• drop-activity-prob: [0.05, 0.15] 
• drop-friend-prob: [0, 0.01] 
• emmigration-rate: [0 ,0.03] 
• immigration-rate: [0, 0.02] 
• int-immigration-rate: [0, 0.02] 
• majority-prop: [0.55, 1] 
• prob-move-near: [0, 1] 
• prob-partner: [0.01, 0.03] 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 9
For each of these runs… 
• Measure many different indicators of the outputs 
(say at the end of the simulation) including: 
– Pop.size – population size 
– Av.age – average age 
– Av.adfriends – average number of friends (adults only) 
– Prop.maj – proportion of the majority population 
– Prop.adult – proportion that is adult 
– Prop.1stgen – proportion that are 1st generation immigrant 
– av.clust – average proportion of friends who are friends 
– av.sim.hh – average similarity within households 
– av.sim.fr – average similarity between friends 
– ncvs.ac – number of conversations over activity links 
– ncvs.sc – number of conversations over “school” links 
– Prop. Adults with highest level of political interest 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 10
Dendrogram of hierarchical 
clustering of simulations 
	 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 11
A heatmap of the hierarchical 
clustering 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 12
The within group sum of squares against the 
number of clusters for 10 randomly initialised 
runs using k-means 
	 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 13
Clustergram of PCA-weighted mean of k-mean 
clusters vs. number of clusters 
	 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 14
Centroid plot against the first two discriminant 
functions showing the 3 clusters 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 15
Details of the centroids of the 3 k-means 
clusters 
Attribute Cluster 1 
(543 
records) 
Cluster 2 
(1333 
records) 
Cluster 3 
( 1986 
records) 
Pop.size 100 557 1750 
Av.age 76 58 55 
Av.adfriends 0.73 1.36 1.82 
Prop.maj 74% 67% 65% 
Prop.adult 99% 94% 93.5% 
Prop.1stgen 8% 13% 14% 
av.clust 0.97 0.84 0.70 
av.sim.hh 2.45 3.53 3.74 
av.sim.fr 2.82 3.70 3.33 
Rate ncvs.ac 1.3% 1.3% 0.0% 
Rate ncvs.sc 0.45% 0.20% 0.13% 
Prop. Adults 
0.97% 1.6% 1.7% 
involved 
Within cluster 
sum of squares 
6748.243 11288.460 7407.591 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 16
the 3 clusters against the parameters: emigration 
rate, immigration rate, internal immigration rate 
	 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 17
Pop Size Av. Age 
	 
Multi-Dimensional Scatter Graphs 
Av Sim Hh Av Sim Fr 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 18
Average proportion of similar friends 
against time for different immigration 
rates 
0.95	 
0.9	 
0.85	 
0.8	 
0.75	 
0.7	 
0	 6	 12	18	24	30	36	42	48	54	60	66	72	78	84	90	96	 
0	 
0.0025	 
0.005	 
0.0075	 
0.01	 
0.0125	 
0.015	 
0.0175	 
0.02	 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 19
Average link density against time for 
different initial majority proportions 
1	 
0.95	 
0.9	 
0.85	 
0.8	 
0.75	 
0.7	 
0.65	 
0.6	 
0.55	 
0.5	 
0	 
5	 
10	 
15	 
20	 
25	 
30	 
35	 
40	 
45	 
50	 
55	 
60	 
65	 
70	 
75	 
80	 
85	 
90	 
95	 
100	 
mp=0.55	 
mp=0.6	 
mp=0.65	 
mp=0.7	 
mp=0.75	 
mp=0.8	 
mp=0.85	 
mp=0.9	 
mp=0.95	 
mp=1	 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 20
Conclusions 
• The particular results and insights in this 
model are not as important as the overall 
approach which… 
• …tries to get a more complex and holistic 
idea of the properties of a complex model 
• …which then might suggest simple 
hypotheses/models 
• …and thus “stage” abstraction a bit more 
gradually and carefully, being more aware 
of what is being abstracted away 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 21
Postscript: Emerging Principles 
• That evidence should not be ignored 
without a very, VERY good reason 
• That abstraction should be staged in 
gradual steps rather than “heroic” leaps 
• Be clear and explicit about your goals 
• Separate exploratory from analytic stages 
• Recognise that it is easy to fool ourselves 
and impose (wrong or limited) assumptions 
• Utilise any and all techniques that are 
applicable, but recognising their limitations 
Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 22
The End 
Claire Little: http:// 
Bruce Edmonds: http://bruce.edmonds.name 
Centre for Policy Modelling: http://cfpm.org 
Ed Fieldhouse: http:// 
Laurence Lessard-Phillips: http:// 
Institute for Social Change: http:// 
The SCID Project: http://www.scid-project.org 
These slides will be at: http://slideshare.com/BruceEdmonds 
The simulation will ‘soon’ be at: http://openabm.org as “The Voter Model”

More Related Content

PPTX
Anomaly Detection
PPT
Sampling.ppt
PDF
Statistical modeling in San Francisco Crime Prediction
PDF
[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...
PPTX
DoWhy Python library for causal inference: An End-to-End tool
PDF
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
PPTX
Techniques Machine Learning
PPTX
How can we rely upon Social Network Measures? Agent-base modelling as the nex...
Anomaly Detection
Sampling.ppt
Statistical modeling in San Francisco Crime Prediction
[RIIT 2017] Identifying Grey Sheep Users By The Distribution of User Similari...
DoWhy Python library for causal inference: An End-to-End tool
巨量與開放資料之創新機會與關鍵挑戰-曾新穆
Techniques Machine Learning
How can we rely upon Social Network Measures? Agent-base modelling as the nex...

Viewers also liked (16)

PPTX
Towards Institutional System Farming
PPTX
Modelling and Knowledge
PPTX
The Sociality of Context
PPT
The Modelling of Context-Dependent Causal Processes A Recasting of Robert Ros...
PPTX
Winter is coming! – how to survive the coming critical storm and demonstrate ...
PPTX
Using Agent-Based Simulation to integrate micro/qualitative evidence, macro-q...
PPTX
Be ea-talk-final
PPTX
Computing the Sociology of Survival – how to use simulations to understand c...
PPTX
Policy Making using Modelling in a Complex world
PPTX
A Model of Making
PDF
Risk-aware policy evaluation using agent-based simulation
PPTX
Staged Models for Interdisciplinary Research
PDF
Simulating Superdiversity
PPTX
Social complexity and coupled Socio-Ecological Systems
PPTX
A Model of Social and Cognitive Coherence
PPTX
Culture trumps ethnicity! – Intra-generational cultural evolution and ethnoce...
Towards Institutional System Farming
Modelling and Knowledge
The Sociality of Context
The Modelling of Context-Dependent Causal Processes A Recasting of Robert Ros...
Winter is coming! – how to survive the coming critical storm and demonstrate ...
Using Agent-Based Simulation to integrate micro/qualitative evidence, macro-q...
Be ea-talk-final
Computing the Sociology of Survival – how to use simulations to understand c...
Policy Making using Modelling in a Complex world
A Model of Making
Risk-aware policy evaluation using agent-based simulation
Staged Models for Interdisciplinary Research
Simulating Superdiversity
Social complexity and coupled Socio-Ecological Systems
A Model of Social and Cognitive Coherence
Culture trumps ethnicity! – Intra-generational cultural evolution and ethnoce...
Ad

Similar to Analysing a Complex Agent-Based Model Using Data-Mining Techniques (20)

PPT
An Introduction to Agent-Based Modelling
PPTX
Gors intro abm
PPTX
"A 30min Introduction to Agent-Based Modelling" for GORS
PPTX
Using Data Integration Models for Understanding Complex Social Systems
PPT
Agent-Based Modelling: Social Science Meets Computer Science?
PDF
Multi agent paradigm for cognitive parameter based feature similarity for soc...
PDF
Multi agent paradigm for cognitive parameter based feature similarity for soc...
PPTX
Introduction to Agent-based Modelling
PDF
Survey of the Euro Currency Fluctuation by Using Data Mining
PPTX
Big Data 101 - An introduction
PPTX
Big Data Real Time Training in Chennai
PPTX
Big data and data mining
PPT
Dwdm ppt for the btech student contain basis
PPT
A (Very Short) Guide to ABM (and Qualitative Data)
PDF
data mining
PDF
Lec 1 computational modeling - Introduction.pdf
PPT
Using Social Science Data in ABM: Opportunities and Challenges
PPT
Introduction of abm
PPT
Distributed Datamining and Agent System,security
PPTX
Skillwise Big data
An Introduction to Agent-Based Modelling
Gors intro abm
"A 30min Introduction to Agent-Based Modelling" for GORS
Using Data Integration Models for Understanding Complex Social Systems
Agent-Based Modelling: Social Science Meets Computer Science?
Multi agent paradigm for cognitive parameter based feature similarity for soc...
Multi agent paradigm for cognitive parameter based feature similarity for soc...
Introduction to Agent-based Modelling
Survey of the Euro Currency Fluctuation by Using Data Mining
Big Data 101 - An introduction
Big Data Real Time Training in Chennai
Big data and data mining
Dwdm ppt for the btech student contain basis
A (Very Short) Guide to ABM (and Qualitative Data)
data mining
Lec 1 computational modeling - Introduction.pdf
Using Social Science Data in ABM: Opportunities and Challenges
Introduction of abm
Distributed Datamining and Agent System,security
Skillwise Big data
Ad

More from Bruce Edmonds (20)

PPTX
Staging Model Abstraction – an example about political participation
PPTX
Modelling Pitfalls - extra resources
PPTX
Modelling Pitfalls - introduction and some cases
PPTX
The evolution of empirical ABMs
PPTX
Mixing fat data, simulation and policy - what could possibly go wrong?
PPTX
Social Context
PPTX
Using agent-based simulation for socio-ecological uncertainty analysis
PPTX
Finding out what could go wrong before it does – Modelling Risk and Uncertainty
PPTX
How social simulation could help social science deal with context
PPTX
Agent-based modelling, laboratory experiments, and observation in the wild
PPTX
Culture trumps ethnicity! – Intra-generational cultural evolution and ethnoce...
PDF
Mixing ABM and policy...what could possibly go wrong?
PPTX
Different Modelling Purposes - an 'anit-theoretical' approach
PPTX
Socio-Ecological Simulation - a risk-assessment approach
PPTX
A Simple Model of Group Commoning
PPTX
6 Modelling Purposes
PPTX
Are Mixed-Methods Just a Fudge? The Dangers and Prospects for Integrating Qu...
PPTX
The Post-Truth Drift in Social Simulation
PPTX
Drilling down below opinions: how co-evolving beliefs and social structure mi...
PPTX
Model Purpose and Complexity
Staging Model Abstraction – an example about political participation
Modelling Pitfalls - extra resources
Modelling Pitfalls - introduction and some cases
The evolution of empirical ABMs
Mixing fat data, simulation and policy - what could possibly go wrong?
Social Context
Using agent-based simulation for socio-ecological uncertainty analysis
Finding out what could go wrong before it does – Modelling Risk and Uncertainty
How social simulation could help social science deal with context
Agent-based modelling, laboratory experiments, and observation in the wild
Culture trumps ethnicity! – Intra-generational cultural evolution and ethnoce...
Mixing ABM and policy...what could possibly go wrong?
Different Modelling Purposes - an 'anit-theoretical' approach
Socio-Ecological Simulation - a risk-assessment approach
A Simple Model of Group Commoning
6 Modelling Purposes
Are Mixed-Methods Just a Fudge? The Dangers and Prospects for Integrating Qu...
The Post-Truth Drift in Social Simulation
Drilling down below opinions: how co-evolving beliefs and social structure mi...
Model Purpose and Complexity

Recently uploaded (20)

PPTX
Cell Structure & Organelles in detailed.
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Complications of Minimal Access Surgery at WLH
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Lesson notes of climatology university.
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
RMMM.pdf make it easy to upload and study
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Classroom Observation Tools for Teachers
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PDF
Weekly quiz Compilation Jan -July 25.pdf
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Cell Structure & Organelles in detailed.
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Microbial disease of the cardiovascular and lymphatic systems
2.FourierTransform-ShortQuestionswithAnswers.pdf
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Abdominal Access Techniques with Prof. Dr. R K Mishra
Complications of Minimal Access Surgery at WLH
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Anesthesia in Laparoscopic Surgery in India
Final Presentation General Medicine 03-08-2024.pptx
Lesson notes of climatology university.
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
VCE English Exam - Section C Student Revision Booklet
RMMM.pdf make it easy to upload and study
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Classroom Observation Tools for Teachers
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Weekly quiz Compilation Jan -July 25.pdf
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...

Analysing a Complex Agent-Based Model Using Data-Mining Techniques

  • 1. Analysing a Complex Agent-Based Model Using Data-Mining Techniques Claire Little, Bruce Edmonds Centre for Policy Modelling Manchester Metropolitan University Ed Fieldhouse, Laurence Lessard-Phillips Institute for Social Change University of Manchester
  • 2. An “Inconvenient Truth” • That the universe is not arranged for our benefit (as researchers studying it) • in other words, that assumptions such as the following are likely to be wrong: – Our planet is the centre of the universe – Risky events follow a normal distribution – Humans act as if they followed a simple algorithm – Society can be understood using simple, universal prinicples – etc. etc. • In particular, the assumption that there will always (if you look hard enough) be models that are: – (a) simple enough for us to understand and – (b) adequate to what we want to model • …is wrong Complexity and Context-Dependency, Bruce Edmonds, ECCS, Lisbon, Sept 2010. slide-2
  • 3. The Alternative • Thus consider the alternative, more realistic, situation where one is facing some phenomena where any model that is adequate (w.r.t. our goals) will be too complex for us to completely understand • Instead of indulging in wishful thinking this paper looks at ways forward under complexity • In other words, if we have a simulation model that is too complex to completely understand, how can we obtain some useful understanding of its properties… • …and hence use it to leverage some understanding/control over the target phenomena Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 3
  • 4. The Broad Idea 1. Make relatively complex simulations based on available evidence (a “KIDS” approach) 2. Analyse this simulation in a number of ways, including… 3. Data mining output data in a more holistic manner over a broad “space” of settings 4. Look for patterns in the data that suggest (maybe context-dependent) hypotheses 5. These are candidates for simpler (but maybe partial) models of the simulation Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 4
  • 5. The Model • To explore the complex mix of factors, structures and processes that affect whether people vote • An agent-based model, with demographics and dynamic social networks • Was formulated using a mixture of qualitative, survey data and others’ expert opinion/results Class Age Activities Ethnicity Etc. Level-of-Political-Interest A Household Discuss-politics-with person-23 blue expert=false neighbour-network year=10 month=3 Lots-family-discussions year=10 month=2 Etc. Memory An Agent’s Memory of Events Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 5
  • 6. Overall Structure of Model Underlying data about population composition Demographics of people in households Social network formation and maintenance (homophily) Influence via social networks • Political discussions Voting Behaviour Input Output
  • 7. Technique • Instead of initiating ‘thin’ analyses of the simulation behaviour (e.g. 1 or 2D parameter sweeps/correlation models against a few key output measures) • To sample a multi-dimensional space of settings and cluster on a multi-dimensional space of output indicators (in this case 9 parameter x 13 output measures) • Look at the patterns between clusters for indications as to hypotheses of behaviour • Then test these with targeted simulation experiments Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 7
  • 8. More Holistic but Less Detailed 1. Many Runs, Randomly Sampling Parameter Values Multi-Dimensional Space of Parameter Settings 2. Analyse data set of result measures using data mining 3. Look for patterns that you might then check in a more systematic manner Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 8
  • 9. Parameter ranges 3862 independent runs with parameters sampled from the following, uniform distributions: • density: [0.65, 0.95] • drop-activity-prob: [0.05, 0.15] • drop-friend-prob: [0, 0.01] • emmigration-rate: [0 ,0.03] • immigration-rate: [0, 0.02] • int-immigration-rate: [0, 0.02] • majority-prop: [0.55, 1] • prob-move-near: [0, 1] • prob-partner: [0.01, 0.03] Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 9
  • 10. For each of these runs… • Measure many different indicators of the outputs (say at the end of the simulation) including: – Pop.size – population size – Av.age – average age – Av.adfriends – average number of friends (adults only) – Prop.maj – proportion of the majority population – Prop.adult – proportion that is adult – Prop.1stgen – proportion that are 1st generation immigrant – av.clust – average proportion of friends who are friends – av.sim.hh – average similarity within households – av.sim.fr – average similarity between friends – ncvs.ac – number of conversations over activity links – ncvs.sc – number of conversations over “school” links – Prop. Adults with highest level of political interest Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 10
  • 11. Dendrogram of hierarchical clustering of simulations Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 11
  • 12. A heatmap of the hierarchical clustering Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 12
  • 13. The within group sum of squares against the number of clusters for 10 randomly initialised runs using k-means Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 13
  • 14. Clustergram of PCA-weighted mean of k-mean clusters vs. number of clusters Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 14
  • 15. Centroid plot against the first two discriminant functions showing the 3 clusters Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 15
  • 16. Details of the centroids of the 3 k-means clusters Attribute Cluster 1 (543 records) Cluster 2 (1333 records) Cluster 3 ( 1986 records) Pop.size 100 557 1750 Av.age 76 58 55 Av.adfriends 0.73 1.36 1.82 Prop.maj 74% 67% 65% Prop.adult 99% 94% 93.5% Prop.1stgen 8% 13% 14% av.clust 0.97 0.84 0.70 av.sim.hh 2.45 3.53 3.74 av.sim.fr 2.82 3.70 3.33 Rate ncvs.ac 1.3% 1.3% 0.0% Rate ncvs.sc 0.45% 0.20% 0.13% Prop. Adults 0.97% 1.6% 1.7% involved Within cluster sum of squares 6748.243 11288.460 7407.591 Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 16
  • 17. the 3 clusters against the parameters: emigration rate, immigration rate, internal immigration rate Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 17
  • 18. Pop Size Av. Age Multi-Dimensional Scatter Graphs Av Sim Hh Av Sim Fr Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 18
  • 19. Average proportion of similar friends against time for different immigration rates 0.95 0.9 0.85 0.8 0.75 0.7 0 6 12 18 24 30 36 42 48 54 60 66 72 78 84 90 96 0 0.0025 0.005 0.0075 0.01 0.0125 0.015 0.0175 0.02 Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 19
  • 20. Average link density against time for different initial majority proportions 1 0.95 0.9 0.85 0.8 0.75 0.7 0.65 0.6 0.55 0.5 0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95 100 mp=0.55 mp=0.6 mp=0.65 mp=0.7 mp=0.75 mp=0.8 mp=0.85 mp=0.9 mp=0.95 mp=1 Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 20
  • 21. Conclusions • The particular results and insights in this model are not as important as the overall approach which… • …tries to get a more complex and holistic idea of the properties of a complex model • …which then might suggest simple hypotheses/models • …and thus “stage” abstraction a bit more gradually and carefully, being more aware of what is being abstracted away Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 21
  • 22. Postscript: Emerging Principles • That evidence should not be ignored without a very, VERY good reason • That abstraction should be staged in gradual steps rather than “heroic” leaps • Be clear and explicit about your goals • Separate exploratory from analytic stages • Recognise that it is easy to fool ourselves and impose (wrong or limited) assumptions • Utilise any and all techniques that are applicable, but recognising their limitations Analysing a Complex Agent-Based Model Using Data-Mining Techniques. Little & Edmonds, SSC 2014, Barcelona, 22
  • 23. The End Claire Little: http:// Bruce Edmonds: http://bruce.edmonds.name Centre for Policy Modelling: http://cfpm.org Ed Fieldhouse: http:// Laurence Lessard-Phillips: http:// Institute for Social Change: http:// The SCID Project: http://www.scid-project.org These slides will be at: http://slideshare.com/BruceEdmonds The simulation will ‘soon’ be at: http://openabm.org as “The Voter Model”