SlideShare a Scribd company logo
Predicting Spatiotemporal Risk of
Yellow Fever Using a Machine
Learning Approach
RajReni B. Kaul, MichelleV. Evans,
Courtney Murdock, and John M. Drake
bit.ly/YF-Kaul
Yellow fever (YF)
Endemic to 44
countries in Africa
and Latin America
Estimated 60,000
deaths annually
2017 spike in Brazilian YF cases
Year
ConfirmedCases
780
data stops
Jan 2018
Reported byTABNET
2017 spike in Brazilian YF cases
Year
ConfirmedCases
780
data stops
Jan 2018
Reported byTABNET
Previous years had fewer cases
Year
ConfirmedCases
Reported byTABNET
2017 spike on the range edge
Many of the cases
were outside the
historical range
Hg. leucocelaenus
Hg. janthinomys
Sa. chloropterus
Ae. aegypti
Ae. albopictus
Sylvatic cycle dominates YF transmission
Hg. leucocelaenus
Hg. janthinomys
Sa. chloropterus
Ae. aegypti
Ae. albopictus
Sylvatic cycle dominates YF transmission
Drivers of spillover
Adapted from Gortazar et al 2014
Connecting drivers and
data for spillover
NHP and vector
range maps
human density
Connecting drivers
and data
Rainfall
Temperature
NDVI
Connecting drivers
and data
Fire density
NHP and
agriculture land
overlap
Connecting drivers and data for spillover
Fire density
NHP and agriculture land
overlap
NHP and vector range maps
human density
Rainfall
Temperature
NDVI
Connecting drivers and data for spillover
Fire density
NHP and agriculture land
overlap
NHP and vector range maps
human density
Rainfall
Temperature
NDVI
Data aggregation: municipality
Human Population
Density
Rainfall
NDVI
YF case(s) reported
Temperature
Data aggregation: monthly*
Human Population
Density
Rainfall
Temperature
NDVI
month 1
month 2
month 3
Dataset units: municipality-months
spillovers were very rare ~0.01%
106 spillover events between
Jan 2001 – Dec 2013
month 1
month 2
month 3
BAGGING:
BOOTSTRAP
AGGREGATING
Machine Learning
Method
Idealized data set
Covariates
Response
Idealized data set
Covariates
Response
Single observation
with positive
response value
Idealized data set
Covariates
Response
Single observation
with negative
response value
Single observation
with positive
response value
Subsample = bootstrapping
Covariates
Response
Subsample dataset by selecting cases at random
Sparse
dataset
Subsample
dataset
Subsample = bootstrapping
Covariates
Response
Sparse
dataset
Subsample
dataset
0.01% positive
observations
10% positive
observations
Subsample many times
Model1 Model2
Modeln
Final product: Aggregated Model
Model1 Model2
Modeln
Final product: Aggregated Model
Advantages of Bagging
Aggregating limits variance-bias trade off
Model1 Model2
Modeln
Final product: Aggregated Model
Advantages of Bagging
Aggregating limits variance-bias trade off Increasing number of positive
observations in subsample reduces over
fitting
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine learning approach.
Model methods
Training
data
subsample
logistic
regression
bagged
logistic
regression
average of 500
models
30% data
withheld
Exploring model predictions
Jan 2001
Exploring model predictions
Jan 2001
Feb 2001
Exploring model predictions
Jan 2001
Feb 2001
Mar 2001
Exploring model predictions
Jan 2001
Feb 2001
Mar 2001
Month
MeanIntensity
Mean spillover intensity
by month
Exploring model predictions
Jan 2001
Feb 2001
Mar 2001
Variance in spillover
intensity by municipality
YF spillover
intensity
AUC 0.81
YF spillover intensity
Month
MeanIntensity
YF spillover intensity
Month
MeanIntensity
Variance in
spillover intensity
YF spillover intensity
Month
MeanIntensity
Natural break in data led to regional models
NHP species richness
No.Municipalities
High Richness
Low Richness
High
Low
Spillover intensity by region
Low 86
Spillover events
High 20
Total 106
AUC 0.79
AUC 0.88
High
Low
Mean intensity by region
Month
MeanIntensity
High
Low
Mean intensity by region
Variance in
spillover intensity
Comparing national and regional models
MeanIntensity
Month
Variance in
intensity
VARIABLE IMPORTANCE
BY PERMUTATIONMachine Learning
Method
Idealized data set
Covariates
Response
Idealized data set
Response
Covariates
C1 C2
C3 C4
Covariates are linked to a observation
Response
Covariates
C1 C2
C3 C4
Randomize a single covariate
Response
Covariates
C1 C2
C3 C4
Randomize a single covariate
Original data set
Model
Model accuracy predicting
the training data
Randomize a single covariate
Original data set
C1 permutated data set
Model
Model accuracy predicting
the training data
Model
Model accuracy predicting
the training data
Randomize a single covariate
Original data set
C1 permutated data set
Model
Model accuracy predicting
the training data
Model
Model accuracy predicting
the training data
Performance decline
due to permutated covariate
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine learning approach.
Assessing drivers High
Low
RankedVariableImportance
National
Model
Assessing drivers High
Low
RankedVariableImportance
National
Model
Low NHP High NHP
Policy implication:
Same interventions can be
used for whole country
Assessing drivers High
Low
RankedVariableImportance
National
Model
Low NHP High NHP
Policy implication:
Different interventions
are needed across the
country
High
Low
High
Low
RankedVariableImportance
National
High
Low
High
Low
RankedVariableImportance
National Low NHP
RankedVariableImportance
National Low NHP High NHP
RankedVariableImportance
National Low NHP High NHP
RankedVariableImportance
National Low NHP High NHP
RankedVariableImportance
National Low NHP High NHP
Yellow fever in Brazil
Model
drivers of spillover
are regional
requiring regional
interventions
Yellow fever in Brazil
Model
drivers of spillover
are regional
At times coastal cities
are at higher risk than
the Amazon River basin
Yellow fever in Brazil
Model
drivers of spillover
are regional
At times coastal cities
are at higher risk than
the Amazon River basin
Month
MeanIntensity
Acknowledgements
Eric Marty
JP Schmidt
Spencer Hall
Annakate Schatz
@renikaul
renikaul/YF_Brazil
reni@uga.edu
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine learning approach.
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine learning approach.
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine learning approach.
Assessing drivers High
Low
RankedVariableImportance
National
Model
Low NHP
Low 5350
Municipalities
High 210
Total 5560

More Related Content

PPTX
Franz cobb seltmann 2015 spnhc current state of arthropod biodiversity data
PPTX
Growing Threat of Vector Borne Diseases
PDF
Mapping the risk of Rift Valley fever in Uganda
PPTX
The Cost of Delays in FMD Detection
PPTX
Bee genera phenology 2016
PPT
Ga Arboviral Surveillance
PPTX
Land use, biodiversity changes and the risk of zoonotic diseases: Findings fr...
PPTX
Phenology of Common Bee Species. MidAtantic States, USA
Franz cobb seltmann 2015 spnhc current state of arthropod biodiversity data
Growing Threat of Vector Borne Diseases
Mapping the risk of Rift Valley fever in Uganda
The Cost of Delays in FMD Detection
Bee genera phenology 2016
Ga Arboviral Surveillance
Land use, biodiversity changes and the risk of zoonotic diseases: Findings fr...
Phenology of Common Bee Species. MidAtantic States, USA

What's hot (20)

PPT
PDF
Disease frequency of selected bacterial zoonoses in small ruminants in Tana R...
PDF
BOSC 2016 - The Open-Source Outbreak
PDF
Declaring a TB outbreak over with genomics
PPT
One Health Hackathon 25/10/2020 - Cyril Caminade
PPS
thesis.pps
PPTX
Speco aeet meeting coimbra_2015_poster_ javier morente lopez
PDF
Perspectives of predictive epidemiology and early warning systems for Rift Va...
PDF
Modeling the Ebola Outbreak in West Africa, August 19th 2014 update
PPS
Dengue Fever and Blood Transfusion
PPTX
Dr. David R. Smith - What We know (and don’t know) About Pneumonia in Beef Ca...
PDF
An electronic syndromic surveillance system for early detection and control o...
PPTX
ImmunoScience Introduction
PPTX
Precision agriculture for SAT; Near future or unrealistic effort?
PDF
Modeling the Ebola Outbreak in West Africa, September 5th 2014 update
PDF
SMBE Satellite Meeting on Pathogen Evolution and Transmission
PPTX
Dr. Andres Perez - PRRS Epidemiology: Best Principles of Control at a Regiona...
PDF
Population genetics of infectious diseases
PDF
Genomic surveillance of the Rift Valley fever: From sequencing to Lineage ass...
PDF
Ecohealth 2014 gianni lo iacono presentation on integrative modelling
Disease frequency of selected bacterial zoonoses in small ruminants in Tana R...
BOSC 2016 - The Open-Source Outbreak
Declaring a TB outbreak over with genomics
One Health Hackathon 25/10/2020 - Cyril Caminade
thesis.pps
Speco aeet meeting coimbra_2015_poster_ javier morente lopez
Perspectives of predictive epidemiology and early warning systems for Rift Va...
Modeling the Ebola Outbreak in West Africa, August 19th 2014 update
Dengue Fever and Blood Transfusion
Dr. David R. Smith - What We know (and don’t know) About Pneumonia in Beef Ca...
An electronic syndromic surveillance system for early detection and control o...
ImmunoScience Introduction
Precision agriculture for SAT; Near future or unrealistic effort?
Modeling the Ebola Outbreak in West Africa, September 5th 2014 update
SMBE Satellite Meeting on Pathogen Evolution and Transmission
Dr. Andres Perez - PRRS Epidemiology: Best Principles of Control at a Regiona...
Population genetics of infectious diseases
Genomic surveillance of the Rift Valley fever: From sequencing to Lineage ass...
Ecohealth 2014 gianni lo iacono presentation on integrative modelling
Ad

Similar to IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine learning approach. (20)

PDF
Comparative study of decision tree algorithm and naive bayes classifier for s...
PPT
Biosurveillance 2.0
PDF
The Use of AI in Predicting Disease Outbreaks (www.kiu.ac.ug)
PDF
MACHINE LEARNING TECHNIQUES IN THE PREDICTION OF INFECTIOUS DISEASE SPREAD.pp...
PDF
Health Risk Prediction Using Support Vector Machine with Gray Wolf Optimizati...
PPTX
D3T2 mapping disease transmission risk
PPTX
Machine Learning Final presentation
PPTX
Mapping Disease Transmission Risk
DOCX
Prevent COVID-19 using ML
PPT
Biosurveillance 2.0: Lecture at Emory University
PDF
Prediction analysis on the pre and post COVID outbreak assessment using machi...
PDF
76 s201910
PDF
Mortality prediction of COVID-19 patients using supervised machine learning
PPTX
DISummit - Denguehack winners The Juniors
PPT
Biosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di Tada
PPTX
AI in veterinary medicine Enhancing Medical/Veterinary Teaching with AI Techn...
PPTX
Introduction to SaTScan, seasonality and time series analysis
PDF
Artificial Intelligence in Predicting Epudemic Outbreaks (www.kiu.ac.ug)
PDF
Symptom-Based Prediction of COVID-19 using Machine Learning Models with SMOTE...
PDF
Covid 19 Prediction in India using Machine Learning
Comparative study of decision tree algorithm and naive bayes classifier for s...
Biosurveillance 2.0
The Use of AI in Predicting Disease Outbreaks (www.kiu.ac.ug)
MACHINE LEARNING TECHNIQUES IN THE PREDICTION OF INFECTIOUS DISEASE SPREAD.pp...
Health Risk Prediction Using Support Vector Machine with Gray Wolf Optimizati...
D3T2 mapping disease transmission risk
Machine Learning Final presentation
Mapping Disease Transmission Risk
Prevent COVID-19 using ML
Biosurveillance 2.0: Lecture at Emory University
Prediction analysis on the pre and post COVID outbreak assessment using machi...
76 s201910
Mortality prediction of COVID-19 patients using supervised machine learning
DISummit - Denguehack winners The Juniors
Biosurveillance: Machine Learning And Disease Surveillance by Kass-Hout Di Tada
AI in veterinary medicine Enhancing Medical/Veterinary Teaching with AI Techn...
Introduction to SaTScan, seasonality and time series analysis
Artificial Intelligence in Predicting Epudemic Outbreaks (www.kiu.ac.ug)
Symptom-Based Prediction of COVID-19 using Machine Learning Models with SMOTE...
Covid 19 Prediction in India using Machine Learning
Ad

More from Louisa Diggs (20)

PDF
Workshop: Quantifying Error in Training Data for Mapping and Monitoring the E...
PDF
Using Active Learning to Quantify how Training Data Errors Impact Classificat...
PDF
Machine Learning for Better Maps
PPTX
Generating Training Data from Noisy Measrements
PPTX
Cropped Field Boundaries, Food Systems, & Fire
PDF
Challenges to Large Scale Mapping: Can Data Geometry Help?
PPTX
A Random Walk of Issues Related to Training Data and Land Cover Mapping
PPTX
Assessing Land Cover Change using Uncertain Data
PPTX
Informal Settlements and Cadastral Mapping
PPTX
Sources of Map Error in Public Health Activities and Operations Research
PDF
Measuring the impact of label noise on semantic segmentation using rastervision
PPTX
Mapping Smallholder Yields Using Micro-Satellite Data
PPTX
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASA
PDF
IMED 2018: The use of remote sensing, geostatistical and machine learning met...
PPT
IMED 2018: Predicting the environmental suitability of podoconiosis in Ethiopia
PDF
IMED 2018: Landcover/habitat
PDF
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...
PDF
IMED 2018: An intro to Remote Sensing and Machine Learning
PDF
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...
PDF
IMED 2018: Innovations and Challenges in the Use of Open-source Remote Sensin...
Workshop: Quantifying Error in Training Data for Mapping and Monitoring the E...
Using Active Learning to Quantify how Training Data Errors Impact Classificat...
Machine Learning for Better Maps
Generating Training Data from Noisy Measrements
Cropped Field Boundaries, Food Systems, & Fire
Challenges to Large Scale Mapping: Can Data Geometry Help?
A Random Walk of Issues Related to Training Data and Land Cover Mapping
Assessing Land Cover Change using Uncertain Data
Informal Settlements and Cadastral Mapping
Sources of Map Error in Public Health Activities and Operations Research
Measuring the impact of label noise on semantic segmentation using rastervision
Mapping Smallholder Yields Using Micro-Satellite Data
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASA
IMED 2018: The use of remote sensing, geostatistical and machine learning met...
IMED 2018: Predicting the environmental suitability of podoconiosis in Ethiopia
IMED 2018: Landcover/habitat
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...
IMED 2018: An intro to Remote Sensing and Machine Learning
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...
IMED 2018: Innovations and Challenges in the Use of Open-source Remote Sensin...

Recently uploaded (20)

PPTX
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus
PPTX
SKIN Anatomy and physiology and associated diseases
PPTX
ACID BASE management, base deficit correction
PPT
Breast Cancer management for medicsl student.ppt
PPT
OPIOID ANALGESICS AND THEIR IMPLICATIONS
PPTX
History and examination of abdomen, & pelvis .pptx
PPTX
DENTAL CARIES FOR DENTISTRY STUDENT.pptx
PDF
Human Health And Disease hggyutgghg .pdf
PPTX
neonatal infection(7392992y282939y5.pptx
PPTX
Chapter-1-The-Human-Body-Orientation-Edited-55-slides.pptx
PPTX
Fundamentals of human energy transfer .pptx
PPTX
Note on Abortion.pptx for the student note
PPTX
Uterus anatomy embryology, and clinical aspects
PPTX
CME 2 Acute Chest Pain preentation for education
PPTX
Gastroschisis- Clinical Overview 18112311
PPTX
CEREBROVASCULAR DISORDER.POWERPOINT PRESENTATIONx
PPTX
15.MENINGITIS AND ENCEPHALITIS-elias.pptx
PDF
Neuro ED Bet Sexologist in Patna Bihar India Dr. Sunil Dubey
PDF
Khadir.pdf Acacia catechu drug Ayurvedic medicine
PDF
CT Anatomy for Radiotherapy.pdf eryuioooop
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus
SKIN Anatomy and physiology and associated diseases
ACID BASE management, base deficit correction
Breast Cancer management for medicsl student.ppt
OPIOID ANALGESICS AND THEIR IMPLICATIONS
History and examination of abdomen, & pelvis .pptx
DENTAL CARIES FOR DENTISTRY STUDENT.pptx
Human Health And Disease hggyutgghg .pdf
neonatal infection(7392992y282939y5.pptx
Chapter-1-The-Human-Body-Orientation-Edited-55-slides.pptx
Fundamentals of human energy transfer .pptx
Note on Abortion.pptx for the student note
Uterus anatomy embryology, and clinical aspects
CME 2 Acute Chest Pain preentation for education
Gastroschisis- Clinical Overview 18112311
CEREBROVASCULAR DISORDER.POWERPOINT PRESENTATIONx
15.MENINGITIS AND ENCEPHALITIS-elias.pptx
Neuro ED Bet Sexologist in Patna Bihar India Dr. Sunil Dubey
Khadir.pdf Acacia catechu drug Ayurvedic medicine
CT Anatomy for Radiotherapy.pdf eryuioooop

IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine learning approach.