SlideShare a Scribd company logo
Using active learning to quantify how training
data errors impact classification accuracy over
smallholder-dominated agricultural systems
Stephanie Debats, Lei Song, Su Ye, Sitian Xiong, Kaixi Zhang,
Tammy Woodard, Ron Eastman, Ryan Avery, Kelly Caylor,
Dennis McRitchie, Lyndon Estes
Clark University|Clark Labs
University of California Santa Barbara
AWS Cloud Credits for Research Program
IIASA
Stephanie Debats Ryan Avery Su YeLei Song Sitian Xiong
Problem 1: High spatial variability
Problem 2: High temporal variability
Bing Base Map
PlanetScope Analytic
Problem 3: Interpretation errors in training data
High spatial & temporal resolution imagery
Active Learning
01
00
11
Train
Predict
Select
Re-label
Label
Debats et al, 2017
Study Region
Using Active Learning to Quantify how Training Data Errors Impact Classification Accuracy over Smallholder-Dominated Agricultural Systems
prob 

(%)
prob 

(%)
Growing Season
Dry Season
Labelling component:
Crowdsourcing Platform
Using Active Learning to Quantify how Training Data Errors Impact Classification Accuracy over Smallholder-Dominated Agricultural Systems
Using Active Learning to Quantify how Training Data Errors Impact Classification Accuracy over Smallholder-Dominated Agricultural Systems
Using Active Learning to Quantify how Training Data Errors Impact Classification Accuracy over Smallholder-Dominated Agricultural Systems
True positive (TP) False positive (FP) False negative (FN) True negative (TN)
score = in_accuracy * β0 +
out_accuracy * β1 +
fragmentation * β2 +
edge_accuracy * β3 +
categorical_accuracy * β4
Accuracy assessment and consensus labelling
Probability
Bayesian Model Averaging
Label collection
! " # = % ! &' # !("|#, &')
,
'-.
Bayesian Model Averaging
Heat map
! " # = % ! &' # !("|#, &')
,
'-.
Consensus Label
Probability
Debats et al (2016)
A generalized computer vision approach to mapping crop fields in
heterogeneous agricultural landscapes
Remote Sensing Environment 179
Machine Learning component
1. On the fly feature extraction
2. Spark ML RandomForest
GeoTrellis/
GeoPySpark
Does Training Data Error Impact Classification Performance?
Using Active Learning to Quantify how Training Data Errors Impact Classification Accuracy over Smallholder-Dominated Agricultural Systems
Next Steps
1. Errors in image atmospheric corrections
2. Increase feature space for classifier
3. Improve label quality
4. Quantify gap between worker and ground
Worker map
Ground truth(y)
Where lies the truth?
8
Circle Bias, many
false positive
identified because
of overreliance on
circular features
https://github.com/ecoh
ydro/CropMask_RCNN
Probability
score above
.7 deemed a
center pivot
Tested on
never before
seen
512x512 tiles
11
Some center
pivots are
missed
because of
date mismatch
between
imagery and
labels of the
reference
dataset
BAYESIAN MODEL AVERAGING:
! " # = %
&'(
)
! *& # !("|#, *&)
": the ground truth, which will be either ‘field’ or ‘no field’
#: the given data of crowdsourcing opinions for labeling this pixel
(e.g., # = {#mapper_1 = field , #mapper_/= no field, …} )
*&: the Mappers considered
(1) 012234&’s opinion: how much probability to
be "
(2) Weight (or evidence): is the probability that we weigh
012234&’s opinion based on their mapping history
combining crowdsourcing labels from their mapping history
MAPPER OPINION
In our mapping project, mappers are allowed to only label a crispy category for polygons (either ‘field’ or ‘no
field’). So ! " #, %& = 0 )* 1
(1) !(" = -./01|#& = -./01, %&) = 1
(2) !(" = 4) -./01|#& = -./01, %&) = 0
(3) !(" = 4) -./01|#& = 4) -./01, %&) = 1
(4) !(" = -./01|#& = 4) -./01, %&) = 0
WEIGHT
Weight: ! "# $ ∝ ! $ "# !("#)
(1) !("#): ‘mapper priors’, is our prior belief for mapper '. We can use average score
(combining geometric and thematic accuracy) to represent our belief
(()*) ∝ (∑,-.
/
01234,) /7
(2) ! $ "# : ‘mapper likelihood’, ! $ "# ∝ exp(-
.
8
9:;#) [1][2]
BIC(Bayesian Information Criterion) = ln ? ∗ A − 2 ln D $ ̂F, "
‘BIC simply reduces to maximum likelihood when the number of parameters is equal
for the models of interest’ [3] , so 9:; ≈ −2 ln D $ IF, " . After adjustment,
( J )* ∝ K J ̂F, )* (Maximum mapper likelihood)
(? is the sample number, A
is the parameter number to
be estimated (our case has
only one, i.e., L), ML is the
label that maximizes the
likelihood function)
WEIGHT (CONTI.)
Weight: ! "# $ ∝ ! $ "# !("#)
Mapper likelihood: ' ( )* ∝ + ( ,-, )* (Maximum Mapper likelihood)
(1) !(- = 01234| ,-, "#) = ! $ = 01234 - = 01234, "# = (∑8
9 :;<
:;<=>?<
) /A
(2) !(- = BC 01234| ,-, "#) = ! $ = BC 01234 - = BC 01234, "# = (∑8
9 :?<
:?<=>;<
) /A
D $ ̂-, " can be computed as:
* Maximum mapper likelihood is actually average producer’s accuracy of the mapper
SUMMARY
! " # = ∑&'(
)
! *& # !("|#, *&)
weight = score ∗ producer′s accuracy ∝ P M8 D
P("|D, M8) = 0 ;< 1
Labeling:
If ! " = >?@AB # > ! " = D; >?@AB # (or ! " = >?@AB # > 0.5), we give a consensus label
as field; otherwise, we give a label as no field
The posterior probability of the pixel label " given the data of mappers’ opinions (#):
(*& is the mapper ?)
→ ! " # =
∑FGH
I
JK&LMNF∗ O(P|Q,RF)
∑FGH
I
JK&LMNF
, where

More Related Content

DOCX
MS Excel functions
PPT
Where2.0Now - Finding the heat in Thematic Maps
ODP
Making Beautiful Maps: Oct 5 2011
PDF
Cons30 S Mar 4 09
PDF
Cons30 S Data Analysis Line Plots
PDF
RDataMining slides-regression-classification
PDF
Dem 7263 fall 2015 Spatial GLM's
PDF
ABC short course: final chapters
MS Excel functions
Where2.0Now - Finding the heat in Thematic Maps
Making Beautiful Maps: Oct 5 2011
Cons30 S Mar 4 09
Cons30 S Data Analysis Line Plots
RDataMining slides-regression-classification
Dem 7263 fall 2015 Spatial GLM's
ABC short course: final chapters

Similar to Using Active Learning to Quantify how Training Data Errors Impact Classification Accuracy over Smallholder-Dominated Agricultural Systems (20)

PDF
Iowa_Report_2
PDF
07 dimensionality reduction
PDF
Random forest algorithm for regression a beginner's guide
PDF
T. Lucas Makinen x Imperial SBI Workshop
PDF
2013.11.14 Big Data Workshop Bruno Voisin
PDF
Curvefitting
PDF
A note on estimation of population mean in sample survey using auxiliary info...
PPTX
Scaling up data science applications
PDF
FinalReportFoxMelle
PDF
Survey Demo
PDF
Optimization of sample configurations for spatial trend estimation
PPTX
An Interactive Introduction To R (Programming Language For Statistics)
PPTX
Query Optimization By Example on Database Apps
PPTX
software engineering modules iii & iv.pptx
PDF
Course Project for Coursera Practical Machine Learning
PDF
Regression and Classification with R
PPTX
2. data types, variables and operators
PDF
Principal Components Analysis, Calculation and Visualization
PDF
Outrageous Ideas for Graph Databases
PDF
Prob-Dist-Toll-Forecast-Uncertainty
Iowa_Report_2
07 dimensionality reduction
Random forest algorithm for regression a beginner's guide
T. Lucas Makinen x Imperial SBI Workshop
2013.11.14 Big Data Workshop Bruno Voisin
Curvefitting
A note on estimation of population mean in sample survey using auxiliary info...
Scaling up data science applications
FinalReportFoxMelle
Survey Demo
Optimization of sample configurations for spatial trend estimation
An Interactive Introduction To R (Programming Language For Statistics)
Query Optimization By Example on Database Apps
software engineering modules iii & iv.pptx
Course Project for Coursera Practical Machine Learning
Regression and Classification with R
2. data types, variables and operators
Principal Components Analysis, Calculation and Visualization
Outrageous Ideas for Graph Databases
Prob-Dist-Toll-Forecast-Uncertainty
Ad

More from Louisa Diggs (20)

PDF
Workshop: Quantifying Error in Training Data for Mapping and Monitoring the E...
PDF
Machine Learning for Better Maps
PPTX
Generating Training Data from Noisy Measrements
PPTX
Cropped Field Boundaries, Food Systems, & Fire
PDF
Challenges to Large Scale Mapping: Can Data Geometry Help?
PPTX
A Random Walk of Issues Related to Training Data and Land Cover Mapping
PPTX
Assessing Land Cover Change using Uncertain Data
PPTX
Informal Settlements and Cadastral Mapping
PPTX
Sources of Map Error in Public Health Activities and Operations Research
PDF
Measuring the impact of label noise on semantic segmentation using rastervision
PPTX
Mapping Smallholder Yields Using Micro-Satellite Data
PPTX
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASA
PDF
IMED 2018: The use of remote sensing, geostatistical and machine learning met...
PPT
IMED 2018: Predicting the environmental suitability of podoconiosis in Ethiopia
PDF
IMED 2018: Landcover/habitat
PDF
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...
PDF
IMED 2018: An intro to Remote Sensing and Machine Learning
PDF
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...
PDF
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...
PDF
IMED 2018: Innovations and Challenges in the Use of Open-source Remote Sensin...
Workshop: Quantifying Error in Training Data for Mapping and Monitoring the E...
Machine Learning for Better Maps
Generating Training Data from Noisy Measrements
Cropped Field Boundaries, Food Systems, & Fire
Challenges to Large Scale Mapping: Can Data Geometry Help?
A Random Walk of Issues Related to Training Data and Land Cover Mapping
Assessing Land Cover Change using Uncertain Data
Informal Settlements and Cadastral Mapping
Sources of Map Error in Public Health Activities and Operations Research
Measuring the impact of label noise on semantic segmentation using rastervision
Mapping Smallholder Yields Using Micro-Satellite Data
Crowdsourcing Land Cover and Land Use Data: Experiences from IIASA
IMED 2018: The use of remote sensing, geostatistical and machine learning met...
IMED 2018: Predicting the environmental suitability of podoconiosis in Ethiopia
IMED 2018: Landcover/habitat
IMED 2018: Modeled Population Estimates from Satellite Imagery and Microcensu...
IMED 2018: An intro to Remote Sensing and Machine Learning
IMED 2018: Mapping Monkeypox risk in the Congo Basin using Remote Sensing and...
IMED 2018: Predicting spatiotemporal risk of yellow fever using a machine lea...
IMED 2018: Innovations and Challenges in the Use of Open-source Remote Sensin...
Ad

Recently uploaded (20)

PDF
cuic standard and advanced reporting.pdf
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PDF
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
KodekX | Application Modernization Development
PDF
Advanced Soft Computing BINUS July 2025.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Modernizing your data center with Dell and AMD
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....
cuic standard and advanced reporting.pdf
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
[발표본] 너의 과제는 클라우드에 있어_KTDS_김동현_20250524.pdf
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
GDG Cloud Iasi [PUBLIC] Florian Blaga - Unveiling the Evolution of Cybersecur...
Reach Out and Touch Someone: Haptics and Empathic Computing
KodekX | Application Modernization Development
Advanced Soft Computing BINUS July 2025.pdf
NewMind AI Weekly Chronicles - August'25 Week I
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Modernizing your data center with Dell and AMD
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Unlocking AI with Model Context Protocol (MCP)
Spectral efficient network and resource selection model in 5G networks
Network Security Unit 5.pdf for BCA BBA.
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Diabetes mellitus diagnosis method based random forest with bat algorithm
The AUB Centre for AI in Media Proposal.docx
Chapter 3 Spatial Domain Image Processing.pdf
breach-and-attack-simulation-cybersecurity-india-chennai-defenderrabbit-2025....

Using Active Learning to Quantify how Training Data Errors Impact Classification Accuracy over Smallholder-Dominated Agricultural Systems

  • 1. Using active learning to quantify how training data errors impact classification accuracy over smallholder-dominated agricultural systems Stephanie Debats, Lei Song, Su Ye, Sitian Xiong, Kaixi Zhang, Tammy Woodard, Ron Eastman, Ryan Avery, Kelly Caylor, Dennis McRitchie, Lyndon Estes Clark University|Clark Labs University of California Santa Barbara
  • 2. AWS Cloud Credits for Research Program IIASA
  • 3. Stephanie Debats Ryan Avery Su YeLei Song Sitian Xiong
  • 4. Problem 1: High spatial variability
  • 5. Problem 2: High temporal variability Bing Base Map PlanetScope Analytic
  • 6. Problem 3: Interpretation errors in training data
  • 7. High spatial & temporal resolution imagery
  • 19. True positive (TP) False positive (FP) False negative (FN) True negative (TN)
  • 20. score = in_accuracy * β0 + out_accuracy * β1 + fragmentation * β2 + edge_accuracy * β3 + categorical_accuracy * β4
  • 21. Accuracy assessment and consensus labelling Probability
  • 22. Bayesian Model Averaging Label collection ! " # = % ! &' # !("|#, &') , '-.
  • 23. Bayesian Model Averaging Heat map ! " # = % ! &' # !("|#, &') , '-.
  • 26. Debats et al (2016) A generalized computer vision approach to mapping crop fields in heterogeneous agricultural landscapes Remote Sensing Environment 179 Machine Learning component 1. On the fly feature extraction 2. Spark ML RandomForest GeoTrellis/ GeoPySpark
  • 27. Does Training Data Error Impact Classification Performance?
  • 29. Next Steps 1. Errors in image atmospheric corrections 2. Increase feature space for classifier 3. Improve label quality 4. Quantify gap between worker and ground
  • 31. 8 Circle Bias, many false positive identified because of overreliance on circular features https://github.com/ecoh ydro/CropMask_RCNN
  • 32. Probability score above .7 deemed a center pivot Tested on never before seen 512x512 tiles 11 Some center pivots are missed because of date mismatch between imagery and labels of the reference dataset
  • 33. BAYESIAN MODEL AVERAGING: ! " # = % &'( ) ! *& # !("|#, *&) ": the ground truth, which will be either ‘field’ or ‘no field’ #: the given data of crowdsourcing opinions for labeling this pixel (e.g., # = {#mapper_1 = field , #mapper_/= no field, …} ) *&: the Mappers considered (1) 012234&’s opinion: how much probability to be " (2) Weight (or evidence): is the probability that we weigh 012234&’s opinion based on their mapping history combining crowdsourcing labels from their mapping history
  • 34. MAPPER OPINION In our mapping project, mappers are allowed to only label a crispy category for polygons (either ‘field’ or ‘no field’). So ! " #, %& = 0 )* 1 (1) !(" = -./01|#& = -./01, %&) = 1 (2) !(" = 4) -./01|#& = -./01, %&) = 0 (3) !(" = 4) -./01|#& = 4) -./01, %&) = 1 (4) !(" = -./01|#& = 4) -./01, %&) = 0
  • 35. WEIGHT Weight: ! "# $ ∝ ! $ "# !("#) (1) !("#): ‘mapper priors’, is our prior belief for mapper '. We can use average score (combining geometric and thematic accuracy) to represent our belief (()*) ∝ (∑,-. / 01234,) /7 (2) ! $ "# : ‘mapper likelihood’, ! $ "# ∝ exp(- . 8 9:;#) [1][2] BIC(Bayesian Information Criterion) = ln ? ∗ A − 2 ln D $ ̂F, " ‘BIC simply reduces to maximum likelihood when the number of parameters is equal for the models of interest’ [3] , so 9:; ≈ −2 ln D $ IF, " . After adjustment, ( J )* ∝ K J ̂F, )* (Maximum mapper likelihood) (? is the sample number, A is the parameter number to be estimated (our case has only one, i.e., L), ML is the label that maximizes the likelihood function)
  • 36. WEIGHT (CONTI.) Weight: ! "# $ ∝ ! $ "# !("#) Mapper likelihood: ' ( )* ∝ + ( ,-, )* (Maximum Mapper likelihood) (1) !(- = 01234| ,-, "#) = ! $ = 01234 - = 01234, "# = (∑8 9 :;< :;<=>?< ) /A (2) !(- = BC 01234| ,-, "#) = ! $ = BC 01234 - = BC 01234, "# = (∑8 9 :?< :?<=>;< ) /A D $ ̂-, " can be computed as: * Maximum mapper likelihood is actually average producer’s accuracy of the mapper
  • 37. SUMMARY ! " # = ∑&'( ) ! *& # !("|#, *&) weight = score ∗ producer′s accuracy ∝ P M8 D P("|D, M8) = 0 ;< 1 Labeling: If ! " = >?@AB # > ! " = D; >?@AB # (or ! " = >?@AB # > 0.5), we give a consensus label as field; otherwise, we give a label as no field The posterior probability of the pixel label " given the data of mappers’ opinions (#): (*& is the mapper ?) → ! " # = ∑FGH I JK&LMNF∗ O(P|Q,RF) ∑FGH I JK&LMNF , where