SlideShare a Scribd company logo
Complex Data Preparation and
Preprocessing for Predicting Forest
Pests with GeoAI
Presenters
The
Peak
of
Data
Integration
20
23
Dr. Christopher
Britsch
Consultant
con terra GmbH
The
Peak
of
Data
Integration
20
23
Agenda
1. Introduction
2. Project Background
3. Data Preparation and Preprocessing
4. Conclusion
The
Peak
of
Data
Integration
20
23
Introduction
The
Peak
of
Data
Integration
20
23
GeoAI - Definition
GeoAI is a Machine
Learning technology, which
enables the caption and
analysis of complex patterns
and structures in
(geospatial) data.
The
Peak
of
Data
Integration
20
23
Setup of a „typical“GeoAI Project
Data Preprocessing
AI-Methods
Integration and Operation
Visualization and Application
The
Peak
of
Data
Integration
20
23
Project Background
The
Peak
of
Data
Integration
20
23
“We are the first generation to feel the impact
of climate change and the last generation that
can do something about it”.
Barack Obama
FormerPresidentof the USA
The
Peak
of
Data
Integration
20
23
Project - KINoPro
Künstliche Intelligenzzur
NonnenfalterPrognose
• Research Project of TU Dresden and con terra GmbH
• Data from state forestries Brandenburgand Saxony
• Climate Change Influence
• Trees struggling with dry and hot weather
• Forestpests adapt fasterthanplants
→ Irregularpopulationgrowth/appearence
• Forestry personnel needs to be managed more efficiently
• New prediction models are necessary
The
Peak
of
Data
Integration
20
23
Approach- Overview
Monitoring of
Black Arches
GeoAI-Modell
Influencing
Factors
Data
Research
Web Application
Data
Preparation and
Preprocessing
Validation and survey for additional
parameters
The
Peak
of
Data
Integration
20
23
Data
● Around 6k data points
● Right-skewed distribution
● Data from 70 weeks per
point
| 11
The
Peak
of
Data
Integration
20
23
Parameters
• Target Variable:
Moth Count per Year
• Additional Information:
• Land Cover Classification
• Altitude
• Slope
• Trap Alignment
Air
Temperature
Rainfall
Humidity
Wind
Speed
Depth of
Frost
Sum of last
years Black
Arches
Soil
Moisture
Soil
Temperature
The
Peak
of
Data
Integration
20
23
Output
4 weeks prior to
moth activities
June 15th
August 16th
The
Peak
of
Data
Integration
20
23
Data Preparation and
Preprocessing
The
Peak
of
Data
Integration
20
23
Challenges in the Data
● Data Formats
○ Excel files, Access-DBs, Esri ASCII Grids, GeoTIFFs, netCDF, etc.
● Projections
○ Different projections, partially highly individual projections
● Spatial Correlation/Corrections
○ Traps can slightly change position
○ Positions are, however, critical to connect previous with current year
The
Peak
of
Data
Integration
20
23
Challenges in the Data
● Pest Monitoring Strategies
○ Different states – different strategies
→ data has to be standardized
● Weather Data
○ Changes in temporal resultion, terminologies, accuracies
→ data has to be standardized
The
Peak
of
Data
Integration
20
23
Challenges in the Data
● Data Quality Assurance
○ Filtering of nonsense data
○ Interpolating missing data
● Performance
○ Large raster files are minimized by clipping
The
Peak
of
Data
Integration
20
23
Challenges in the Data
● Most AI algorithms expect vectors of numerical data as
input
● The input for the KINoPro model is composed of features
from various data sources and with different resolutions
● Data must be transformed to a consistent format and
resolution
The
Peak
of
Data
Integration
20
23
Data Preprocessingwith FME
● FME supports a wide range of input data formats
● No need for…
○ …multiple scripts
○ …a variety of different libraries
○ …researching best solutions for each and every format
The
Peak
of
Data
Integration
20
23
Data Preprocessingwith FME
GeoTIFFs
ASCII Grids
netCDF
Gridded Binary
The
Peak
of
Data
Integration
20
23
Conclusion
The
Peak
of
Data
Integration
20
23
Data Preprocessingwith FME
● All relevant transformations could be performed
● Workspace is easy to read and understand
● Every step is reproducible and if necessary, easy to
adjust to new parameters
● All in one place, everything connected
Thank You!
c.britsch@conterra.de

More Related Content

PDF
Integrating GeoAI Models in FME
PDF
Mastering AI Workflows with FME by Mark Döring
PDF
Mastering AI Workflows with FME - Peak of Data & AI 2025
PDF
FME as an Orchestration Tool with Principles From Data Gravity
PDF
FME as an Orchestration Tool - Peak of Data & AI 2025
PDF
Working Forward: From Data to Location Intelligence for Renewables Siting and...
PDF
Use Different Free and Open Data from Norway (geonorge.no) with FME to Create...
PDF
North Sea Transition Authority Using FME to Regulate and Influence in the Ene...
Integrating GeoAI Models in FME
Mastering AI Workflows with FME by Mark Döring
Mastering AI Workflows with FME - Peak of Data & AI 2025
FME as an Orchestration Tool with Principles From Data Gravity
FME as an Orchestration Tool - Peak of Data & AI 2025
Working Forward: From Data to Location Intelligence for Renewables Siting and...
Use Different Free and Open Data from Norway (geonorge.no) with FME to Create...
North Sea Transition Authority Using FME to Regulate and Influence in the Ene...

Similar to Complex Data Preparation and Preprocessing for Predicting Forest Pests with GeoAI (20)

PDF
Model Build ArcPy Into Your FME Workflows
PDF
Collection and Integration of Project Dara for Visualization and Analysis
PDF
Let's Collect Network Data
PDF
FME & Power Plants – Synergy in Energy
PDF
Review of Digital Soil Mapping steps
 
PDF
FME: a Key Component of the Spatial DNA Platform
PDF
Using FME Cloud, Space Data and R Libraries to Help Assess Production Impact ...
PPT
5A_ 2_Developing a statistical methodology to improve classification and mapp...
PDF
From Field to Digital Twin: Leveraging FME for Efficient Data Ingestion in Di...
PDF
Bridging CAD, IBM TRIRIGA & GIS with FME: The Portland Public Schools Case
PDF
Emergency Warnings for the Population: A Technical Review on Number Crunching...
PDF
Managing City of Cockburn's Cloud Migration and ESRI Field Operations with FME
PDF
FME Integration Universal Test Framework
PPT
Image Resampling Detection Based on Convolutional Neural Network Yaohua Liang...
PDF
Field to Finish Utilizing Trimble and FME
PDF
Creating an Automated Mobile Noise Assessment Service with FME
PDF
Recipes for geodata management in oceanography
PDF
FME 2022.0: Driving Data Decisions, Fueling Innovation
PDF
Utilizing FME as an API Test Framework
PDF
Identification of landscape features eligible for EU subsidy - Analysis of la...
Model Build ArcPy Into Your FME Workflows
Collection and Integration of Project Dara for Visualization and Analysis
Let's Collect Network Data
FME & Power Plants – Synergy in Energy
Review of Digital Soil Mapping steps
 
FME: a Key Component of the Spatial DNA Platform
Using FME Cloud, Space Data and R Libraries to Help Assess Production Impact ...
5A_ 2_Developing a statistical methodology to improve classification and mapp...
From Field to Digital Twin: Leveraging FME for Efficient Data Ingestion in Di...
Bridging CAD, IBM TRIRIGA & GIS with FME: The Portland Public Schools Case
Emergency Warnings for the Population: A Technical Review on Number Crunching...
Managing City of Cockburn's Cloud Migration and ESRI Field Operations with FME
FME Integration Universal Test Framework
Image Resampling Detection Based on Convolutional Neural Network Yaohua Liang...
Field to Finish Utilizing Trimble and FME
Creating an Automated Mobile Noise Assessment Service with FME
Recipes for geodata management in oceanography
FME 2022.0: Driving Data Decisions, Fueling Innovation
Utilizing FME as an API Test Framework
Identification of landscape features eligible for EU subsidy - Analysis of la...
Ad

More from Safe Software (20)

PDF
Getting Started with Data Integration: FME Form 101
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
PDF
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
PDF
Notification System for Construction Logistics Application
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
PDF
FME in Overdrive - Peak of Data & AI 2025
PDF
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
PDF
Pipeline Industry IoT - Real Time Data Monitoring
PDF
FME in Overdrive: Unleashing the Power of Parallel Processing
PDF
Fiber to the People! By Deutsche Telekom
PDF
Governing Geospatial Data at Scale: Optimizing ArcGIS Online with FME in Envi...
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
PDF
Introducing and Operating FME Flow for Kubernetes in a Large Enterprise: Expe...
PDF
5 Things to Consider When Deploying AI in Your Enterprise
Getting Started with Data Integration: FME Form 101
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
Notification System for Construction Logistics Application
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Transforming Utility Networks: Large-scale Data Migrations with FME
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
FME in Overdrive - Peak of Data & AI 2025
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Pipeline Industry IoT - Real Time Data Monitoring
FME in Overdrive: Unleashing the Power of Parallel Processing
Fiber to the People! By Deutsche Telekom
Governing Geospatial Data at Scale: Optimizing ArcGIS Online with FME in Envi...
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Introducing and Operating FME Flow for Kubernetes in a Large Enterprise: Expe...
5 Things to Consider When Deploying AI in Your Enterprise
Ad

Recently uploaded (20)

PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Approach and Philosophy of On baking technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Cloud computing and distributed systems.
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
MYSQL Presentation for SQL database connectivity
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Electronic commerce courselecture one. Pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
cuic standard and advanced reporting.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Review of recent advances in non-invasive hemoglobin estimation
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Approach and Philosophy of On baking technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Cloud computing and distributed systems.
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Diabetes mellitus diagnosis method based random forest with bat algorithm
MYSQL Presentation for SQL database connectivity
MIND Revenue Release Quarter 2 2025 Press Release
Unlocking AI with Model Context Protocol (MCP)
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Electronic commerce courselecture one. Pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
cuic standard and advanced reporting.pdf
sap open course for s4hana steps from ECC to s4
Dropbox Q2 2025 Financial Results & Investor Presentation
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Encapsulation_ Review paper, used for researhc scholars
Review of recent advances in non-invasive hemoglobin estimation

Complex Data Preparation and Preprocessing for Predicting Forest Pests with GeoAI