SlideShare a Scribd company logo
Connecting Data
and Intelligence:
The Role of FME in Machine Learning
The Peak of Data
and AI 2025
2025
The
Peak
of
Data
and
AI
Thiago Brigagão
Director
Danilo de
Lima
Director
1. Solutial
2. From Paper to Place
3. Object Identification
4. Results
Agenda
2025
The
Peak
of
Data
and
AI
Solutial – Soluções e Análise de Dados
We are located in São José dos Campos - SP, and has over 18 years
of market experience. We are the official representatives of the
FME Platform in Brazil and South America.
Complete range of services, including specialized consulting,
project development, and customized training. We hold all the
necessary certifications, provided by Safe Software, to ensure
excellence in each of these activities.
2025
The
Peak
of
Data
and
AI
Market-leading solutions
2025
The
Peak
of
Data
and
AI
Segments
Agriculture &
Forest
Energy GIS & IT Government
Mining Water Oil & Gas
Telco
From Paper to Place:
Extracting Land Intelligence from Old Records
2025
The
Peak
of
Data
and
AI
Imagine…
You need to extract key insights from an 80-
page document — written 20 years ago and
available only as a scanned image.
2025
The
Peak
of
Data
and
AI
Sounds Challenging?
2025
The
Peak
of
Data
and
AI
For many organizations,
this is part of daily life — they need to
make important decisions using old and
hard-to-find information.
2025
The
Peak
of
Data
and
AI
Introduction
Every day, organizations in different areas
need to read land documents to find
important information like the size of the
property, who owns it now, and other legal
or land details.
2025
The
Peak
of
Data
and
AI
Doing this work normally needs a special
team that only reads and checks the
documents.
These documents are usually very
long.
2025
The
Peak
of
Data
and
AI
This means the process is manual, takes
a lot of time and effort, and costs a lot of
money.
2025
The
Peak
of
Data
and
AI
Technical Challenges
1. IMAGE-BASED PDFS
Most documents are scanned images, not text-based files.
2. COMPLEX LAYOUTS
Text is scattered, mixed with notary stamps and visual elements.
3. INCONSISTENT QUALITY
Document clarity and resolution vary significantly.
4. TEXT EXTRACTION
How to accurately read and extract content from image-based documents?
5. TEXT OPTIMIZATION
How to clean and improve the quality of extracted text, especially from old or noisy documents?
6. INFORMATION STRUCTURING
How to automatically generate summaries and extract key data like property owners and land area?
2025
The
Peak
of
Data
and
AI
1. IMAGE-BASED PDFS
Most documents are scanned images, not text-based files.
2. COMPLEX LAYOUTS
Text is scattered, mixed with notary stamps and visual
elements.
2
Extracting the text
Uses Tesseract OCR to identify text in the
processed image.
4
Sending to ChatGPT to
analyze the text
Text into parts and sent to ChatGPT
for step-by-step analysis, enabling
more accurate extraction
z
Applies statistics to recover the average
of the boxes with text.
Applies dissolve to merge the boxes that
are touching.
Recovers the areas of higher text clustering,
excluding other areas that are not of
interest.
Crops the image from the PDF based on the
identified text areas.
Raster processing, noise removal, and
extraction of letter bounding boxes.
Raster processing and text
area identification
PDF Reading (RASTER)
1
Prepares the questions for assembling the
request to ChatGPT.
Separates the fragmented texts and
prepares them for sending to ChatGPT.
Word count to fragment the text into
pieces.
Preparing the text for the OpenAI API
request - ChatGPT
3
5
Sends the response via email to the
requesting user.
Processing the response and sending it via
e-mail to the requester.
Prepares the response and the HTML
template to send the reply via email.
2025
The
Peak
of
Data
and
AI
Key Point
It’s important to highlight that, although the Transformer
OpenAIChatGPTConnector exists, we choose to work directly
with Python code to fully customize the solution according to
the specific needs of the project.
Another key point is the use of the Tiktoken library, which
allows us to track the number of tokens used during both the
input and output phases of the interaction
2025
The
Peak
of
Data
and
AI
2025
The
Peak
of
Data
and
AI
Turning lost information into
intelligent decisions is the
true power of automation
combined with artificial
intelligence
— ChatGPT, your co-pilot in data innovation
Object Identification:
Extraction of objects and Data Quality
In development
2025
The
Peak
of
Data
and
AI
One of the largest Energy Distributors in South America.
Total number of customers: around 8.8 million.
Population served: approximately 18 million people.
Coverage area: 785 municipalities in Brazil.
Transmission: Operates more than 10,000 km of transmission lines.
2025
The
Peak
of
Data
and
AI
Our eletrical client has a manual process for
identifying objects, wich is very expensive.
2025
The
Peak
of
Data
and
AI
Sounds Challenging?
2025
The
Peak
of
Data
and
AI
Drone images over the transmission line are generated, and it is necessary to manually inspect these
images.
2025
The
Peak
of
Data
and
AI
Technical Challenges
1. RECEIVING THESE DAILY IMAGES
2. OBJECT IDENTIFICATIONS (VISUALLY)
Transformers — to step down or step up electrical voltage.
Fuse cutouts — for circuit protection and disconnection.
Reclosers (automatic circuit reclosers) — to automatically restore power after minor faults.
Insulator bolts / pin or suspension insulators — to hold the wires without energy loss.
Conductor cables — for medium or low voltage.
Crossarms and hardware — for mechanical support.
Smart meters — in advanced urban areas.
Telecommunication boxes — for internet, telephone, and fiber optic cables.
Public lighting (light arms and luminaires) — installed on the same poles.
Grounding systems — for safety against electrical discharges.
Sensors and cameras — in smart grid networks, for real-time monitoring.
2025
The
Peak
of
Data
and
AI
Technical Challenges
3. REPORT GENERATION (MANUAL)
Fill in an excel template file
4. GENERATE SHAPE FILE
Containing the information linked to the utility poles
5. ASSET BASE
Correction of the asset base – Data Base
6. USER NOTIFICATION
Notify user by email of results
2025
The
Peak
of
Data
and
AI
2025
The
Peak
of
Data
and
AI
Solution
Data Integration/ETL + GeoAI
Picterra
2025
The
Peak
of
Data
and
AI
Picterra
From Earth observation imagery to actionable insights with AI
Founded in 2016 in Switzerland
• +100 corporate clients globally
• + 30.000 Machine Learning Models
2025
The
Peak
of
Data
and
AI
Products
Detect objects, patterns, and
changes faster than ever by
managing the entire geospatial ML
pipeline with our cloud-native
platform.
Analysis of plots related to
sustainability compliance, such as
EUDR and carbon storage estimates.
It can be integrated with various
software and platforms, mainly
because it uses open standards
(REST, JSON, GeoJSON, shapefile),
which allows for great flexibility.
GEOAI Tracer
API
2025
The
Peak
of
Data
and
AI
Integration Picterra and FME
Drone Images Build Detectors ML FME Output
Perform object detection with PicterraConnector within FME:
• List folders, rasters, detectors and vector layers.
• Upload images and detection áreas.
• Perform detection.
• Download results.
2025
The
Peak
of
Data
and
AI
Solution
Extract Images Identify Objects
Receving the
results
Email
3
4
5
1 2
Update Excel
file template
2025
The
Peak
of
Data
and
AI
Thank
You
Danilo de Lima & Thiago Brigagão
Solutial Soluções e Análise de Dados
danilo@solutial.com.br
thago@solutial.com.br

More Related Content

PDF
How to design ai functions to the cloud native infra
PDF
Cognitive IoT Whitepaper_Dec 2015
PDF
Transformacion del Negocio Financiero por medio de Tecnologias Cloud
PDF
Dell AI and HPC University Roadshow
PDF
Cubitic: Predictive Analytics
PDF
33977_IoT_in_HighTech_11_03_14
PDF
Wireless Global Congress: 2020 is not that far away
PDF
InfoRepos Academy Introduction v1.1 - IIOT Experiential Learning Program
How to design ai functions to the cloud native infra
Cognitive IoT Whitepaper_Dec 2015
Transformacion del Negocio Financiero por medio de Tecnologias Cloud
Dell AI and HPC University Roadshow
Cubitic: Predictive Analytics
33977_IoT_in_HighTech_11_03_14
Wireless Global Congress: 2020 is not that far away
InfoRepos Academy Introduction v1.1 - IIOT Experiential Learning Program

Similar to Connecting Data and Intelligence: The Role of FME in Machine Learning (20)

PDF
IoT by Silver Touch Tech Lab
PDF
T Bytes Hybrid cloud infrastructure
PDF
NUS-ISS Learning Day 2018- Harnessing the power of cloud solutions in urban a...
PDF
08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdf
PDF
Cloud Computing and Edge Computing(CTO Kieun Park) - Edge Computing Seminar
PPTX
Ypo 20190131 v1
PDF
Dell AI Telecom Webinar
PDF
How to fail in the IoT business
PDF
Future of Big Data
PDF
internet of things : 2021 perspective
PDF
Vertex Perspectives | AI-optimized Chipsets | Part I
PDF
Vertex perspectives ai optimized chipsets (part i)
PPTX
SAP Leonardo
PPTX
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
PDF
CCC-Internet of Things Foundation
PPTX
Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)
PDF
The boom in Xaas and the knowledge graph
PDF
Wikibon 2018 Predictions
PDF
Building a reliable and scalable IoT platform with MongoDB and HiveMQ
PDF
Top 8 AI Trends and Predictions to Watch out for in 2022 ARTiBA.pdf
IoT by Silver Touch Tech Lab
T Bytes Hybrid cloud infrastructure
NUS-ISS Learning Day 2018- Harnessing the power of cloud solutions in urban a...
08_-_Masamichi_Tanaka_-_Bigdata_and_AI_in_IOT.pdf
Cloud Computing and Edge Computing(CTO Kieun Park) - Edge Computing Seminar
Ypo 20190131 v1
Dell AI Telecom Webinar
How to fail in the IoT business
Future of Big Data
internet of things : 2021 perspective
Vertex Perspectives | AI-optimized Chipsets | Part I
Vertex perspectives ai optimized chipsets (part i)
SAP Leonardo
Driving Network and Marketing Investments at O2 by Focusing on Improving the ...
CCC-Internet of Things Foundation
Big Data, Trends,opportunities and some case studies( Mahmoud Khosravi)
The boom in Xaas and the knowledge graph
Wikibon 2018 Predictions
Building a reliable and scalable IoT platform with MongoDB and HiveMQ
Top 8 AI Trends and Predictions to Watch out for in 2022 ARTiBA.pdf
Ad

More from Safe Software (20)

PDF
Getting Started with Data Integration: FME Form 101
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Automating ArcGIS Content Discovery with FME: A Real World Use Case
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
PDF
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
PDF
Notification System for Construction Logistics Application
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
PDF
Transforming Utility Networks: Large-scale Data Migrations with FME
PDF
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
PDF
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
PDF
FME in Overdrive - Peak of Data & AI 2025
PDF
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
PDF
Pipeline Industry IoT - Real Time Data Monitoring
PDF
FME in Overdrive: Unleashing the Power of Parallel Processing
PDF
Fiber to the People! By Deutsche Telekom
PDF
Governing Geospatial Data at Scale: Optimizing ArcGIS Online with FME in Envi...
PDF
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
PDF
Introducing and Operating FME Flow for Kubernetes in a Large Enterprise: Expe...
PDF
5 Things to Consider When Deploying AI in Your Enterprise
Getting Started with Data Integration: FME Form 101
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Automating ArcGIS Content Discovery with FME: A Real World Use Case
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Infrastructure planning and resilience - Keith Hastings.pptx.pdf
Notification System for Construction Logistics Application
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Transforming Utility Networks: Large-scale Data Migrations with FME
Peak of Data & AI Encore AI-Enhanced Workflows for the Real World
Automating Feature Enrichment and Station Creation in Natural Gas Utility Net...
FME in Overdrive - Peak of Data & AI 2025
Powering GIS with FME and VertiGIS - Peak of Data & AI 2025
Pipeline Industry IoT - Real Time Data Monitoring
FME in Overdrive: Unleashing the Power of Parallel Processing
Fiber to the People! By Deutsche Telekom
Governing Geospatial Data at Scale: Optimizing ArcGIS Online with FME in Envi...
Enhancing Environmental Monitoring with Real-Time Data Integration: Leveragin...
Introducing and Operating FME Flow for Kubernetes in a Large Enterprise: Expe...
5 Things to Consider When Deploying AI in Your Enterprise
Ad

Recently uploaded (20)

DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPT
Teaching material agriculture food technology
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Encapsulation theory and applications.pdf
The AUB Centre for AI in Media Proposal.docx
NewMind AI Weekly Chronicles - August'25 Week I
Understanding_Digital_Forensics_Presentation.pptx
Empathic Computing: Creating Shared Understanding
MIND Revenue Release Quarter 2 2025 Press Release
Teaching material agriculture food technology
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
The Rise and Fall of 3GPP – Time for a Sabbatical?
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Programs and apps: productivity, graphics, security and other tools
MYSQL Presentation for SQL database connectivity
Advanced methodologies resolving dimensionality complications for autism neur...
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Electronic commerce courselecture one. Pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Encapsulation theory and applications.pdf

Connecting Data and Intelligence: The Role of FME in Machine Learning