SlideShare a Scribd company logo
PreparingYour Rich GML Data for deegree
-
the ETL Challenge
Just van den Broecke
deegree Community Space 2012, Bonn
November 13, 2012
www.justobjects.nl
About Me
Independent Open Source Geospatial Professional
Trailblazer OSGeo Dutch Local Chapter
Just van den Broecke
just@justobjects.nl
www.justobjects.nl
Stetl: Preparing Rich GML Data for deegree - The ETL Challenge
Stetl: Preparing Rich GML Data for deegree - The ETL Challenge
THE DUTCH KADASTER
GETS INSPIRED WITH
deegree day - nov 16, 2010
THE DUTCH KADASTER
GETS INSPIRED WITH
deegree Community Space - nov 13, 2012
Stetl: Preparing Rich GML Data for deegree - The ETL Challenge
50+ DATASETS
WMS/WFS/WCS/ATOM
METADATA
+
NL INSPIRE ACCESS POINT
PDOK	
  –	
  Open	
  Source	
  &	
  Open	
  Standards
Applications
OGC/ISO/INSPIRE Web Services
Storage
Conversion
Services
sETLFME?
OSGeo - Bolsena - 2010
BOLSENA
2012
ALLES VORBEI ?
BOLSENA
2012
BOLSENA
2012
PreparingYour Rich GML Data for deegree
-
the ETL Challenge
Just van den Broecke
deegree Community Space 2012, Bonn
November 13, 2012
www.justobjects.nl
We have a
Problem
The Rich GML
Problem
Rich GML = Complex Mess
INSPIRE
Dutch National DSs
AFIS-ALKIS-ATKIS
.
.
Stetl: Preparing Rich GML Data for deegree - The ETL Challenge
The Streetname!
Complex
Model
Transformations
Millions
of
Objects
10s of Millions
of
<Elements>
Multiple
Transformation
Steps
Solution is
Spatial ETL
A.K.A.
ThankYou
for your
Attention!
But what about...
....FOSS ?
... Stetl?
FOSS ETL - High Level
FOSS ETL - Lower Level
But Each Powerful by Itself
ogr2ogr
FOSS ETL - DIY ? (No!)
FOSS ETL - How to Combine?
=+ + ?
ogr2ogr
FOSS ETL - Add Python to Equation
=+ + ?( )
ogr2ogr
=+ +
Stetl
( )
ogr2ogr
Stetl
=
Simple
Streaming
Spatial
Speedy
ETL
Process Chain
Input Filter Output
gml
Filter
Stetl concepts
Speed: Streaming
Input Filter Output
gml
Stetl concepts
Speed: Going Native
Input Filter Output
gml
ogr2ogr sETLsETL
Native C Libs/Progs
Calls
Stetl concepts
Example: GML to PostGIS
Reader
XML
Splitter
ogr2ogr
gml
Stetl concepts
Example: INSPIRE Model Transform
ogr2ogr XSLT Writer
gml
Stetl concepts
Example: deegree Store
ogr2ogr XSLT
deegree
Writer
Stetl concepts
Process Chain - How?
Input Filters Output
Stetl concepts
Example: XML to Shape
The Source
Example: XML to Shape
First: XSLT Transform to GML
Example: XML to Shape
XML
Input
XSLT ogr2ogr
Example: XML to Shape
The SETL Chain Config File
Process
Chain
Reader
XSLT
ogr2ogr
Example Components
Input Filters Output
Stetl concepts
XMLFile XSLT GMLFile
ogr2gml GMLSplitter gml2ogr
LineStream XMLValidator WFS-T
deegree* FeatureExtractor deegree*
YourInput YourFilter YourOutput
Data Structures
Stetl concepts
✴Components exchange Packet
✴Packet contains data
✴Data format:
xml_line_stream
etree_doc
etree_feature_array
xml_doc_as_string
any
deegree Integration
Stetl concepts
✴Input
DeegreeBlobstoreInput
✴Output
DeegreeBlobstoreInput
DeegreeFSLoaderOutput
WFSTOutput
Cases
✴INSPIRE Download Services
publish to deegree store (WFS)
GML files (for Atom Feed)
✴National GML Datasets
GML to PostGIS
Case: Dutch Addresses
Source
<GML>
sETL sETL
deegree
WFS
sETL
INSPIRE
<GML>
Atom
Feed
Other
INSPIRE
Addresses
Dutch
Addresses+
Buildings
Dutch
Geocoder
deegree
blobstore
ThankYou !
stetl.org
www.inspire-foss.org

More Related Content

PDF
NLExtract - Unlocking Dutch Open Geo-datasets - High Level Introduction
PDF
XSLT for INSPIRE Transformation by Dutch Kadaster in ESDIN
PDF
Who Pulls the Strings?
PDF
Symfony2 and MongoDB
PDF
Just's Career Highlights - Version 2
PDF
Designing & Developing mLearning using HTML5 #mlearncon
PDF
Dataiku - google cloud platform roadshow - october 2013
PDF
The Heron Mapping Client
NLExtract - Unlocking Dutch Open Geo-datasets - High Level Introduction
XSLT for INSPIRE Transformation by Dutch Kadaster in ESDIN
Who Pulls the Strings?
Symfony2 and MongoDB
Just's Career Highlights - Version 2
Designing & Developing mLearning using HTML5 #mlearncon
Dataiku - google cloud platform roadshow - october 2013
The Heron Mapping Client

Similar to Stetl: Preparing Rich GML Data for deegree - The ETL Challenge (7)

PDF
Hippo gettogether april 2012 faceted navigation a tale of daemons
ODP
Milton Webdav Presentation for Linagora
PDF
Inglês curriculo marcos
PDF
Curriculo Marcos in English
PDF
D.I.S.Cloud
DOCX
Cv of mahmoud mohamed doudah dwh developer-
PPTX
Conference Creative Wallonia, November 16th 2012
Hippo gettogether april 2012 faceted navigation a tale of daemons
Milton Webdav Presentation for Linagora
Inglês curriculo marcos
Curriculo Marcos in English
D.I.S.Cloud
Cv of mahmoud mohamed doudah dwh developer-
Conference Creative Wallonia, November 16th 2012
Ad

More from Just van den Broecke (20)

PDF
Just's Career Highlights - Version 1
PDF
Open Sensor Networks
PDF
Open Sensor Networks with LoRa TTN and SensorThings API
PDF
Sensor SDI in PDOK with Smart Emission Platform
PDF
osgeonl-opening-foss4gnl-2018
PDF
Stetl-engine-nlextract-smartem
PDF
OSGeo.nl-NewYearsParty-2018-Opening
PDF
Opening OSGeo.nl Day 2017
PDF
Smart Emission Data Platform
PPT
De Levenscyclus van Open Geodata met Open Source Tools
PDF
NLExtract Project - OGT Award Pitch GeoBuzz 2016
PDF
Smart Emission - Citizens measuring Air Quality - Overview
PDF
Geospatial ETL with Stetl - GeoPython 2016
PDF
Smart Emission - Data - Viewers - Standards
PDF
NLExtract voor BAG - overview
PDF
3D Breakthrough Meeting - 3D Standards progress
PDF
Wandelen met GPS en De Evolutie van Navigatie
PDF
OSGeo.nl - Year 2014 Highlights
PDF
Nederland Ontsloten! OSGeo.nl Dag 2014
PDF
Big Data - Introduction and Research Topics - for Dutch Kadaster
Just's Career Highlights - Version 1
Open Sensor Networks
Open Sensor Networks with LoRa TTN and SensorThings API
Sensor SDI in PDOK with Smart Emission Platform
osgeonl-opening-foss4gnl-2018
Stetl-engine-nlextract-smartem
OSGeo.nl-NewYearsParty-2018-Opening
Opening OSGeo.nl Day 2017
Smart Emission Data Platform
De Levenscyclus van Open Geodata met Open Source Tools
NLExtract Project - OGT Award Pitch GeoBuzz 2016
Smart Emission - Citizens measuring Air Quality - Overview
Geospatial ETL with Stetl - GeoPython 2016
Smart Emission - Data - Viewers - Standards
NLExtract voor BAG - overview
3D Breakthrough Meeting - 3D Standards progress
Wandelen met GPS en De Evolutie van Navigatie
OSGeo.nl - Year 2014 Highlights
Nederland Ontsloten! OSGeo.nl Dag 2014
Big Data - Introduction and Research Topics - for Dutch Kadaster
Ad

Recently uploaded (20)

PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PPTX
The various Industrial Revolutions .pptx
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PPT
Geologic Time for studying geology for geologist
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
CloudStack 4.21: First Look Webinar slides
PDF
Architecture types and enterprise applications.pdf
PDF
Getting Started with Data Integration: FME Form 101
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
August Patch Tuesday
PPT
What is a Computer? Input Devices /output devices
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
Hybrid model detection and classification of lung cancer
PPTX
Chapter 5: Probability Theory and Statistics
PPTX
Modernising the Digital Integration Hub
DP Operators-handbook-extract for the Mautical Institute
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
The various Industrial Revolutions .pptx
1 - Historical Antecedents, Social Consideration.pdf
Group 1 Presentation -Planning and Decision Making .pptx
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Geologic Time for studying geology for geologist
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
CloudStack 4.21: First Look Webinar slides
Architecture types and enterprise applications.pdf
Getting Started with Data Integration: FME Form 101
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
NewMind AI Weekly Chronicles – August ’25 Week III
August Patch Tuesday
What is a Computer? Input Devices /output devices
Final SEM Unit 1 for mit wpu at pune .pptx
O2C Customer Invoices to Receipt V15A.pptx
Hybrid model detection and classification of lung cancer
Chapter 5: Probability Theory and Statistics
Modernising the Digital Integration Hub

Stetl: Preparing Rich GML Data for deegree - The ETL Challenge

Editor's Notes