SlideShare a Scribd company logo
GEOlab, Como Campus
A GRASS-based automated procedure
to compare OpenStreetMap and
authoritative road network datasets
Politecnico di Milano, Como Campus, DICA, via Valleggio 11, 22100 Como (Italy)
Maria Antonia Brovelli, Marco Minghini, Monia Elisa Molinari
2
✔ Increasing popularity of OpenStreetMap (OSM) as today's most notable
Volunteered Geographic Information (VGI) project on the Internet
✔ Increasing concern on VGI (and OSM) data quality:
Motivation of the work – VGI & OSM quality
GEOlab, Politecnico di Milano – Como Campus
➔ spatial accuracy
➔ temporal accuracy
➔ semantic accuracy
➔ up-to-dateness
➔ consistency
➔ fitness-for-use & fitness-for-purpose
➔ ...
✔ Increasing availability of open data from NMAs and CSC that can be used
as a source of comparison for VGI (and OSM) data:
➔ comparing two spatial datasets against each other is a challenging
geocomputation problem!
3
✔ Literature provides plenty of works assessing or comparing OSM quality
against that of authoritative datasets:
Motivation of the work – OSM comparisons
➔ strongly focused on road network
➔ mainly investigating OSM positional accuracy
➔ OSM compared to data from NMA (UK Ordnance Survey, French
NMA, USGS TNM/TIGER, etc.) and CSC (Navteq, TeleAtlas, etc.)
➔ semi- or fully-automated
➔ results from poor to very good
✔ Comparison techniques are very strong and fit for purpose, but mostly
application and dataset specific:
➔ hard to replicate
➔ difficult to extend to other dataset comparisons
GEOlab, Politecnico di Milano – Como Campus
4
✔ Novel methodology to compare OSM and authoritative road datasets:
Our methodology
➔ fully automated
➔ focused on spatial accuracy and completeness
➔ flexible, i.e. not developed for a specific dataset
➔ built with FOSS4G (Free and Open Source Software for Geospatial)
✗ made of required and optional operations
✗ users can define the value of the parameters involved to adapt the
procedure to their specific authoritative datasets
✗ users are supposed to be familiar with the authoritative dataset used
as reference
✗ reusable and extensible in case of need
GEOlab, Politecnico di Milano – Como Campus
5
➔ 1. Preliminary comparison of the datasets and computation of
global statistics
➔ 2. Geometric preprocessing of the OSM dataset to extract a subset
which is fully comparable with the IGN dataset
➔ 3. Evaluation of OSM spatial accuracy using a grid-based approach
✔ Currently developed as 3 GRASS GIS modules:
Our methodology – Overview
➔ written in Python
➔ available with a Graphical User Interface (GUI)
✔ Comparison between OSM and reference (IGN) road network datasets
composed of 3 consecutive steps:
GEOlab, Politecnico di Milano – Como Campus
6
✔ Import and select the OSM and IGN datasets [required]
Step 1: Preliminary comparison of the datasets
GEOlab, Politecnico di Milano – Como Campus
7
✔ Import and select the OSM and IGN datasets [required]
Step 1: Preliminary comparison of the datasets
GEOlab, Politecnico di Milano – Como Campus
data © IGN and © OpenStreetMap contributors
8
✔ Import and select the OSM and IGN datasets [required]
✔ If the extent of the OSM and/or IGN datasets is larger than the one of
interest, import a vector layer to be used as clipping mask [optional]
Step 1: Preliminary comparison of the datasets
GEOlab, Politecnico di Milano – Como Campus
9
✔ Apply a set of buffers of user-specified width around both the IGN and
OSM datasets, to compute the length and the length percentage of the
OSM and IGN datasets included in the buffer [required]
Step 1: Preliminary comparison of the datasets
GEOlab, Politecnico di Milano – Como Campus
10
✔ Compute also the total length of OSM and IGN datasets and their length
difference, both in map units and percentage [required]
Step 1: Preliminary comparison of the datasets
➔ output values are returned in a text file
GEOlab, Politecnico di Milano – Como Campus
11
✔ Compute also the total length of OSM and IGN datasets and their length
difference, both in map units and percentage [required]
Step 1: Preliminary comparison of the datasets
➔ output values are returned in a text file
GEOlab, Politecnico di Milano – Como Campus
12
✔ Compute also the total length of OSM and IGN datasets and their length
difference, both in map units and percentage [required]
Step 1: Preliminary comparison of the datasets
➔ output values are returned in a text file
GEOlab, Politecnico di Milano – Como Campus
✗ ≅450 km more in OSM than IGN dataset!
13
✔ Compute also the total length of OSM and IGN datasets and their length
difference, both in map units and percentage [required]
Step 1: Preliminary comparison of the datasets
➔ output values are returned in a text file
GEOlab, Politecnico di Milano – Como Campus
✗ ≅450 km more in OSM than IGN dataset!
➔ more footways and pedestrian routes mapped in OSM
Boulevard des Invalides Gare de l'Est
IGN
OSM
data © IGN and © OpenStreetMap contributors
14
✔ Compute also the total length of OSM and IGN datasets and their length
difference, both in map units and percentage [required]
Step 1: Preliminary comparison of the datasets
➔ output values are returned in a text file
GEOlab, Politecnico di Milano – Como Campus
✗ ≅450 km more in OSM than IGN dataset!
➔ cycleways and carriageways mapped as separate highways in OSM
IGN
OSM
Boulevard Jules Ferry data © IGN and © OpenStreetMap contributors
Boulevard Henri IV
15
✔ Outputs from Step 1 can be used to perform further analysis:
Step 1: Preliminary comparison of the datasets
➔ sensitivity analysis on the buffer width
GEOlab, Politecnico di Milano – Como Campus
16
✔ Outputs from Step 1 can be used to perform further analysis:
Step 1: Preliminary comparison of the datasets
➔ sensitivity analysis on the buffer width
GEOlab, Politecnico di Milano – Como Campus
17
✔ Cleaning of OSM dataset to make it comparable with IGN dataset
Step 2: preprocessing of the OSM dataset
GEOlab, Politecnico di Milano – Como Campus
18
✔ Cleaning of OSM dataset to make it comparable with IGN dataset
Step 2: preprocessing of the OSM dataset
➔ computationally intensive – work area divided in 4 sub-areas
GEOlab, Politecnico di Milano – Como Campus
1
4
2
3
data © IGN and © OpenStreetMap contributors
19
✔ Generalize the IGN dataset with the Douglas-Peucker algorithm [optional]
Step 2: preprocessing of the OSM dataset
➔ users have to enter the threshold for the algorithm
GEOlab, Politecnico di Milano – Como Campus
20
✔ Generalize the IGN dataset with the Douglas-Peucker algorithm [optional]
Step 2: preprocessing of the OSM dataset
✔ Split the line features of the datasets into segments [required]
➔ users have to enter the threshold for the algorithm
GEOlab, Politecnico di Milano – Como Campus
21
✔ Compute a measure of degree for the nodes of IGN dataset [required]
Step 2: preprocessing of the OSM dataset
➔ identify the terminal nodes (degree = 1)
GEOlab, Politecnico di Milano – Como Campus
data © IGN
22
✔ Apply a buffer of user-specified width around the IGN dataset [required]
Step 2: preprocessing of the OSM dataset
➔ suitable buffer width derived from Step 1
➔ delete all the OSM roads falling outside the buffer
GEOlab, Politecnico di Milano – Como Campus
23
✔ Apply a buffer of user-specified width around the IGN dataset [required]
Step 2: preprocessing of the OSM dataset
➔ suitable buffer width derived from Step 1
➔ delete all the OSM roads falling outside the buffer
➔ buffer is applied without cap around the terminal nodes
GEOlab, Politecnico di Milano – Como Campus
24
✔ Apply a buffer of user-specified width around the IGN dataset [required]
Step 2: preprocessing of the OSM dataset
➔ suitable buffer width derived from Step 1
➔ delete all the OSM roads falling outside the buffer
➔ buffer is applied without cap around the terminal nodes
GEOlab, Politecnico di Milano – Como Campus
25
✔ Apply a buffer of user-specified width around the IGN dataset [required]
Step 2: preprocessing of the OSM dataset
➔ suitable buffer width derived from Step 1
➔ delete all the OSM roads falling outside the buffer
➔ buffer is applied without cap around the terminal nodes
GEOlab, Politecnico di Milano – Como Campus
26
Step 2: preprocessing of the OSM dataset
✔ Further clean the OSM dataset [required]:
GEOlab, Politecnico di Milano – Como Campus
27
Step 2: preprocessing of the OSM dataset
➔ apply a buffer of user-specified width around each IGN segment
✔ Further clean the OSM dataset [required]:
GEOlab, Politecnico di Milano – Como Campus
28
Step 2: preprocessing of the OSM dataset
➔ compute the angular coefficient of each IGN segment and all the
OSM segments included in the buffer around it
✔ Further clean the OSM dataset [required]:
GEOlab, Politecnico di Milano – Como Campus
29
Step 2: preprocessing of the OSM dataset
➔ compare the difference between IGN and OSM angular coefficients
with a user-specified threshold
✔ Further clean the OSM dataset [required]:
GEOlab, Politecnico di Milano – Como Campus
30
Step 2: preprocessing of the OSM dataset
✔ Further clean the OSM dataset [required]:
GEOlab, Politecnico di Milano – Como Campus
31
✔ Outputs from Step 2 are saved and can be used for further analysis:
Step 2: preprocessing of the OSM dataset
➔ sensitivity analysis on the parameters involved
GEOlab, Politecnico di Milano – Como Campus
data © IGN and © OpenStreetMap contributors
32
✔ Outputs from Step 2 are saved and can be used for further analysis:
Step 2: preprocessing of the OSM dataset
➔ sensitivity analysis on the parameters involved
GEOlab, Politecnico di Milano – Como Campus
33
✔ Outputs from Step 2 are saved and can be used for further analysis:
Step 2: preprocessing of the OSM dataset
GEOlab, Politecnico di Milano – Como Campus
➔ Area 2: generalization threshold = 0.5 m, buffer = 11 m
34
✔ Outputs from Step 2 are saved and can be used for further analysis:
Step 2: preprocessing of the OSM dataset
➔ Area 2: generalization threshold = 0.5 m, buffer = 11 m
GEOlab, Politecnico di Milano – Como Campus
✗ preprocessed OSM has 50 km less than original OSM≅
✗ preprocessed OSM has still 50 km more than IGN≅
35
✔ Outputs from Step 2 are saved and can be used for further analysis:
Step 2: preprocessing of the OSM dataset
GEOlab, Politecnico di Milano – Como Campus
36
✔ Use a grid to take into account OSM heterogeneous nature [optional]:
Step 3: grid-based evaluation of OSM accuracy
➔ import a vector layer to be used as grid
GEOlab, Politecnico di Milano – Como Campus
37
✔ Use a grid to take into account OSM heterogeneous nature [optional]:
Step 3: grid-based evaluation of OSM accuracy
➔ import a vector layer to be used as grid
➔ manually create a grid
GEOlab, Politecnico di Milano – Como Campus
38
✔ For each grid cell, find the OSM maximum deviation from IGN [optional]:
Step 3: grid-based evaluation of OSM accuracy
➔ enter an upper bound value for the deviation, and the percentage
of OSM road length to be considered (to take into account outliers)
GEOlab, Politecnico di Milano – Como Campus
39
✔ For each grid cell, find the OSM maximum deviation from IGN [optional]:
Step 3: grid-based evaluation of OSM accuracy
GEOlab, Politecnico di Milano – Como Campus
➔ Area 2: generalization threshold = 0.5 m, buffer = 11 m
5 - 6 m
6 - 7 m
7 - 8 m
8 - 9 m
9 - 10 m
10 - 11 m
40
✔ For each grid cell, find the OSM maximum deviation from IGN [optional]:
Step 3: grid-based evaluation of OSM accuracy
GEOlab, Politecnico di Milano – Como Campus
➔ Area 2: generalization threshold = 0.5 m, buffer = 11 m
➔ worst results are mainly due to:
✗ presence of 2 or more OSM roads for a single IGN road
✗ inherent complexity of the road network
IGN
OSM
data © IGN and © OpenStreetMap contributors
41
✔ For each grid cell, evaluate OSM accuracy against one or more threshold
values of OSM deviation from IGN [optional]:
Step 3: grid-based evaluation of OSM accuracy
➔ users have to enter one or more thresholds for deviation
GEOlab, Politecnico di Milano – Como Campus
42
✔ For each grid cell, evaluate OSM accuracy against one or more threshold
values of OSM deviation from IGN [optional]:
Step 3: grid-based evaluation of OSM accuracy
➔ length percentage of OSM roads included in the threshold buffer
➔ Area 2: threshold buffer = 6 m
GEOlab, Politecnico di Milano – Como Campus
85 - 90%
90 - 95%
95 - 100%
43
✔ For each grid cell, evaluate OSM accuracy against one or more threshold
values of OSM deviation from IGN [optional]:
Step 3: grid-based evaluation of OSM accuracy
➔ length percentage of OSM roads included in the threshold buffer
➔ Area 2: threshold buffer = 8 m
GEOlab, Politecnico di Milano – Como Campus
85 - 90%
90 - 95%
95 - 100%
44
✔ For each grid cell, evaluate OSM accuracy against one or more threshold
values of OSM deviation from IGN [optional]:
Step 3: grid-based evaluation of OSM accuracy
➔ length percentage of OSM roads included in the threshold buffer
➔ Area 2: threshold buffer = 10 m
GEOlab, Politecnico di Milano – Como Campus
85 - 90%
90 - 95%
95 - 100%
45
✔ Sensitivity analysis on the Douglas-Peucker generalization threshold:
Step 3: grid-based evaluation of OSM accuracy
GEOlab, Politecnico di Milano – Como Campus
0 m 0.5 m 1 m
6 m
Area 1 10 10 10
Area 2 14 14 14
Area 3 7 7 7
Area 4 8 8 8
8 m
Area 1 22 22 22
Area 2 26 26 26
Area 3 37 37 37
Area 4 26 26 26
10 m
Area 1 28 28 28
Area 2 28 28 28
Area 3 44 44 44
Area 4 30 30 30
➔ number of grid cells where the percentage of OSM length satisfying
the given accuracy is > 95%
46
✔ Sensitivity analysis on the Douglas-Peucker generalization threshold:
Step 3: grid-based evaluation of OSM accuracy
GEOlab, Politecnico di Milano – Como Campus
0 m 0.5 m 1 m
6 m
Area 1 10 10 10
Area 2 14 14 14
Area 3 7 7 7
Area 4 8 8 8
8 m
Area 1 22 22 22
Area 2 26 26 26
Area 3 37 37 37
Area 4 26 26 26
10 m
Area 1 28 28 28
Area 2 28 28 28
Area 3 44 44 44
Area 4 30 30 30
➔ number of grid cells where the percentage of OSM length satisfying
the given accuracy is > 95%
➔ no difference at all!
47
Step 3: grid-based evaluation of OSM accuracy
GEOlab, Politecnico di Milano – Como Campus
media dev. st. min(abs) max(abs)
6 m -0.010 0.114 0.001 0.510
8 m 0.003 0.080 0.001 0.276
10 m 0.013 0.059 0.001 0.208
➔ statistics on the differences between the percentages of OSM length
satisfying each given accuracy, for the generalization thresholds of 0
m and 1 m (Area 1)
✔ Sensitivity analysis on the Douglas-Peucker generalization threshold:
48
➔ generalization (within the nominal accuracy of the dataset) does not
influence accuracy evaluation results – and allows to save much time!
Step 3: grid-based evaluation of OSM accuracy
GEOlab, Politecnico di Milano – Como Campus
➔ statistics on the differences between the percentages of OSM length
satisfying each given accuracy, for the generalization thresholds of 0
m and 1 m (Area 1)
✔ Sensitivity analysis on the Douglas-Peucker generalization threshold:
media dev. st. min(abs) max(abs)
6 m -0.010 0.114 0.001 0.510
8 m 0.003 0.080 0.001 0.276
10 m 0.013 0.059 0.001 0.208
49
✔ Work in progress, currently available just for Step 1
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
➔ available at http://131.175.143.84/WPS
50
✔ User instructions on how to use the tool
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
51
✔ Geocoding service to move the map to a specified location
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
52
✔ Upload of IGN road network dataset
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
53
✔ Visualization of IGN road network dataset
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
data © IGN
54
✔ Upload of OSM road network dataset
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
data © IGN
55
✔ Visualization of OSM road network dataset
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
data © IGN
56
✔ Definition of layers and buffer value for comparison, a PDF is generated
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
data © IGN
57
✔ Retrieval of OSM road network dataset from the current map view
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
58
✔ Retrieval of OSM road network dataset from the current map view
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
59
Transposition of the procedure as a WPS
GEOlab, Politecnico di Milano – Como Campus
✔ Retrieval of OSM road network dataset from a rectangle drawn on the
map or from the bounding box of an uploaded layer
60
✔ Methodology for comparing OSM and authoritative road datasets:
Conclusions
✔ Future work:
➔ understand the influence of parameters through sensitivity analysis
➔ reduce computational time (especially for Step 2)
➔ increase usability through a WPS implementation (also for Steps 2 & 3)
➔ extend the procedure to also compare attributes
➔ test on different datasets (any test dataset is welcome)!
➔ (hopefully) fills a gap in literature
➔ not tied to a specific reference dataset
➔ generic, flexible and adaptable to any reference dataset
➔ users have a key role in driving the procedure
➔ parameter values should reflect the characteristic of the reference
dataset involved (e.g. nominal scale and accuracy)
GEOlab, Politecnico di Milano – Como Campus
61
Links & publications
GEOlab, Politecnico di Milano – Como Campus
✔ Related publications:
➔ Brovelli M. A., Minghini M., Molinari M. and Mooney P (in press).
Towards an automated comparison of OpenStreetMap with
authoritative road datasets. Transactions in GIS.
➔ Antunes F., Fonte C. C., Brovelli M. A., Minghini M., Molinari M. and
Mooney P. (2015) Assessing OSM Road Positional Quality with
Authoritative Data. Proceedings of the VIII Conferência Nacional de
Cartografia e Geodesia, Lisbon (Portugal), October 29-30, 2015.
➔ Brovelli M. A., Minghini M., Molinari M. and Mooney P. (2015) A
FOSS4G-based procedure to compare OpenStreetMap and
authoritative road network datasets. Geomatics Workbooks 12,
235-238, ISSN 1591-092X.
✔ Links:
➔ source code:https://github.com/MoniaMolinari/OSM-roads-comparison
➔ WPS client: http://131.175.143.84/WPS
62
Contacts
Thanks for your attention!
Politecnico di Milano
GEOlab – Polo Territoriale di Como
Via Valleggio 11, 22100 Como (Italy)
marco.minghini@polimi.it
@MarcoMinghini
Marco Minghini
GEOlab, Politecnico di Milano – Como Campus

More Related Content

PDF
A GRASS-based procedure to compare OSM and IGN Paris road network datasets
PDF
Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...
PDF
FOSS4G in Europe; Italy and the Politecnico de Milano
PDF
A new module in ADMS-Roads for elevated roads
PDF
Roadside barriers - Accounting for the effect of vehicle induced momentum and...
PDF
Uses of vehicle emissions remote sensing data for emission factor development
PDF
Modelling the Notre-Dame de Paris fire
PDF
Light vehicle dynamics and NOx emissions on the motorway network
A GRASS-based procedure to compare OSM and IGN Paris road network datasets
Qualità dei dati OpenStreetMap: sperimentazioni sulla città di Milano e risul...
FOSS4G in Europe; Italy and the Politecnico de Milano
A new module in ADMS-Roads for elevated roads
Roadside barriers - Accounting for the effect of vehicle induced momentum and...
Uses of vehicle emissions remote sensing data for emission factor development
Modelling the Notre-Dame de Paris fire
Light vehicle dynamics and NOx emissions on the motorway network

Similar to A GRASS-based automated procedure to compare OpenStreetMap and authoritative road network datasets (20)

PDF
Lesson2 esa summer_school_brovelli
PDF
3D GeoInfo Conference
PDF
SFScon16 - Gianluca Antonacci: "The CLEAN-ROADS project case study"
PDF
Lesson3 esa summer_school_brovelli
PDF
NextGEOSS: The Next Generation European Data Hub and Cloud Platform for Earth...
PDF
Systematic Approch to a GIS
PPTX
Placing Images with Refined Language Models and Similarity Search with PCA-re...
PDF
Analysis of Educational Robotics activities using a machine learning approach
PDF
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
PDF
Accès ouvert aux données météorologiques d’Environnement Canada
PPT
Francisco J. Doblas-Big Data y cambio climático
PDF
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
PDF
Frank van diggelen keynote, android gnss measurements update
PDF
An IMU-based Turn Prediction System NOMS 2018
PDF
Learning global pooling operators in deep neural networks for image retrieval...
PDF
Mapathon polimi 2015-11-19
PDF
EPANET in gvSIG
PDF
Real time traffic management - challenges and solutions
Lesson2 esa summer_school_brovelli
3D GeoInfo Conference
SFScon16 - Gianluca Antonacci: "The CLEAN-ROADS project case study"
Lesson3 esa summer_school_brovelli
NextGEOSS: The Next Generation European Data Hub and Cloud Platform for Earth...
Systematic Approch to a GIS
Placing Images with Refined Language Models and Similarity Search with PCA-re...
Analysis of Educational Robotics activities using a machine learning approach
How to Visualize the ​Spatio-Temporal Data Using CesiumJS​
Accès ouvert aux données météorologiques d’Environnement Canada
Francisco J. Doblas-Big Data y cambio climático
MediaEval 2016 - Placing Images with Refined Language Models and Similarity S...
Frank van diggelen keynote, android gnss measurements update
An IMU-based Turn Prediction System NOMS 2018
Learning global pooling operators in deep neural networks for image retrieval...
Mapathon polimi 2015-11-19
EPANET in gvSIG
Real time traffic management - challenges and solutions
Ad

More from Marco Minghini (7)

PDF
Preliminary Assessment of the Global Urban Footprint and the Global Human Set...
PDF
OpenStreetMap a servizio delle emergenze
PDF
MIGRATE – MIGRation pATterns in Europe
PDF
MIGRATE: MIGRation pATterns in Europe
PDF
Building Land Use/Land Cover maps from OpenStreetMap
PDF
How to organize and run your own OSM humanitarian mapathon
PDF
The paths of Regina: Web Mapping to promote slow tourism
Preliminary Assessment of the Global Urban Footprint and the Global Human Set...
OpenStreetMap a servizio delle emergenze
MIGRATE – MIGRation pATterns in Europe
MIGRATE: MIGRation pATterns in Europe
Building Land Use/Land Cover maps from OpenStreetMap
How to organize and run your own OSM humanitarian mapathon
The paths of Regina: Web Mapping to promote slow tourism
Ad

Recently uploaded (20)

PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
Construction Project Organization Group 2.pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PDF
composite construction of structures.pdf
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
additive manufacturing of ss316l using mig welding
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
PPT on Performance Review to get promotions
DOCX
573137875-Attendance-Management-System-original
PPTX
Artificial Intelligence
PPTX
OOP with Java - Java Introduction (Basics)
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Construction Project Organization Group 2.pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Internet of Things (IOT) - A guide to understanding
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
composite construction of structures.pdf
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
additive manufacturing of ss316l using mig welding
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Lecture Notes Electrical Wiring System Components
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Operating System & Kernel Study Guide-1 - converted.pdf
PPT on Performance Review to get promotions
573137875-Attendance-Management-System-original
Artificial Intelligence
OOP with Java - Java Introduction (Basics)
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk

A GRASS-based automated procedure to compare OpenStreetMap and authoritative road network datasets

  • 1. GEOlab, Como Campus A GRASS-based automated procedure to compare OpenStreetMap and authoritative road network datasets Politecnico di Milano, Como Campus, DICA, via Valleggio 11, 22100 Como (Italy) Maria Antonia Brovelli, Marco Minghini, Monia Elisa Molinari
  • 2. 2 ✔ Increasing popularity of OpenStreetMap (OSM) as today's most notable Volunteered Geographic Information (VGI) project on the Internet ✔ Increasing concern on VGI (and OSM) data quality: Motivation of the work – VGI & OSM quality GEOlab, Politecnico di Milano – Como Campus ➔ spatial accuracy ➔ temporal accuracy ➔ semantic accuracy ➔ up-to-dateness ➔ consistency ➔ fitness-for-use & fitness-for-purpose ➔ ... ✔ Increasing availability of open data from NMAs and CSC that can be used as a source of comparison for VGI (and OSM) data: ➔ comparing two spatial datasets against each other is a challenging geocomputation problem!
  • 3. 3 ✔ Literature provides plenty of works assessing or comparing OSM quality against that of authoritative datasets: Motivation of the work – OSM comparisons ➔ strongly focused on road network ➔ mainly investigating OSM positional accuracy ➔ OSM compared to data from NMA (UK Ordnance Survey, French NMA, USGS TNM/TIGER, etc.) and CSC (Navteq, TeleAtlas, etc.) ➔ semi- or fully-automated ➔ results from poor to very good ✔ Comparison techniques are very strong and fit for purpose, but mostly application and dataset specific: ➔ hard to replicate ➔ difficult to extend to other dataset comparisons GEOlab, Politecnico di Milano – Como Campus
  • 4. 4 ✔ Novel methodology to compare OSM and authoritative road datasets: Our methodology ➔ fully automated ➔ focused on spatial accuracy and completeness ➔ flexible, i.e. not developed for a specific dataset ➔ built with FOSS4G (Free and Open Source Software for Geospatial) ✗ made of required and optional operations ✗ users can define the value of the parameters involved to adapt the procedure to their specific authoritative datasets ✗ users are supposed to be familiar with the authoritative dataset used as reference ✗ reusable and extensible in case of need GEOlab, Politecnico di Milano – Como Campus
  • 5. 5 ➔ 1. Preliminary comparison of the datasets and computation of global statistics ➔ 2. Geometric preprocessing of the OSM dataset to extract a subset which is fully comparable with the IGN dataset ➔ 3. Evaluation of OSM spatial accuracy using a grid-based approach ✔ Currently developed as 3 GRASS GIS modules: Our methodology – Overview ➔ written in Python ➔ available with a Graphical User Interface (GUI) ✔ Comparison between OSM and reference (IGN) road network datasets composed of 3 consecutive steps: GEOlab, Politecnico di Milano – Como Campus
  • 6. 6 ✔ Import and select the OSM and IGN datasets [required] Step 1: Preliminary comparison of the datasets GEOlab, Politecnico di Milano – Como Campus
  • 7. 7 ✔ Import and select the OSM and IGN datasets [required] Step 1: Preliminary comparison of the datasets GEOlab, Politecnico di Milano – Como Campus data © IGN and © OpenStreetMap contributors
  • 8. 8 ✔ Import and select the OSM and IGN datasets [required] ✔ If the extent of the OSM and/or IGN datasets is larger than the one of interest, import a vector layer to be used as clipping mask [optional] Step 1: Preliminary comparison of the datasets GEOlab, Politecnico di Milano – Como Campus
  • 9. 9 ✔ Apply a set of buffers of user-specified width around both the IGN and OSM datasets, to compute the length and the length percentage of the OSM and IGN datasets included in the buffer [required] Step 1: Preliminary comparison of the datasets GEOlab, Politecnico di Milano – Como Campus
  • 10. 10 ✔ Compute also the total length of OSM and IGN datasets and their length difference, both in map units and percentage [required] Step 1: Preliminary comparison of the datasets ➔ output values are returned in a text file GEOlab, Politecnico di Milano – Como Campus
  • 11. 11 ✔ Compute also the total length of OSM and IGN datasets and their length difference, both in map units and percentage [required] Step 1: Preliminary comparison of the datasets ➔ output values are returned in a text file GEOlab, Politecnico di Milano – Como Campus
  • 12. 12 ✔ Compute also the total length of OSM and IGN datasets and their length difference, both in map units and percentage [required] Step 1: Preliminary comparison of the datasets ➔ output values are returned in a text file GEOlab, Politecnico di Milano – Como Campus ✗ ≅450 km more in OSM than IGN dataset!
  • 13. 13 ✔ Compute also the total length of OSM and IGN datasets and their length difference, both in map units and percentage [required] Step 1: Preliminary comparison of the datasets ➔ output values are returned in a text file GEOlab, Politecnico di Milano – Como Campus ✗ ≅450 km more in OSM than IGN dataset! ➔ more footways and pedestrian routes mapped in OSM Boulevard des Invalides Gare de l'Est IGN OSM data © IGN and © OpenStreetMap contributors
  • 14. 14 ✔ Compute also the total length of OSM and IGN datasets and their length difference, both in map units and percentage [required] Step 1: Preliminary comparison of the datasets ➔ output values are returned in a text file GEOlab, Politecnico di Milano – Como Campus ✗ ≅450 km more in OSM than IGN dataset! ➔ cycleways and carriageways mapped as separate highways in OSM IGN OSM Boulevard Jules Ferry data © IGN and © OpenStreetMap contributors Boulevard Henri IV
  • 15. 15 ✔ Outputs from Step 1 can be used to perform further analysis: Step 1: Preliminary comparison of the datasets ➔ sensitivity analysis on the buffer width GEOlab, Politecnico di Milano – Como Campus
  • 16. 16 ✔ Outputs from Step 1 can be used to perform further analysis: Step 1: Preliminary comparison of the datasets ➔ sensitivity analysis on the buffer width GEOlab, Politecnico di Milano – Como Campus
  • 17. 17 ✔ Cleaning of OSM dataset to make it comparable with IGN dataset Step 2: preprocessing of the OSM dataset GEOlab, Politecnico di Milano – Como Campus
  • 18. 18 ✔ Cleaning of OSM dataset to make it comparable with IGN dataset Step 2: preprocessing of the OSM dataset ➔ computationally intensive – work area divided in 4 sub-areas GEOlab, Politecnico di Milano – Como Campus 1 4 2 3 data © IGN and © OpenStreetMap contributors
  • 19. 19 ✔ Generalize the IGN dataset with the Douglas-Peucker algorithm [optional] Step 2: preprocessing of the OSM dataset ➔ users have to enter the threshold for the algorithm GEOlab, Politecnico di Milano – Como Campus
  • 20. 20 ✔ Generalize the IGN dataset with the Douglas-Peucker algorithm [optional] Step 2: preprocessing of the OSM dataset ✔ Split the line features of the datasets into segments [required] ➔ users have to enter the threshold for the algorithm GEOlab, Politecnico di Milano – Como Campus
  • 21. 21 ✔ Compute a measure of degree for the nodes of IGN dataset [required] Step 2: preprocessing of the OSM dataset ➔ identify the terminal nodes (degree = 1) GEOlab, Politecnico di Milano – Como Campus data © IGN
  • 22. 22 ✔ Apply a buffer of user-specified width around the IGN dataset [required] Step 2: preprocessing of the OSM dataset ➔ suitable buffer width derived from Step 1 ➔ delete all the OSM roads falling outside the buffer GEOlab, Politecnico di Milano – Como Campus
  • 23. 23 ✔ Apply a buffer of user-specified width around the IGN dataset [required] Step 2: preprocessing of the OSM dataset ➔ suitable buffer width derived from Step 1 ➔ delete all the OSM roads falling outside the buffer ➔ buffer is applied without cap around the terminal nodes GEOlab, Politecnico di Milano – Como Campus
  • 24. 24 ✔ Apply a buffer of user-specified width around the IGN dataset [required] Step 2: preprocessing of the OSM dataset ➔ suitable buffer width derived from Step 1 ➔ delete all the OSM roads falling outside the buffer ➔ buffer is applied without cap around the terminal nodes GEOlab, Politecnico di Milano – Como Campus
  • 25. 25 ✔ Apply a buffer of user-specified width around the IGN dataset [required] Step 2: preprocessing of the OSM dataset ➔ suitable buffer width derived from Step 1 ➔ delete all the OSM roads falling outside the buffer ➔ buffer is applied without cap around the terminal nodes GEOlab, Politecnico di Milano – Como Campus
  • 26. 26 Step 2: preprocessing of the OSM dataset ✔ Further clean the OSM dataset [required]: GEOlab, Politecnico di Milano – Como Campus
  • 27. 27 Step 2: preprocessing of the OSM dataset ➔ apply a buffer of user-specified width around each IGN segment ✔ Further clean the OSM dataset [required]: GEOlab, Politecnico di Milano – Como Campus
  • 28. 28 Step 2: preprocessing of the OSM dataset ➔ compute the angular coefficient of each IGN segment and all the OSM segments included in the buffer around it ✔ Further clean the OSM dataset [required]: GEOlab, Politecnico di Milano – Como Campus
  • 29. 29 Step 2: preprocessing of the OSM dataset ➔ compare the difference between IGN and OSM angular coefficients with a user-specified threshold ✔ Further clean the OSM dataset [required]: GEOlab, Politecnico di Milano – Como Campus
  • 30. 30 Step 2: preprocessing of the OSM dataset ✔ Further clean the OSM dataset [required]: GEOlab, Politecnico di Milano – Como Campus
  • 31. 31 ✔ Outputs from Step 2 are saved and can be used for further analysis: Step 2: preprocessing of the OSM dataset ➔ sensitivity analysis on the parameters involved GEOlab, Politecnico di Milano – Como Campus data © IGN and © OpenStreetMap contributors
  • 32. 32 ✔ Outputs from Step 2 are saved and can be used for further analysis: Step 2: preprocessing of the OSM dataset ➔ sensitivity analysis on the parameters involved GEOlab, Politecnico di Milano – Como Campus
  • 33. 33 ✔ Outputs from Step 2 are saved and can be used for further analysis: Step 2: preprocessing of the OSM dataset GEOlab, Politecnico di Milano – Como Campus ➔ Area 2: generalization threshold = 0.5 m, buffer = 11 m
  • 34. 34 ✔ Outputs from Step 2 are saved and can be used for further analysis: Step 2: preprocessing of the OSM dataset ➔ Area 2: generalization threshold = 0.5 m, buffer = 11 m GEOlab, Politecnico di Milano – Como Campus ✗ preprocessed OSM has 50 km less than original OSM≅ ✗ preprocessed OSM has still 50 km more than IGN≅
  • 35. 35 ✔ Outputs from Step 2 are saved and can be used for further analysis: Step 2: preprocessing of the OSM dataset GEOlab, Politecnico di Milano – Como Campus
  • 36. 36 ✔ Use a grid to take into account OSM heterogeneous nature [optional]: Step 3: grid-based evaluation of OSM accuracy ➔ import a vector layer to be used as grid GEOlab, Politecnico di Milano – Como Campus
  • 37. 37 ✔ Use a grid to take into account OSM heterogeneous nature [optional]: Step 3: grid-based evaluation of OSM accuracy ➔ import a vector layer to be used as grid ➔ manually create a grid GEOlab, Politecnico di Milano – Como Campus
  • 38. 38 ✔ For each grid cell, find the OSM maximum deviation from IGN [optional]: Step 3: grid-based evaluation of OSM accuracy ➔ enter an upper bound value for the deviation, and the percentage of OSM road length to be considered (to take into account outliers) GEOlab, Politecnico di Milano – Como Campus
  • 39. 39 ✔ For each grid cell, find the OSM maximum deviation from IGN [optional]: Step 3: grid-based evaluation of OSM accuracy GEOlab, Politecnico di Milano – Como Campus ➔ Area 2: generalization threshold = 0.5 m, buffer = 11 m 5 - 6 m 6 - 7 m 7 - 8 m 8 - 9 m 9 - 10 m 10 - 11 m
  • 40. 40 ✔ For each grid cell, find the OSM maximum deviation from IGN [optional]: Step 3: grid-based evaluation of OSM accuracy GEOlab, Politecnico di Milano – Como Campus ➔ Area 2: generalization threshold = 0.5 m, buffer = 11 m ➔ worst results are mainly due to: ✗ presence of 2 or more OSM roads for a single IGN road ✗ inherent complexity of the road network IGN OSM data © IGN and © OpenStreetMap contributors
  • 41. 41 ✔ For each grid cell, evaluate OSM accuracy against one or more threshold values of OSM deviation from IGN [optional]: Step 3: grid-based evaluation of OSM accuracy ➔ users have to enter one or more thresholds for deviation GEOlab, Politecnico di Milano – Como Campus
  • 42. 42 ✔ For each grid cell, evaluate OSM accuracy against one or more threshold values of OSM deviation from IGN [optional]: Step 3: grid-based evaluation of OSM accuracy ➔ length percentage of OSM roads included in the threshold buffer ➔ Area 2: threshold buffer = 6 m GEOlab, Politecnico di Milano – Como Campus 85 - 90% 90 - 95% 95 - 100%
  • 43. 43 ✔ For each grid cell, evaluate OSM accuracy against one or more threshold values of OSM deviation from IGN [optional]: Step 3: grid-based evaluation of OSM accuracy ➔ length percentage of OSM roads included in the threshold buffer ➔ Area 2: threshold buffer = 8 m GEOlab, Politecnico di Milano – Como Campus 85 - 90% 90 - 95% 95 - 100%
  • 44. 44 ✔ For each grid cell, evaluate OSM accuracy against one or more threshold values of OSM deviation from IGN [optional]: Step 3: grid-based evaluation of OSM accuracy ➔ length percentage of OSM roads included in the threshold buffer ➔ Area 2: threshold buffer = 10 m GEOlab, Politecnico di Milano – Como Campus 85 - 90% 90 - 95% 95 - 100%
  • 45. 45 ✔ Sensitivity analysis on the Douglas-Peucker generalization threshold: Step 3: grid-based evaluation of OSM accuracy GEOlab, Politecnico di Milano – Como Campus 0 m 0.5 m 1 m 6 m Area 1 10 10 10 Area 2 14 14 14 Area 3 7 7 7 Area 4 8 8 8 8 m Area 1 22 22 22 Area 2 26 26 26 Area 3 37 37 37 Area 4 26 26 26 10 m Area 1 28 28 28 Area 2 28 28 28 Area 3 44 44 44 Area 4 30 30 30 ➔ number of grid cells where the percentage of OSM length satisfying the given accuracy is > 95%
  • 46. 46 ✔ Sensitivity analysis on the Douglas-Peucker generalization threshold: Step 3: grid-based evaluation of OSM accuracy GEOlab, Politecnico di Milano – Como Campus 0 m 0.5 m 1 m 6 m Area 1 10 10 10 Area 2 14 14 14 Area 3 7 7 7 Area 4 8 8 8 8 m Area 1 22 22 22 Area 2 26 26 26 Area 3 37 37 37 Area 4 26 26 26 10 m Area 1 28 28 28 Area 2 28 28 28 Area 3 44 44 44 Area 4 30 30 30 ➔ number of grid cells where the percentage of OSM length satisfying the given accuracy is > 95% ➔ no difference at all!
  • 47. 47 Step 3: grid-based evaluation of OSM accuracy GEOlab, Politecnico di Milano – Como Campus media dev. st. min(abs) max(abs) 6 m -0.010 0.114 0.001 0.510 8 m 0.003 0.080 0.001 0.276 10 m 0.013 0.059 0.001 0.208 ➔ statistics on the differences between the percentages of OSM length satisfying each given accuracy, for the generalization thresholds of 0 m and 1 m (Area 1) ✔ Sensitivity analysis on the Douglas-Peucker generalization threshold:
  • 48. 48 ➔ generalization (within the nominal accuracy of the dataset) does not influence accuracy evaluation results – and allows to save much time! Step 3: grid-based evaluation of OSM accuracy GEOlab, Politecnico di Milano – Como Campus ➔ statistics on the differences between the percentages of OSM length satisfying each given accuracy, for the generalization thresholds of 0 m and 1 m (Area 1) ✔ Sensitivity analysis on the Douglas-Peucker generalization threshold: media dev. st. min(abs) max(abs) 6 m -0.010 0.114 0.001 0.510 8 m 0.003 0.080 0.001 0.276 10 m 0.013 0.059 0.001 0.208
  • 49. 49 ✔ Work in progress, currently available just for Step 1 Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus ➔ available at http://131.175.143.84/WPS
  • 50. 50 ✔ User instructions on how to use the tool Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus
  • 51. 51 ✔ Geocoding service to move the map to a specified location Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus
  • 52. 52 ✔ Upload of IGN road network dataset Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus
  • 53. 53 ✔ Visualization of IGN road network dataset Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus data © IGN
  • 54. 54 ✔ Upload of OSM road network dataset Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus data © IGN
  • 55. 55 ✔ Visualization of OSM road network dataset Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus data © IGN
  • 56. 56 ✔ Definition of layers and buffer value for comparison, a PDF is generated Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus data © IGN
  • 57. 57 ✔ Retrieval of OSM road network dataset from the current map view Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus
  • 58. 58 ✔ Retrieval of OSM road network dataset from the current map view Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus
  • 59. 59 Transposition of the procedure as a WPS GEOlab, Politecnico di Milano – Como Campus ✔ Retrieval of OSM road network dataset from a rectangle drawn on the map or from the bounding box of an uploaded layer
  • 60. 60 ✔ Methodology for comparing OSM and authoritative road datasets: Conclusions ✔ Future work: ➔ understand the influence of parameters through sensitivity analysis ➔ reduce computational time (especially for Step 2) ➔ increase usability through a WPS implementation (also for Steps 2 & 3) ➔ extend the procedure to also compare attributes ➔ test on different datasets (any test dataset is welcome)! ➔ (hopefully) fills a gap in literature ➔ not tied to a specific reference dataset ➔ generic, flexible and adaptable to any reference dataset ➔ users have a key role in driving the procedure ➔ parameter values should reflect the characteristic of the reference dataset involved (e.g. nominal scale and accuracy) GEOlab, Politecnico di Milano – Como Campus
  • 61. 61 Links & publications GEOlab, Politecnico di Milano – Como Campus ✔ Related publications: ➔ Brovelli M. A., Minghini M., Molinari M. and Mooney P (in press). Towards an automated comparison of OpenStreetMap with authoritative road datasets. Transactions in GIS. ➔ Antunes F., Fonte C. C., Brovelli M. A., Minghini M., Molinari M. and Mooney P. (2015) Assessing OSM Road Positional Quality with Authoritative Data. Proceedings of the VIII Conferência Nacional de Cartografia e Geodesia, Lisbon (Portugal), October 29-30, 2015. ➔ Brovelli M. A., Minghini M., Molinari M. and Mooney P. (2015) A FOSS4G-based procedure to compare OpenStreetMap and authoritative road network datasets. Geomatics Workbooks 12, 235-238, ISSN 1591-092X. ✔ Links: ➔ source code:https://github.com/MoniaMolinari/OSM-roads-comparison ➔ WPS client: http://131.175.143.84/WPS
  • 62. 62 Contacts Thanks for your attention! Politecnico di Milano GEOlab – Polo Territoriale di Como Via Valleggio 11, 22100 Como (Italy) marco.minghini@polimi.it @MarcoMinghini Marco Minghini GEOlab, Politecnico di Milano – Como Campus