SlideShare a Scribd company logo
Managing Data Quality in
         OpenStreetMap


TOOLS FOR AN ACTIVE
MAPPING COMMUNITY

NC GIS CONFERENCE 2013



    This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see:
    http://creativecommons.org/licenses/by-sa/3.0/
Overview
                            2

 The Short History of the OpenStreetMap
   Revolution

 Assessing Open Source Data Quality


 Overview of Tools


 Creating Tools that Matter


NC GIS Conference 2013                     23 February 2013
Overview: Key Questions
                                    3

 How can crowd-sourced projects manage data
   quality effectively?

 What tools exist for monitoring data quality in
   OpenStreetMap?

 What conclusions can be drawn about existing tools?


 What is the future of data quality in crowd-sourced
   projects?
NC GIS Conference 2013                             23 February 2013
OpenStreetMap is…
                                 4




 A freely-editable map of the world
   unconstrained by proprietary ownership

 “Wikipedia for maps”




NC GIS Conference 2013                       23 February 2013
The Origins of OpenStreetMap
                              5



 OpenStreetMap.org domain registered by Steve
  Coast in 2004
 Project originated in the United Kingdom, where…
   Crown copyright on geospatial data

   Little, or no public domain data

 Simple goal to create a free, publicly-available
  database of street centerlines


NC GIS Conference 2013                      23 February 2013
OpenStreetMap is…
                                 6




 A freely-editable map of the world
   unconstrained by proprietary ownership

 “Wikipedia for maps”




NC GIS Conference 2013                       23 February 2013
Looks like…a wiki
                                 7




NC GIS Conference 2013                       23 February 2013
Wiki-based Documentation!
                         8




NC GIS Conference 2013

                                         23 February 2013
Milestones in OpenStreetMap History
                             9

 2004 - OpenStreetMap.org registered by Steve Coast
 2005 – Map Limehouse, 1st OpenStreetMap mapping
    party
   2005 – 1000 registered OpenStreetMap users
   2006 – OpenStreetMap Foundation established
   2007 – 5 million ways in OSM database
   2007 – 10,000 registered OpenStreetMap users
   2008 - TIGER data import for the US completed
   2009 - 100,000 registered OpenStreetMap users
   2010 - 200,000 registered OpenStreetMap users
   2012 – ~670,000 registered OpenStreetMap users

NC GIS Conference 2013                          23 February 2013
OpenStreetMap User Growth
                                          10
One million registered users worldwide!




 NC GIS Conference 2013                         23 February 2013
OpenStreetMap Growth in User Edits
                         11




NC GIS Conference 2013                 23 February 2013
OpenStreetMap Database Growth
                           12




NC GIS Conference 2013                  23 February 2013
Data Quality in Crowd-sourced Projects
                                                            13

 Goodchild & Li: Identified three mechanisms for
   Quality Assurance

       Crowd-sourcing

       Social

       Geographic


Goodchild, Michael F., and Linna Li. "Assuring the quality of volunteered geographic information."
Spatial Statistics 1 (2012): 110-120.


NC GIS Conference 2013                                                                               23 February 2013
Crowd-sourced Approach to Data Quality
                                                        14

 Based on Surowiecki’s “Wisdom of the Crowd”
   Multiple users converge around consensus solutions that
    might escape an individual
   Many independent observations reinforce the validity of a
    single observation
   Concurrence on observed features (e.g. “It’s a bridge.”)

   Convergence on the truth



      The group validates observations & corrects errors



   Surowiecki, J., 2005. The Wisdom of Crowds. Anchor, New York.

NC GIS Conference 2013                                             23 February 2013
Social Approach to Data Quality
                             15

 Through practices, users acquire reputations
 Users with good reputations are trusted
 Trust and reputation are indicators of stewardship
 As the project evolves, social leadership becomes
   more formalized.

 The Data Working Group of OpenStreetMap fullfills
  this function
 Email lists supplement social stewardship


NC GIS Conference 2013                        23 February 2013
Geographic Tools for Data Quality
                                   16

 Geographic approach draws on formal geographic
   theory:
      Spatial neighbors & auto-correlation (Moran statistics)
      Christaller’s Central Place Theory
      Descriptive Statistics
      Inferential Statistics & Analysis of Variance (ANOVA)
      Richardson plots of linear measurements
      Cluster analysis, e.g. k-means
 These approaches have not been widely adopted for
   use in the OpenStreetMap project…yet

NC GIS Conference 2013                                     23 February 2013
A Quick Survey of Data Quality Tools
                               17

 Two types of tools are in widespread use:


      Error Detection Tools

      Monitoring Tools




NC GIS Conference 2013                        23 February 2013
Error Detection Tools: Keep Right
                             18




NC GIS Conference 2013                      23 February 2013
Error Detection Tools: Map Dust
                             19




NC GIS Conference 2013                     23 February 2013
Error Detection Tools: OpenStreetBugs




NC GIS Conference 2013                 23 February 2013
Error Detection Tools: No Name
                             21




NC GIS Conference 2013                     23 February 2013
Error Detection Tools: MapRoulette
                           22




NC GIS Conference 2013                    23 February 2013
Monitoring Tools
                                23




NC GIS Conference 2013                      23 February 2013
Monitoring Tools: OpenStreetMap Watch List
                  (OWL)
                         24




NC GIS Conference 2013            23 February 2013
Monitoring Tools: GeoFabrik Map Compare
                         25




NC GIS Conference 2013           23 February 2013
Monitoring Tools: Who Did It
                               26




NC GIS Conference 2013                           23 February 2013
Monitoring Tools: ITO TIGER Reviewed
                         27




NC GIS Conference 2013              23 February 2013
Monitoring Tools: ITO TIGER Reviewed
                         28




NC GIS Conference 2013              23 February 2013
Monitoring Tools: Green Means Go
                          29




NC GIS Conference 2013                  23 February 2013
Monitoring Tools: Who’s Around Me
                          30




NC GIS Conference 2013                  23 February 2013
Social Controls
                                31

 OpenStreetMap - Data Working Group (DWG)
   Resolving disputes between users

   Processes & protocols for data imports

   Investigates copyright infringement

   Deals with issues of vandalism and fraud

   Suspends or closes user accounts (in case of abuse)

   IP blocking (in case of abuse)




NC GIS Conference 2013                              23 February 2013
How do Social Methods Treat Vandalism?
                                32

 OpenStreetMap is not immune from malicious intent
   Copyright infringement (e.g. copying from Google Maps)

   Graffiti

   Disputes & “Edit Wars” (e.g. Kashmir region, Palestine)

   Spam

 Tools for Managing Vandalism
   Detect using daily diffs

   UserActivity – batch comparison of two versions of the
    database
   Revert – undo changeset to previous version

   Virtual Ban


NC GIS Conference 2013                                 23 February 2013
Summary Review
                                 33

 Three methods for data quality control
   Crowd-sourced

   Social

   Geographic

 OpenStreetMap has crowd-sourced and social tools
   for managing data quality
      Error & Monitoring tools
      Data Working Group - Social
 Geographic methods are experimental at this time
 Increasingly complete geographic features will lead
   to better tools
NC GIS Conference 2013                        23 February 2013
Lessons Learned about OSM Data Quality
                                                       34

 Successive editing by multiple users can improve
   accuracy…up to a point
      Haklay suggests that few improvements are made beyond the
       13th edit
      Semantic differences are not easy to resolve – “Tag wars”
      Obscure edits do not always get corrected if there are no local
       mappers that take ownership
 Social approaches will acquire more authority
   Are part-time, volunteer staffers enough to guarantee data
    quality?
   What are appropriate metrics for trust and reputation?

     Haklay, M. 2010. How Good is volunteered geographical information? a comparative study of OpenStreetMap and
     Ordnance Survey Datasets. Environment & Planning B: Planning and Design 37 (4), 682-703g
NC GIS Conference 2013                                                                           23 February 2013
Thank You
                                                                   35

 Questions?




 Steven Johnson
   (e) stevejohnson@deloitte.com

   (t) @geomantic




             This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see:
             http://creativecommons.org/licenses/by-sa/3.0/




NC GIS Conference 2013                                                                                              23 February 2013

More Related Content

PPTX
OpenStreetMap in Government: US Census Bureau Experience
PPT
Digitization and Landscape-Research Nexus
PDF
Exploratory Analysis of Massive Movement Data (RGS-IBG GIScience Research Gro...
PDF
GEO-N-VIRON webinar talk on Geospatial Technology in Sustainable Environment
PPTX
Lect 1 & 2 introduction to gis & rs
PPTX
Geoscience Australia National Map
PPT
Introduction To Gis With Employment Info
DOCX
Smart city and gis
OpenStreetMap in Government: US Census Bureau Experience
Digitization and Landscape-Research Nexus
Exploratory Analysis of Massive Movement Data (RGS-IBG GIScience Research Gro...
GEO-N-VIRON webinar talk on Geospatial Technology in Sustainable Environment
Lect 1 & 2 introduction to gis & rs
Geoscience Australia National Map
Introduction To Gis With Employment Info
Smart city and gis

What's hot (20)

PDF
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
PPTX
Geographic information system
PDF
The Application of GIS in Urban Planning
PDF
Geodatabase with GIS & RS
PPTX
Future of GIS, Moving to the Enterprise Platform
PPT
Geographic information system
PPT
GIS and Petroleum Land Management
PPT
Introduction To GIS
PPTX
Geographical information system in transportation planning
PPT
Open Source GIS
PPTX
Gis powerpoint
PPTX
Open source health gis presentation final
PDF
Why Does GIS Matter
DOCX
survey paper 2
PPT
Geographic Information Systems in the Oil & Gas Industry
PPTX
Get Big Geo Data
PDF
A Study of the Development and Distribution of Open Geospatial Data in Japane...
PPTX
MODERN trends of GIS
PDF
CKANへの空間情報機能拡張実装の試み
Movement Data in GIS - Geobeer Lightning Talk, 2021-03-08
Geographic information system
The Application of GIS in Urban Planning
Geodatabase with GIS & RS
Future of GIS, Moving to the Enterprise Platform
Geographic information system
GIS and Petroleum Land Management
Introduction To GIS
Geographical information system in transportation planning
Open Source GIS
Gis powerpoint
Open source health gis presentation final
Why Does GIS Matter
survey paper 2
Geographic Information Systems in the Oil & Gas Industry
Get Big Geo Data
A Study of the Development and Distribution of Open Geospatial Data in Japane...
MODERN trends of GIS
CKANへの空間情報機能拡張実装の試み
Ad

Similar to OpenStreetMap Data Quality (20)

PDF
OpenStreetMap and CycleStreets: collaborative map-making and cartography in t...
PDF
Lessons Learned From Neogeography Nc Gis 2009
PPTX
OpenStreetMap
PDF
Philippine Geospatial Forum Presentation 20130311
PDF
Crowdsourced mapping for open collaboration: A story of Taiwan so far
PPT
Crowdsourcing and Participation in Cartography (G572 Guest Lecture)
KEY
Intro to OpenStreetMap- UC Merced 4.22.09
PDF
B00624300_EGM701_MSc ResearchPaper_AlfredoConetta_03-May-15
KEY
Stanford Presentation to GISSIG
PDF
This is not your grandmother's online map: Advancing your mission with GIS tools
PDF
070928 Collaborative Geospatial Mapping And Data Authorization
PPTX
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
PPTX
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
PDF
Talk: "Using Open Data and Crowdsourcing to develop CycleStreets"
PPTX
Tnmc mc andrew_sotmus13_rev2
PDF
Open Data and Open Software Geospatial Applications
PDF
The road to opening up governmental geospatial data in taiwan
PDF
Maptivism reloaded: Open Data for Development @oddc
PDF
Open Access to Multi-Domain Collaborative Geospatial Analysis - AGU 2009
OpenStreetMap and CycleStreets: collaborative map-making and cartography in t...
Lessons Learned From Neogeography Nc Gis 2009
OpenStreetMap
Philippine Geospatial Forum Presentation 20130311
Crowdsourced mapping for open collaboration: A story of Taiwan so far
Crowdsourcing and Participation in Cartography (G572 Guest Lecture)
Intro to OpenStreetMap- UC Merced 4.22.09
B00624300_EGM701_MSc ResearchPaper_AlfredoConetta_03-May-15
Stanford Presentation to GISSIG
This is not your grandmother's online map: Advancing your mission with GIS tools
070928 Collaborative Geospatial Mapping And Data Authorization
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
Geospatial Data Insfrastructures, Cybercartography and Open Data: The Need f...
Talk: "Using Open Data and Crowdsourcing to develop CycleStreets"
Tnmc mc andrew_sotmus13_rev2
Open Data and Open Software Geospatial Applications
The road to opening up governmental geospatial data in taiwan
Maptivism reloaded: Open Data for Development @oddc
Open Access to Multi-Domain Collaborative Geospatial Analysis - AGU 2009
Ad

Recently uploaded (20)

PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
Heart disease approach using modified random forest and particle swarm optimi...
PDF
Approach and Philosophy of On baking technology
PPTX
TLE Review Electricity (Electricity).pptx
PPTX
A Presentation on Artificial Intelligence
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
Hybrid model detection and classification of lung cancer
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
A Presentation on Touch Screen Technology
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
Tartificialntelligence_presentation.pptx
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Assigned Numbers - 2025 - Bluetooth® Document
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Heart disease approach using modified random forest and particle swarm optimi...
Approach and Philosophy of On baking technology
TLE Review Electricity (Electricity).pptx
A Presentation on Artificial Intelligence
Programs and apps: productivity, graphics, security and other tools
Hindi spoken digit analysis for native and non-native speakers
Unlocking AI with Model Context Protocol (MCP)
DP Operators-handbook-extract for the Mautical Institute
Univ-Connecticut-ChatGPT-Presentaion.pdf
Chapter 5: Probability Theory and Statistics
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
A novel scalable deep ensemble learning framework for big data classification...
Hybrid model detection and classification of lung cancer
Digital-Transformation-Roadmap-for-Companies.pptx
A Presentation on Touch Screen Technology
A comparative study of natural language inference in Swahili using monolingua...
Tartificialntelligence_presentation.pptx

OpenStreetMap Data Quality

  • 1. Managing Data Quality in OpenStreetMap TOOLS FOR AN ACTIVE MAPPING COMMUNITY NC GIS CONFERENCE 2013 This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see: http://creativecommons.org/licenses/by-sa/3.0/
  • 2. Overview 2  The Short History of the OpenStreetMap Revolution  Assessing Open Source Data Quality  Overview of Tools  Creating Tools that Matter NC GIS Conference 2013 23 February 2013
  • 3. Overview: Key Questions 3  How can crowd-sourced projects manage data quality effectively?  What tools exist for monitoring data quality in OpenStreetMap?  What conclusions can be drawn about existing tools?  What is the future of data quality in crowd-sourced projects? NC GIS Conference 2013 23 February 2013
  • 4. OpenStreetMap is… 4  A freely-editable map of the world unconstrained by proprietary ownership  “Wikipedia for maps” NC GIS Conference 2013 23 February 2013
  • 5. The Origins of OpenStreetMap 5  OpenStreetMap.org domain registered by Steve Coast in 2004  Project originated in the United Kingdom, where…  Crown copyright on geospatial data  Little, or no public domain data  Simple goal to create a free, publicly-available database of street centerlines NC GIS Conference 2013 23 February 2013
  • 6. OpenStreetMap is… 6  A freely-editable map of the world unconstrained by proprietary ownership  “Wikipedia for maps” NC GIS Conference 2013 23 February 2013
  • 7. Looks like…a wiki 7 NC GIS Conference 2013 23 February 2013
  • 8. Wiki-based Documentation! 8 NC GIS Conference 2013 23 February 2013
  • 9. Milestones in OpenStreetMap History 9  2004 - OpenStreetMap.org registered by Steve Coast  2005 – Map Limehouse, 1st OpenStreetMap mapping party  2005 – 1000 registered OpenStreetMap users  2006 – OpenStreetMap Foundation established  2007 – 5 million ways in OSM database  2007 – 10,000 registered OpenStreetMap users  2008 - TIGER data import for the US completed  2009 - 100,000 registered OpenStreetMap users  2010 - 200,000 registered OpenStreetMap users  2012 – ~670,000 registered OpenStreetMap users NC GIS Conference 2013 23 February 2013
  • 10. OpenStreetMap User Growth 10 One million registered users worldwide! NC GIS Conference 2013 23 February 2013
  • 11. OpenStreetMap Growth in User Edits 11 NC GIS Conference 2013 23 February 2013
  • 12. OpenStreetMap Database Growth 12 NC GIS Conference 2013 23 February 2013
  • 13. Data Quality in Crowd-sourced Projects 13  Goodchild & Li: Identified three mechanisms for Quality Assurance  Crowd-sourcing  Social  Geographic Goodchild, Michael F., and Linna Li. "Assuring the quality of volunteered geographic information." Spatial Statistics 1 (2012): 110-120. NC GIS Conference 2013 23 February 2013
  • 14. Crowd-sourced Approach to Data Quality 14  Based on Surowiecki’s “Wisdom of the Crowd”  Multiple users converge around consensus solutions that might escape an individual  Many independent observations reinforce the validity of a single observation  Concurrence on observed features (e.g. “It’s a bridge.”)  Convergence on the truth  The group validates observations & corrects errors Surowiecki, J., 2005. The Wisdom of Crowds. Anchor, New York. NC GIS Conference 2013 23 February 2013
  • 15. Social Approach to Data Quality 15  Through practices, users acquire reputations  Users with good reputations are trusted  Trust and reputation are indicators of stewardship  As the project evolves, social leadership becomes more formalized.  The Data Working Group of OpenStreetMap fullfills this function  Email lists supplement social stewardship NC GIS Conference 2013 23 February 2013
  • 16. Geographic Tools for Data Quality 16  Geographic approach draws on formal geographic theory:  Spatial neighbors & auto-correlation (Moran statistics)  Christaller’s Central Place Theory  Descriptive Statistics  Inferential Statistics & Analysis of Variance (ANOVA)  Richardson plots of linear measurements  Cluster analysis, e.g. k-means  These approaches have not been widely adopted for use in the OpenStreetMap project…yet NC GIS Conference 2013 23 February 2013
  • 17. A Quick Survey of Data Quality Tools 17  Two types of tools are in widespread use:  Error Detection Tools  Monitoring Tools NC GIS Conference 2013 23 February 2013
  • 18. Error Detection Tools: Keep Right 18 NC GIS Conference 2013 23 February 2013
  • 19. Error Detection Tools: Map Dust 19 NC GIS Conference 2013 23 February 2013
  • 20. Error Detection Tools: OpenStreetBugs NC GIS Conference 2013 23 February 2013
  • 21. Error Detection Tools: No Name 21 NC GIS Conference 2013 23 February 2013
  • 22. Error Detection Tools: MapRoulette 22 NC GIS Conference 2013 23 February 2013
  • 23. Monitoring Tools 23 NC GIS Conference 2013 23 February 2013
  • 24. Monitoring Tools: OpenStreetMap Watch List (OWL) 24 NC GIS Conference 2013 23 February 2013
  • 25. Monitoring Tools: GeoFabrik Map Compare 25 NC GIS Conference 2013 23 February 2013
  • 26. Monitoring Tools: Who Did It 26 NC GIS Conference 2013 23 February 2013
  • 27. Monitoring Tools: ITO TIGER Reviewed 27 NC GIS Conference 2013 23 February 2013
  • 28. Monitoring Tools: ITO TIGER Reviewed 28 NC GIS Conference 2013 23 February 2013
  • 29. Monitoring Tools: Green Means Go 29 NC GIS Conference 2013 23 February 2013
  • 30. Monitoring Tools: Who’s Around Me 30 NC GIS Conference 2013 23 February 2013
  • 31. Social Controls 31  OpenStreetMap - Data Working Group (DWG)  Resolving disputes between users  Processes & protocols for data imports  Investigates copyright infringement  Deals with issues of vandalism and fraud  Suspends or closes user accounts (in case of abuse)  IP blocking (in case of abuse) NC GIS Conference 2013 23 February 2013
  • 32. How do Social Methods Treat Vandalism? 32  OpenStreetMap is not immune from malicious intent  Copyright infringement (e.g. copying from Google Maps)  Graffiti  Disputes & “Edit Wars” (e.g. Kashmir region, Palestine)  Spam  Tools for Managing Vandalism  Detect using daily diffs  UserActivity – batch comparison of two versions of the database  Revert – undo changeset to previous version  Virtual Ban NC GIS Conference 2013 23 February 2013
  • 33. Summary Review 33  Three methods for data quality control  Crowd-sourced  Social  Geographic  OpenStreetMap has crowd-sourced and social tools for managing data quality  Error & Monitoring tools  Data Working Group - Social  Geographic methods are experimental at this time  Increasingly complete geographic features will lead to better tools NC GIS Conference 2013 23 February 2013
  • 34. Lessons Learned about OSM Data Quality 34  Successive editing by multiple users can improve accuracy…up to a point  Haklay suggests that few improvements are made beyond the 13th edit  Semantic differences are not easy to resolve – “Tag wars”  Obscure edits do not always get corrected if there are no local mappers that take ownership  Social approaches will acquire more authority  Are part-time, volunteer staffers enough to guarantee data quality?  What are appropriate metrics for trust and reputation? Haklay, M. 2010. How Good is volunteered geographical information? a comparative study of OpenStreetMap and Ordnance Survey Datasets. Environment & Planning B: Planning and Design 37 (4), 682-703g NC GIS Conference 2013 23 February 2013
  • 35. Thank You 35  Questions?  Steven Johnson  (e) stevejohnson@deloitte.com  (t) @geomantic This document licensed in entirety by Creative Commons CC-by-SA. For specific terms of license, see: http://creativecommons.org/licenses/by-sa/3.0/ NC GIS Conference 2013 23 February 2013