SlideShare a Scribd company logo
Wide access to spatial
Citizen Science data
ECSA 2016, Berlin
Paul van Genuchten, Lieke Verhelst, Clemens Portele
Wide access to spatial Citizen Science data - ECSA Berlin 2016
About the authors
Paul van Genuchten is a software engineer at “GeoCat BV”, supporting
governments to publish (spatial/open) data on the web.
Lieke Verhelst is owner of “Linked Data Factory”. Lieke is a linked data expert
and has developed multiple ontologies in the scope of food-safety, soil science,
nature reserves, water management
Clemens Portele is managing director of “interactive instruments GmbH”.
interactive instruments is a software engineering company in the spatial data
infrastructure domain and is an active contributor to multiple OGC standards.
COBWEB
COBWEB is a research project to empower citizens with the ability to collect
environmental information using mobile devices, which will then be made suitable
for use in research, decision making and policy formation.
GeoCat improves GeoNetwork opensource, targeting citizen science data
discovery and visualisation in the scope of the COBWEB FP7 project.
The project has received funding from the European Union under grant agreement
No 308513
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
The open data challenges
- Discovery; people can’t find the data
- Format; the data is exposed in complex services/formats
- License; the license is restrictive
- Aggregation level; “raw data now” *
* Rufus Pollock, 2007 http://blog.okfn.org/2007/11/07/give-us-the-data-raw-and-give-it-to-us-now/
Background
One of the objectives of COBWEB is to publish citizen science data to GEOSS
GEOSS has a focus on spatial standards (CSW, SensorWeb, WMS/WFS)
Major part of citizen science community is not aware of these standards
Average users use search engines to discover data and common formats to
analyse data
How to bridge the gap between services in GEOSS and search engines
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Geonovum testbed
The gap between OGC and WEB standards is a general challenge
W3C and OGC have set up a joint working group to develop best practices
At the start of 2016 Geonovum (dutch national government) organised a testbed to
move the ‘spatial data on the web’ best practices forward.
What search engines expect
HTML (text) output on unique persistent url’s
An index that lists links to all url’s to discover
HTML documents annotated with “schema.org”-markup transform web pages into
structured data
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Schema.org and Citizen Science
The Schema.org ontology currently does not provide classes for citizen science
projects and observations
An extension to schema.org can be proposed to model citizen science
communities and observations, for example based on schema.org/Measurement
Wide access to spatial Citizen Science data - ECSA Berlin 2016
A proxy approach
A proxy layer transforms WFS/CSW requests to HTML annotated with schema.org
The CSW proxy approach is implemented in GeoNetwork opensource
For the WFS proxy approach a new open source product has been released by
interactive instruments, called ‘LDproxy’
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
Wide access to spatial Citizen Science data - ECSA Berlin 2016
{image of google structured data testing tool}
Wide access to spatial Citizen Science data - ECSA Berlin 2016
A proxy approach to reach other communities
A similar approach can be used to expose OGC services to other communities,
such as citizen science developer community
- CSW/iso19139 metadata exposed as DCAT/VOID in RDFa or rdf/xml
- SOS/WFS/GML exposed as Darwin Core in RDFa or json-ld
- A json API for web developers
Also interesting would be to look at a vice versa approach, in which a proxy is
used to expose unstructured citizen science data to the geoss community as
WFS/SOS.
Privacy and the search engines
Some of the search engines are generally percieved as a challenge for privacy
However; in this case it is the campaign organiser that should take measures
A complicating factor is that citizens tend to like to advertise that they made a
contribution, or even claim ownership of a contribution
Privacy by design
Minimise the transport and storage (timespan) of data that could be used to derive
identity (minimise, separate, aggregate & hide*)
Communicate transparently about the transport and storage strategy
Offer users the ability to review and remove their personal data
Transport a location/timestamp to the level of detail that is required for the use
case
Use a wallet with reliability-credits instead of keeping a user history for reliability
assessment
* https://www.pilab.nl/wp-content/uploads/2013/12/Privacy-design-strategies-JHH-5-12-2013.pdf
“Privacy awareness is growing,
it’s comparable with the stage of environmental awareness 40 years ago” *
*Jaap-Henk Hoepman, Privacy & Identity Lab, Radboud University Nijmegen
Conclusions
A proxy approach for CSW is a good way to make existing published datasets
more widely discoverable via alternative channels
A proxy approach for WFS/SOS has potential to bridge the gap between OGC
services and search engines, however currently the search engines have limited
implementations for using the schema.org annotations
Adopting an established standard helps in making data more widely available.
There’s a growing number of tools available to facilitate to engage with open data

More Related Content

PPTX
20191119_The OpenAIRE Research Graph
PPTX
OpenAIRE Open Innovation call: Next Generation Repositories
PPTX
SafeShare - Networkshop44
PPTX
Research data spring - Jisc Digital Festival 2015
PPT
Museum Collections Management: Possibilities for Access and Use with Linked D...
PPTX
Uncovering research - what's the standard - Jisc Digital Festival 2015
PPT
Open Science at the European Commission
PPTX
Application of Assent in the safe - Networkshop44
20191119_The OpenAIRE Research Graph
OpenAIRE Open Innovation call: Next Generation Repositories
SafeShare - Networkshop44
Research data spring - Jisc Digital Festival 2015
Museum Collections Management: Possibilities for Access and Use with Linked D...
Uncovering research - what's the standard - Jisc Digital Festival 2015
Open Science at the European Commission
Application of Assent in the safe - Networkshop44

What's hot (20)

PPTX
Demonstration of the 4C cost comparison tool
PPTX
Engage Project on Open Data
PPTX
WikiRate - Data Liberation and Radical Transparency
PPTX
Winning Horizon 2020 with Open Science
PPTX
WikiRate: Stakeholder Perspectives - NGOs and Academics
PPTX
Research data spring: filling in the digital preservation gap
PPTX
Towards a National Data Infrastructure. First Insights Regarding Its Design a...
PPT
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
PPTX
Kit-Catalogue - Discovering the Value of Equipment Sharing - Universities UK ...
PPTX
Research data management in UK universities: A collaborative venture
PPT
More with Less? Collaborative Trends in Research Data Management
PPTX
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
PPTX
European open science cloud
PPT
Societal Challnge 5 and Big Data Europe 1st hangout
PPT
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
PDF
Open APC Data in Germany - A Contribution to Open Access Monitoring
PPTX
Showcasing research data tools - Jisc Digifest 2016
PPT
Enabling Data-Intensive Science Through Data Infrastructures
PPTX
OpenAIRE-connect: Services for open science
PDF
MOVING presentation at JSI
Demonstration of the 4C cost comparison tool
Engage Project on Open Data
WikiRate - Data Liberation and Radical Transparency
Winning Horizon 2020 with Open Science
WikiRate: Stakeholder Perspectives - NGOs and Academics
Research data spring: filling in the digital preservation gap
Towards a National Data Infrastructure. First Insights Regarding Its Design a...
Bristol's Research Data Service - Debra Hiom - Jisc Digital Festival 2014
Kit-Catalogue - Discovering the Value of Equipment Sharing - Universities UK ...
Research data management in UK universities: A collaborative venture
More with Less? Collaborative Trends in Research Data Management
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
European open science cloud
Societal Challnge 5 and Big Data Europe 1st hangout
Roadmaps, Roles and Re-engineering: Developing Data Informatics Capability in...
Open APC Data in Germany - A Contribution to Open Access Monitoring
Showcasing research data tools - Jisc Digifest 2016
Enabling Data-Intensive Science Through Data Infrastructures
OpenAIRE-connect: Services for open science
MOVING presentation at JSI
Ad

Viewers also liked (16)

PPT
La tradición de la rosca de reyes
DOCX
PPTX
Rosca de reyes
PPTX
Novena de navidad 2016
PPT
La tradición de la rosca de reyes
PPT
Ponentes 2016
DOCX
δημοσιευσεις με ριγανελαια
PDF
Data, Data, Data - Geoforum 2016 - Phil Bartie
PPT
Aesthetics in orhtododntics /certified fixed orthodontic courses by Indian d...
PDF
Industrial Hygiene
PPT
Orthodontic treament in mixed dentition
PDF
Useful vocabulary
PPT
Early and interceptive orthodontic treatment /certified fixed orthodontic cou...
PPTX
Teaching GIS to Non-Geographers
La tradición de la rosca de reyes
Rosca de reyes
Novena de navidad 2016
La tradición de la rosca de reyes
Ponentes 2016
δημοσιευσεις με ριγανελαια
Data, Data, Data - Geoforum 2016 - Phil Bartie
Aesthetics in orhtododntics /certified fixed orthodontic courses by Indian d...
Industrial Hygiene
Orthodontic treament in mixed dentition
Useful vocabulary
Early and interceptive orthodontic treatment /certified fixed orthodontic cou...
Teaching GIS to Non-Geographers
Ad

Similar to Wide access to spatial Citizen Science data - ECSA Berlin 2016 (20)

PPTX
COBWEB technology platform and future development needs
PDF
COBWEB technology platform and future development needs, ISPRA 2016
PPTX
COBWEB Summit at the OGC TC Dublin, 2016
ODP
Citizen science, vgi, geo crowd sourcing, big geo data how they matter to th...
PPTX
The Open Landscape of Geospatial Information:
PDF
Open Data and Open Software Geospatial Applications
PDF
COBWEB (presentation from Citizens’ Science and Smart Cities Summit) - Chris ...
PDF
The role of geospatial information in a hyper connected society
PDF
The role of geospatial information in a hyper connected society
PDF
The role of geospatial information in a hyper connected society
PPT
Towards the Open Geospatial Web
PDF
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
PPTX
DGI 2015 - London, UK
PPT
Towards the Open Geospatial Web (eurogeographics edition)
PPTX
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
PPTX
COBWEB Smart Technology = Smart Data? Citizen Science in the Dyfi Biosphere R...
PPTX
Comprehensive Overview of the Geoweb
 
PDF
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
PPTX
Geo know general presentation 2013
PPTX
IoT Meets Geo
COBWEB technology platform and future development needs
COBWEB technology platform and future development needs, ISPRA 2016
COBWEB Summit at the OGC TC Dublin, 2016
Citizen science, vgi, geo crowd sourcing, big geo data how they matter to th...
The Open Landscape of Geospatial Information:
Open Data and Open Software Geospatial Applications
COBWEB (presentation from Citizens’ Science and Smart Cities Summit) - Chris ...
The role of geospatial information in a hyper connected society
The role of geospatial information in a hyper connected society
The role of geospatial information in a hyper connected society
Towards the Open Geospatial Web
Semantics-enhanced Geoscience Interoperability, Analytics, and Applications
DGI 2015 - London, UK
Towards the Open Geospatial Web (eurogeographics edition)
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB Smart Technology = Smart Data? Citizen Science in the Dyfi Biosphere R...
Comprehensive Overview of the Geoweb
 
EEO/AGI-Scotland 2015: Citizen Science and GIScience - background and common ...
Geo know general presentation 2013
IoT Meets Geo

More from COBWEB Project (20)

PPTX
COBWEB A quality assurance workflow authoring tool for citizen science and cr...
PPTX
COBWEB - Semantics
PPT
COBWEB: Towards an Optimised Interoperability Framework for Citizen Science
PPT
COBWEB: Privacy and Security
PDF
Cobweb: In Pursuit of Conclusions
PPT
Cobweb: Using citizen science data to support flood risk modelling
PPTX
A Standardized Encoding to Exchange Citizen Science Data - ESCA 2015
PPTX
Learning in, about and for the Dyfi Biosphere - Kirsten Manley
PPTX
COBWEB RDA Plenery 5 - Joint meeting of IG Geospatial & IG Big Data - Didier...
PPTX
Citizen Observatories: A Standards Based Architecture - Dr Ingo Simonis, OGCE...
PDF
COBWEB - Chris Higgins, EDINA
PPT
COBWEB - Existing Work and Future Plans - Presentation by James Hodges of the...
PPTX
Coetiroedd Dyfi Woodlands Presentation by Kirsten Manley from COBWEB Workshop...
PPTX
Introduction to COBWEB - Chris Higgins, COBWEB
PPTX
COBWEB: helping to map vegetation - work with Aberystwyth University - Crona ...
PPTX
WP6 Demonstrators Estimating inundation extent from a distance - Brewar, Evan...
PPTX
Attention Citizens! Presentation as part of the Citizen Science Workshop - Ni...
PPTX
Ensuring the Citizen is at the heart of the COBWEB - Citizen Observatory Web ...
PPTX
COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges
PPTX
General Overview of the COBWEB Project - Bart De Lathouwer and Chris Higgins
COBWEB A quality assurance workflow authoring tool for citizen science and cr...
COBWEB - Semantics
COBWEB: Towards an Optimised Interoperability Framework for Citizen Science
COBWEB: Privacy and Security
Cobweb: In Pursuit of Conclusions
Cobweb: Using citizen science data to support flood risk modelling
A Standardized Encoding to Exchange Citizen Science Data - ESCA 2015
Learning in, about and for the Dyfi Biosphere - Kirsten Manley
COBWEB RDA Plenery 5 - Joint meeting of IG Geospatial & IG Big Data - Didier...
Citizen Observatories: A Standards Based Architecture - Dr Ingo Simonis, OGCE...
COBWEB - Chris Higgins, EDINA
COBWEB - Existing Work and Future Plans - Presentation by James Hodges of the...
Coetiroedd Dyfi Woodlands Presentation by Kirsten Manley from COBWEB Workshop...
Introduction to COBWEB - Chris Higgins, COBWEB
COBWEB: helping to map vegetation - work with Aberystwyth University - Crona ...
WP6 Demonstrators Estimating inundation extent from a distance - Brewar, Evan...
Attention Citizens! Presentation as part of the Citizen Science Workshop - Ni...
Ensuring the Citizen is at the heart of the COBWEB - Citizen Observatory Web ...
COBWEB: Citizen Observatories Web Ecology meets the crowd - Crona Hodges
General Overview of the COBWEB Project - Bart De Lathouwer and Chris Higgins

Recently uploaded (20)

DOCX
Epoxy Coated Steel Bolted Tanks for Agricultural Waste Biogas Digesters Turns...
PPTX
Disposal Of Wastes.pptx according to community medicine
PPTX
Making GREEN and Sustainable Urban Spaces
PDF
Tree Biomechanics, a concise presentation
PDF
Effect of salinity on biochimical and anatomical characteristics of sweet pep...
PDF
Bai bao Minh chứng sk2-DBTrong-003757.pdf
DOCX
Epoxy Coated Steel Bolted Tanks for Farm Digesters Supports On-Farm Organic W...
PPTX
Holticulture, floriculte oleiriculture.pptx
PDF
Session7 Outlines of AR7 Reports Working Group III
PPTX
FIRE SAFETY SEMINAR SAMPLE FOR EVERYONE.pptx
DOCX
Epoxy Coated Steel Bolted Tanks for Beverage Wastewater Storage Manages Liqui...
PDF
Session 1 Introduction to the IPCC - Programme Officer M Shongwe
PPTX
The age of Artificial Intelligence and our future
PDF
Urban Hub 50: Spirits of Place - & the Souls' of Places
PPTX
Delivery census may 2025.pptxMNNN HJTDV U
DOCX
Epoxy Coated Steel Bolted Tanks for Fish Farm Water Provides Reliable Water f...
PPTX
Session8a AR6 Findings Working Group I Vice-Chair Nana Ama Browne Klutse
PPTX
carbon footprint, emissioncontrol and carbon tax
DOCX
Epoxy Coated Steel Bolted Tanks for Anaerobic Digestion (AD) Plants Core Comp...
PPTX
Concept of Safe and Wholesome Water.pptx
Epoxy Coated Steel Bolted Tanks for Agricultural Waste Biogas Digesters Turns...
Disposal Of Wastes.pptx according to community medicine
Making GREEN and Sustainable Urban Spaces
Tree Biomechanics, a concise presentation
Effect of salinity on biochimical and anatomical characteristics of sweet pep...
Bai bao Minh chứng sk2-DBTrong-003757.pdf
Epoxy Coated Steel Bolted Tanks for Farm Digesters Supports On-Farm Organic W...
Holticulture, floriculte oleiriculture.pptx
Session7 Outlines of AR7 Reports Working Group III
FIRE SAFETY SEMINAR SAMPLE FOR EVERYONE.pptx
Epoxy Coated Steel Bolted Tanks for Beverage Wastewater Storage Manages Liqui...
Session 1 Introduction to the IPCC - Programme Officer M Shongwe
The age of Artificial Intelligence and our future
Urban Hub 50: Spirits of Place - & the Souls' of Places
Delivery census may 2025.pptxMNNN HJTDV U
Epoxy Coated Steel Bolted Tanks for Fish Farm Water Provides Reliable Water f...
Session8a AR6 Findings Working Group I Vice-Chair Nana Ama Browne Klutse
carbon footprint, emissioncontrol and carbon tax
Epoxy Coated Steel Bolted Tanks for Anaerobic Digestion (AD) Plants Core Comp...
Concept of Safe and Wholesome Water.pptx

Wide access to spatial Citizen Science data - ECSA Berlin 2016

  • 1. Wide access to spatial Citizen Science data ECSA 2016, Berlin Paul van Genuchten, Lieke Verhelst, Clemens Portele
  • 3. About the authors Paul van Genuchten is a software engineer at “GeoCat BV”, supporting governments to publish (spatial/open) data on the web. Lieke Verhelst is owner of “Linked Data Factory”. Lieke is a linked data expert and has developed multiple ontologies in the scope of food-safety, soil science, nature reserves, water management Clemens Portele is managing director of “interactive instruments GmbH”. interactive instruments is a software engineering company in the spatial data infrastructure domain and is an active contributor to multiple OGC standards.
  • 4. COBWEB COBWEB is a research project to empower citizens with the ability to collect environmental information using mobile devices, which will then be made suitable for use in research, decision making and policy formation. GeoCat improves GeoNetwork opensource, targeting citizen science data discovery and visualisation in the scope of the COBWEB FP7 project. The project has received funding from the European Union under grant agreement No 308513
  • 10. The open data challenges - Discovery; people can’t find the data - Format; the data is exposed in complex services/formats - License; the license is restrictive - Aggregation level; “raw data now” * * Rufus Pollock, 2007 http://blog.okfn.org/2007/11/07/give-us-the-data-raw-and-give-it-to-us-now/
  • 11. Background One of the objectives of COBWEB is to publish citizen science data to GEOSS GEOSS has a focus on spatial standards (CSW, SensorWeb, WMS/WFS) Major part of citizen science community is not aware of these standards Average users use search engines to discover data and common formats to analyse data How to bridge the gap between services in GEOSS and search engines
  • 15. Geonovum testbed The gap between OGC and WEB standards is a general challenge W3C and OGC have set up a joint working group to develop best practices At the start of 2016 Geonovum (dutch national government) organised a testbed to move the ‘spatial data on the web’ best practices forward.
  • 16. What search engines expect HTML (text) output on unique persistent url’s An index that lists links to all url’s to discover HTML documents annotated with “schema.org”-markup transform web pages into structured data
  • 19. Schema.org and Citizen Science The Schema.org ontology currently does not provide classes for citizen science projects and observations An extension to schema.org can be proposed to model citizen science communities and observations, for example based on schema.org/Measurement
  • 21. A proxy approach A proxy layer transforms WFS/CSW requests to HTML annotated with schema.org The CSW proxy approach is implemented in GeoNetwork opensource For the WFS proxy approach a new open source product has been released by interactive instruments, called ‘LDproxy’
  • 26. {image of google structured data testing tool}
  • 28. A proxy approach to reach other communities A similar approach can be used to expose OGC services to other communities, such as citizen science developer community - CSW/iso19139 metadata exposed as DCAT/VOID in RDFa or rdf/xml - SOS/WFS/GML exposed as Darwin Core in RDFa or json-ld - A json API for web developers Also interesting would be to look at a vice versa approach, in which a proxy is used to expose unstructured citizen science data to the geoss community as WFS/SOS.
  • 29. Privacy and the search engines Some of the search engines are generally percieved as a challenge for privacy However; in this case it is the campaign organiser that should take measures A complicating factor is that citizens tend to like to advertise that they made a contribution, or even claim ownership of a contribution
  • 30. Privacy by design Minimise the transport and storage (timespan) of data that could be used to derive identity (minimise, separate, aggregate & hide*) Communicate transparently about the transport and storage strategy Offer users the ability to review and remove their personal data Transport a location/timestamp to the level of detail that is required for the use case Use a wallet with reliability-credits instead of keeping a user history for reliability assessment * https://www.pilab.nl/wp-content/uploads/2013/12/Privacy-design-strategies-JHH-5-12-2013.pdf
  • 31. “Privacy awareness is growing, it’s comparable with the stage of environmental awareness 40 years ago” * *Jaap-Henk Hoepman, Privacy & Identity Lab, Radboud University Nijmegen
  • 32. Conclusions A proxy approach for CSW is a good way to make existing published datasets more widely discoverable via alternative channels A proxy approach for WFS/SOS has potential to bridge the gap between OGC services and search engines, however currently the search engines have limited implementations for using the schema.org annotations Adopting an established standard helps in making data more widely available. There’s a growing number of tools available to facilitate to engage with open data