SlideShare a Scribd company logo
A linked open data based system for flexible
delineation of geographic areas
Peter van den Besselaar, Ali Khalili and Klaas Andries de Graaf
August 2017
Artificial Intelligence Section
Department of Computer Science
Faculty of Science
Department of Organization Sciences
Faculty of Social Science
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 2
SMS Platform Focus Goal
How to capture new insights
by integrating data from
multiple heterogeneous data
sources in the STI domain?
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 2
SMS Platform Focus Goal
How to capture new insights
by integrating data from
multiple heterogeneous data
sources in the STI domain?
Linked Data Creation
Linked Data Services
Applications
Use Cases
Data Ingestion
RISISPublicData
RISISPrivateData
OpenDataontheWeb
combine	one	or	more	SMS	services	with	other	
existing	services	and	applications	to	build	novel	
and	innovative	applications.
a
b
c
d
a) Dataset Metadata Editor
b) RISIS Datasets Portal
c) Spreadsheet add-on
d) Data Linking UI
In order to enable batch processing of data,
SMS provides a Google spreadsheet add-on.
This add-on allows users to enrich their data
directly in their spreadsheets.
A set of user interfaces to allow users
create their lenticular lenses –
different views on entity linking.
The RISIS dataset holders all have to describe their
datasets in a detailed, consistent, and uniform way.
To achieve this goal, RDF data model is used to describe
the RISIS datasets. To stimulate non-Semantic Web
users to generate valid RDF metadata descriptions, we
designed a novel user-friendly editor which hides the
complexity of RDF from non-technical users. The metadata
editor exploits the state of the art Web technologies to provide
user-friendly component to view and edit the metadata.
In order to exploit the generated metadata, RISIS datasets
portal brings a user interface to view and browse the
metadata. Faceted	browsing	allows	users	to	explore	the	
dataset	via	multiple	entry	points,	or	when	users	do	not	
know	what	they	are	looking	 for	beforehand. The portal also
handles user registration and supports the process of
reviewing visit/access requests to certain
RISIS datasets.
convert	unstructured	and	structured	data	to	RDF.
expose	the	functionality	of	the	SMS	platform	to	third-party	
users	by	standard	Application	Programming	Interfaces	(APIs).
Identity	Resolution	Services
Named	Entity	Recognition	Services
Metadata	Services
Category	Services
Basic	Geo	Services
Innovative	Geo	Services
Integration	Services
handle the entity disambiguation
problem and manage the same or
similar entities found in different
datasets.
extract and classify
named entities (e.g.
people, places,
organizations) in
unstructured text.
resolve the mapping
between heterogeneous
classification schemas
(e.g. WoS and FoS, or
different ISO codes for
countries).
allow search on datasets based on
the description of datasets in
multiple metadata categories (e.g.
language, time coverage, etc.).
deal with basic representation of
geographical data together with
geocoding functions and
identifying the boundaries
containing a given point.
provide innovative services based
on new notions of distance (e.g.
traffic congestion, language factor,
flight routes, etc.)
allow integration of data with RISIS public and
private datasets as well as open datasets and social
data available on the Web.
DBpedia
Wikidata
…
OrgRef
GRID
FundRef
Geoname
ISNI
VIAF
Cordis
?
Lenticular Lenses
When	comparing	two	entities,	
depending	on	the	user’s	
perspective	and	the	context	of	
study,	they	might	be	considered	
the	same,	similar	or	different.
Sometimes	two	organizations	
(e.g.	departments)	can	be	the	
same	– because	they	are	parts	
of	the	same	organization	
(university).	But	if	one	wants	to	
compare	departments,	this	is	
not	the	case.
Lenticular	lenses	support	linking	
the	entities	based	on	the	
context	of	use	and	provided	
features	of	data.
access	and	retrieve	heterogeneous	data.
Access Control Points (ACPs)
- provide	standard	interfaces	to	reduce	
technical	difficulties	of	accessing	data.
- provide	a	mechanism	to	coordinate	access	
to	data	based	on	the	user	role	and	the	
datasets	owner’s	requirements.
WWW
e) Faceted browser
Allows browsing a dataset using a set
of pre-defined facets
e
http://sms.risis.eu
The Semantically Mapping
Science (SMS)
Platform
RISIS Datasets: Entity Types
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 4
SMS Platform Data
Organization Product Agreement
Person Policy
Policy
Evaluation
Location
CIB ETER EUPRO JOREP Leiden-Ranking
MORE I Nano Profile SIPER VICO
Higher
Education
Firm
Funding
Body
Publication
Patent
Project
Investment
Funding
Program
Integration
RISIS Datasets: Entity Types
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 4
SMS Platform Data
Organization Product Agreement
Person Policy
Policy
Evaluation
Location
CIB ETER EUPRO JOREP Leiden-Ranking
MORE I Nano Profile SIPER VICO
Higher
Education
Firm
Funding
Body
Publication
Patent
Project
Investment
Funding
Program
Integration
RISIS WP9 Vision
	Proposing	S&T	map	of	Europe
Administrative Geo Boundaries
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 6
SMS Platform Geo Services
Municipalities
Functional Urban Areas (FUAs)
Functional Urban Areas (FUAs)
defined by OECD in collaboration with EC/Eurostat
consider factors beyond the predefined city boundaries to better
reflect the economic geography of where people live and work
Functional Urban Areas (FUAs)
OECD Metropolitan eXplorer: http://measuringurban.oecd.org
defined by OECD in collaboration with EC/Eurostat
consider factors beyond the predefined city boundaries to better
reflect the economic geography of where people live and work
population
area
GDP
environment (CO2 emissions and air pollution)
labour market (employment and unemployment growth)
innovation (patent intensity)
urban form and territorial organization
Functional Urban Areas (FUAs)
OECD Metropolitan eXplorer: http://measuringurban.oecd.org
Functional Urban Areas (FUAs)
Problem
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10
SMS Platform Geo Services
Address FUA
?
Problem
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10
SMS Platform Geo Services
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
Problem
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10
SMS Platform Geo Services
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
OECD FUAs List
Problem
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10
SMS Platform Geo Services
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
- Geocode to LAU (municipality)
OECD FUAs List
Problem
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10
SMS Platform Geo Services
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
- Geocode to LAU (municipality)
- Shapefiles for FUAs or LAUs?
OECD FUAs List
Problem
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10
SMS Platform Geo Services
Address FUA
?
• Vrije Universiteit Amsterdam
• De Boelelaan 1105, 1081 HV Amsterdam
Amsterdam (NL002)
- Geocode to LAU (municipality)
- Shapefiles for FUAs or LAUs?
OECD FUAs List
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 11
SMS Platform Geo Services
Building a Linked Open Data Space
for Flexible Delineation of
Geographic Areas
Goal
Geo
Boundaries
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 12
SMS Platform Geo Services Adaptive FUAs
Address
Flexible Geographic Areas
Administrative Boundaries
Coordinates
geocode
Geo Statistical
DataFGA
LifeCycle
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 13
SMS Platform Geo Services Linked Data
LifeCycle
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 13
SMS Platform Geo Services
Data Discovery &
Collection
Data Extraction
& Conversion
Service to
Application
Data Storage &
Querying
Data to Service
Data Linkage
Linked Data
LifeCycle
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 13
SMS Platform Geo Services
Data Discovery &
Collection
Data Extraction
& Conversion
Service to
Application
Data Storage &
Querying
Data to Service
Data Linkage
Linked Data
DATA DISCOVERY & COLLECTION
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 14
SMS Platform Geo Services
• OpenStreepMap (OSM)
• Database of Global Administrative Areas (GADM)
• Flickr Shapefiles Dataset
• Published Shapefiles for Individual Countries
• Published Geospatial RDF Datasets
Open Administrative Boundaries
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 15
SMS Platform Geo Services
• Level 1: super-national
administrations e.g.
European Union.
• Level 2: country borders
based on the political
entities listed on the ISO
3166 standard.
• Level 3 to 11: subnational
borders such as ``state'',
``province'', ``region'' and
``district''.
• Level 0: countries.
• Level 1 to 5: lower level
subdivisions such as
provinces, departments,
counties, etc.
depending on the size
and availability of data
for the underlying
country.
• Level 1: country
• Level 2: region
• Level 3: county
• Level 4: locality
• Level 5:
neighborhood
Open Administrative Boundaries: Example
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 16
SMS Platform Geo Services
DATA EXTRACTION & CONVERSION
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 17
SMS Platform Geo Services
GeoJSON
Enrichment
Functions
Mapping
Configurations
OSM XML
PBF
ESRI shapes
triplify
mapshaper
osmtogeojson
osmosis
DATA EXTRACTION & CONVERSION
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 18
SMS Platform Geo Services
Metadata about different levels provided by OSM
http://wiki.openstreetmap.org/wiki/Tag:boundary%3Dadministrative
DATA STORAGE & QUERYING
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 19
SMS Platform Geo Services
Virtuoso Geo Spatial
Geometry as SMS
internal representation
for Geo-data in RDF
DATA LINKAGE
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 20
SMS Platform Geo Services
- Query on metadata about the
administrative boundaries
- Find the alignment between levels
in different datasets
DATA LINKAGE
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 21
SMS Platform Geo Services
- used the possible mappings between datasets at different levels.
- check the overlaps of areas at the similar level, and for the matching areas apply
string matching to make sure that they refer to the same administrative boundary.
DATA LINKAGE
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 22
SMS Platform Geo Services
WikiData
GADM
Flickr
Shapes
OpenStreetMap
Administrative
Boundaries
DBpedia
StatisticalDataontheWeb
58,561
71,724
76,366 (288,667)
(276,975)
(344,269)
162,059
25,440
FUAs
DATA LINKAGE
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 22
SMS Platform Geo Services
WikiData
GADM
Flickr
Shapes
OpenStreetMap
Administrative
Boundaries
DBpedia
StatisticalDataontheWeb
58,561
71,724
76,366 (288,667)
(276,975)
(344,269)
162,059
25,440
FUAs
DATA LINKAGE
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 22
SMS Platform Geo Services
WikiData
GADM
Flickr
Shapes
OpenStreetMap
Administrative
Boundaries
DBpedia
StatisticalDataontheWeb
58,561
71,724
76,366 (288,667)
(276,975)
(344,269)
162,059
25,440
FUAs
http://api.sms.risis.eu/
DATA TO SERVICE
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 23
SMS Platform Geo Services
SERVICE TO APPLICATION
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 24
SMS Platform Geo Services
https://youtu.be/qZGDD5RN7pI?list=PLSBPxopOi20XPOn1sGBthbNtXIUOqM_4b
SMS LD Geo-Enrichment Tool
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 25
SMS Platform Geo Services
https://youtu.be/FFy4-Zlt_ak?list=PLSBPxopOi20XPOn1sGBthbNtXIUOqM_4b
Geo-data Exploration
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 26
SMS Platform Geo Enrichment
Investigating the effect of regional socio-economic properties
innovative activities, as stimulated by recent RTD policies in the
Netherlands.
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 27
SMS Platform Geo Services
Address
FUA
Administrative Boundaries
Coordinates
geocode
RVO Dataset
(research) and innovation
subsidies for organizations
and companies in the
Netherlands
GADM Dataset
CBS Dataset
statistical information on dimensions such
as labour and income, economy, society
and regional aspects of municipalities and
regions in the Netherlands.
Use Case
Iden5fying	innova5ve	areas
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 28
SMS Platform
CBS-NL
RVO-NL
• Couple	open	data	on	the	innovation	projects	with	the	
theoretically	defined	geo-boundaries	….
• ….	to	investigate	the	geography of	innovation
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 29
SMS Platform Geo Services Use Case
People Hybrid OECD FUAsBusinesses
People Hybrid OECD FUAsBusinesses
FGAs
Projects mapped to FGAs
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 30
SMS Platform
(3)	Statistical	data	about	
boundaries to	create	an	own	
geo-classification,	e.g.	CBS-NL
(2)	Open	boundaries
e.g. OpenStreetMap
(1) e.g.	innovation	project	
in	RVO-NL database
(4)	Distribution	 of	innovation	projects	over	
self	(theoretically)	defined	 area’s
Overview
(5)	Link	to	open	statistical	
data	:	e.g.,	Statistics	
Netherlands	or OECD
- a	wealth	of	contextual	
variables
(6)	Link	to	open	data	on	
organizations:	ORGREF,	
ETER,	CORDIS
AREAS
Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 31
SMS Platform Geo Services
Any questions? comments?
http://sms.risis.eu

More Related Content

PDF
Cloud Interoperability Infrastructures for Governments: The Government Servic...
PDF
WHITE PAPER: Data Harmonization & Interoperability in OpenTransportNet
PPS
2011 ITS World Congress - GO-Sync - A Framework to Synchronize Transit Agency...
PDF
OpenGovIntelligence Workshop at NTTS2017
PDF
OpenTransportNet: Stimulating Innovation with Open Geographic Information
PPT
[2015 e-Government Program] Action Plan : Warsaw(Poland)
PDF
Big data traffic management in vehicular ad-hoc network
Cloud Interoperability Infrastructures for Governments: The Government Servic...
WHITE PAPER: Data Harmonization & Interoperability in OpenTransportNet
2011 ITS World Congress - GO-Sync - A Framework to Synchronize Transit Agency...
OpenGovIntelligence Workshop at NTTS2017
OpenTransportNet: Stimulating Innovation with Open Geographic Information
[2015 e-Government Program] Action Plan : Warsaw(Poland)
Big data traffic management in vehicular ad-hoc network

What's hot (20)

PDF
GeoSEO and Map Series - Discovery Integrated With Geographical Search in Map ...
PPTX
[Srijan Wednesday Webinar] Leveraging the OGD Platform and Visualization Engine
PPT
Track D Lieven Raes - Belgium
PPTX
Dealing with Open Data in Istat
DOC
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
PDF
Jose Arcos DG COMM- Data Visualisation on ec.europa.eu
PPT
What to do with the existing spatial data in planning
PPTX
UrbanIT Partner Presentation
PPTX
DATASTAT HUB
PDF
Tomas Knap: UnifiedViews in COMSODE pilot projects
PDF
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
PPTX
PoolParty Semantic Suite - Solutions for Sustainable Development
PPT
EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...
PPTX
The Public Sector DNA on the web: semantically marking up government portals
PPTX
Bde euro proworkshop
PDF
ADEQUATe and CommuniData
PDF
EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...
PDF
2013 06-25 goedertier-inspire-2013
PPTX
Innovative Approaches for the collection of road transport statistics
PPT
Opportunities in Data
GeoSEO and Map Series - Discovery Integrated With Geographical Search in Map ...
[Srijan Wednesday Webinar] Leveraging the OGD Platform and Visualization Engine
Track D Lieven Raes - Belgium
Dealing with Open Data in Istat
Notes for talk on 12th June 2013 to Open Innovation meeting, Glasgow
Jose Arcos DG COMM- Data Visualisation on ec.europa.eu
What to do with the existing spatial data in planning
UrbanIT Partner Presentation
DATASTAT HUB
Tomas Knap: UnifiedViews in COMSODE pilot projects
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
PoolParty Semantic Suite - Solutions for Sustainable Development
EDF2014: Talk of Ioannis Kotsiopoulos, European Dynamics: Semantics – Interop...
The Public Sector DNA on the web: semantically marking up government portals
Bde euro proworkshop
ADEQUATe and CommuniData
EDF2014: Piek Vossen, Professor Computational Lexicology, VU University Amste...
2013 06-25 goedertier-inspire-2013
Innovative Approaches for the collection of road transport statistics
Opportunities in Data
Ad

Similar to ERSA 2017: A linked open data based system for flexible delineation of geographic areas (20)

PDF
Sii-Mobility Km4City Smart City API and App
PPT
Web Mapping
PPT
Web Mapping - exploiting location based information through eGovernment
PPT
02 -how-will-inspire-influence-local-authorities-and-spatial-planning
PPTX
Dublinked tech workshop_15_dec2011
PDF
Snap4City November 2019 Course: Smart City IOT Data Ingestion Interoperabilit...
PDF
Ontology Building vs Data Harvesting and Cleaning for Smart-city Services
PDF
Semantically Mapping Science (SMS) Platform
PPT
Presentationsfk2010
PDF
Analysing Transportation Data with Open Source Big Data Analytic Tools
DOCX
BIG IOT AND SOCIAL NETWORKING DATA FOR SMART CITIES Alg.docx
DOCX
BIG IOT AND SOCIAL NETWORKING DATA FOR SMART CITIES Alg.docx
PPT
GIS 2.0 and Neogeography
PDF
remotesensing-12-01253.pdf
PPTX
BDE_MobilTUMWorkshop
PDF
SC7 Workshop 3: Space-based applications and Big Data
PDF
Maps4Finland 28.8.2012, Jari Reini
PDF
Maps4 finland 28.8.2012, jari reini
PPT
Symposium 2008
PPT
Dotted Eyes - Open Software, Standards and Data
Sii-Mobility Km4City Smart City API and App
Web Mapping
Web Mapping - exploiting location based information through eGovernment
02 -how-will-inspire-influence-local-authorities-and-spatial-planning
Dublinked tech workshop_15_dec2011
Snap4City November 2019 Course: Smart City IOT Data Ingestion Interoperabilit...
Ontology Building vs Data Harvesting and Cleaning for Smart-city Services
Semantically Mapping Science (SMS) Platform
Presentationsfk2010
Analysing Transportation Data with Open Source Big Data Analytic Tools
BIG IOT AND SOCIAL NETWORKING DATA FOR SMART CITIES Alg.docx
BIG IOT AND SOCIAL NETWORKING DATA FOR SMART CITIES Alg.docx
GIS 2.0 and Neogeography
remotesensing-12-01253.pdf
BDE_MobilTUMWorkshop
SC7 Workshop 3: Space-based applications and Big Data
Maps4Finland 28.8.2012, Jari Reini
Maps4 finland 28.8.2012, jari reini
Symposium 2008
Dotted Eyes - Open Software, Standards and Data
Ad

More from Ali Khalili (12)

PDF
FERASAT: A Serendipity-Fostering Faceted Browser for Linked Data
PDF
An introduction to Linked Open Data
PDF
Human-Linked Data Interaction
PDF
WYSIWYQ -- What You See Is What You Query
PPTX
Semantically Mapping Science (SMS)
PDF
Adaptive Linked Data-driven Web Components: Building Flexible and Reusable Se...
PDF
LD-R Presentation at ESWC2016 Developers Hackshop
PDF
Web of Data and its Status on Persian Web Data Space
PDF
An introduction to Linked (Open) Data
PDF
A Semantics-based User Interface Model for Content Annotation, Authoring and ...
PPTX
conTEXT -- Lightweight Text Analytics using Linked Data
PPTX
SlideWiki: Elicitation and Sharing of Knowledge using Presentations
FERASAT: A Serendipity-Fostering Faceted Browser for Linked Data
An introduction to Linked Open Data
Human-Linked Data Interaction
WYSIWYQ -- What You See Is What You Query
Semantically Mapping Science (SMS)
Adaptive Linked Data-driven Web Components: Building Flexible and Reusable Se...
LD-R Presentation at ESWC2016 Developers Hackshop
Web of Data and its Status on Persian Web Data Space
An introduction to Linked (Open) Data
A Semantics-based User Interface Model for Content Annotation, Authoring and ...
conTEXT -- Lightweight Text Analytics using Linked Data
SlideWiki: Elicitation and Sharing of Knowledge using Presentations

Recently uploaded (20)

PDF
Hindi spoken digit analysis for native and non-native speakers
PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
STKI Israel Market Study 2025 version august
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PPTX
Chapter 5: Probability Theory and Statistics
PPTX
The various Industrial Revolutions .pptx
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Unlock new opportunities with location data.pdf
PPT
What is a Computer? Input Devices /output devices
DOCX
search engine optimization ppt fir known well about this
PDF
August Patch Tuesday
PDF
Getting Started with Data Integration: FME Form 101
Hindi spoken digit analysis for native and non-native speakers
Module 1.ppt Iot fundamentals and Architecture
Zenith AI: Advanced Artificial Intelligence
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Univ-Connecticut-ChatGPT-Presentaion.pdf
1 - Historical Antecedents, Social Consideration.pdf
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Assigned Numbers - 2025 - Bluetooth® Document
STKI Israel Market Study 2025 version august
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
NewMind AI Weekly Chronicles – August ’25 Week III
Chapter 5: Probability Theory and Statistics
The various Industrial Revolutions .pptx
Group 1 Presentation -Planning and Decision Making .pptx
Unlock new opportunities with location data.pdf
What is a Computer? Input Devices /output devices
search engine optimization ppt fir known well about this
August Patch Tuesday
Getting Started with Data Integration: FME Form 101

ERSA 2017: A linked open data based system for flexible delineation of geographic areas

  • 1. A linked open data based system for flexible delineation of geographic areas Peter van den Besselaar, Ali Khalili and Klaas Andries de Graaf August 2017 Artificial Intelligence Section Department of Computer Science Faculty of Science Department of Organization Sciences Faculty of Social Science
  • 2. Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 2 SMS Platform Focus Goal How to capture new insights by integrating data from multiple heterogeneous data sources in the STI domain?
  • 3. Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 2 SMS Platform Focus Goal How to capture new insights by integrating data from multiple heterogeneous data sources in the STI domain?
  • 4. Linked Data Creation Linked Data Services Applications Use Cases Data Ingestion RISISPublicData RISISPrivateData OpenDataontheWeb combine one or more SMS services with other existing services and applications to build novel and innovative applications. a b c d a) Dataset Metadata Editor b) RISIS Datasets Portal c) Spreadsheet add-on d) Data Linking UI In order to enable batch processing of data, SMS provides a Google spreadsheet add-on. This add-on allows users to enrich their data directly in their spreadsheets. A set of user interfaces to allow users create their lenticular lenses – different views on entity linking. The RISIS dataset holders all have to describe their datasets in a detailed, consistent, and uniform way. To achieve this goal, RDF data model is used to describe the RISIS datasets. To stimulate non-Semantic Web users to generate valid RDF metadata descriptions, we designed a novel user-friendly editor which hides the complexity of RDF from non-technical users. The metadata editor exploits the state of the art Web technologies to provide user-friendly component to view and edit the metadata. In order to exploit the generated metadata, RISIS datasets portal brings a user interface to view and browse the metadata. Faceted browsing allows users to explore the dataset via multiple entry points, or when users do not know what they are looking for beforehand. The portal also handles user registration and supports the process of reviewing visit/access requests to certain RISIS datasets. convert unstructured and structured data to RDF. expose the functionality of the SMS platform to third-party users by standard Application Programming Interfaces (APIs). Identity Resolution Services Named Entity Recognition Services Metadata Services Category Services Basic Geo Services Innovative Geo Services Integration Services handle the entity disambiguation problem and manage the same or similar entities found in different datasets. extract and classify named entities (e.g. people, places, organizations) in unstructured text. resolve the mapping between heterogeneous classification schemas (e.g. WoS and FoS, or different ISO codes for countries). allow search on datasets based on the description of datasets in multiple metadata categories (e.g. language, time coverage, etc.). deal with basic representation of geographical data together with geocoding functions and identifying the boundaries containing a given point. provide innovative services based on new notions of distance (e.g. traffic congestion, language factor, flight routes, etc.) allow integration of data with RISIS public and private datasets as well as open datasets and social data available on the Web. DBpedia Wikidata … OrgRef GRID FundRef Geoname ISNI VIAF Cordis ? Lenticular Lenses When comparing two entities, depending on the user’s perspective and the context of study, they might be considered the same, similar or different. Sometimes two organizations (e.g. departments) can be the same – because they are parts of the same organization (university). But if one wants to compare departments, this is not the case. Lenticular lenses support linking the entities based on the context of use and provided features of data. access and retrieve heterogeneous data. Access Control Points (ACPs) - provide standard interfaces to reduce technical difficulties of accessing data. - provide a mechanism to coordinate access to data based on the user role and the datasets owner’s requirements. WWW e) Faceted browser Allows browsing a dataset using a set of pre-defined facets e http://sms.risis.eu The Semantically Mapping Science (SMS) Platform
  • 5. RISIS Datasets: Entity Types Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 4 SMS Platform Data Organization Product Agreement Person Policy Policy Evaluation Location CIB ETER EUPRO JOREP Leiden-Ranking MORE I Nano Profile SIPER VICO Higher Education Firm Funding Body Publication Patent Project Investment Funding Program Integration
  • 6. RISIS Datasets: Entity Types Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 4 SMS Platform Data Organization Product Agreement Person Policy Policy Evaluation Location CIB ETER EUPRO JOREP Leiden-Ranking MORE I Nano Profile SIPER VICO Higher Education Firm Funding Body Publication Patent Project Investment Funding Program Integration
  • 8. Administrative Geo Boundaries Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 6 SMS Platform Geo Services Municipalities
  • 11. defined by OECD in collaboration with EC/Eurostat consider factors beyond the predefined city boundaries to better reflect the economic geography of where people live and work Functional Urban Areas (FUAs) OECD Metropolitan eXplorer: http://measuringurban.oecd.org
  • 12. defined by OECD in collaboration with EC/Eurostat consider factors beyond the predefined city boundaries to better reflect the economic geography of where people live and work population area GDP environment (CO2 emissions and air pollution) labour market (employment and unemployment growth) innovation (patent intensity) urban form and territorial organization Functional Urban Areas (FUAs) OECD Metropolitan eXplorer: http://measuringurban.oecd.org
  • 14. Problem Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10 SMS Platform Geo Services Address FUA ?
  • 15. Problem Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10 SMS Platform Geo Services Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002)
  • 16. Problem Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10 SMS Platform Geo Services Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002) OECD FUAs List
  • 17. Problem Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10 SMS Platform Geo Services Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002) - Geocode to LAU (municipality) OECD FUAs List
  • 18. Problem Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10 SMS Platform Geo Services Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002) - Geocode to LAU (municipality) - Shapefiles for FUAs or LAUs? OECD FUAs List
  • 19. Problem Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 10 SMS Platform Geo Services Address FUA ? • Vrije Universiteit Amsterdam • De Boelelaan 1105, 1081 HV Amsterdam Amsterdam (NL002) - Geocode to LAU (municipality) - Shapefiles for FUAs or LAUs? OECD FUAs List
  • 20. Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 11 SMS Platform Geo Services Building a Linked Open Data Space for Flexible Delineation of Geographic Areas Goal
  • 21. Geo Boundaries Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 12 SMS Platform Geo Services Adaptive FUAs Address Flexible Geographic Areas Administrative Boundaries Coordinates geocode Geo Statistical DataFGA
  • 22. LifeCycle Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 13 SMS Platform Geo Services Linked Data
  • 23. LifeCycle Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 13 SMS Platform Geo Services Data Discovery & Collection Data Extraction & Conversion Service to Application Data Storage & Querying Data to Service Data Linkage Linked Data
  • 24. LifeCycle Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 13 SMS Platform Geo Services Data Discovery & Collection Data Extraction & Conversion Service to Application Data Storage & Querying Data to Service Data Linkage Linked Data
  • 25. DATA DISCOVERY & COLLECTION Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 14 SMS Platform Geo Services • OpenStreepMap (OSM) • Database of Global Administrative Areas (GADM) • Flickr Shapefiles Dataset • Published Shapefiles for Individual Countries • Published Geospatial RDF Datasets
  • 26. Open Administrative Boundaries Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 15 SMS Platform Geo Services • Level 1: super-national administrations e.g. European Union. • Level 2: country borders based on the political entities listed on the ISO 3166 standard. • Level 3 to 11: subnational borders such as ``state'', ``province'', ``region'' and ``district''. • Level 0: countries. • Level 1 to 5: lower level subdivisions such as provinces, departments, counties, etc. depending on the size and availability of data for the underlying country. • Level 1: country • Level 2: region • Level 3: county • Level 4: locality • Level 5: neighborhood
  • 27. Open Administrative Boundaries: Example Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 16 SMS Platform Geo Services
  • 28. DATA EXTRACTION & CONVERSION Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 17 SMS Platform Geo Services GeoJSON Enrichment Functions Mapping Configurations OSM XML PBF ESRI shapes triplify mapshaper osmtogeojson osmosis
  • 29. DATA EXTRACTION & CONVERSION Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 18 SMS Platform Geo Services Metadata about different levels provided by OSM http://wiki.openstreetmap.org/wiki/Tag:boundary%3Dadministrative
  • 30. DATA STORAGE & QUERYING Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 19 SMS Platform Geo Services Virtuoso Geo Spatial Geometry as SMS internal representation for Geo-data in RDF
  • 31. DATA LINKAGE Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 20 SMS Platform Geo Services - Query on metadata about the administrative boundaries - Find the alignment between levels in different datasets
  • 32. DATA LINKAGE Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 21 SMS Platform Geo Services - used the possible mappings between datasets at different levels. - check the overlaps of areas at the similar level, and for the matching areas apply string matching to make sure that they refer to the same administrative boundary.
  • 33. DATA LINKAGE Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 22 SMS Platform Geo Services WikiData GADM Flickr Shapes OpenStreetMap Administrative Boundaries DBpedia StatisticalDataontheWeb 58,561 71,724 76,366 (288,667) (276,975) (344,269) 162,059 25,440 FUAs
  • 34. DATA LINKAGE Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 22 SMS Platform Geo Services WikiData GADM Flickr Shapes OpenStreetMap Administrative Boundaries DBpedia StatisticalDataontheWeb 58,561 71,724 76,366 (288,667) (276,975) (344,269) 162,059 25,440 FUAs
  • 35. DATA LINKAGE Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 22 SMS Platform Geo Services WikiData GADM Flickr Shapes OpenStreetMap Administrative Boundaries DBpedia StatisticalDataontheWeb 58,561 71,724 76,366 (288,667) (276,975) (344,269) 162,059 25,440 FUAs
  • 36. http://api.sms.risis.eu/ DATA TO SERVICE Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 23 SMS Platform Geo Services
  • 37. SERVICE TO APPLICATION Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 24 SMS Platform Geo Services https://youtu.be/qZGDD5RN7pI?list=PLSBPxopOi20XPOn1sGBthbNtXIUOqM_4b
  • 38. SMS LD Geo-Enrichment Tool Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 25 SMS Platform Geo Services https://youtu.be/FFy4-Zlt_ak?list=PLSBPxopOi20XPOn1sGBthbNtXIUOqM_4b
  • 39. Geo-data Exploration Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 26 SMS Platform Geo Enrichment
  • 40. Investigating the effect of regional socio-economic properties innovative activities, as stimulated by recent RTD policies in the Netherlands. Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 27 SMS Platform Geo Services Address FUA Administrative Boundaries Coordinates geocode RVO Dataset (research) and innovation subsidies for organizations and companies in the Netherlands GADM Dataset CBS Dataset statistical information on dimensions such as labour and income, economy, society and regional aspects of municipalities and regions in the Netherlands. Use Case
  • 41. Iden5fying innova5ve areas Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 28 SMS Platform CBS-NL RVO-NL • Couple open data on the innovation projects with the theoretically defined geo-boundaries …. • …. to investigate the geography of innovation
  • 42. Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 29 SMS Platform Geo Services Use Case People Hybrid OECD FUAsBusinesses People Hybrid OECD FUAsBusinesses FGAs Projects mapped to FGAs
  • 43. Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 30 SMS Platform (3) Statistical data about boundaries to create an own geo-classification, e.g. CBS-NL (2) Open boundaries e.g. OpenStreetMap (1) e.g. innovation project in RVO-NL database (4) Distribution of innovation projects over self (theoretically) defined area’s Overview (5) Link to open statistical data : e.g., Statistics Netherlands or OECD - a wealth of contextual variables (6) Link to open data on organizations: ORGREF, ETER, CORDIS AREAS
  • 44. Semantically Mapping Science (SMS) Platform: http://sms.risis.eu 31 SMS Platform Geo Services Any questions? comments? http://sms.risis.eu