SlideShare a Scribd company logo
Machine-Interpretable
Dataset and Service Descriptions
for Heterogeneous Data Access & Retrieval
Anastasia Dimou, Ruben Verborgh,
Miel Vander Sande, Erik Mannens, Rik Van de Walle
Anastasia.Dimou@UGent.be @natadimou
Ghent University – iMinds – Multimedia Lab
http://RML.io
Semantic Web enabled applications
rely on data represented as
Linked Open Data
Linked Open Data
describe domain-level knowledge
that is understandable by
both humans and machines
Resource Description Framework (RDF)
is the prevalent data model
for describing Linked Open Data
predicatesubject object
Resource Description Framework (RDF)
ex:1 ex:MMLabex:works
“Anastasia Dimou”
ex:1 ex:MMLabex:works
“Anastasia Dimou”
ex:2 ex:MMLabex:works
“Ruben Verborgh”
ex:1 ex:MMLabex:works
“Anastasia Dimou”
ex:2 ex:MMLabex:works
“Ruben Verborgh”
ex:3 ex:MMLabex:works
“Miel Vander Sande”
ex:1 ex:MMLabex:works
“Anastasia Dimou”
ex:locatedex:MMLab ex:Ghent
ex:2 ex:MMLabex:works
“Ruben Verborgh”
ex:3 ex:MMLabex:works
“Miel Vander Sande”
ex:{id}
ex:{lab}
ex:located
ex:{lab} ex:{city}
sets of triples of a dataset have repetitive patterns
“{firstname}
{surname}”
ex:{id}
ex:{lab}
sets of triples of a dataset have repetitive patterns
“{firstname}
{surname}”
triple-oriented mapping languages
formalize patterns into rules to map data to RDF
ex:located
ex:{lab} ex:{city}
RDF Mapping Language (RML)
map any data to RDF
uniform, integrable, interoperable, extensible
extends the W3C-recommended R2RML
http://RML.io
A. Dimou, M. Vander Sande, P. Colpaert, R. Verborgh, E. Mannens, and R. Van de Walle.
RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data.
In Proceedings of the 7th Workshop on Linked Data on the Web (LDOW2014), 2014.
RML describes
rules to map any structured data to RDF
RML supports any data independently of
which structure and format they have
where they originally reside
how they are accessed & retrieved
data access and retrieval
is manually performed
remains hard-coded
Mapping data
any data to RDF with RML
Specifying data
which data form a data input
how to reference data input extracts
Accessing & Retrieving data
data input from original source(s)
Mapping data
any data to RDF with RML
Specifying
which data form a data input
how to reference data input extracts
Accessing & Retrieving
data input from original source(s)
rr:constant
ex:located
rr:template
“http://ex.com/{lab}”
rr:template
“http://ex.com/{city}”
rr:template
“http://ex.com/{id}”
rr:template
“http://ex.com/{lab}”
rr:template “{firstname} {surname}”
rr:termType rr:Literal
RDF Mapping Language (RML)
@prefix rr: <http://www.w3.org/ns/r2rml#>
Predicate MapSubject
Map
Object
Map
<#TriplesMap>
RDF Mapping Language (RML)
rr:constant
ex:located
rr:template
“http://ex.com/{lab}”
rr:template
“http://ex.com/{city}”
rr:template
“http://ex.com/{lab}”
rr:template
“http://ex.com/{lab}”
<#ResearcherMap>
<#LabMap>
rr:template “{firstname} {surname}”
rr:termType rr:Literal
Mapping data
data to RDF with RML
Specifying data
which data form a data input
how to reference data input extracts
Accessing & Retrieving data
data input from original source(s)
Triples Map
RDF Mapping Language (RML)
Predicate
Object Map
Subject
Map
Predicate
Map
Object Map
Triples Map
RDF Mapping Language (RML)
Predicate
Object Map
Subject
Map
Predicate
Map
Object Map
Logical
Source
Support data in Heterogeneous Structures
tabular-structured
hierarchical-structured
(semi-)structured
… … …
Support data in Heterogeneous Structures and Formats
tabular-structured
tables in DBs or CSV files …
hierarchical-structured
JSON or XML …
(semi-)structured
HTML …
… … …
rr:template
“http://ex.com/{id}”
rr:template
“http://ex.com/{lab}”
<#ResearcherMap> rr:template “{firstname} {surname}”
rr:termType rr:Literal
id firstname surname lab
1 Anastasia Dimou MMLab
2 Ruben Verborgh MMLab
3 Miel Vander Sande MMLab
support tabular-structured data
rr:constant
ex:located
rr:template
“http://ex.com/
{/labs/lab/short}”
rr:template
“http://ex.com/
{/labs/lab/location/city}”
<#LabMap>
<labs>
<lab>
<short>MMLab</short>
<title>Multimedia Lab</title>
<location>
<city>Ghent</city>
</location>
</lab>
<lab> …. </lab>
…
</labs>
support hierarchical-structured data
rr:constant
ex:located
rr:template
“http://ex.com/
{/labs/lab/short}”
rr:template
“http://ex.com/
{/labs/lab/location/city}”
<#LabMap>
<labs>
<lab>
<short>MMLab</short>
<title>Multimedia Lab</title>
<location>
<city>Ghent</city>
</location>
</lab>
<lab> …. </lab>
…
</labs>
How to reference data extracts?
Triples Map
RDF Mapping Language (RML)
Predicate
Object Map
Subject
Map
Predicate
Map
Object Map
Logical
Source
Reference
Formulation
<labs>
<lab>
<short>MMLab</short>
<title>Multimedia Lab</title>
<location>
<city>Ghent</city>
</location>
</lab>
<lab> …. </lab>
…
</labs>
<#Lab
Logical
Source>
ql:XPath
rr:constant
ex:located
rr:template
“http://ex.com/
{/labs/lab/short}”
rr:template
“http://ex.com/
{/labs/lab/location/city}”
<#LabMap>
<labs>
<lab>
<short>MMLab</short>
<title>Multimedia Lab</title>
<location>
<city>Ghent</city>
</location>
</lab>
<lab> …. </lab>
…
</labs>
<#Lab
Logical
Source>
ql:XPath
rr:constant
ex:located
rr:template
“http://ex.com/
{/labs/lab/short}”
rr:template
“http://ex.com/
{/labs/lab/location/city}”
<#LabMap>
How to iterate over the data?
Triples Map
RDF Mapping Language (RML)
Predicate
Object Map
Subject
Map
Predicate
Map
Object Map
Logical
Source
Reference
Formulation
iterator
<labs>
<lab>
<short>MMLab</short>
<title>Multimedia Lab</title>
<location>
<city>Ghent</city>
</location>
</lab>
<lab> …. </lab>
…
</labs>
<#Lab
Logical
Source>
ql:XPath
“/labs/lab”
rr:constant
ex:located
rr:template
“http://ex.com/
{/labs/lab/short}”
rr:template
“http://ex.com/
{/labs/lab/location/city}”
<#LabMap>
Mapping data
data to RDF with RML
Specifying data
which data form a data source
how to reference data extracts
Accessing & Retrieving data
data from their original sources
Input
data
Input
data
Input
data
Output
RDF
Mapping
module
RML
Processor
Map
doc
Data
source
Access
interface
Input
data
Input
data
Input
data
Output
RDF
Mapping
module
RML
Processor
Map
doc
Data
source
Access
interface
Data
source
Access
interface
Retrieval
module
Source
description
Data
source
Access
interface
Input
data
Input
data
Input
data
Output
RDF
Mapping
module
RML
Processor
Map
doc
Data
source
Access
interface
Data
source
Access
interface
Retrieval
module
Source
description
Where does this data originally come from?
Support different Locations and Access Interfaces
Local File(s)
Database connectivity
Web source(s)
RDF source(s)
Dataset and Service Vocabularies
advertising in machine-interpretable fashion
how to access the underlying data
can also be used in combination with RML
to retrieve the data input to be mapped
from its original source
Support different Locations and Access Interfaces
Local File(s)
Database connectivity
D2RQ
Web source(s) (Web API/service)
DCAT, CSVW, Hydra, VOiD (Dataset)
RDF source(s)
VOiD (Endpoint), SPARQL-SD
Triples Map
RDF Mapping Language (RML)
Predicate
Object Map
Subject
Map
Predicate
Map
Object Map
Logical
Source
Reference
Formulation
iterator
Source
<labs>
<lab>
<short>MMLab</short>
<title>Multimedia Lab</title>
<location>
<city>Ghent</city>
</location>
</lab>
<lab> …. </lab>
…
</labs>
<#Lab
Logical
Source>
ql:XPath
rr:constant
ex:located
rr:template
“http://ex.com/
{/labs/lab/short}”
rr:template
“http://ex.com/
{/labs/lab/location/city}”
<#LabMap>
“/labs/lab”
_:Source
Where does this data originally come from?
file.xml
XML
data
Output
RDF
Mapping
module
RML
Processor
Map
doc
Retrieval
module
Support Local File(s)
<labs>
<lab>
<short>MMLab</short>
<title>Multimedia Lab</title>
<location>
<city>Ghent</city>
</location>
</lab>
<lab> …. </lab>
…
</labs>
<#Lab
Logical
Source>
ql:XPath
rr:constant
ex:located
rr:template
“http://ex.com/
{/labs/lab/short}”
rr:template
“http://ex.com/
{/labs/lab/location/city}”
<#LabMap>
“/labs/lab”
“file.xml”
Support Local File(s)
file.xml
WEBAPI
DCAT
XML
data
Output
RDF
Mapping
module
RML
Processor
Map
doc
Retrieval
module
Source
description
Support file(s) published on the Web
<labs>
<lab>
<short>MMLab</short>
<title>Multimedia Lab</title>
<location>
<city>Ghent</city>
</location>
</lab>
<lab> …. </lab>
…
</labs>
<#Lab
Logical
Source>
ql:XPath
dcat:
distribution
a dcat:
Distribution
“/labs/lab”
_:Source
Support dataset on the Web (DCAT)
_:Source
dcat:Dataset
<http://ex.com/
file.xml>
dcat:
downloadUrl
file.xml
WEBAPI
DCAT
XML
data
JSON
data
Output
RDF
Mapping
module
RML
Processor
Map
doc
Data
repo
WEBAPI
Hydra
Retrieval
module
Source
description
Support data derived from a Web API
<labs>
<lab>
<short>MMLab</short>
<title>Multimedia Lab</title>
<location>
<city>Ghent</city>
</location>
</lab>
<lab> …. </lab>
…
</labs>
<#Lab
Logical
Source>
ql:XPath
hydra:
template
“http://ex.com/lab?
name={labName}”
“/labs/lab”
_:Source
Support data from a Web API (Hydra)
_:Source
hydra:
IriTemplate
file.xml
WEBAPI
DCAT
XML
data
JSON
data
tabular
data
Output
RDF
Mapping
module
RML
Processor
Map
doc
Data
repo
WEBAPI
Hydra
Data
base
JDBC
D2RQ
Retrieval
module
Source
description
rr:template
“http://ex.com/{id}”
rr:template
“http://ex.com/{lab}”
<#ResearcherMap> rr:template “{firstname} {surname}”
rr:termType rr:Literal
id firstname surname lab
1 Anastasia Dimou MMLab
2 Ruben Verborgh MMLab
3 Miel Vander
Sande
MMLab
Support tabular-structured data
<#DB
Logical
Source>
rr:SQL2008
“…”
_:Source
“SELECT …”
rr:template
“http://ex.com/{id}”
rr:template
“http://ex.com/{lab}”
<#ResearcherMap> rr:template “{firstname} {surname}”
rr:termType rr:Literal
Support tabular-structured data
<#DB
Logical
Source>
rr:SQL2008
“…”
_:Source
“SELECT …”
“…”
_:Source
d2rq:Database
“…”
“…”
“…”
file.xml
WEBAPI
DCAT
XML
data
JSON
data
tabular
data
Output
RDF
Mapping
module
RML
Processor
Map
doc
Data
repo
WEBAPI
Hydra
Data
base
JDBC
D2RQ
Retrieval
module
Source
description
Triple
store
SPARQL
ex:located
ex:{lab}
dbpedia:
{city}
ex:located
ex:{lab} ex:{city}
object defined in existing RDF source(s)
<#Lab
Logical
Source>
ql:XPath
rr:constant
ex:located
rr:template
“http://ex.com/
{/labs/lab/short}”
rml:reference “{/…/city}”
rr:termType rr:IRI
<#LabMap>
“/labs/lab”
_:Source
<#Dbpedia
Logical
Source>
ql:XPath
“/…/result”
DBpedia
<#DBpediaMap>
ex:located
ex:{lab}
dbpedia:
{city}
“SELECT …”
<#Lab
Logical
Source>
ql:XPath
rr:constant
ex:located
rr:template
“http://ex.com/
{/labs/lab/short}”
rml:reference “{/…/city}”
rr:termType rr:IRI
<#LabMap>
“/labs/lab”
_:Source
<#Dbpedia
Logical
Source>
ql:XPath
“/…/result”
DBpedia
<#DBpediaMap>
ex:located
ex:{lab}
dbpedia:
{city}
“SELECT …”
RML Editor (http://RML.io/RMLeditor)
Mapping data
any data to RDF with RML
Specifying data
which data form a data input
how to reference data input extracts
Accessing & Retrieving data
data input from original source(s)
Data access, retrieval and mapping descriptions
are machine-interpretable
Granular robust solution based on RML
which further automates and facilitates
the generation of RDF representations
RML.io
Questions?
Anastasia Dimou
@natadimou

More Related Content

PDF
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
PDF
Assessing and Refining Mappings to RDF to Improve Dataset Quality
PDF
DBpedia Mappings Quality Assessment
PDF
Mapping Hierarchical Sources into RDF using the RML Mapping Language
PDF
Mappings Validation
PDF
Introduction to data analysis using R
PPTX
PPT
Achieving time effective federated information from scalable rdf data using s...
A Generic Language for Integrated RDF Mappings of Heterogeneous Data
Assessing and Refining Mappings to RDF to Improve Dataset Quality
DBpedia Mappings Quality Assessment
Mapping Hierarchical Sources into RDF using the RML Mapping Language
Mappings Validation
Introduction to data analysis using R
Achieving time effective federated information from scalable rdf data using s...

What's hot (20)

PDF
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
PDF
PPTX
Timbuctoo 2 EASY
PPT
Scalable Data Analysis in R -- Lee Edlefsen
PPTX
SWT Lecture Session 2 - RDF
PPT
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
PPT
DB and IR Integration
PPT
DB-IR-ranking
PPT
The Network Data Structure in Computing
PDF
Linked (Open) Data
PPT
Tracing Networks: Ontology Software in a Nutshell
PPTX
RDF data model
PPTX
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation
PPT
Computing with Directed Labeled Graphs
PPTX
Efficient RDF Interchange (ERI) Format for RDF Data Streams
PPTX
FedX - Optimization Techniques for Federated Query Processing on Linked Data
PPTX
Efficient source selection for sparql endpoint federation
PPTX
Visualising the Australian open data and research data landscape
PPTX
Federated Query Formulation and Processing Through BioFed
PPT
Introduction To RDF and RDFS
Mapping, Interlinking and Exposing MusicBrainz as Linked Data
Timbuctoo 2 EASY
Scalable Data Analysis in R -- Lee Edlefsen
SWT Lecture Session 2 - RDF
A Practical Ontology for the Large-Scale Modeling of Scholarly Artifacts and ...
DB and IR Integration
DB-IR-ranking
The Network Data Structure in Computing
Linked (Open) Data
Tracing Networks: Ontology Software in a Nutshell
RDF data model
HiBISCuS: Hypergraph-Based Source Selection for SPARQL Endpoint Federation
Computing with Directed Labeled Graphs
Efficient RDF Interchange (ERI) Format for RDF Data Streams
FedX - Optimization Techniques for Federated Query Processing on Linked Data
Efficient source selection for sparql endpoint federation
Visualising the Australian open data and research data landscape
Federated Query Formulation and Processing Through BioFed
Introduction To RDF and RDFS
Ad

Viewers also liked (20)

PPTX
The Rules - SGS
PPT
Ultimate Platform Hotness Smackdown (Twitter, Facebook, iPhone, Native Web / ...
PDF
William Gross Sues Pimco for Hundreds of Millions
PPTX
teaching methods
PPTX
Latin Dansları
PDF
Interactive big data analytics
PDF
Challenging our Notions of Learning: Understanding How Web 2.0 Technology Wor...
DOC
Guia De Estudio Digestivo
PPT
xoxooo tkmmm
PPTX
Balanceo de una ecuación química
PDF
Moving to the Right Side of Safety
PDF
The State of Facilities at Eastern Region Institutions JUNE16
PDF
Tachyon memory centric, fault tolerance storage for cluster framworks
PPTX
Charitable Giving and Happiness
PDF
How to increase traffic to your WordPress website.
PDF
Social media strategies for libraries poster
PDF
Practica 2 quimica organica -espol
PPT
Jvm mbeans jmxtran
PPTX
God Is Forgiving
PPTX
Torque
The Rules - SGS
Ultimate Platform Hotness Smackdown (Twitter, Facebook, iPhone, Native Web / ...
William Gross Sues Pimco for Hundreds of Millions
teaching methods
Latin Dansları
Interactive big data analytics
Challenging our Notions of Learning: Understanding How Web 2.0 Technology Wor...
Guia De Estudio Digestivo
xoxooo tkmmm
Balanceo de una ecuación química
Moving to the Right Side of Safety
The State of Facilities at Eastern Region Institutions JUNE16
Tachyon memory centric, fault tolerance storage for cluster framworks
Charitable Giving and Happiness
How to increase traffic to your WordPress website.
Social media strategies for libraries poster
Practica 2 quimica organica -espol
Jvm mbeans jmxtran
God Is Forgiving
Torque
Ad

Similar to Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data Access and Retrieval (20)

PDF
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
PPTX
Triplestore and SPARQL
PDF
Graph databases & data integration v2
PDF
Explicit Semantics in Graph DBs Driving Digital Transformation With Neo4j
PDF
HyperGraphQL
PPTX
SWT Lecture Session 10 R2RML Part 1
PDF
Rdf data-model-and-storage
PDF
Triplewave: a step towards RDF Stream Processing on the Web
ODP
State of the Semantic Web
PPT
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
PPT
On the need for a W3C community group on RDF Stream Processing
ZIP
SemWeb Fundamentals - Info Linking & Layering in Practice
PPTX
Querying the Web of Data
PDF
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
PDF
Applying large scale text analytics with graph databases
PDF
XSPARQL CrEDIBLE workshop
PDF
Translation of Relational and Non-Relational Databases into RDF with xR2RML
PDF
Linked Data Fragments
PPTX
A Little SPARQL in your Analytics
PDF
Data Source API in Spark
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
Triplestore and SPARQL
Graph databases & data integration v2
Explicit Semantics in Graph DBs Driving Digital Transformation With Neo4j
HyperGraphQL
SWT Lecture Session 10 R2RML Part 1
Rdf data-model-and-storage
Triplewave: a step towards RDF Stream Processing on the Web
State of the Semantic Web
OrdRing 2013 keynote - On the need for a W3C community group on RDF Stream Pr...
On the need for a W3C community group on RDF Stream Processing
SemWeb Fundamentals - Info Linking & Layering in Practice
Querying the Web of Data
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
Applying large scale text analytics with graph databases
XSPARQL CrEDIBLE workshop
Translation of Relational and Non-Relational Databases into RDF with xR2RML
Linked Data Fragments
A Little SPARQL in your Analytics
Data Source API in Spark

More from andimou (7)

PDF
What Factors Influence the Design of a Linked Data Generation Algorithm?
PDF
High quality Linked Data generation for librarians
PDF
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
PDF
Towards an Interface for User-Friendly Linked Data Generation Administration
PDF
Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality
PDF
Extraction and Semantic Annotation of Workshop Proceedings in HTML using RML
PPTX
Visualizing the information of a Linked Open Data enabled Research Informatio...
What Factors Influence the Design of a Linked Data Generation Algorithm?
High quality Linked Data generation for librarians
iLastic: Linked Data Generation Workflow and User Interface for iMinds Schola...
Towards an Interface for User-Friendly Linked Data Generation Administration
Test-driven Assessment of [R2]RML Mappings to Improve Dataset Quality
Extraction and Semantic Annotation of Workshop Proceedings in HTML using RML
Visualizing the information of a Linked Open Data enabled Research Informatio...

Recently uploaded (20)

PDF
Electronic commerce courselecture one. Pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Encapsulation theory and applications.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Machine Learning_overview_presentation.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
cuic standard and advanced reporting.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Empathic Computing: Creating Shared Understanding
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Electronic commerce courselecture one. Pdf
MIND Revenue Release Quarter 2 2025 Press Release
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Encapsulation theory and applications.pdf
MYSQL Presentation for SQL database connectivity
Machine Learning_overview_presentation.pptx
Network Security Unit 5.pdf for BCA BBA.
Reach Out and Touch Someone: Haptics and Empathic Computing
cuic standard and advanced reporting.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Unlocking AI with Model Context Protocol (MCP)
NewMind AI Weekly Chronicles - August'25-Week II
Chapter 3 Spatial Domain Image Processing.pdf
Review of recent advances in non-invasive hemoglobin estimation
A comparative analysis of optical character recognition models for extracting...
Empathic Computing: Creating Shared Understanding
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf

Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data Access and Retrieval