SlideShare a Scribd company logo
Publishing and Consuming Linked Data.
(Lessons learnt when using LOD in an application)
Marta Villegas
Universitat Pompeu Fabra
Cercedillas, June 2015
IULA-UPF scenario
OLAC
Language
Resource
Catalogue
OAI-PMH
SERVER
Dublin Core Metashare OLAC
Metadata
Formats
.....METADATA HARVESTING....
IULA-UPF moving to LOD
Ojectives:
- Displaying data to the user in a comprehensive way
- Aggregating external data in a sensitive manner
- Making hidden implicit relations explicit.
Triple store (Virtuoso) http://lodserver.iula.upf.edu
Sparql server (Virtuoso) http://lodserver.iula.upf.edu/sparql
Web Browser (RoR + SPARQL) http://lod.iula.upf.edu/
RDF
When the focus shifts from growing the cloud to
deploying applications
• Complex types (identity resolution)
• Simple types (as instances)
• Linking data (linking vs. reusing)
• Data enrichment
• Approach: incremental process first bunch and curation
process
RDFying – index
RDFyinf – complex instances
<Document>
<Person>
<Organisation>
<Project>
<LangResourceInfo>
<identificationInfo>
<distributionInfo>
<contactPerson>
<metadataInfo>
<validationInfo>
<resourceDocumentationInfo>
<resourceCreationInfo>
<resourceComponentType>
</LangResourceInfo>
RDFyinf – complex instances
<langResource-URI-1>
<langResource-URI-2>
<langResource-URI-3>
<langResource-URI-n>
<person-URI-1>
<person-URI-2>
<person-URI-3>
=?
Identity
resolution
<contactPerson>
<surname>Monachini</surname>
<givenName>Monica</givenName>
<communicationInfo>
<email>monica.monachini@ilc.cnr.it</email>
<email>risorse@ilc.cnr.it</email>
<url>http://www.ilc.cnr.it/</url>
<address>Via Giuseppe Moruzzi</address>
<zipCode>56124</zipCode>
<city>Pisa</city>
<country>Italy</country>
</communicationInfo>
<affiliation>
<organizationName>………</organizationName>
<departmentName>Istituto …</departmentName>
<communicationInfo>
</affiliation>
</contactPerson>
http://…/Monica_Monachini
<contactPerson>
<surname>Monachini</surname>
<givenName>Monica</givenName>
<communicationInfo>
<email>monica.monachini@ilc.cnr.it</email>
<email>risorse@ilc.cnr.it</email>
<url>http://www.ilc.cnr.it/</url>
<address>Via Giuseppe Moruzzi</address>
<zipCode>56124</zipCode>
<city>Pisa</city>
<country>Italy</country>
</communicationInfo>
<affiliation>
<organizationName>………</organizationName>
<departmentName>Istituto …</departmentName>
<communicationInfo>
</affiliation>
</contactPerson>
http://…/Monica_Monachini
<fundingProject>
<projectName> Platform for Automatic, Normalised
Annotation and Cost-Effective Acquisition of
Language Resources for Human Languages
Technologies </projectName>
<projectShortName> PANACEA </projectShortName>
<url> http://panacea-lr.eu/ </url>
<fundingType> euFunds </fundingType>
<funder> European Union </funder>
</fundingProject>
<organizationInfo>
<organizationName> Consiglio Nazionale delle
Ricerche. Istituto di Linguistica Computazionale
“Antonio Zampolli” </organizationName>
<organizationShortName>CNR</organizationShortName>
…
For each embeded Project/Person/Organization/
1. Generate: Subject property URI triple for the
backwards relation.
– If Person then use “name_givenName”
– If “short name” exists use “shortname”
– Else use 20 first characters of “long name”
2. Generate URI property object triples as the result of
the union of all local declarations (where union
removes duplicate triples).
– This requires a final curation task that agrees on node values
in case they are different.
– The preliminary version needs further curation (we used
SPARQL select distinct to identify oddities)
RDFying Documents:
- DBLP to get full RDF descriptions
- Google Scholar to get BibTex descriptions
- For a small dataset this can be assumed. For big
datasets this needs a lot of work (some automatic
tasks may be defined)
<document>Quochi V, Frontini F, Rubino F. A MWE
Acquisition and Lexicon Builder Web Service. COLING 2012,
24th International Conference on Computational
Linguistics, Proceedings of the Conference: Technical
Papers,8-15 December 2012, Mumbai, India</document>
RDFying - Where to stop?
BIBTEX:
@inproceedings {quochi2012mwe,
title={A MWE Acquisition and Lexicon Builder Web Service.},
author={Quochi, Valeria & Frontini, Francesca & Rubino, Francesco},
booktitle={COLING},
year={2012}}
DBLP
<http://dblp.uni-trier.de/rec/conf/coling/QuochiFR12 >
owl:sameAs <http://dblp.org/rec/conf/coling/QuochiFR12> ;
dblp:title “A MWE Acquisition and Lexicon Builder Web Service”;
dblp:authoredBy <http://dblp.uni-trier.de/pers/q/Quochi:Valeria>;
dblp:authoredBy <http://dblp.uni-trier.de/pers/f/Frontini:Francesca>;
dblp:authoredBy <http://dblp.uni-trier.de/pers/r/Rubino:Francesco >;
dblp:publishedAsPartOf <http://dblp.uni-trier.de/rec/conf/coling/2012 >:
dblp:yearOfPublication “2012” .
Article
title
creator Mikel Forcada
subject discourse analysis, question answering
keywords NER, LMF, ...
references FreeLing, TreeBank, PANACEA ...
language English
RDFying- simple types
<subject>Gender Studies</subject>
<usage>NER</usage>
<format>XCES</format>
<standard>LMF</standard>
Not only Enumerations but also string
elements !!!
RDFying - simple types as instances
RDFying - simple types as instances
Value Value
counter
Resource
counter
eng 518 476
en 215 174
EN 120 120
Spa 390 376
es 77 71
ES 10 10
Language codes in MS central node
Enumerations:
object property + Class + instances +
checking existing vocabularies
‘free strings’:
1) generate data type property + string value.
2) curation process that:
a) identifies ‘enumeration like’ candidates (eg.
language) and choose an appropriate Vocabulary
b) Match value strings to relevant URIS (Dbpedia)
RDFying - simple types as instances
SELECT DISTINCT ?language
WHERE { ?s ms:languageId ?language }
(eng , en , EN …)
INSERT { ?s ms:language <http://.../English>.}
WHERE { ?s ms:language “EN". }
DELETE { ?s ms:language “EN". }
Curation using SPARQL
RDFying - simple types as instances
Linking data !!
Person
Organization
Document
Project
Enumerations
String valued
VIAF
ORCID
DBLP
Vocabularies
DBpedia
Linking data !! – linking vs reusing
documentation sameAs
documentation
Linking data !! – linking vs reusing
http://lod.iula.upf.edu/resources/PAN_metad
ata_MW_ENV_IT
http://lod.iula.upf.edu/resources/doc_37
local
URIs
external
URIs
Core concepts which belong to
some ‘local’ Class.
Instances which belong to some
‘external’ Class:
• Person (FOAF)
• Document (BIBO)
• Organisation (FOAF)
•….
But, some functional reasons:
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an application)”
“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an application)”
Why all this ? Is it worth it?
- Displaying data to the user in a comprehensive way
- Aggregating external data in a sensitive manner
- Making hidden implicit relations explicit.
<usage>NER</usage>
<format>XCES</format>
<standard>LMF</standard>
Any good article or tool ?
NER
Projects
Services Articles
Reports
Named
Entity
SELECT * WHERE { ?s ?p ms:NER }
IULA?
10!
Why all this ? – IULA at MS central node
IULA?
104
P E R S O N
ID
name
description
...
A N I S AT I O N
ID
name
description
...
R E S O U R C E
ID
name
description
...
L I C E N S E
ID
name
description
...
Has_
ID
ID
Has_
ID
ID
Has_
ID
ID
Has_
ID
ID
Has_
ID
ID
Has_
ID
ID
D O C U M E N T
ID
name
description
...
P E R S O N
ID
name
description
...
P R O J E C T
ID
name
description
...
SELECT * FROM WHERE { … ...}HELP!!
Everything about IULA?
HELP!!
SELECT * WHERE { ?s ?p “IULA” }
SELECT * WHERE { ?s ?p “IULA” }
sample data
(855 records)
sample data
(855 records)
Why all this ? – data Mashups
Backwards relations
• LOD opens new possibilities and SPARQL is a powerful tool
BUT
• Curation task is crucial and effort/time consuming. You can
address it as an incremental process.
Publishing LOD vs. deploying LOD applications
• Until now, the LOD community seems to focus on “growing
the cloud”
• In this scenario, creating new URIs and mapping to existing
URIs is OK but,
• when the focus shifts from growing the cloud to developing
applications, new problems will arise: massive redundancy of
URIs, trust on third party servers/data, …
Conclussions & reflections

More Related Content

PDF
Ontologies and semantic web
PPTX
Development of Semantic Web based Disaster Management System
PPTX
Owl web ontology language
PDF
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
PPTX
Composing JSON-based Web APIs
PDF
Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...
PDF
Ontologies and semantic web
Development of Semantic Web based Disaster Management System
Owl web ontology language
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Composing JSON-based Web APIs
Graph Databases and Web Frameworks (NodeJS, AngularJS, GridFS, OpenLink Virtu...

What's hot (20)

PPT
Web ontology language (owl)
PDF
Introduction of Knowledge Graphs
PPTX
Ontology development in protégé-آنتولوژی در پروتوغه
PPTX
Contributing to the Smart City Through Linked Library Data
PPTX
Semantic Web: introduction & overview
PPTX
Towards digitizing scholarly communication
ODP
Introduction to LDL 2012
PPTX
semantic web & natural language
PPT
Semantic Web in Action
PPTX
General Introduction for Semantic Web and Linked Open Data
PPTX
Jarrar: OWL -Web Ontology Language
PPT
Rdf And Rdf Schema For Ontology Specification
PPTX
LOD2 Webinar Series: 3rd relase of the Stack
PDF
The Web Ontology Language
ODP
State of the Semantic Web
PPTX
Knowledge Graph Introduction
PDF
Jarrar: OWL (Web Ontology Language)
PPTX
Usage of Linked Data: Introduction and Application Scenarios
PPT
Dublin Core In Practice
Web ontology language (owl)
Introduction of Knowledge Graphs
Ontology development in protégé-آنتولوژی در پروتوغه
Contributing to the Smart City Through Linked Library Data
Semantic Web: introduction & overview
Towards digitizing scholarly communication
Introduction to LDL 2012
semantic web & natural language
Semantic Web in Action
General Introduction for Semantic Web and Linked Open Data
Jarrar: OWL -Web Ontology Language
Rdf And Rdf Schema For Ontology Specification
LOD2 Webinar Series: 3rd relase of the Stack
The Web Ontology Language
State of the Semantic Web
Knowledge Graph Introduction
Jarrar: OWL (Web Ontology Language)
Usage of Linked Data: Introduction and Application Scenarios
Dublin Core In Practice
Ad

Viewers also liked (16)

PPTX
Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...
PPTX
UDトークアプリ新たな機能
PDF
Efficient, Scalable, and Provenance-Aware Management of Linked Data
PDF
Page0153
PDF
Legal Liab
PDF
Catastrophes Humanitaires et Communication
PDF
Tesis asfaltos
KEY
Ebookslibrarylinknj
PPT
Cefpi Southern Region
PPTX
Hands Only CPR
PPTX
160511 バス列(内部向け10分)
PDF
ESPECIACIÓN QUÍMICA DE FÓSFORO EN SEDIMENTOS SUPERFICIALES DEL GOLFO DE PARIA...
PDF
Zika : situation épidémiologique à mi janvier 2016
PPTX
Linked Open Data at SAAM: Past, Present, Future
PDF
【UDC2015】データ可視化 131 交通量計測器TRAPO
Spanish Language Technology Plan. David Pérez Fernández, Cabinet of State Sec...
UDトークアプリ新たな機能
Efficient, Scalable, and Provenance-Aware Management of Linked Data
Page0153
Legal Liab
Catastrophes Humanitaires et Communication
Tesis asfaltos
Ebookslibrarylinknj
Cefpi Southern Region
Hands Only CPR
160511 バス列(内部向け10分)
ESPECIACIÓN QUÍMICA DE FÓSFORO EN SEDIMENTOS SUPERFICIALES DEL GOLFO DE PARIA...
Zika : situation épidémiologique à mi janvier 2016
Linked Open Data at SAAM: Past, Present, Future
【UDC2015】データ可視化 131 交通量計測器TRAPO
Ad

Similar to “Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an application)” (20)

PPT
Structured Dynamics' Semantic Technologies Product Stack
PPTX
20100614 ISWSA Keynote
PPT
Finding knowledge, data and answers on the Semantic Web
PDF
Ontologies & linked open data
PPTX
Sem webmaubeuge
PDF
Knowledge discoverylaurahollink
PPTX
Mining and Managing Large-scale Linked Open Data
PPTX
Mining and Managing Large-scale Linked Open Data
PPTX
Why do they call it Linked Data when they want to say...?
PPTX
Large-Scale Semantic Search
PPTX
Consuming Linked Data 4/5 Semtech2011
PPTX
CSHALS 2010 W3C Semanic Web Tutorial
PDF
Using BM25F for Semantic Search
PDF
The Lonesome LOD Cloud
PPTX
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
PPT
Re-using Media on the Web: Media fragment re-mixing and playout
PPTX
Intro to the semantic web (for libraries)
PPTX
Introduction to SPARQL
PPTX
Introduction to SPARQL
PPT
ontology.ppt
Structured Dynamics' Semantic Technologies Product Stack
20100614 ISWSA Keynote
Finding knowledge, data and answers on the Semantic Web
Ontologies & linked open data
Sem webmaubeuge
Knowledge discoverylaurahollink
Mining and Managing Large-scale Linked Open Data
Mining and Managing Large-scale Linked Open Data
Why do they call it Linked Data when they want to say...?
Large-Scale Semantic Search
Consuming Linked Data 4/5 Semtech2011
CSHALS 2010 W3C Semanic Web Tutorial
Using BM25F for Semantic Search
The Lonesome LOD Cloud
‘Facilitating User Engagement by Enriching Library Data using Semantic Techno...
Re-using Media on the Web: Media fragment re-mixing and playout
Intro to the semantic web (for libraries)
Introduction to SPARQL
Introduction to SPARQL
ontology.ppt

Recently uploaded (20)

PPTX
worship songs, in any order, compilation
PPTX
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
PDF
Nykaa-Strategy-Case-Fixing-Retention-UX-and-D2C-Engagement (1).pdf
PPTX
chapter8-180915055454bycuufucdghrwtrt.pptx
PDF
Presentation1 [Autosaved].pdf diagnosiss
PPTX
lesson6-211001025531lesson plan ppt.pptx
PPTX
The Effect of Human Resource Management Practice on Organizational Performanc...
DOCX
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
PPTX
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
PPTX
water for all cao bang - a charity project
PPTX
English-9-Q1-3-.pptxjkshbxnnxgchchxgxhxhx
PPTX
Emphasizing It's Not The End 08 06 2025.pptx
PPTX
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
DOC
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
PPTX
Anesthesia and it's stage with mnemonic and images
PPTX
An Unlikely Response 08 10 2025.pptx
PPTX
Impressionism_PostImpressionism_Presentation.pptx
PPTX
S. Anis Al Habsyi & Nada Shobah - Klasifikasi Hambatan Depresi.pptx
DOCX
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
PPTX
Primary and secondary sources, and history
worship songs, in any order, compilation
INTERNATIONAL LABOUR ORAGNISATION PPT ON SOCIAL SCIENCE
Nykaa-Strategy-Case-Fixing-Retention-UX-and-D2C-Engagement (1).pdf
chapter8-180915055454bycuufucdghrwtrt.pptx
Presentation1 [Autosaved].pdf diagnosiss
lesson6-211001025531lesson plan ppt.pptx
The Effect of Human Resource Management Practice on Organizational Performanc...
"Project Management: Ultimate Guide to Tools, Techniques, and Strategies (2025)"
Role and Responsibilities of Bangladesh Coast Guard Base, Mongla Challenges
water for all cao bang - a charity project
English-9-Q1-3-.pptxjkshbxnnxgchchxgxhxhx
Emphasizing It's Not The End 08 06 2025.pptx
PHIL.-ASTRONOMY-AND-NAVIGATION of ..pptx
学位双硕士UTAS毕业证,墨尔本理工学院毕业证留学硕士毕业证
Anesthesia and it's stage with mnemonic and images
An Unlikely Response 08 10 2025.pptx
Impressionism_PostImpressionism_Presentation.pptx
S. Anis Al Habsyi & Nada Shobah - Klasifikasi Hambatan Depresi.pptx
ENGLISH PROJECT FOR BINOD BIHARI MAHTO KOYLANCHAL UNIVERSITY
Primary and secondary sources, and history

“Publishing and Consuming Linked Data. (Lessons learnt when using LOD in an application)”