SlideShare a Scribd company logo
Data Archiving and Networked Services

Linking knowledge
spaces
Christophe Guéret (@cgueret)

DANS is een instituut van KNAW en NWO
Take home message
● Best practices for data are so 90’s …
but, no worries, there are alternatives ;-)
● “Linked Data” is not a new data exchange
standard. It is a way to publish and link data
using the Web
● Linked knowledges spaces are richer and
easier to map & explore
Moving back in time…

© Tom Ryan, Flickr
Dealing with documents until 1989
● 4 simple, natural, steps (using the Internet) :
○
○
○
○

Get a document from a source
Find a software able to process it
Process and write down links to other documents
Keep an eye on updates

● Somewhat cumbersome
○ Authors can not easily link documents
○ Hard to process & keep up with updates
○ Hard to get a “big picture” out
Then came the Web …
● Easy
○ Web browsers display Web documents served by
Web servers and wrote using a common language

● Convenient
○ Latest version of a document available from the Web
server
○ Links between unique identifiers assigned to Web
documents (Uniform Resource Identifier)

● Scalable
○ Decentralised document publication platform
This had a tremendous success!
● > 40 billion indexed web documents
● Numerous standards and tools
● Dedicated services to find and use
documents
We could hardly go back now
● Would you dare not creating a web site for
your research group or yourself ?
● Web technologies are reaching out beyond
simple documents
Now it is data that matters

© Luc Legay, Flickr
Dealing with data until, well, now
● 4 simple, natural, steps (using the Internet) :
○
○
○
○

Get a dataset from a source
Find a software able to process it
Process and write down links to other datasets
Keep an eye on updates

● Somewhat cumbersome
○ Authors can not easily link datasets
○ Hard to process & keep up with updates
○ Hard to get a “big picture” out
Sounds familiar ?
● We deal with data the way we dealt with
documents 20 years ago
● Lots of different formats, no links, hard to
have up-to-date data, model de-coupled
from the data...
Linked Data
● 4 design principles, introduced in 2006
○ Use URIs as names for things
○ Use HTTP URIs so that people can look up those
names
○ When someone looks up a URI, provide useful
information, using the standards (RDF*, SPARQL)
○ Include links to other URIs so that they can discover
more things

● Publish data using the Web (not on the Web)
Linked Data
● 4 design principles, introduced in 2006
○ Use URIs as names for things
○ Use HTTP URIs so that people can look up those
names
Packed with good stuff:
○ When someone looks up a URI, provide useful
Open standards
information, using the standards (RDF*, SPARQL)
HTTP
ReST
○ Include links to other URIs so that they can discover
De-centralised publication
more things

● Publish data using the Web (not on the Web)
Concretely...
● Lille is in France and called “Rijsel” in Dutch
http://dbpedia.org/resource/Lille

http://dbpedia.org/ontology/country

http://www.w3.org/2000/01/rdf-schema#label

http://dbpedia.org/resource/France
“Rijsel”@NL
Concretely...
● Lille is in France and called “Rijsel” in Dutch
http://dbpedia.org/resource/Lille

Hey! I can click on that too!
http://dbpedia.org/ontology/country

http://www.w3.org/2000/01/rdf-schema#label

http://dbpedia.org/resource/France
“Rijsel”@NL
Part of the data integration
is already done!
Linked Data + Open Data = LOD
● 5-star scheme to get from closed data to
open linked data http://5stardata.info/
LOD + Semantics = Semantic Web
● Tell a bit about the Semantics of your data
and a computer will derive new facts for you
● For instance, “All the cities in France are in
Europe” => “Lille is in Europe”
Let’s take a step back
● A quick comparison of some features...
Web of Documents

Web of Data

Any data on the Web

Model

Tree

Statements

Varied

Identifiers

URI

URI

URN + URI

Serialisation

XML

XML, TTL, ...

XML, CSV, ...

Granularity

Page

Statement

Data set

Access

Look up

Look up

Download

Schema

HTML

Varied

Varied

Query language

XQuery / XPath

SPARQL

Varied

Sweet spot for data integration !
Linking & Mapping knowledge spaces

© Christopher Bulle, Flickr
Mapping knowledge spaces
● Without Linked Data
○
○
○
○

Download individual data sets
Integrate them as another data set
Map the output
(return to the first step on every update)

● With Linked Data
○ Index the different data sources
○ Map the output using “live” data
○ Eventually, cache the data for speed/accessibility
Example: Research landscape
● Without www.narcis.nl
Example: Research landscape
● With : http://narcis-vivo.appspot.com/

Dutch +
French data
Running
without data
Live browsing of the Web of Data
● LODLive at http://en.lodlive.it/
Information relevant to FAO efforts
● OpenAGRIS : http://agris.fao.org/openagris/index.do
Take home message
● Modern best practices are so 90’s …
but this can be changed ;-)
● “Linked Data” is not a new data exchange
standard. It is a way to publish and link data
using the Web
● Linked knowledges spaces are richer and
easier to map & explore

More Related Content

ODP
DBpedia: A Public Data Infrastructure for the Web of Data
PPTX
2011 05-01 linked data
PPTX
2011 05-02 linked data intro
PDF
DBpedia/association Introduction The Hague 12.2.2016
PPTX
WG5: A data wrangling experiment
PDF
Wikidata
PDF
Linked Data
PPTX
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
DBpedia: A Public Data Infrastructure for the Web of Data
2011 05-01 linked data
2011 05-02 linked data intro
DBpedia/association Introduction The Hague 12.2.2016
WG5: A data wrangling experiment
Wikidata
Linked Data
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...

What's hot (20)

PPTX
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
PPTX
DMDW Lesson 01 - Introduction
PPTX
The Europeana Strategy and Linked Open Data
PDF
Web Archive Research Skills and Tools Survey (WARST)
PPTX
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
PDF
Maurer Presentation - WARCnet Spring Meeting 2021
PPTX
Webber Presentation
PPT
A researcher driven data description for the archived web: Why and how?
PDF
Graph databases & data integration - the case of RDF
PDF
Methodological Guidelines for Publishing Linked Data
PPTX
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...
PDF
8th DBpedia meeting / California 2016
PDF
Indexing, searching, and aggregation with redi search and .net
PDF
Dirk Goldhahn: Introduction to the German Wortschatz Project
PDF
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
PDF
PDF
Crawling the Web for Structured Documents
PDF
Linked Data (1st Linked Data Meetup Malmö)
IFLA LIDASIG Open Session 2017: Introduction to Linked Data
DMDW Lesson 01 - Introduction
The Europeana Strategy and Linked Open Data
Web Archive Research Skills and Tools Survey (WARST)
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Maurer Presentation - WARCnet Spring Meeting 2021
Webber Presentation
A researcher driven data description for the archived web: Why and how?
Graph databases & data integration - the case of RDF
Methodological Guidelines for Publishing Linked Data
New approaches for data acquisition at europeana iiif, sitemaps and schema.o...
8th DBpedia meeting / California 2016
Indexing, searching, and aggregation with redi search and .net
Dirk Goldhahn: Introduction to the German Wortschatz Project
Build Narratives, Connect Artifacts: Linked Open Data for Cultural Heritage
Crawling the Web for Structured Documents
Linked Data (1st Linked Data Meetup Malmö)
Ad

Similar to Linking knowledge spaces (20)

PPTX
Linked Open Data Utrecht University Library
PDF
Open data and linked data
PDF
Linked Data Management
PPTX
Linked Data In Action
PDF
Linked Open Data for Digital Humanities
PDF
Implementing Linked Data in Low-Resource Conditions
PDF
Informal presentation about RES
PPT
RDFa From Theory to Practice
PDF
Llinked open data training for EU institutions
PPSX
Linked Data to Improve the OER Experience
PPTX
Linked open data project
PPTX
Linked Open Data and Applications
PDF
Jabes 2011 - Conférence inaugurale "Linked Open Data : opportunités et défis"
PPT
Linked Data and the Semantic Web - Mimas Seminar
PPTX
Linked Data Tutorial (Florianópolis)
PPTX
The Semantic Web Exists. What Next?
PDF
Data Collection and Integration, Linked Data Management
PDF
Linked Open Data Principles, Technologies and Examples
PDF
Linked Data Generation for the University Data From Legacy Database
PDF
An introduction to Linked Open Data
Linked Open Data Utrecht University Library
Open data and linked data
Linked Data Management
Linked Data In Action
Linked Open Data for Digital Humanities
Implementing Linked Data in Low-Resource Conditions
Informal presentation about RES
RDFa From Theory to Practice
Llinked open data training for EU institutions
Linked Data to Improve the OER Experience
Linked open data project
Linked Open Data and Applications
Jabes 2011 - Conférence inaugurale "Linked Open Data : opportunités et défis"
Linked Data and the Semantic Web - Mimas Seminar
Linked Data Tutorial (Florianópolis)
The Semantic Web Exists. What Next?
Data Collection and Integration, Linked Data Management
Linked Open Data Principles, Technologies and Examples
Linked Data Generation for the University Data From Legacy Database
An introduction to Linked Open Data
Ad

More from Christophe Guéret (20)

PDF
HHAI June 2022 - KGs and Hybrid Intelligence
ODP
Stop making tools! Nobody likes them anyway...
ODP
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
ODP
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
PDF
The Entity Registry System (ERS)
PDF
Let's downscale the semantic web !
PDF
Your next data viz gear should be a Wii-U
ODP
The data behind the HuisKluis
PDF
Digital archiving 3.0
PDF
The road towards a Web-based data ecosystem
PDF
Downscaling information systems for education
PDF
ICT4D course 2013 - Low resources infrastructure
PDF
ICT4D course 2013 - OLPC deployments
PDF
ICT4D course 2013 - Sugar
PDF
Exposing the data from NARCIS with VIVO
PDF
Clarifier le sens de vos données publiques avec le Web de données
ODP
Embedding young learners into the information society
PDF
Is linked data something for me?
ODP
Decentralised entity registry “WikiReg”
ODP
Evolutionary and Swarm Computing for scaling up the Semantic Web
HHAI June 2022 - KGs and Hybrid Intelligence
Stop making tools! Nobody likes them anyway...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
The Entity Registry System (ERS)
Let's downscale the semantic web !
Your next data viz gear should be a Wii-U
The data behind the HuisKluis
Digital archiving 3.0
The road towards a Web-based data ecosystem
Downscaling information systems for education
ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - Sugar
Exposing the data from NARCIS with VIVO
Clarifier le sens de vos données publiques avec le Web de données
Embedding young learners into the information society
Is linked data something for me?
Decentralised entity registry “WikiReg”
Evolutionary and Swarm Computing for scaling up the Semantic Web

Recently uploaded (20)

PPT
Teaching material agriculture food technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Modernizing your data center with Dell and AMD
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
NewMind AI Monthly Chronicles - July 2025
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Big Data Technologies - Introduction.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Cloud computing and distributed systems.
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Empathic Computing: Creating Shared Understanding
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
A Presentation on Artificial Intelligence
PDF
Encapsulation_ Review paper, used for researhc scholars
Teaching material agriculture food technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Modernizing your data center with Dell and AMD
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Unlocking AI with Model Context Protocol (MCP)
NewMind AI Monthly Chronicles - July 2025
MYSQL Presentation for SQL database connectivity
Big Data Technologies - Introduction.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Dropbox Q2 2025 Financial Results & Investor Presentation
Cloud computing and distributed systems.
The Rise and Fall of 3GPP – Time for a Sabbatical?
Empathic Computing: Creating Shared Understanding
Chapter 3 Spatial Domain Image Processing.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
A Presentation on Artificial Intelligence
Encapsulation_ Review paper, used for researhc scholars

Linking knowledge spaces

  • 1. Data Archiving and Networked Services Linking knowledge spaces Christophe Guéret (@cgueret) DANS is een instituut van KNAW en NWO
  • 2. Take home message ● Best practices for data are so 90’s … but, no worries, there are alternatives ;-) ● “Linked Data” is not a new data exchange standard. It is a way to publish and link data using the Web ● Linked knowledges spaces are richer and easier to map & explore
  • 3. Moving back in time… © Tom Ryan, Flickr
  • 4. Dealing with documents until 1989 ● 4 simple, natural, steps (using the Internet) : ○ ○ ○ ○ Get a document from a source Find a software able to process it Process and write down links to other documents Keep an eye on updates ● Somewhat cumbersome ○ Authors can not easily link documents ○ Hard to process & keep up with updates ○ Hard to get a “big picture” out
  • 5. Then came the Web … ● Easy ○ Web browsers display Web documents served by Web servers and wrote using a common language ● Convenient ○ Latest version of a document available from the Web server ○ Links between unique identifiers assigned to Web documents (Uniform Resource Identifier) ● Scalable ○ Decentralised document publication platform
  • 6. This had a tremendous success! ● > 40 billion indexed web documents ● Numerous standards and tools ● Dedicated services to find and use documents
  • 7. We could hardly go back now ● Would you dare not creating a web site for your research group or yourself ? ● Web technologies are reaching out beyond simple documents
  • 8. Now it is data that matters © Luc Legay, Flickr
  • 9. Dealing with data until, well, now ● 4 simple, natural, steps (using the Internet) : ○ ○ ○ ○ Get a dataset from a source Find a software able to process it Process and write down links to other datasets Keep an eye on updates ● Somewhat cumbersome ○ Authors can not easily link datasets ○ Hard to process & keep up with updates ○ Hard to get a “big picture” out
  • 10. Sounds familiar ? ● We deal with data the way we dealt with documents 20 years ago ● Lots of different formats, no links, hard to have up-to-date data, model de-coupled from the data...
  • 11. Linked Data ● 4 design principles, introduced in 2006 ○ Use URIs as names for things ○ Use HTTP URIs so that people can look up those names ○ When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) ○ Include links to other URIs so that they can discover more things ● Publish data using the Web (not on the Web)
  • 12. Linked Data ● 4 design principles, introduced in 2006 ○ Use URIs as names for things ○ Use HTTP URIs so that people can look up those names Packed with good stuff: ○ When someone looks up a URI, provide useful Open standards information, using the standards (RDF*, SPARQL) HTTP ReST ○ Include links to other URIs so that they can discover De-centralised publication more things ● Publish data using the Web (not on the Web)
  • 13. Concretely... ● Lille is in France and called “Rijsel” in Dutch http://dbpedia.org/resource/Lille http://dbpedia.org/ontology/country http://www.w3.org/2000/01/rdf-schema#label http://dbpedia.org/resource/France “Rijsel”@NL
  • 14. Concretely... ● Lille is in France and called “Rijsel” in Dutch http://dbpedia.org/resource/Lille Hey! I can click on that too! http://dbpedia.org/ontology/country http://www.w3.org/2000/01/rdf-schema#label http://dbpedia.org/resource/France “Rijsel”@NL Part of the data integration is already done!
  • 15. Linked Data + Open Data = LOD ● 5-star scheme to get from closed data to open linked data http://5stardata.info/
  • 16. LOD + Semantics = Semantic Web ● Tell a bit about the Semantics of your data and a computer will derive new facts for you ● For instance, “All the cities in France are in Europe” => “Lille is in Europe”
  • 17. Let’s take a step back ● A quick comparison of some features... Web of Documents Web of Data Any data on the Web Model Tree Statements Varied Identifiers URI URI URN + URI Serialisation XML XML, TTL, ... XML, CSV, ... Granularity Page Statement Data set Access Look up Look up Download Schema HTML Varied Varied Query language XQuery / XPath SPARQL Varied Sweet spot for data integration !
  • 18. Linking & Mapping knowledge spaces © Christopher Bulle, Flickr
  • 19. Mapping knowledge spaces ● Without Linked Data ○ ○ ○ ○ Download individual data sets Integrate them as another data set Map the output (return to the first step on every update) ● With Linked Data ○ Index the different data sources ○ Map the output using “live” data ○ Eventually, cache the data for speed/accessibility
  • 20. Example: Research landscape ● Without www.narcis.nl
  • 21. Example: Research landscape ● With : http://narcis-vivo.appspot.com/ Dutch + French data Running without data
  • 22. Live browsing of the Web of Data ● LODLive at http://en.lodlive.it/
  • 23. Information relevant to FAO efforts ● OpenAGRIS : http://agris.fao.org/openagris/index.do
  • 24. Take home message ● Modern best practices are so 90’s … but this can be changed ;-) ● “Linked Data” is not a new data exchange standard. It is a way to publish and link data using the Web ● Linked knowledges spaces are richer and easier to map & explore