SlideShare a Scribd company logo
Querying the Web of DataKennissystemen, December 2010Rinke Hoekstra
OverviewLinked (Open) DataThe Web of DataScalability issuesTechnologyRDF SyntaxesRDF Storage and QueryingKennissystemen 2010
The Semantic Web IdeologyIdentity is everythingPartial solutions are great too!Layer cakeOWLKennissystemen 2010
The Web of Data… does it exist?Kennissystemen 2010
Linked DataKennissystemen 2010
Semantic WebIntially
`Metadata’ for web pages
Since ~2006
`Web of Data’
Semantic web as data source in its own right
Linked Data
A ‘Databaseesque’ Web
RDF Triple stores
Query languagesKennissystemen 2010
Storage (on the web)As documents.rdf, .n3, .turtle, .htmlRDF triple storesSesame, Joseki, 4Store, AllegroGraph, OpenLink Virtuoso, SDB/TDB, Open Calais, SWI Prolog Reasoners ‘on top’, or via DIGPellet, OWLIM, etc.SPARQL EndpointsResults as JSON, XML, CSV etc.Kennissystemen 2010
Data and the WebNeed to add this ‘meta’ to my ‘data’‘Linking’ data across sitesWeb of Documents and the Web of DataOld fashioned HTML:<link rel='meta' type='application/rdf+xml' href='http://www.leibnizcenter.org/~hoekstra/foaf.rdf' title='FOAF'> URL-basedHTTP 303 `see other’http://www.w3.org/TR/swbp-vocab-pub/RDFaKennissystemen 2010
BBC MusicKennissystemen 2010
Kennissystemen 2010
Kennissystemen 2010
Kennissystemen 2010
Integration: 303 See OtherKennissystemen 2010
Integration: Inline RDFaAttributes on XHTML elementshttp://www.w3.org/TR/xhtml-rdfa-primerKennissystemen 2010
Integration: RDFa ExampleIn XHTML:<?xml version="1.0" encoding="UTF-8"?><!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"          "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"><html xmlns:cal="http://www.w3.org/2002/12/cal/ical#">    <head><title>Jo's Friends and Family Blog</title></head>    <body>    <pinstanceof="cal:Vevent">        I'm holding        <span property="cal:summary">one last summer Barbecue</span>,        on        <span property="cal:dtstart" content="20070916T1600-0500">            September 16th at 4pm.        </span>    </p>    </body></html>In RDF:_:blanknode0rdf:typecal:Vevent; cal:summary ”one last summer Barbecue";cal:dtstart "20070916T1600-0500" .Kennissystemen 2010
Legal InformationRetrievalforLaymenKennissystemen 2010
VoorbeeldKennissystemen 2010
So, where’s that data?I repeat: does it really exist?Kennissystemen 2010
Linked Open DataKennissystemen 2010
November 2009: 13.1 Billion triples, 142 Million links Kennissystemen 2010
September 2010: 25 Miljard triples, 395 Miljoenlinks Kennissystemen 2010
ScalabilityHow to deal with massive amounts of data?Consequences for reasoningBillion Triple Challenge(864.8 Million Triples)Consequences for queryingTable lookups, joins etc.… and what about …Dealing with change, provenance, trust?Kennissystemen 2010
A rough idea…I can crash a DL reasoner using an ontology of ~15 classes and 5 individuals (honestly)What if my ontology contains thousands of classes and billions of individuals?Kennissystemen 2010
ReasoningReasoning with inconsistent knowledgeincomplete knowledgeComplete vs. incomplete reasoningKennissystemen 2010
ReasoningWhen?
Realtime vs. in advance
Lightweight reasoning (RDFS, OWL 2 RL)
Implementable using forward chaining rules
Still problems with scalability
Distributed reasoning (DAS-3)
MaRVIN
‘SpeedDate’ distrubution of triples across nodes
MapReduce
Full closure of BTC in 57 minutes
Output: 30B triples
And what to do with the results?Kennissystemen 2010
2 Degrees from Kevin BaconPREFIX p: http://dbpedia.org/property/SELECT ?film1 ?actor1 ?film2 ?actor2WHERE {  	?film1 p:starring <http://dbpedia.org/resource/Kevin_Bacon> .  	?film1 p:starring ?actor1 .  	?film2 p:starring ?actor1 .  	?film2 p:starring ?actor2 .}DBPedia: 150M triplesKennissystemen 2010
Another rough idea…1 Billion triples in MySQL
Load time
… a couple of hours
Simple table lookup (one-variable query)
… about 5 minutes

More Related Content

PDF
Two graph data models : RDF and Property Graphs
PDF
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
PDF
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
PDF
Debunking some “RDF vs. Property Graph” Alternative Facts
PPTX
4 sw architectures and sparql
PPTX
Semantic web meetup – sparql tutorial
PPTX
A Little SPARQL in your Analytics
PDF
ParlBench: a SPARQL-benchmark for electronic publishing applications.
Two graph data models : RDF and Property Graphs
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Getty Vocabulary Program LOD: Ontologies and Semantic Representation
Debunking some “RDF vs. Property Graph” Alternative Facts
4 sw architectures and sparql
Semantic web meetup – sparql tutorial
A Little SPARQL in your Analytics
ParlBench: a SPARQL-benchmark for electronic publishing applications.

What's hot (20)

PDF
RDF Tutorial - SPARQL 20091031
PDF
Jesús Barrasa
PDF
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
PDF
Property graph vs. RDF Triplestore comparison in 2020
PDF
LDQL: A Query Language for the Web of Linked Data
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
PPTX
Triple Stores
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
PPT
Ks2008 Semanticweb In Action
PDF
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
ODP
Graph Data -- RDF and Property Graphs
PPTX
Semantic Variation Graphs the case for RDF & SPARQL
PDF
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
PPTX
Democratizing Big Semantic Data management
PPT
Ist16-04 An introduction to RDF
PPTX
RDF Data Model
PDF
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
PPTX
Challenges and applications of RDF shapes
PPT
Aidan's PhD Viva
PDF
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
RDF Tutorial - SPARQL 20091031
Jesús Barrasa
Reasoning with Big Knowledge Graphs: Choices, Pitfalls and Proven Recipes
Property graph vs. RDF Triplestore comparison in 2020
LDQL: A Query Language for the Web of Linked Data
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 1 (...
Triple Stores
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 2 (...
Ks2008 Semanticweb In Action
Tutorial "An Introduction to SPARQL and Queries over Linked Data" Chapter 3 (...
Graph Data -- RDF and Property Graphs
Semantic Variation Graphs the case for RDF & SPARQL
Rethinking Online SPARQL Querying to Support Incremental Result Visualization
Democratizing Big Semantic Data management
Ist16-04 An introduction to RDF
RDF Data Model
#sod14 - ok, è un endpoint SPARQL non facciamoci prendere dal panico
Challenges and applications of RDF shapes
Aidan's PhD Viva
2011 4IZ440 Semantic Web – RDF, SPARQL, and software APIs
Ad

Viewers also liked (6)

ODP
Summary of SIGIR 2011 Papers
PDF
Enterprise Use of Twitter by Doug Ross
KEY
Concept- en Definitie Extractie
ODP
SIGIR 2011
PDF
Experience aware Item Recommendation in Evolving Review Communities
PDF
QBer - Connect your data to the cloud
Summary of SIGIR 2011 Papers
Enterprise Use of Twitter by Doug Ross
Concept- en Definitie Extractie
SIGIR 2011
Experience aware Item Recommendation in Evolving Review Communities
QBer - Connect your data to the cloud
Ad

Similar to Querying the Web of Data (20)

PPT
Friday talk 11.02.2011
PPT
Ks2007 Semanticweb In Action
PPTX
Ks2009 Semanticweb In Action
PPTX
Semantic Web and Related Work at W3C
PPT
Integrating Government Data New
ODP
Gist od2-feb-2011
PPTX
Knowledge Graph Introduction
PDF
Data integration with a façade. The case of knowledge graph construction.
PPT
SemanticWeb Nuts 'n Bolts
PPTX
SWT Lecture Session 10 R2RML Part 1
PPTX
A Real-World Implementation of Linked Data
PDF
Short Report Bridges performance gap between Relational and RDF
PPTX
The Web, one huge database ...
PPTX
One day workshop Linked Data and Semantic Web
PPTX
Optimized index structures for querying rdf from the web
PDF
Ivan Herman - Semantic Web Activities @ W3C
PPTX
Using Linked Data to Mine RDF from Wikipedia's Tables
PPT
Re-using Media on the Web: Media fragment re-mixing and playout
PDF
The Semantic Web Client Library - Consuming Linked Data in Your Applications
ODP
Linked opendata parisemantique.fr - 24062011
Friday talk 11.02.2011
Ks2007 Semanticweb In Action
Ks2009 Semanticweb In Action
Semantic Web and Related Work at W3C
Integrating Government Data New
Gist od2-feb-2011
Knowledge Graph Introduction
Data integration with a façade. The case of knowledge graph construction.
SemanticWeb Nuts 'n Bolts
SWT Lecture Session 10 R2RML Part 1
A Real-World Implementation of Linked Data
Short Report Bridges performance gap between Relational and RDF
The Web, one huge database ...
One day workshop Linked Data and Semantic Web
Optimized index structures for querying rdf from the web
Ivan Herman - Semantic Web Activities @ W3C
Using Linked Data to Mine RDF from Wikipedia's Tables
Re-using Media on the Web: Media fragment re-mixing and playout
The Semantic Web Client Library - Consuming Linked Data in Your Applications
Linked opendata parisemantique.fr - 24062011

More from Rinke Hoekstra (20)

PDF
Knowledge Representation on the Web
PDF
Managing Metadata for Science and Technology Studies: the RISIS case
PDF
An Ecosystem for Linked Humanities Data
PDF
Jurix 2014 welcome presentation
PDF
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
PDF
Prov-O-Viz: Interactive Provenance Visualization
PDF
Linkitup: Link Discovery for Research Data
PDF
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
PDF
Linked (Open) Data - But what does it buy me?
PDF
Linked Science - Building a Web of Research Data
PDF
COMMIT/VIVO
PDF
Semantic Representations for Research
PDF
A Slightly Different Web of Data
PDF
The Knowledge Reengineering Bottleneck
PDF
Linked Census Data
KEY
SIKS 2011 Semantic Web Languages
PDF
The MetaLex Document Server - Legal Documents as Versioned Linked Data
PPTX
History of Knowledge Representation (SIKS Course 2010)
PPTX
Making Sense of Design Patterns
PPTX
Publicatie van Linked Open Overheids Data
Knowledge Representation on the Web
Managing Metadata for Science and Technology Studies: the RISIS case
An Ecosystem for Linked Humanities Data
Jurix 2014 welcome presentation
Provenance and Reuse of Open Data (PILOD 2.0 June 2014)
Prov-O-Viz: Interactive Provenance Visualization
Linkitup: Link Discovery for Research Data
A Network Analysis of Dutch Regulations - Using the Metalex Document Server
Linked (Open) Data - But what does it buy me?
Linked Science - Building a Web of Research Data
COMMIT/VIVO
Semantic Representations for Research
A Slightly Different Web of Data
The Knowledge Reengineering Bottleneck
Linked Census Data
SIKS 2011 Semantic Web Languages
The MetaLex Document Server - Legal Documents as Versioned Linked Data
History of Knowledge Representation (SIKS Course 2010)
Making Sense of Design Patterns
Publicatie van Linked Open Overheids Data

Recently uploaded (20)

PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
Big Data Technologies - Introduction.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
Spectroscopy.pptx food analysis technology
PDF
Machine learning based COVID-19 study performance prediction
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Empathic Computing: Creating Shared Understanding
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
“AI and Expert System Decision Support & Business Intelligence Systems”
Big Data Technologies - Introduction.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
Review of recent advances in non-invasive hemoglobin estimation
Encapsulation_ Review paper, used for researhc scholars
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Programs and apps: productivity, graphics, security and other tools
Spectroscopy.pptx food analysis technology
Machine learning based COVID-19 study performance prediction
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Reach Out and Touch Someone: Haptics and Empathic Computing
20250228 LYD VKU AI Blended-Learning.pptx
cuic standard and advanced reporting.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
sap open course for s4hana steps from ECC to s4
Empathic Computing: Creating Shared Understanding
The AUB Centre for AI in Media Proposal.docx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Understanding_Digital_Forensics_Presentation.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...

Querying the Web of Data