SlideShare a Scribd company logo
Frank van Harmelen
All the questions
we couldn’t ask
10 years ago
Creative Commons License:
allowed to share & remix,
but must attribute & non-commercial
The bad news:
you’re going to get 3 talks
1. Where are we now?
– The Semantic Web in 4 principles & a movie
– Did we get anywhere?
2. Now what?
– Questions we couldn’t ask 10 years ago
3. Methodological hobby horse
– Science or engineering?
Semantic Web:
What is it?
a web page
in English
about
Frank
And this
page is
about
LarKC
and another
web page
about
Frank
And this
page is
about
Stefano
This page
is about
the Vrije
Uniersitei
“The Semantic Web” a.k.a. “The Web of Data”
http://www.youtube.com/watch?v=tBSdYi4EY3s
P1. Give all things a name
P2. Relations form a graph
between things
P3. The names are addresses on the Web
x T
[<x> IsOfType <T>]
different
owners & locations
<analgesic>
P1+P2+P3 = Giant Global Graph
P4. explicit & formal semantics
• assign types to things
• assign types to relations
• organise types in a hierarchy
• impose constraints on
possible interpretations
Examples of “semantics”
Frank Lynda
married-to
• Frank is male
• married-to relates
males to females
• married-to relates
1 male to 1 female
• Lynda = Hazel
lowerbound upperbound
Hazel
Semantic Web:
Where are we now?
Did we get anywhere?
• Google = meaningful search
• NXP = data integration
• BBC = content re-use
• Wallmart= SEO (RDF-a)
• data.gov = data-publishing
Semantic Web questions we couldn't ask 10 years ago
NXP: data integration
about 26.000 products
Triple store
Triple store
Departments
Customers
Notice the 3-layer architecture
BBC
Notice the 3-layer architecture
Semantic Web questions we couldn't ask 10 years ago
Semantic Web questions we couldn't ask 10 years ago
Semantic Web questions we couldn't ask 10 years ago
Semantic Web questions we couldn't ask 10 years ago
Did we get anywhere?
• Google = meaningful search
• NXP = data integration
• BBC = content re-use
• BestBuy = SEO (RDF-a)
• data.gov = data-publishing
Oracle DB, IBM DB2
Reuters,
New York Times, Guardian
Sears, Kmart, OverStock,
Volkswagen, Renault
GoodRelations ontology,
schema.org
Size Matters: 25-45 billion facts
The questions
that we couldn’t ask
10 years ago
• Heterogeneity
• Self-organisation, long tails
• Distribution
• Provenance & trust
• Dynamics
• Errors & Noise
• Scale
heterogeneity
is unavoidable
•Linguistic,
•Structural,
•Logical,
•Statistical,
....
Socio-
economic
first to
market
market-
share
Self-organisation
Self-organisation
Self-organisation
Self-organisation
Self-organisation
Bio-medical
ontologies in
Bio-portal > 5 links
Self-organisation
knowledge follows
a long-tail
incidental
or universal?
impact on
mapping?
impact on
reasoning?
impact on
storage?
Distribution
Caching?
Subgraphs?
Payload
priority?
query-
planning?
Provenance
Representation?
From provenance
to trust?
(Re)construction?
knowledge about
knowledge?
Dynamics
Streams? Incremental
reasoning?
Non-
monotonicity?
versioning?
Errors & noise
Maximally
consistent
subsets?
Fuzzy
Semantics?
Uncertainty
Semantics?
Rough
Semantics?
Modules?
Repair?
Argumentation?
Maximally
consistent
subsets?
Modules?
Repair?
Argumentation?
Fuzzy
Semantics?
Uncertainty
Semantics?
Rough
Semantics?
Streams?
Incremental
reasoning?
Non-
monotonicity?
versioning?
Representation?
From provenance
to trust?
(Re)construction?
knowledge about
knowledge?
Caching?
Subgraphs?
Payload
priority?
incidental
or universal?
impact on
mapping?
impact on
reasoning?
impact on
storage?
Socio-
economic
first to
market
market-
share
Methodological
Hobby horse
Laws about the physical universe
Laws about the information universe ?
knowledge follows
a long-tail
Law: F = a-br
Law: |T|<< |A|
T = terminological knowledge
A = assertional knowledge
Dataset Closure of
T
Closure of
T + A
Ratio
LUBM 8sec 1h15min 562
Linked Life Data 332sec 1h05min 11
FactForge 89sec 2h45min 111
We don’t have any good laws on complexity
Semantic Web questions we couldn't ask 10 years ago
Semantic Web questions we couldn't ask 10 years ago

More Related Content

PPTX
The Empirical Turn in Knowledge Representation
PPTX
The Web of Data: do we actually understand what we built?
PPTX
Empirical Semantics
PPTX
The end of the scientific paper as we know it (in 4 easy steps)
PPTX
The end of the scientific paper as we know it (or not...)
PDF
Open data and linked data
PPTX
2011 05-02 linked data intro
PPTX
2011 05-01 linked data
The Empirical Turn in Knowledge Representation
The Web of Data: do we actually understand what we built?
Empirical Semantics
The end of the scientific paper as we know it (in 4 easy steps)
The end of the scientific paper as we know it (or not...)
Open data and linked data
2011 05-02 linked data intro
2011 05-01 linked data

What's hot (19)

PPTX
Development of Semantic Web based Disaster Management System
PPTX
Australian Open government and research data pilot survey 2017
PPTX
Visualising the Australian open data and research data landscape
PPTX
Consuming Linked Data SemTech2010
PPTX
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
PDF
Linked Data Snowball, or Why We Need Reconciliation
PDF
Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...
PDF
Pandas, Data Wrangling & Data Science
PPT
The Semantic Web: 2010 Update
PPT
Information Extraction and Linked Data Cloud
PDF
Web of Data and its Status on Persian Web Data Space
PPTX
How the Web can change social science research (including yours)
ODP
What the Adoption of schema.org Tells about Linked Open Data
PDF
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...
PDF
Network Relationships and Job Changes of Software Developers at Sunbelt 2016
PDF
R, Data Wrangling & Kaggle Data Science Competitions
PDF
Linked Data Tutorial
PDF
From the Semantic Web to the Web of Data: ten years of linking up
PDF
Linked Open Data
Development of Semantic Web based Disaster Management System
Australian Open government and research data pilot survey 2017
Visualising the Australian open data and research data landscape
Consuming Linked Data SemTech2010
Developing Linked Data and Semantic Web-based Applications (Expotec 2015)
Linked Data Snowball, or Why We Need Reconciliation
Big Data LDN 2017: Machine Learning on Structured Data. Why Is Learning Rules...
Pandas, Data Wrangling & Data Science
The Semantic Web: 2010 Update
Information Extraction and Linked Data Cloud
Web of Data and its Status on Persian Web Data Space
How the Web can change social science research (including yours)
What the Adoption of schema.org Tells about Linked Open Data
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...
Network Relationships and Job Changes of Software Developers at Sunbelt 2016
R, Data Wrangling & Kaggle Data Science Competitions
Linked Data Tutorial
From the Semantic Web to the Web of Data: ten years of linking up
Linked Open Data
Ad

Similar to Semantic Web questions we couldn't ask 10 years ago (20)

PDF
Contextual Computing: Laying a Global Data Foundation
PDF
Contextual Computing - Knowledge Graphs & Web of Entities
PPTX
What happened to the Semantic Web?
PPTX
NISO Virtual Conference: The Semantic Web Coming of Age: Technologies and Imp...
PDF
Schema.org: Where did that come from!
PPTX
Scraping talk public
PPTX
Semantic Web and Schema.org
PPT
20140521 sem-tech-biz-guest-lecture
PPTX
What is the Semantic Web
PPTX
Van de droom van het Semantic Web naar de realiteit van Linked Open
PDF
Structured data: Where did that come from & why are Google asking for it
PPTX
Introduction to Information Retrieval
PDF
GLAMorous LOD and ResearchSpace introduction
PPTX
Mining Web content for Enhanced Search
PPSX
The Web of data and web data commons
PPTX
Text Mining, Term Mining, and Visualization - Improving the Impact of Scholar...
PDF
GLAMorous LOD
PDF
Schema.org Structured data the What, Why, & How
PPT
Semantic Search overview at SSSW 2012
PDF
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...
Contextual Computing: Laying a Global Data Foundation
Contextual Computing - Knowledge Graphs & Web of Entities
What happened to the Semantic Web?
NISO Virtual Conference: The Semantic Web Coming of Age: Technologies and Imp...
Schema.org: Where did that come from!
Scraping talk public
Semantic Web and Schema.org
20140521 sem-tech-biz-guest-lecture
What is the Semantic Web
Van de droom van het Semantic Web naar de realiteit van Linked Open
Structured data: Where did that come from & why are Google asking for it
Introduction to Information Retrieval
GLAMorous LOD and ResearchSpace introduction
Mining Web content for Enhanced Search
The Web of data and web data commons
Text Mining, Term Mining, and Visualization - Improving the Impact of Scholar...
GLAMorous LOD
Schema.org Structured data the What, Why, & How
Semantic Search overview at SSSW 2012
II-SDV 2012 Text Mining, Term Mining and Visualization - Improving the Impac...
Ad

More from Frank van Harmelen (20)

PPTX
Neuro-symbolic is not enough, we need neuro-*semantic*
PPTX
The K in "neuro-symbolic" stands for "knowledge"
PPTX
Adoption of Knowledge Graphs, mid 2022 (incomplete)
PPTX
Modular design patterns for systems that learn and reason: a boxology
PPTX
Adoption of Knowledge Graphs, late 2019
PPTX
Adoption of Knowledge Graphs, mid 2019
PPTX
On the nature of AI, and the relation between symbolic and statistical approa...
PPTX
Linked Open Data for Medical Guidelines Interactions
PPT
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
PPTX
Informatics is a natural science
PPTX
4 Popular Fallacies about the Semantic Web
PPT
PPT
Het slimme Web 3.0
PPT
OWL briefing
PPT
RDF briefing
PPT
Semantic Web research anno 2006:main streams, popular falacies, current statu...
PPT
Ontology mapping needs context & approximation
PPT
Ontology Mapping - Out Of The Babel Tower
PPT
Where Does It Break?
PPT
LarKC: the large knowledge collider
Neuro-symbolic is not enough, we need neuro-*semantic*
The K in "neuro-symbolic" stands for "knowledge"
Adoption of Knowledge Graphs, mid 2022 (incomplete)
Modular design patterns for systems that learn and reason: a boxology
Adoption of Knowledge Graphs, late 2019
Adoption of Knowledge Graphs, mid 2019
On the nature of AI, and the relation between symbolic and statistical approa...
Linked Open Data for Medical Guidelines Interactions
Knowledge Engineering rediscovered, Towards Reasoning Patterns for the Semant...
Informatics is a natural science
4 Popular Fallacies about the Semantic Web
Het slimme Web 3.0
OWL briefing
RDF briefing
Semantic Web research anno 2006:main streams, popular falacies, current statu...
Ontology mapping needs context & approximation
Ontology Mapping - Out Of The Babel Tower
Where Does It Break?
LarKC: the large knowledge collider

Recently uploaded (20)

PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
GDM (1) (1).pptx small presentation for students
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
RMMM.pdf make it easy to upload and study
PDF
01-Introduction-to-Information-Management.pdf
PDF
Computing-Curriculum for Schools in Ghana
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Classroom Observation Tools for Teachers
PDF
Sports Quiz easy sports quiz sports quiz
PDF
Pre independence Education in Inndia.pdf
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
Institutional Correction lecture only . . .
PPTX
Final Presentation General Medicine 03-08-2024.pptx
Microbial diseases, their pathogenesis and prophylaxis
Abdominal Access Techniques with Prof. Dr. R K Mishra
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
GDM (1) (1).pptx small presentation for students
2.FourierTransform-ShortQuestionswithAnswers.pdf
RMMM.pdf make it easy to upload and study
01-Introduction-to-Information-Management.pdf
Computing-Curriculum for Schools in Ghana
VCE English Exam - Section C Student Revision Booklet
Classroom Observation Tools for Teachers
Sports Quiz easy sports quiz sports quiz
Pre independence Education in Inndia.pdf
Pharma ospi slides which help in ospi learning
Renaissance Architecture: A Journey from Faith to Humanism
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
TR - Agricultural Crops Production NC III.pdf
Cell Structure & Organelles in detailed.
Institutional Correction lecture only . . .
Final Presentation General Medicine 03-08-2024.pptx

Semantic Web questions we couldn't ask 10 years ago

Editor's Notes

  • #14: @TODO@: do a slide on data-integration at NXP@TODO@: find a slide on RDF-a in Wallmart etc
  • #22: @TODO@: do a slide on data-integration at NXP@TODO@: find a slide on RDF-a in Wallmart etc@TODO@: replace company names with logo’s?
  • #25: @@ Add: trust@@Add: noisy data (inconsistency, misleading, incomplete)
  • #27: Suggests to let a 1000 ontologies blossom, to have lots of connections between lots of datasets.
  • #33: Some known information laws already apply:Zipf law / long tail distributions are everywhere= vast majority of occurrences are caused by a vast minority of itemsthis phenomen is sometimes a blessing, sometimes a cursenice for compressionawful for load balancingand knowing the law helps us deal with the phenomenonthat’s why it’s worth trying to discover these laws.
  • #34: @add another long-tail example@ (e.g. in-degree?)
  • #35: Physical distribution doesn’t work the web is not a database (and never will be)@@ADD: even worse for long tail
  • #41: - Compare to physics laws: gravity F = G m_1 m_2 / r^2 conservation of energy (dE/dt = 0), increase of entropy (dS/dt \geq 0),we cannot yet hope for such beautifully mathematised laws,in such a concise language that fits on a very compact space computer science is like alchemy, a &quot;protoscience&quot;
  • #42: Some known information laws already apply:Zipf law / long tail distributions are everywhere= vast majority of occurrences are caused by a vast minority of itemsthis phenomen is sometimes a blessing, sometimes a cursenice for compressionawful for load balancingand knowing the law helps us deal with the phenomenonthat’s why it’s worth trying to discover these laws.
  • #43: this only works because terminologies are in general only simple hierarchies. (it’s easy to build examples where this doesn’t hold, but in practice it turns out to hold).So, this law depends on the previous lawas an aside: the graph is now big enough to do statistics on it.
  • #45: use complexity” as a measure, not just “size”. spell out LLD,don’t break FactForge
  • #46: - Semantic Web = engineering enterprise.- This talk = what are the scientific observations/facts/theories after 10 yearsWhat are the big CS (or: KR?) lessons we can learn from a decade of SemWeb?(= regard SemWeb adoption as a giant laboratory for CS laws)Did we learn any science? (and of course the laws won’t be specific to SemWeb? Hopefully not. Hopefully they are generic laws about the structure and behaviour of informaiton!)
  • #47: a gazillion new open questionsdon’t just try to build things, also try to understand thingsdon’t just ask how, also ask why