SlideShare a Scribd company logo
Heterogeneity is Here to Stay and
Semantics is not about Agreement
Krzysztof Janowicz
STKO Lab University of California, Santa Barbara, USA
EarthCube All-Hands Meeting
Addressing Data Heterogeneity 2014
June 2014
Heterogeneity is Here to Stay and Semantics is not about Agreement K. Janowicz
The Data Retrieval Problem Is Real
Even the major data hubs such as Data.gov still rely on keyword-based search
and have unreliable, incomplete, and missing metadata. For this type of
retrieval problems, even ’a little semantics goes a long way’ (Hendler 1997).
Heterogeneity is Here to Stay and Semantics is not about Agreement K. Janowicz
Sensemaking is Difficult – Fitness for Puspose is Key
There is no shortage of data, but
finding data that is fit for a certain
purpose is difficult.
Data as statements (think RDF) not
as truth.
Heterogeneity is caused by cultural
differences, progress in science,
viewpoints, granularity, ...
Alchemist Fallacy1; semantics
does not come for free.
Lack of provenance information
Sensemaking requires more
powerful semantic technologies and
ontologies (compared to IR).
1You cannot transmute base metals into gold and even if you could, gold would not be precious anymore. Recall the data citation discussion.
Heterogeneity is Here to Stay and Semantics is not about Agreement K. Janowicz
Meaningful Analysis and Synthesis is Difficult
Ensuring that data is analyzed and
combined in a meaningful way is far
from trivial.
What if the information on how to
use the data would come together
with these data?
Focus on smart data instead of
(merely on) smart applications.
The purpose of ontologies is not to
agree on the meaning of terms but to
make the data provider’s intended
meaning explicit.
A little experiment: The statement all rivers flow into other water bodies
is not useful because it is ’true’2, but because...?
2It is not; rivers can flow into the ground or just dry up entirely before reaching another water body.
Heterogeneity is Here to Stay and Semantics is not about Agreement K. Janowicz

More Related Content

PDF
Multi perspective Ontology Engineering
PDF
Big Geo Data
PDF
AAG 2014 Talk on Ontology Views, Reusue, Alignment
PDF
Why the Data Train Needs Semantic Rails -- The Case of Linked Scientometrics ...
PDF
Ontology alignment representation
PDF
Linked (Data) Scientometrics Keynote
PDF
Debiasing Knowledge Graphs: Why Female Presidents are not like Female Popes
PDF
Golledge Lecture May 2018
Multi perspective Ontology Engineering
Big Geo Data
AAG 2014 Talk on Ontology Views, Reusue, Alignment
Why the Data Train Needs Semantic Rails -- The Case of Linked Scientometrics ...
Ontology alignment representation
Linked (Data) Scientometrics Keynote
Debiasing Knowledge Graphs: Why Female Presidents are not like Female Popes
Golledge Lecture May 2018

More from kjanowicz (15)

PDF
How “Alternative" are Alternative Facts? Towards Measuring Statement Coherenc...
PDF
Geo-Humanities 2017 Keynote at SIGSPATIAL 2017
PDF
Building Blocks for Distributed Geo-Knowledge Graphs
PDF
Ontology Engineering: A View from the Trenches - WOP 2015 Keynote
PDF
Exploring the Data Universe with Semantic Signatures: Plous Lecture 2015
PDF
Pattern-based Ontology Engineering
PDF
GeoVoCamp SB 2015 Welcome Slides
PDF
Ontology Virtualization for Smart Data -- A Semantics Perspective on Open Dat...
PDF
'The Why, What, and How of Geo-Information Observatories' GeoRich2014 Keynote
PDF
A Non-Technical, Example-Driven Introduction to Linked Data
PDF
Please don't agree: Introducing Descartes-Core
PDF
Where is the sweet spot for ontologies?
PDF
GEOSPATIAL SEMANTICS -- PROBLEMS AND PROJECTS
PDF
Semantics and Linked Data for CyberGIS -- AAG 2013 Frontiers and Roadmaps Se...
PDF
Introductory slides into Big Data in Geographic Information Science
How “Alternative" are Alternative Facts? Towards Measuring Statement Coherenc...
Geo-Humanities 2017 Keynote at SIGSPATIAL 2017
Building Blocks for Distributed Geo-Knowledge Graphs
Ontology Engineering: A View from the Trenches - WOP 2015 Keynote
Exploring the Data Universe with Semantic Signatures: Plous Lecture 2015
Pattern-based Ontology Engineering
GeoVoCamp SB 2015 Welcome Slides
Ontology Virtualization for Smart Data -- A Semantics Perspective on Open Dat...
'The Why, What, and How of Geo-Information Observatories' GeoRich2014 Keynote
A Non-Technical, Example-Driven Introduction to Linked Data
Please don't agree: Introducing Descartes-Core
Where is the sweet spot for ontologies?
GEOSPATIAL SEMANTICS -- PROBLEMS AND PROJECTS
Semantics and Linked Data for CyberGIS -- AAG 2013 Frontiers and Roadmaps Se...
Introductory slides into Big Data in Geographic Information Science
Ad

Recently uploaded (20)

PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PPTX
The KM-GBF monitoring framework – status & key messages.pptx
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
Comparative Structure of Integument in Vertebrates.pptx
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
2. Earth - The Living Planet Module 2ELS
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
An interstellar mission to test astrophysical black holes
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
Microbiology with diagram medical studies .pptx
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PDF
Biophysics 2.pdffffffffffffffffffffffffff
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
The KM-GBF monitoring framework – status & key messages.pptx
POSITIONING IN OPERATION THEATRE ROOM.ppt
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
ECG_Course_Presentation د.محمد صقران ppt
2Systematics of Living Organisms t-.pptx
Comparative Structure of Integument in Vertebrates.pptx
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
2. Earth - The Living Planet Module 2ELS
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
An interstellar mission to test astrophysical black holes
neck nodes and dissection types and lymph nodes levels
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Microbiology with diagram medical studies .pptx
TOTAL hIP ARTHROPLASTY Presentation.pptx
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
INTRODUCTION TO EVS | Concept of sustainability
Biophysics 2.pdffffffffffffffffffffffffff
Ad

Heterogeneity is Here to Stay and Semantics is Not About Agreement

  • 1. Heterogeneity is Here to Stay and Semantics is not about Agreement Krzysztof Janowicz STKO Lab University of California, Santa Barbara, USA EarthCube All-Hands Meeting Addressing Data Heterogeneity 2014 June 2014 Heterogeneity is Here to Stay and Semantics is not about Agreement K. Janowicz
  • 2. The Data Retrieval Problem Is Real Even the major data hubs such as Data.gov still rely on keyword-based search and have unreliable, incomplete, and missing metadata. For this type of retrieval problems, even ’a little semantics goes a long way’ (Hendler 1997). Heterogeneity is Here to Stay and Semantics is not about Agreement K. Janowicz
  • 3. Sensemaking is Difficult – Fitness for Puspose is Key There is no shortage of data, but finding data that is fit for a certain purpose is difficult. Data as statements (think RDF) not as truth. Heterogeneity is caused by cultural differences, progress in science, viewpoints, granularity, ... Alchemist Fallacy1; semantics does not come for free. Lack of provenance information Sensemaking requires more powerful semantic technologies and ontologies (compared to IR). 1You cannot transmute base metals into gold and even if you could, gold would not be precious anymore. Recall the data citation discussion. Heterogeneity is Here to Stay and Semantics is not about Agreement K. Janowicz
  • 4. Meaningful Analysis and Synthesis is Difficult Ensuring that data is analyzed and combined in a meaningful way is far from trivial. What if the information on how to use the data would come together with these data? Focus on smart data instead of (merely on) smart applications. The purpose of ontologies is not to agree on the meaning of terms but to make the data provider’s intended meaning explicit. A little experiment: The statement all rivers flow into other water bodies is not useful because it is ’true’2, but because...? 2It is not; rivers can flow into the ground or just dry up entirely before reaching another water body. Heterogeneity is Here to Stay and Semantics is not about Agreement K. Janowicz