SlideShare a Scribd company logo
Towards Transfer Learning of Link Specications
Axel-Cyrille Ngonga Ngomo

Jens Lehmann

Mofeed Hassan

2013-09-16

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

1 / 29
Outline

1

Motivation

2

Transfer Learning Framework

3

Experimental Setup

4

Results

5

Conclusions and Future Work

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

2 / 29
Outline

1

Motivation

2

Transfer Learning Framework

3

Experimental Setup

4

Results

5

Conclusions and Future Work

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

3 / 29
Why Link Discovery?
1

2

3

Fourth Linked Data
principle
Links are central for
Cross-ontology QA
Data Integration
Reasoning
Federated Queries
...
2011 topology of the
LOD Cloud:
31+ billion triples
≈ 0.5 billion links
owl:sameAs in most
cases

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

4 / 29
Why is it dicult?
Denition (Link Discovery)

S and T of resources and relation R
Find M = {(s , t ) ∈ S × T : R(s , t )}

Given sets

Task:
Common approaches:
Find
= {( , ) ∈
= {( , ) ∈
Find

M
M

s t S × T : σ(s , t ) ≥ θ}
s t S × T : δ(s , t ) ≤ θ}

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

5 / 29
Why is it dicult?
Denition (Link Discovery)

S and T of resources and relation R
Find M = {(s , t ) ∈ S × T : R(s , t )}

Given sets

Task:
Common approaches:
Find
= {( , ) ∈
= {( , ) ∈
Find

M
M

1

s t S × T : σ(s , t ) ≥ θ}
s t S × T : δ(s , t ) ≤ θ}

Time complexity

Large number of triples
Quadratic a-priori runtime
69 days for mapping cities from
DBpedia to Geonames (1ms per
comparison)
Decades for linking DBpedia and LGD
...

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

5 / 29
Why is it dicult?
2

Complexity of specications

Combination of several attributes required for high precision
Tedious discovery of most adequate mapping
Dataset-dependent similarity functions

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

6 / 29
LIMES Framework

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

7 / 29
Link Specication
Detection of accurate link specication is key
Link Specications has three components:
Two sets of restrictions RS ... RS resp. RT ... RT that specify the
m
1
1
k
sets resp. ,
A specication of a complex similarity metric σ via the combination of
several atomic similarity measures σ1 , ..., σn and
A set of thresholds τ1 , ..., τn such that τi is the threshold for σi .

S

T

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

8 / 29
Transfer Learning
Classical Learning of Link Specs

Transfer Learning of Link Specs
Current Linking Task

Different Linking Tasks

Task Repository

spec accuracy: α
class similarity: ζ
property similarity: π
Learning System

Learning System

Learning System

In our approach we use

Transfer Learning System

Transductive Transfer Learning

Class and property matching is assumed to be known already
(numerous approaches from ontology matching can be employed) the goal is to nd the complex similarity metric

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

9 / 29
Outline

1

Motivation

2

Transfer Learning Framework

3

Experimental Setup

4

Results

5

Conclusions and Future Work

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

10 / 29
Transfer Learning Framework I

Transfer Learning of link specications is reduce to three subproblems:
Restrictions/class similarity ζ : 2C × 2C → [0, 1]
e.g. ζ({City , Village }, {Town}) = 0.6
Property similarity: ξ : 2P × 2P → [0, 1]
e.g. ξ({rdfs : label }, {rdfs : label }) = 1.0

Accuracy of link specications: α : Q → [0, 1]

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

11 / 29
Transfer Learning Framework II
Overall similarity measure for transfer learning:
ω(t , t ) = α(q ) · ζ(ψ(q ), C) · ζ(ψ (q ), C ) · ξ(sp (q ), PL ) · ξ(tp (q ), PL )
(details in paper)
Each similarity measure can be implemented in manifold approaches

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

12 / 29
Transfer Learning Framework II
Overall similarity measure for transfer learning:
ω(t , t ) = α(q ) · ζ(ψ(q ), C) · ζ(ψ (q ), C ) · ξ(sp (q ), PL ) · ξ(tp (q ), PL )
(details in paper)
Each similarity measure can be implemented in manifold approaches
Implementations of class similarity function ζ in framework:
label-based similarity
name-based similarity (URI similarity)
data-centric similarity
Properties similarities ξ are dened analogously

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

12 / 29
Transfer Learning Framework II
Overall similarity measure for transfer learning:
ω(t , t ) = α(q ) · ζ(ψ(q ), C) · ζ(ψ (q ), C ) · ξ(sp (q ), PL ) · ξ(tp (q ), PL )
(details in paper)
Each similarity measure can be implemented in manifold approaches
Implementations of class similarity function ζ in framework:
label-based similarity
name-based similarity (URI similarity)
data-centric similarity
Properties similarities ξ are dened analogously
Similarities between single classes/properties can be extended to sets
(e.g. using arithmetic / geometric mean of max. similarity)

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

12 / 29
Transfer Learning Framework II
Overall similarity measure for transfer learning:
ω(t , t ) = α(q ) · ζ(ψ(q ), C) · ζ(ψ (q ), C ) · ξ(sp (q ), PL ) · ξ(tp (q ), PL )
(details in paper)
Each similarity measure can be implemented in manifold approaches
Implementations of class similarity function ζ in framework:
label-based similarity
name-based similarity (URI similarity)
data-centric similarity
Properties similarities ξ are dened analogously
Similarities between single classes/properties can be extended to sets
(e.g. using arithmetic / geometric mean of max. similarity)
Spec can be transferred by replacing properties with most similar
properties in PL and PL
Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

12 / 29
Example (New Link Task)
Example link specication for mapping drugs in two datasets DBpedia and
Drugbank (DBpedia-Drugbank.xml):

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

13 / 29
Example (Restriction part)
Three parts of link specs:
Restrictions part

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

14 / 29
Example (Properties Part)
Three parts of link specs:
Restrictions part
Properties part

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

15 / 29
Example (Similarities Measures Part)
Three parts of link specs:
Restrictions part
Properties part
Similarity Measures part: similarity metric and thresholds

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

16 / 29
Example (Link Repository)
Transfer learning is applied using a repository → restrictions and relevant
properties are assumed to be known → nd the similarity measure by
comparing with all specs in the repository, e.g. DBpedia-SiderDrugs.xml

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

17 / 29
Example (Restriction Similarities)
Restrictions in both specications les
Type

DBpedia-Drugbank.xml

DBpedia-SiderDrugs.xml

Source
Target

rdf:type dbpedia-owl:Drug
rdf:type drug:drugs

rdf:type dbpedia-owl:Drug
rdf:type sider:drugs

Straightforward label/URI similarity
For instance, trigram metric in URI similarity without prexes:
ζ({dbpedia-owl:Drug}, {dbpedia-owl:Drug}) = 1.0
ζ({sider:drugs}, {drug:drugs}) = 1.0

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

18 / 29
Example (Restriction Similarities)
Restrictions in both specications les
Type

DBpedia-Drugbank.xml

DBpedia-SiderDrugs.xml

Source
Target

rdf:type dbpedia-owl:Drug
rdf:type drug:drugs

rdf:type dbpedia-owl:Drug
rdf:type sider:drugs

Straightforward label/URI similarity
For instance, trigram metric in URI similarity without prexes:
ζ({dbpedia-owl:Drug}, {dbpedia-owl:Drug}) = 1.0
ζ({sider:drugs}, {drug:drugs}) = 1.0

1
Data-centric: ζd (s , s ) = |P (s )||P (s

sim(x , y ) where
x ∈P (s ) y ∈P (s )
P (s ) = {x : s p x ∧ p rdf:type owl:DatatypeProperty}
(extends similarity to instances)

Ngonga et. al (Univ. Leipzig)

)|

Transfer Learning of Link Specs

2013-09-16

18 / 29
Example (Property Similarities)

type

DBpedia-Drugbank.xml

DBpedia-SiderDrugs.xml

Source

rdfs:label

Target

rdfs:label
drug:genericName

rdfs:label
foaf:name
rdfs:label

Applying similarity function to all properties:
For instance trigram based on URIs and arithmetic mean as
aggregation:
ξ({rdfs : label }, {rdfs : label , foaf : name }) = 0.9
ξ({rdfs : label , drug : genericName }, {rdfs : label }) = 0.8

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

19 / 29
Example (Overall Similarity)

Based on, e.g. F-score assign quality value to q =
DBpedia-SiderDrugs.xml, in our case α(q ) = 0.89
The nal step is calculating the overall similarity measure
ω(DBpedia − Drugbank .xml , DBpedia − SiderDrugs .xml ) =
0.89 * 1.0 * 1.0 * 0.9 * 0.8 = 0.64
The steps are repeated for all link specications in the repository
Most similar link spec can be transferred by replacing its properties
with the most similar ones in the computed property matching

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

20 / 29
Outline

1

Motivation

2

Transfer Learning Framework

3

Experimental Setup

4

Results

5

Conclusions and Future Work

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

21 / 29
Experimental Setup I
The goal of evaluation is two-fold:
Evaluating whether transfer learning can be used to build templates
for link spec
Discover whether the transferred templates can be used directly
113 specications were retrieved from LATC, each has manual links
evaluation
15%

10%

3%

2%
Persons

1%
3%

Events
Locations
Diseases
Drugs
Organizations
Misc
66%

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

22 / 29
Experimental Setup II

Leave-one-out evaluation
1.) Compare top-scored specication (most similar) and check
whether it uses the same combination of similarity functions  assign 1
for match and 0 for no match
2.) Compute F-measure of learned link specs directly  works only on
specs with both endpoints alive (only 12 out of 113)
Used URI similarity

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

23 / 29
Outline

1

Motivation

2

Transfer Learning Framework

3

Experimental Setup

4

Results

5

Conclusions and Future Work

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

24 / 29
First Experiments Set Results
Detecting right specication in 81% of all cases
In geo-spatial domain 91%
In persons domain 58%
100%
90%
80%
70%
60%
50%
40%
30%
20%
10%
0%
Ave

s
s
s
s
e
rag erson Event cation sease
P
Di
Lo

Ngonga et. al (Univ. Leipzig)

gs
ns
Dru izatio
n
rga
O

Transfer Learning of Link Specs

Mis

c

2013-09-16

25 / 29
Second Experiments Set Results
In the second Experiments series, source and target endpoints need to
be alive such that we can execute transferred link spec (12 out of 113)
In general low F-measures
100%

80%

60%

40%
Precision
Recall
F-Measure

20%

s

nt
ry

op
e

ne

i-c
ou

ty
si

ex
t

er

at
at

iv
gu
a-

pe

di
a-

te

nd

un
a-

at

pe
di
db

db

n

on

so

rs

er

pe

-p

d-

on

oo
og
f

-d

db

pe

di
a-

lin

ke

dg

eo
d

nt

or
t
rp
ai

at
a-

m

-r
st
ad
ev

en

ts

ee
r

so
n
-d

er
db

pe

di

ali

nk

ts
e
en

ev

ve

er

-e

-p

fo
od

og

ge
od

l3
s
p_

bl

er

ed

s

ty
ci

ar
ie

K-

ul

vU

ab

go

st

da
ta

di
a-

se

nt

ev
e

db

pe

di
pe

db
e-

im

Ngonga et. al (Univ. Leipzig)

-d

co
u

aco
n

es
-

-e
ss

xe
ra

eu
bc
r
rk

db

lp

-d
at

as

em

an

tic
w

eb
-

ur

re

se
ar

ch

er

nt
ry

0%

Transfer Learning of Link Specs

2013-09-16

26 / 29
Outline

1

Motivation

2

Transfer Learning Framework

3

Experimental Setup

4

Results

5

Conclusions and Future Work

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

27 / 29
Summary
Conclusions:
Detecting right template in 81% of all cases
Transfer learning cannot replace the learning of thresholds in
specications
Future Work:
Combination with machine-learning approaches for link specications
(e.g., EAGLE, COALA), in particular for learning thresholds
More sophisticated class and property similarity approaches

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

28 / 29
The End
Jens Lehmann
lehmann@informatik.uni-leipzig.de
AKSW/Uni Leipzig

Questions

Geo

Know
http://geoknow.eu

Ngonga et. al (Univ. Leipzig)

Transfer Learning of Link Specs

2013-09-16

29 / 29

More Related Content

PDF
A Mathematical Approach to Ontology Authoring and Documentation
PPTX
Probabilistic models (part 1)
PPTX
Tdm probabilistic models (part 2)
PDF
Linked Open (Geo)Data and the Distributed Ontology Language – a perfect match
PDF
Coling2014:Single Document Keyphrase Extraction Using Label Information
DOCX
Clustering sentence level text using a novel fuzzy relational clustering algo...
PDF
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
PPT
Complex Matching of RDF Datatype Properties
A Mathematical Approach to Ontology Authoring and Documentation
Probabilistic models (part 1)
Tdm probabilistic models (part 2)
Linked Open (Geo)Data and the Distributed Ontology Language – a perfect match
Coling2014:Single Document Keyphrase Extraction Using Label Information
Clustering sentence level text using a novel fuzzy relational clustering algo...
Dagstuhl 2013 - Montali - On the Relationship between OBDA and Relational Map...
Complex Matching of RDF Datatype Properties

What's hot (19)

ODP
Topic Modeling
PDF
Multi core processing of xml twig patterns
PDF
A Comparison of Supervised Learning Classifiers for Link Discovery
PDF
Computing probabilistic queries in the presence of uncertainty via probabilis...
PDF
The Web service composition based on colored Petri net
PDF
Text Categorization Using Improved K Nearest Neighbor Algorithm
PPT
Textmining Retrieval And Clustering
PPTX
Topic modeling using big data analytics
PPTX
Document Classification and Clustering
PPTX
Nikolai Blenda - Clusterization of text documents using WordNet and semantic ...
PDF
Interactive Knowledge Discovery over Web of Data.
PDF
Bringing Math to LOD
PPT
Topic Models - LDA and Correlated Topic Models
PPTX
Neural Models for Information Retrieval
PPTX
CSMR: A Scalable Algorithm for Text Clustering with Cosine Similarity and Map...
PDF
Framester: A Wide Coverage Linguistic Linked Data Hub
PDF
Ectel sem_info_rec_learning_resources_v6.0_20120921_ma
PDF
Reference Scope Identification of Citances Using Convolutional Neural Network
PDF
TopicModels_BleiPaper_Summary.pptx
Topic Modeling
Multi core processing of xml twig patterns
A Comparison of Supervised Learning Classifiers for Link Discovery
Computing probabilistic queries in the presence of uncertainty via probabilis...
The Web service composition based on colored Petri net
Text Categorization Using Improved K Nearest Neighbor Algorithm
Textmining Retrieval And Clustering
Topic modeling using big data analytics
Document Classification and Clustering
Nikolai Blenda - Clusterization of text documents using WordNet and semantic ...
Interactive Knowledge Discovery over Web of Data.
Bringing Math to LOD
Topic Models - LDA and Correlated Topic Models
Neural Models for Information Retrieval
CSMR: A Scalable Algorithm for Text Clustering with Cosine Similarity and Map...
Framester: A Wide Coverage Linguistic Linked Data Hub
Ectel sem_info_rec_learning_resources_v6.0_20120921_ma
Reference Scope Identification of Citances Using Convolutional Neural Network
TopicModels_BleiPaper_Summary.pptx
Ad

Similar to Towards Transfer Learning of Link Specifications (20)

PDF
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
PDF
Knowledge Graph Embeddings for Recommender Systems
PDF
An experimental evaluation of similarity-based and embedding-based link predi...
PDF
An Experimental Evaluation of Similarity-Based and Embedding-Based Link Predi...
PDF
Entity2rec recsys
PPTX
Knowledge Graph Introduction
PDF
An experimental evaluation of similarity-based and embedding-based link predi...
PDF
Link Discovery Tutorial Introduction
PPTX
Hai huang presentation
PDF
PDF
Learning Commonalities in RDF and SPARQL
PDF
Learning Commonalities in RDF
PPTX
OWL: Yet to arrive on the Web of Data?
PPTX
MMKG: Multi-Modal Knowledge Graphs dddddd
PDF
machine-learning-with-large-networks-of-people-and-places
PDF
graph_embeddings
PDF
Semantic Web Technology
PPTX
Finding Association Rules in Linked Data
PPTX
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
PDF
A comprehensive survey of link mining and anomalies detection
An Empirical Comparison of Knowledge Graph Embeddings for Item Recommendation
Knowledge Graph Embeddings for Recommender Systems
An experimental evaluation of similarity-based and embedding-based link predi...
An Experimental Evaluation of Similarity-Based and Embedding-Based Link Predi...
Entity2rec recsys
Knowledge Graph Introduction
An experimental evaluation of similarity-based and embedding-based link predi...
Link Discovery Tutorial Introduction
Hai huang presentation
Learning Commonalities in RDF and SPARQL
Learning Commonalities in RDF
OWL: Yet to arrive on the Web of Data?
MMKG: Multi-Modal Knowledge Graphs dddddd
machine-learning-with-large-networks-of-people-and-places
graph_embeddings
Semantic Web Technology
Finding Association Rules in Linked Data
240325_JW_labseminar[node2vec: Scalable Feature Learning for Networks].pptx
A comprehensive survey of link mining and anomalies detection
Ad

More from geoknow (13)

PDF
Esta ld -exploring-spatio-temporal-linked-statistical-data
PDF
Sdwwg experiences and outlook
PDF
Spatial data web application in Suppliy Chain Management
PDF
Generator workbench
PDF
Geold2015 wauer
PDF
Facete - Exploring the web of spatial data with facete
PDF
ESTA-LD exploring spatio-temporal linked statistical data
PDF
The Linked Data Lifecycle
PDF
Can we crate better links playing games?
PDF
LinkedGeoData and GeoKnow
ODP
LinkedGeodata (Deutsch)
PPTX
Geo know general presentation 2013
PPTX
Geo know odw13-presentation
Esta ld -exploring-spatio-temporal-linked-statistical-data
Sdwwg experiences and outlook
Spatial data web application in Suppliy Chain Management
Generator workbench
Geold2015 wauer
Facete - Exploring the web of spatial data with facete
ESTA-LD exploring spatio-temporal linked statistical data
The Linked Data Lifecycle
Can we crate better links playing games?
LinkedGeoData and GeoKnow
LinkedGeodata (Deutsch)
Geo know general presentation 2013
Geo know odw13-presentation

Recently uploaded (20)

PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
Classroom Observation Tools for Teachers
PDF
IGGE1 Understanding the Self1234567891011
PPTX
Introduction to Building Materials
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Lesson notes of climatology university.
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
Indian roads congress 037 - 2012 Flexible pavement
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PDF
Empowerment Technology for Senior High School Guide
PDF
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
History, Philosophy and sociology of education (1).pptx
Classroom Observation Tools for Teachers
IGGE1 Understanding the Self1234567891011
Introduction to Building Materials
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
UV-Visible spectroscopy..pptx UV-Visible Spectroscopy – Electronic Transition...
Final Presentation General Medicine 03-08-2024.pptx
Paper A Mock Exam 9_ Attempt review.pdf.
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
Supply Chain Operations Speaking Notes -ICLT Program
Lesson notes of climatology university.
LDMMIA Reiki Yoga Finals Review Spring Summer
Indian roads congress 037 - 2012 Flexible pavement
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
Empowerment Technology for Senior High School Guide
SOIL: Factor, Horizon, Process, Classification, Degradation, Conservation

Towards Transfer Learning of Link Specifications

  • 1. Towards Transfer Learning of Link Specications Axel-Cyrille Ngonga Ngomo Jens Lehmann Mofeed Hassan 2013-09-16 Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 1 / 29
  • 2. Outline 1 Motivation 2 Transfer Learning Framework 3 Experimental Setup 4 Results 5 Conclusions and Future Work Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 2 / 29
  • 3. Outline 1 Motivation 2 Transfer Learning Framework 3 Experimental Setup 4 Results 5 Conclusions and Future Work Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 3 / 29
  • 4. Why Link Discovery? 1 2 3 Fourth Linked Data principle Links are central for Cross-ontology QA Data Integration Reasoning Federated Queries ... 2011 topology of the LOD Cloud: 31+ billion triples ≈ 0.5 billion links owl:sameAs in most cases Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 4 / 29
  • 5. Why is it dicult? Denition (Link Discovery) S and T of resources and relation R Find M = {(s , t ) ∈ S × T : R(s , t )} Given sets Task: Common approaches: Find = {( , ) ∈ = {( , ) ∈ Find M M s t S × T : σ(s , t ) ≥ θ} s t S × T : δ(s , t ) ≤ θ} Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 5 / 29
  • 6. Why is it dicult? Denition (Link Discovery) S and T of resources and relation R Find M = {(s , t ) ∈ S × T : R(s , t )} Given sets Task: Common approaches: Find = {( , ) ∈ = {( , ) ∈ Find M M 1 s t S × T : σ(s , t ) ≥ θ} s t S × T : δ(s , t ) ≤ θ} Time complexity Large number of triples Quadratic a-priori runtime 69 days for mapping cities from DBpedia to Geonames (1ms per comparison) Decades for linking DBpedia and LGD ... Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 5 / 29
  • 7. Why is it dicult? 2 Complexity of specications Combination of several attributes required for high precision Tedious discovery of most adequate mapping Dataset-dependent similarity functions Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 6 / 29
  • 8. LIMES Framework Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 7 / 29
  • 9. Link Specication Detection of accurate link specication is key Link Specications has three components: Two sets of restrictions RS ... RS resp. RT ... RT that specify the m 1 1 k sets resp. , A specication of a complex similarity metric σ via the combination of several atomic similarity measures σ1 , ..., σn and A set of thresholds τ1 , ..., τn such that τi is the threshold for σi . S T Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 8 / 29
  • 10. Transfer Learning Classical Learning of Link Specs Transfer Learning of Link Specs Current Linking Task Different Linking Tasks Task Repository spec accuracy: α class similarity: ζ property similarity: π Learning System Learning System Learning System In our approach we use Transfer Learning System Transductive Transfer Learning Class and property matching is assumed to be known already (numerous approaches from ontology matching can be employed) the goal is to nd the complex similarity metric Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 9 / 29
  • 11. Outline 1 Motivation 2 Transfer Learning Framework 3 Experimental Setup 4 Results 5 Conclusions and Future Work Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 10 / 29
  • 12. Transfer Learning Framework I Transfer Learning of link specications is reduce to three subproblems: Restrictions/class similarity ζ : 2C × 2C → [0, 1] e.g. ζ({City , Village }, {Town}) = 0.6 Property similarity: ξ : 2P × 2P → [0, 1] e.g. ξ({rdfs : label }, {rdfs : label }) = 1.0 Accuracy of link specications: α : Q → [0, 1] Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 11 / 29
  • 13. Transfer Learning Framework II Overall similarity measure for transfer learning: ω(t , t ) = α(q ) · ζ(ψ(q ), C) · ζ(ψ (q ), C ) · ξ(sp (q ), PL ) · ξ(tp (q ), PL ) (details in paper) Each similarity measure can be implemented in manifold approaches Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 12 / 29
  • 14. Transfer Learning Framework II Overall similarity measure for transfer learning: ω(t , t ) = α(q ) · ζ(ψ(q ), C) · ζ(ψ (q ), C ) · ξ(sp (q ), PL ) · ξ(tp (q ), PL ) (details in paper) Each similarity measure can be implemented in manifold approaches Implementations of class similarity function ζ in framework: label-based similarity name-based similarity (URI similarity) data-centric similarity Properties similarities ξ are dened analogously Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 12 / 29
  • 15. Transfer Learning Framework II Overall similarity measure for transfer learning: ω(t , t ) = α(q ) · ζ(ψ(q ), C) · ζ(ψ (q ), C ) · ξ(sp (q ), PL ) · ξ(tp (q ), PL ) (details in paper) Each similarity measure can be implemented in manifold approaches Implementations of class similarity function ζ in framework: label-based similarity name-based similarity (URI similarity) data-centric similarity Properties similarities ξ are dened analogously Similarities between single classes/properties can be extended to sets (e.g. using arithmetic / geometric mean of max. similarity) Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 12 / 29
  • 16. Transfer Learning Framework II Overall similarity measure for transfer learning: ω(t , t ) = α(q ) · ζ(ψ(q ), C) · ζ(ψ (q ), C ) · ξ(sp (q ), PL ) · ξ(tp (q ), PL ) (details in paper) Each similarity measure can be implemented in manifold approaches Implementations of class similarity function ζ in framework: label-based similarity name-based similarity (URI similarity) data-centric similarity Properties similarities ξ are dened analogously Similarities between single classes/properties can be extended to sets (e.g. using arithmetic / geometric mean of max. similarity) Spec can be transferred by replacing properties with most similar properties in PL and PL Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 12 / 29
  • 17. Example (New Link Task) Example link specication for mapping drugs in two datasets DBpedia and Drugbank (DBpedia-Drugbank.xml): Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 13 / 29
  • 18. Example (Restriction part) Three parts of link specs: Restrictions part Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 14 / 29
  • 19. Example (Properties Part) Three parts of link specs: Restrictions part Properties part Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 15 / 29
  • 20. Example (Similarities Measures Part) Three parts of link specs: Restrictions part Properties part Similarity Measures part: similarity metric and thresholds Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 16 / 29
  • 21. Example (Link Repository) Transfer learning is applied using a repository → restrictions and relevant properties are assumed to be known → nd the similarity measure by comparing with all specs in the repository, e.g. DBpedia-SiderDrugs.xml Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 17 / 29
  • 22. Example (Restriction Similarities) Restrictions in both specications les Type DBpedia-Drugbank.xml DBpedia-SiderDrugs.xml Source Target rdf:type dbpedia-owl:Drug rdf:type drug:drugs rdf:type dbpedia-owl:Drug rdf:type sider:drugs Straightforward label/URI similarity For instance, trigram metric in URI similarity without prexes: ζ({dbpedia-owl:Drug}, {dbpedia-owl:Drug}) = 1.0 ζ({sider:drugs}, {drug:drugs}) = 1.0 Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 18 / 29
  • 23. Example (Restriction Similarities) Restrictions in both specications les Type DBpedia-Drugbank.xml DBpedia-SiderDrugs.xml Source Target rdf:type dbpedia-owl:Drug rdf:type drug:drugs rdf:type dbpedia-owl:Drug rdf:type sider:drugs Straightforward label/URI similarity For instance, trigram metric in URI similarity without prexes: ζ({dbpedia-owl:Drug}, {dbpedia-owl:Drug}) = 1.0 ζ({sider:drugs}, {drug:drugs}) = 1.0 1 Data-centric: ζd (s , s ) = |P (s )||P (s sim(x , y ) where x ∈P (s ) y ∈P (s ) P (s ) = {x : s p x ∧ p rdf:type owl:DatatypeProperty} (extends similarity to instances) Ngonga et. al (Univ. Leipzig) )| Transfer Learning of Link Specs 2013-09-16 18 / 29
  • 24. Example (Property Similarities) type DBpedia-Drugbank.xml DBpedia-SiderDrugs.xml Source rdfs:label Target rdfs:label drug:genericName rdfs:label foaf:name rdfs:label Applying similarity function to all properties: For instance trigram based on URIs and arithmetic mean as aggregation: ξ({rdfs : label }, {rdfs : label , foaf : name }) = 0.9 ξ({rdfs : label , drug : genericName }, {rdfs : label }) = 0.8 Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 19 / 29
  • 25. Example (Overall Similarity) Based on, e.g. F-score assign quality value to q = DBpedia-SiderDrugs.xml, in our case α(q ) = 0.89 The nal step is calculating the overall similarity measure ω(DBpedia − Drugbank .xml , DBpedia − SiderDrugs .xml ) = 0.89 * 1.0 * 1.0 * 0.9 * 0.8 = 0.64 The steps are repeated for all link specications in the repository Most similar link spec can be transferred by replacing its properties with the most similar ones in the computed property matching Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 20 / 29
  • 26. Outline 1 Motivation 2 Transfer Learning Framework 3 Experimental Setup 4 Results 5 Conclusions and Future Work Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 21 / 29
  • 27. Experimental Setup I The goal of evaluation is two-fold: Evaluating whether transfer learning can be used to build templates for link spec Discover whether the transferred templates can be used directly 113 specications were retrieved from LATC, each has manual links evaluation 15% 10% 3% 2% Persons 1% 3% Events Locations Diseases Drugs Organizations Misc 66% Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 22 / 29
  • 28. Experimental Setup II Leave-one-out evaluation 1.) Compare top-scored specication (most similar) and check whether it uses the same combination of similarity functions assign 1 for match and 0 for no match 2.) Compute F-measure of learned link specs directly works only on specs with both endpoints alive (only 12 out of 113) Used URI similarity Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 23 / 29
  • 29. Outline 1 Motivation 2 Transfer Learning Framework 3 Experimental Setup 4 Results 5 Conclusions and Future Work Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 24 / 29
  • 30. First Experiments Set Results Detecting right specication in 81% of all cases In geo-spatial domain 91% In persons domain 58% 100% 90% 80% 70% 60% 50% 40% 30% 20% 10% 0% Ave s s s s e rag erson Event cation sease P Di Lo Ngonga et. al (Univ. Leipzig) gs ns Dru izatio n rga O Transfer Learning of Link Specs Mis c 2013-09-16 25 / 29
  • 31. Second Experiments Set Results In the second Experiments series, source and target endpoints need to be alive such that we can execute transferred link spec (12 out of 113) In general low F-measures 100% 80% 60% 40% Precision Recall F-Measure 20% s nt ry op e ne i-c ou ty si ex t er at at iv gu a- pe di a- te nd un a- at pe di db db n on so rs er pe -p d- on oo og f -d db pe di a- lin ke dg eo d nt or t rp ai at a- m -r st ad ev en ts ee r so n -d er db pe di ali nk ts e en ev ve er -e -p fo od og ge od l3 s p_ bl er ed s ty ci ar ie K- ul vU ab go st da ta di a- se nt ev e db pe di pe db e- im Ngonga et. al (Univ. Leipzig) -d co u aco n es - -e ss xe ra eu bc r rk db lp -d at as em an tic w eb - ur re se ar ch er nt ry 0% Transfer Learning of Link Specs 2013-09-16 26 / 29
  • 32. Outline 1 Motivation 2 Transfer Learning Framework 3 Experimental Setup 4 Results 5 Conclusions and Future Work Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 27 / 29
  • 33. Summary Conclusions: Detecting right template in 81% of all cases Transfer learning cannot replace the learning of thresholds in specications Future Work: Combination with machine-learning approaches for link specications (e.g., EAGLE, COALA), in particular for learning thresholds More sophisticated class and property similarity approaches Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 28 / 29
  • 34. The End Jens Lehmann lehmann@informatik.uni-leipzig.de AKSW/Uni Leipzig Questions Geo Know http://geoknow.eu Ngonga et. al (Univ. Leipzig) Transfer Learning of Link Specs 2013-09-16 29 / 29