SlideShare a Scribd company logo
A Review of Relational Machine Learning
for Knowledge Graphs
Present By: Yalda Akbarzadeh
May 2018
A Review of
Relational Machine Learning
for Knowledge Graphs
1. INTRODUCTION
2. KNOWLEDGE
GRAPHS
A. Knowledge
representation
B. Open vs.
closed world
assumption
C. Knowledge
base
construction
3. STATISTICAL
RELATIONAL LEARNING
FOR KNOWLEDGE
GRAPHS
D. Uses of
knowledge
graphs
E. Main tasks in
knowledge graph
construction and
curation
A. Probabilistic
knowledge
graphs
B. Statistical
properties of
knowledge
graphs
C. Types of
SRL models
4. LATENT
FEATURE MODELS
D. Multi-layer
perceptrons
B. Other tensor
factorization
models
C. Matrix
factorization
methods
A. RESCAL: A
bilinear model
E. Neural tensor
networks
F. Latent
distance models
G. Comparison
of models
8. MARKOV
RANDOM
FIELDS
C. Path Ranking
Algorithm
B. Rule Mining and
Inductive Logic
Programming
A. Similarity
measures for uni-
relational data
6. COMBINING
LATENT AND GRAPH
FEATURE MODELS
7. TRAINING SRL
MODELS ON
KNOWLEDGE
GRAPHS 5. GRAPH FEATURE
MODELS
A. Additive
relational
effects model
B. Other
combined
models
A. Penalized
maximum
likelihood
training
B. Where do
the negative
examples come
from?
C. Pairwise loss
training
D. Model
selection
B. Inference
A. Representation
C. Learning
D. Discussion
9. KNOWLEDGE
VAULT
10. EXTENSIONS
AND FUTURE
WORK
D. Discussion
B. Hard
constraints
A. Non-binary
relations
C. Generalizing
to new entities
and relations
D. Querying
probabilistic
knowledge graphs
12. APPENDIX
11. CONCLUDING
REMARKS
D. Querying
probabilistic
knowledge graphs
A Review of Relational Machine Learning for Knowledge Graphs
1. INTRODUCTION
2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS
3. LATENT FEATURE MODELS
4. GRAPH FEATURE MODELS
5. COMBINING LATENT AND GRAPH FEATURE MODELS
6. TRAINING SRL MODELS ON KNOWLEDGE GRAPHS
7. MARKOV RANDOM FIELDS
EXTENSIONS AND FUTURE WORK, COCLUSION
REFERENCES
Traditional machine
Learning Algorithms
feature vector,
numeric or categorical attribute
Input
Mapping
Output Observation
tower
Class labels,
Regression score,
Unsupervised cluster id
Latent vector (embedding)
1. INTRODUCTION  A. Relational Machine Learning 4
feature vector,
numeric or categorical attribute
Representation of an Object
Traditional machine
Learning Algorithms
Graph,
Node and Labeled edges
Berlin
Munich
Düsseldorf
478 Km.
486 Km.
505Km.
1. INTRODUCTION A. Relational Machine Learning 5
Some Canonical Tasks
>> Collective classification
>> Linkprediction
>> Link-based clustering
prediction of
missing edge
clustering nodes
based on their
connectivity pattern
prediction of
properties of
nodes
1. INTRODUCTION A. Relational Machine Learning 6
>> PREDICT(new facts (edges) |existing fact) = LEARN SRL Methods(KGs )
Two main SRL Models
1. INTRODUCTION A. Relational Machine Learning 7
1. INTRODUCTION B. KNOWLEDGE GRAPHS 8
>> Knowledge graphs model information in the form of entities and relationships between them.
Linked Data
(subject, predicate, object) :: (SPO)
(Leonard Nimoy, played, Spock)
 Leonard Nimoy was an actor who played the character
Spock in the science-fiction movie Star Trek
Semantic Web
Linked Data
Resource Description Framework
subject object
predicate
1. INTRODUCTION B. KNOWLEDGE GRAPHS i. Knowledge representation 9
>> closed world assumption (CWA),
non-existing triples indicate false relationships.
Linked Data>> KGs are known to be very incomplete
>> RDF and the Semantic Web make the open-world assumption.
>> open world assumption (OWA),
a non-existing triple is interpreted as unknown.
1. INTRODUCTION B. KNOWLEDGE GRAPHS ii. Open vs. closed world assumption 10
Existing triples are facts
What about non-existing
triples?
Linked Data
1. INTRODUCTION B. KNOWLEDGE GRAPHS iii. Knowledge base construction 11
Linked Data
1. INTRODUCTION B. KNOWLEDGE GRAPHS iv. Uses of knowledge
graphs
12
Linked Data
1. INTRODUCTION  B. KNOWLEDGE GRAPHS  v. Main tasks in KGs construction and curation
Link prediction Entity resolution Link-based clustering
Kate
Ted
Jack
is sister of
isfatherof
isfatherof
Jack
Bonn
uni.
Worksin
uni.
Of
Bonn
Ted
Studiesin
TjLu
Rj
Ed
Lj
Rt
Kj
Jd
Pj
13
2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS 14
2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS A. Probabilistic KGs 15
2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS B. Statistical properties of KGs 16
• Denotes that query answer
should follow a type.
• Tendency of entities to be related
to other entities with similar characteristic.
• All the same blocks members have
the same relation with other block.
2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS C. Types of SRL models 17
M1) latent features and additional parameters
M2) observed graph features and additional parameters
 M3) existence of local interactions (Markov Random Fields)
2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS C. Types of SRL models 18
of M1 and M2:
3. LATENT FEATURE MODELS 19
prestigious awardlatent features -->
Ex) Guinness received the Academy Award
good actor
<- good actor ->
<- prestigious award ->
Relationships between entities can be derived from interactions of their latent features.
3. LATENT FEATURE MODELS A. RESCAL 20
>> RESCAL is a bilinear relational latent feature model which explains triples via pairwise interactions of latent
features.
This importance is used here:
Relationships between entities can be derived
from interactions of their latent features.
3. LATENT FEATURE MODELS B. Matrix factorization methods 21
>> Approach for learning from knowledge graphs is based on matrix factorization
memory
complexity
Lose
information
Reshape
3. LATENT FEATURE MODELS C. Multi-layer-perceptrons 22
L:
3. LATENT FEATURE MODELS C. Multi-layer-perceptrons 23
>> E-MLP model :
02
03
04
Σ
3. LATENT FEATURE MODELS C. Multi-layer-perceptrons 24
It can reduce the number of required parametersIt can reduce the number of required parameters
Advantage of E-MLP
Disadvantage of E-MLP
02
03
04
Σ
3. LATENT FEATURE MODELS D. Neural tensor networks 25
02
03
04
>> Neural Tensor Network (NTN):
Combines traditional MLPs with bilinear models
3. LATENT FEATURE MODELS E. Latent distance models 26
02
03
04
>> Latent distance models or latent space models in social network analysis:
 Derive the probability of relationships from the distance between latent representations of entities
#--------------------------------------------------------------------------------------------------------------------------------------
4. GRAPH FEATURE MODELS A. Similarity measures for uni-relational data 27
>> Existence of an edge can be predicted by extracting features from the observed edges in the graph.
>> GFMs Widely used for link prediction in uni-relational graphs.
>> intuition:
 similar entities are likely to be related (homophily)
 the similarity of entities can be derived from:
• neighborhood of nodes
• existence of paths between nodes
>> Approaches to measure the similarity of entities:
John Mary
Parent of
Anne
Parent of
(John, marriedTo, Mary)
Predict
Local similarity
Global similarity
Quasi-local similarity
4. GRAPH FEATURE MODELS  B. Path Ranking Algorithm 28
>> Using random walks of bounded lengths for predicting links
in multi-relational knowledge graphs.
4. GRAPH FEATURE MODELS  B. Path Ranking Algorithm 29
EXAMPLES OF PATHS LEARNED BY PRA ON FREEBASE
TO PREDICT WHICH COLLEGE A PERSON ATTENDED
Relation Path Weight
(draftedBy, school) 2.62
(sibling(s), sibling, education, institution) 1.88
(parents, education, institution) 1.37
(children, education, institution) 1.85
>> key idea in PRA:
Use path probabilities as features for predicting the probability of missing edges.
5. COMBINING LATENT AND GRAPH FEATURE MODEL 30
>>Latent feature models:
 well-suited for modeling global relational patterns via newly
introduced latent variables.
 They are computationally efficient if triples can be explained
with a small number of latent variables.
>>Graph feature models:
 are well-suited for modeling local and quasi-local graphs
patterns.
 They are computationally efficient if triples can be explained
from the neighborhood of entities or from short paths in
the graph
5. COMBINING LATENT AND GRAPH FEATURE MODEL  Additive Relational Effects models 31
6. TRAINING SRL MODELS ON KNOWLEDGE GRAPHS  A. Penalized maximum likelihood training 32
02
03
04
posterior priorlikelihood
6. TRAINING SRL MODELS ON KNOWLEDGE GRAPHS  B. Where do the negative examples come from? 33
>> Most knowledge graphs only contain positive training examples.
6. TRAINING SRL MODELS ON KNOWLEDGE GRAPHS 34
02
03
04
7. MARKOV RANDOM FIELDS 35
 3-vertex cliques
7. MARKOV RANDOM FIELDS 36
02
03
04
>> Form of Markov Random Fields (MRFs):
EXTENSIONS AND FUTURE WORK 37
A. Non-binary relations
B. Hard constraints: types, functional constraints, and others
C. Generalizing to new entities and relations
D. Querying probabilistic knowledge graphs
E. Trustworthiness of knowledge graphs
 Conclusion 38
review of state-of-the-art statistical relational learning (SRL) methods applied to very large knowledge graphs
KGs have found important applications in question answering, structured search, exploratory search
showed how to create a truly massive, machine-interpretable “semantic memory” of facts
REFERENCES
39
[1] M. Nickel, K. Murphy, V. Tresp, and E. Gabrilovich, “A review of relational machine learning for knowledge
graphs,” Proceedings of the IEEE, vol. 104, pp. 11-33, 2016.
[2] N. Lao, T. Mitchell, and W. W. Cohen, “Random walk inference and learning in a large scale knowledge base,”
in Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011, pp. 529-539.
[3] “An Introduction to Statistical Relational Learning” at: www.cs.kuleuven.be/~lucdr/salvador.pdf.
[4] “sampling procedure” at: https://www.slideshare.net/anasomoray/chapter-4-sampling-procedure
[5] “Non-Hierarchical Clustering” at: https://courses.cs.washington.edu/courses/cse527/01au/oct25/oct25.html
[6] https://en.wikipedia.org/
[7] N. Feldman, Learning and overgeneralization patterns in a connectionist model of the German plural: Cite seer,
2005.
[8] “Introduction to: Open World Assumption vs Closed World Assumption”, at:
http://www.dataversity.net/introduction-to-open-world-assumption-vs-closed-world-assumption/
[9] “Introducing the Knowledge Graph ”, at: https://www.youtube.com/watch?v=mmQl6VGvX-c
[10] “What is Linked Data?”, at: https://www.w3.org/standards/semanticweb/data
[11] “Resource Description Framework (RDF)”, at: https://www.w3.org/TR/rdf-concepts/
THE END
Thank you
40
?

More Related Content

PDF
A Graph-based Model for Multimodal Information Retrieval
DOCX
Automatic face naming by learning discriminative affinity matrices from weakl...
DOCX
Automatic face naming by learning discriminative
DOCX
AUTOMATIC FACE NAMING BY LEARNING DISCRIMINATIVE AFFINITY MATRICES FROM WEAKL...
PDF
Kerala university m.sc. computer science syllabus
PDF
Classifier fusion method to recognize
PDF
Automatic digital terrain modelling
PDF
The Framework of Image Recognition based on Modified Freeman Chain Code
A Graph-based Model for Multimodal Information Retrieval
Automatic face naming by learning discriminative affinity matrices from weakl...
Automatic face naming by learning discriminative
AUTOMATIC FACE NAMING BY LEARNING DISCRIMINATIVE AFFINITY MATRICES FROM WEAKL...
Kerala university m.sc. computer science syllabus
Classifier fusion method to recognize
Automatic digital terrain modelling
The Framework of Image Recognition based on Modified Freeman Chain Code

What's hot (10)

PDF
Ijarcet vol-2-issue-2-337-341
PDF
M.Phil Computer Science Image Processing Projects
PDF
M.Phil Computer Science Image Processing Projects
PDF
A Hybrid Technique for Shape Matching Based on chain code and DFS Tree
PDF
M.E Computer Science Image Processing Projects
PDF
Role Model of Graph Coloring Application in Labeled 2D Line Drawing Object
PPTX
The state of the art in handwriting synthesis
PPTX
Std 12 Computer Chapter 6 object oriented concept important mcqs
PDF
A study and implementation of the transit route network design problem for a ...
PDF
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...
Ijarcet vol-2-issue-2-337-341
M.Phil Computer Science Image Processing Projects
M.Phil Computer Science Image Processing Projects
A Hybrid Technique for Shape Matching Based on chain code and DFS Tree
M.E Computer Science Image Processing Projects
Role Model of Graph Coloring Application in Labeled 2D Line Drawing Object
The state of the art in handwriting synthesis
Std 12 Computer Chapter 6 object oriented concept important mcqs
A study and implementation of the transit route network design problem for a ...
AUTOMATIC TRAINING DATA SYNTHESIS FOR HANDWRITING RECOGNITION USING THE STRUC...
Ad

Similar to A Review of Relational Machine Learning(SRL) for Knowledge Graphs (20)

PDF
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
PDF
Learning Graph Representation for Data-Efficiency RL
PPTX
GRAPHICAL MODELS (22-5F6) machine learning
PDF
Lecture 07 leonidas guibas - networks of shapes and images
PDF
(DL輪読)Matching Networks for One Shot Learning
PDF
Laplacian-regularized Graph Bandits
PPTX
Deep Graph Contrastive Representation Learning.pptx
PDF
Linearity of Feature Extraction Techniques for Medical Images by using Scale ...
PDF
From Signal to Symbols
PPTX
Responsible AI in Industry: Practical Challenges and Lessons Learned
PDF
Objects Clustering of Movie Using Graph Mining Technique
PDF
algorithms
PPTX
[20240819_LabSeminar_Huy]Learning Decomposed Spatial Relations for Multi-Vari...
PDF
A Novel Feature Selection with Annealing For Computer Vision And Big Data Lea...
PPTX
[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx
PPTX
Exploring the potential of deep learning for map generalization
PDF
An experimental evaluation of similarity-based and embedding-based link predi...
PDF
What Makes Graph Queries Difficult?
PDF
A Graph-based Model for multimodal Information Retrieval (partially presented)
PPTX
Attentive Relational Networks for Mapping Images to Scene Graphs
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Learning Graph Representation for Data-Efficiency RL
GRAPHICAL MODELS (22-5F6) machine learning
Lecture 07 leonidas guibas - networks of shapes and images
(DL輪読)Matching Networks for One Shot Learning
Laplacian-regularized Graph Bandits
Deep Graph Contrastive Representation Learning.pptx
Linearity of Feature Extraction Techniques for Medical Images by using Scale ...
From Signal to Symbols
Responsible AI in Industry: Practical Challenges and Lessons Learned
Objects Clustering of Movie Using Graph Mining Technique
algorithms
[20240819_LabSeminar_Huy]Learning Decomposed Spatial Relations for Multi-Vari...
A Novel Feature Selection with Annealing For Computer Vision And Big Data Lea...
[20240513_LabSeminar_Huy]GraphFewShort_Transfer.pptx
Exploring the potential of deep learning for map generalization
An experimental evaluation of similarity-based and embedding-based link predi...
What Makes Graph Queries Difficult?
A Graph-based Model for multimodal Information Retrieval (partially presented)
Attentive Relational Networks for Mapping Images to Scene Graphs
Ad

Recently uploaded (20)

PDF
. Radiology Case Scenariosssssssssssssss
PPTX
2Systematics of Living Organisms t-.pptx
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
Microbiology with diagram medical studies .pptx
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PPTX
BIOMOLECULES PPT........................
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PPTX
Comparative Structure of Integument in Vertebrates.pptx
PPTX
2. Earth - The Living Planet earth and life
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
Derivatives of integument scales, beaks, horns,.pptx
. Radiology Case Scenariosssssssssssssss
2Systematics of Living Organisms t-.pptx
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Introduction to Fisheries Biotechnology_Lesson 1.pptx
Cell Membrane: Structure, Composition & Functions
Microbiology with diagram medical studies .pptx
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
BIOMOLECULES PPT........................
HPLC-PPT.docx high performance liquid chromatography
Introduction to Cardiovascular system_structure and functions-1
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
bbec55_b34400a7914c42429908233dbd381773.pdf
Comparative Structure of Integument in Vertebrates.pptx
2. Earth - The Living Planet earth and life
ECG_Course_Presentation د.محمد صقران ppt
AlphaEarth Foundations and the Satellite Embedding dataset
Phytochemical Investigation of Miliusa longipes.pdf
Derivatives of integument scales, beaks, horns,.pptx

A Review of Relational Machine Learning(SRL) for Knowledge Graphs

  • 1. A Review of Relational Machine Learning for Knowledge Graphs Present By: Yalda Akbarzadeh May 2018
  • 2. A Review of Relational Machine Learning for Knowledge Graphs 1. INTRODUCTION 2. KNOWLEDGE GRAPHS A. Knowledge representation B. Open vs. closed world assumption C. Knowledge base construction 3. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS D. Uses of knowledge graphs E. Main tasks in knowledge graph construction and curation A. Probabilistic knowledge graphs B. Statistical properties of knowledge graphs C. Types of SRL models 4. LATENT FEATURE MODELS D. Multi-layer perceptrons B. Other tensor factorization models C. Matrix factorization methods A. RESCAL: A bilinear model E. Neural tensor networks F. Latent distance models G. Comparison of models 8. MARKOV RANDOM FIELDS C. Path Ranking Algorithm B. Rule Mining and Inductive Logic Programming A. Similarity measures for uni- relational data 6. COMBINING LATENT AND GRAPH FEATURE MODELS 7. TRAINING SRL MODELS ON KNOWLEDGE GRAPHS 5. GRAPH FEATURE MODELS A. Additive relational effects model B. Other combined models A. Penalized maximum likelihood training B. Where do the negative examples come from? C. Pairwise loss training D. Model selection B. Inference A. Representation C. Learning D. Discussion 9. KNOWLEDGE VAULT 10. EXTENSIONS AND FUTURE WORK D. Discussion B. Hard constraints A. Non-binary relations C. Generalizing to new entities and relations D. Querying probabilistic knowledge graphs 12. APPENDIX 11. CONCLUDING REMARKS D. Querying probabilistic knowledge graphs
  • 3. A Review of Relational Machine Learning for Knowledge Graphs 1. INTRODUCTION 2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS 3. LATENT FEATURE MODELS 4. GRAPH FEATURE MODELS 5. COMBINING LATENT AND GRAPH FEATURE MODELS 6. TRAINING SRL MODELS ON KNOWLEDGE GRAPHS 7. MARKOV RANDOM FIELDS EXTENSIONS AND FUTURE WORK, COCLUSION REFERENCES
  • 4. Traditional machine Learning Algorithms feature vector, numeric or categorical attribute Input Mapping Output Observation tower Class labels, Regression score, Unsupervised cluster id Latent vector (embedding) 1. INTRODUCTION  A. Relational Machine Learning 4
  • 5. feature vector, numeric or categorical attribute Representation of an Object Traditional machine Learning Algorithms Graph, Node and Labeled edges Berlin Munich Düsseldorf 478 Km. 486 Km. 505Km. 1. INTRODUCTION A. Relational Machine Learning 5
  • 6. Some Canonical Tasks >> Collective classification >> Linkprediction >> Link-based clustering prediction of missing edge clustering nodes based on their connectivity pattern prediction of properties of nodes 1. INTRODUCTION A. Relational Machine Learning 6
  • 7. >> PREDICT(new facts (edges) |existing fact) = LEARN SRL Methods(KGs ) Two main SRL Models 1. INTRODUCTION A. Relational Machine Learning 7
  • 8. 1. INTRODUCTION B. KNOWLEDGE GRAPHS 8
  • 9. >> Knowledge graphs model information in the form of entities and relationships between them. Linked Data (subject, predicate, object) :: (SPO) (Leonard Nimoy, played, Spock)  Leonard Nimoy was an actor who played the character Spock in the science-fiction movie Star Trek Semantic Web Linked Data Resource Description Framework subject object predicate 1. INTRODUCTION B. KNOWLEDGE GRAPHS i. Knowledge representation 9
  • 10. >> closed world assumption (CWA), non-existing triples indicate false relationships. Linked Data>> KGs are known to be very incomplete >> RDF and the Semantic Web make the open-world assumption. >> open world assumption (OWA), a non-existing triple is interpreted as unknown. 1. INTRODUCTION B. KNOWLEDGE GRAPHS ii. Open vs. closed world assumption 10 Existing triples are facts What about non-existing triples?
  • 11. Linked Data 1. INTRODUCTION B. KNOWLEDGE GRAPHS iii. Knowledge base construction 11
  • 12. Linked Data 1. INTRODUCTION B. KNOWLEDGE GRAPHS iv. Uses of knowledge graphs 12
  • 13. Linked Data 1. INTRODUCTION  B. KNOWLEDGE GRAPHS  v. Main tasks in KGs construction and curation Link prediction Entity resolution Link-based clustering Kate Ted Jack is sister of isfatherof isfatherof Jack Bonn uni. Worksin uni. Of Bonn Ted Studiesin TjLu Rj Ed Lj Rt Kj Jd Pj 13
  • 14. 2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS 14
  • 15. 2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS A. Probabilistic KGs 15
  • 16. 2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS B. Statistical properties of KGs 16 • Denotes that query answer should follow a type. • Tendency of entities to be related to other entities with similar characteristic. • All the same blocks members have the same relation with other block.
  • 17. 2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS C. Types of SRL models 17 M1) latent features and additional parameters M2) observed graph features and additional parameters  M3) existence of local interactions (Markov Random Fields)
  • 18. 2. STATISTICAL RELATIONAL LEARNING FOR KNOWLEDGE GRAPHS C. Types of SRL models 18 of M1 and M2:
  • 19. 3. LATENT FEATURE MODELS 19 prestigious awardlatent features --> Ex) Guinness received the Academy Award good actor <- good actor -> <- prestigious award -> Relationships between entities can be derived from interactions of their latent features.
  • 20. 3. LATENT FEATURE MODELS A. RESCAL 20 >> RESCAL is a bilinear relational latent feature model which explains triples via pairwise interactions of latent features. This importance is used here: Relationships between entities can be derived from interactions of their latent features.
  • 21. 3. LATENT FEATURE MODELS B. Matrix factorization methods 21 >> Approach for learning from knowledge graphs is based on matrix factorization memory complexity Lose information Reshape
  • 22. 3. LATENT FEATURE MODELS C. Multi-layer-perceptrons 22 L:
  • 23. 3. LATENT FEATURE MODELS C. Multi-layer-perceptrons 23 >> E-MLP model : 02 03 04 Σ
  • 24. 3. LATENT FEATURE MODELS C. Multi-layer-perceptrons 24 It can reduce the number of required parametersIt can reduce the number of required parameters Advantage of E-MLP Disadvantage of E-MLP 02 03 04 Σ
  • 25. 3. LATENT FEATURE MODELS D. Neural tensor networks 25 02 03 04 >> Neural Tensor Network (NTN): Combines traditional MLPs with bilinear models
  • 26. 3. LATENT FEATURE MODELS E. Latent distance models 26 02 03 04 >> Latent distance models or latent space models in social network analysis:  Derive the probability of relationships from the distance between latent representations of entities #--------------------------------------------------------------------------------------------------------------------------------------
  • 27. 4. GRAPH FEATURE MODELS A. Similarity measures for uni-relational data 27 >> Existence of an edge can be predicted by extracting features from the observed edges in the graph. >> GFMs Widely used for link prediction in uni-relational graphs. >> intuition:  similar entities are likely to be related (homophily)  the similarity of entities can be derived from: • neighborhood of nodes • existence of paths between nodes >> Approaches to measure the similarity of entities: John Mary Parent of Anne Parent of (John, marriedTo, Mary) Predict Local similarity Global similarity Quasi-local similarity
  • 28. 4. GRAPH FEATURE MODELS  B. Path Ranking Algorithm 28 >> Using random walks of bounded lengths for predicting links in multi-relational knowledge graphs.
  • 29. 4. GRAPH FEATURE MODELS  B. Path Ranking Algorithm 29 EXAMPLES OF PATHS LEARNED BY PRA ON FREEBASE TO PREDICT WHICH COLLEGE A PERSON ATTENDED Relation Path Weight (draftedBy, school) 2.62 (sibling(s), sibling, education, institution) 1.88 (parents, education, institution) 1.37 (children, education, institution) 1.85 >> key idea in PRA: Use path probabilities as features for predicting the probability of missing edges.
  • 30. 5. COMBINING LATENT AND GRAPH FEATURE MODEL 30 >>Latent feature models:  well-suited for modeling global relational patterns via newly introduced latent variables.  They are computationally efficient if triples can be explained with a small number of latent variables. >>Graph feature models:  are well-suited for modeling local and quasi-local graphs patterns.  They are computationally efficient if triples can be explained from the neighborhood of entities or from short paths in the graph
  • 31. 5. COMBINING LATENT AND GRAPH FEATURE MODEL  Additive Relational Effects models 31
  • 32. 6. TRAINING SRL MODELS ON KNOWLEDGE GRAPHS  A. Penalized maximum likelihood training 32 02 03 04 posterior priorlikelihood
  • 33. 6. TRAINING SRL MODELS ON KNOWLEDGE GRAPHS  B. Where do the negative examples come from? 33 >> Most knowledge graphs only contain positive training examples.
  • 34. 6. TRAINING SRL MODELS ON KNOWLEDGE GRAPHS 34 02 03 04
  • 35. 7. MARKOV RANDOM FIELDS 35  3-vertex cliques
  • 36. 7. MARKOV RANDOM FIELDS 36 02 03 04 >> Form of Markov Random Fields (MRFs):
  • 37. EXTENSIONS AND FUTURE WORK 37 A. Non-binary relations B. Hard constraints: types, functional constraints, and others C. Generalizing to new entities and relations D. Querying probabilistic knowledge graphs E. Trustworthiness of knowledge graphs
  • 38.  Conclusion 38 review of state-of-the-art statistical relational learning (SRL) methods applied to very large knowledge graphs KGs have found important applications in question answering, structured search, exploratory search showed how to create a truly massive, machine-interpretable “semantic memory” of facts
  • 39. REFERENCES 39 [1] M. Nickel, K. Murphy, V. Tresp, and E. Gabrilovich, “A review of relational machine learning for knowledge graphs,” Proceedings of the IEEE, vol. 104, pp. 11-33, 2016. [2] N. Lao, T. Mitchell, and W. W. Cohen, “Random walk inference and learning in a large scale knowledge base,” in Proceedings of the Conference on Empirical Methods in Natural Language Processing, 2011, pp. 529-539. [3] “An Introduction to Statistical Relational Learning” at: www.cs.kuleuven.be/~lucdr/salvador.pdf. [4] “sampling procedure” at: https://www.slideshare.net/anasomoray/chapter-4-sampling-procedure [5] “Non-Hierarchical Clustering” at: https://courses.cs.washington.edu/courses/cse527/01au/oct25/oct25.html [6] https://en.wikipedia.org/ [7] N. Feldman, Learning and overgeneralization patterns in a connectionist model of the German plural: Cite seer, 2005. [8] “Introduction to: Open World Assumption vs Closed World Assumption”, at: http://www.dataversity.net/introduction-to-open-world-assumption-vs-closed-world-assumption/ [9] “Introducing the Knowledge Graph ”, at: https://www.youtube.com/watch?v=mmQl6VGvX-c [10] “What is Linked Data?”, at: https://www.w3.org/standards/semanticweb/data [11] “Resource Description Framework (RDF)”, at: https://www.w3.org/TR/rdf-concepts/