SlideShare a Scribd company logo
Type-aware Entity Retrieval
Darío Garigliotti
IAI
Universitetet i Stavanger (UiS)
NTNU - March 3, 2016
From Information Retrieval to Entity
Retrieval
• Traditional Information Retrieval recently extended
to an Entity-oriented Search
• Satisfaction of more complex information needs
• Current support on search engines
From Information Retrieval to Entity
Retrieval
From Information Retrieval to Entity
Retrieval
Countries where one can pay with the euro
Entity Retrieval
Countries where one can pay with the euro
• Related entities (via a relation or predicate)
• Types or categories or classes
Entity Retrieval
Countries where one can pay with the euro
Impressionist art museums in The Netherlands
• Related entities (via a relation or predicate)
• Types or categories or classes
Entity Retrieval
Countries where one can pay with the euro
Impressionist art museums in The Netherlands
• Related entities (via a relation or predicate)
• Types or categories or classes
Entity Retrieval
Countries where one can pay with the euro
Impressionist art museums in The Netherlands
• Related entities (via a relation or predicate)
• Types or categories or classes
Entity Retrieval
Entity Retrieval
Evaluated tasks
• Entity ranking (given a textual query and target categories)
• List completion (given Q and entity examples, and? types)
• Related entity finding (given entity E, relation R and type T)
e.g. E = "Schumacher", R = "His teammates when he was on
Ferrari", T = "Person"
from Q = "Schumacher teammates when he was on Ferrari"
Type-aware entity retrieval
Our research questions
1. How to represent type-based information?
2. How to combine type-based and textual
information?
3. How to estimate type-based information?
Type-aware entity retrieval
RQ2. How to combine type-based and textual information?
• Basics: term-based models
• A variety of related tasks across the literature
• Entity retrieval approaches
• Where to look for entities? How to find them? How to rank
them?
• Major model families
• Common main insight: types help!
Type-aware entity retrieval
RQ1. How to represent type-based information?
• Dimensions we identified
• type taxonomies
• hierarchical structure
• dataset version
• Minimal concerning in the related work
Type taxonomies
• We consider four well-known type taxonomies
Type system Wikipedia DBpedia Freebase YAGO
#types 753,524 591 1719 568,672
#top-level types NA 58 92 61
#most-specific-level types 753,524 472 1626 549,623
depth NA 7 2 19
entities w/ type 4.12M 3.24M 3.77M 2.89M
avg #types/entity 4.02 6.30 9.57 16.44
Type representation
• We consider different ways of modeling type
assignments:
Top level, most specific level, and path-to-top
r
e
r
e
r
e
Experimental setup
• Our experimental environment looks like this:
Term-based representation
Query model Entity model Query model Entity model
Type-based representation
P(e|q) / P(q|e)P(e)
p(t|✓T
e )p(t|✓T
q )
KL(✓T
q ||✓T
e )
p(t0
| ✓T 0
e )p(t0
| ✓T 0
q )
KL(✓T 0
q k ✓T 0
e )
P(q | e) = (1 )P(✓T 0
q | ✓T 0
e ) + P(✓T
q | ✓T
e )
Experimental setup
• Term-based component: Mixture of LM method
• We obtain combinations of these elements:
• Type taxonomies
• Models
• Type-based representations
Ingredients
• Model instantiations for
• M1 (Mixture):
• M2 (Multiplicative):
• M3 (Filtering):
P(e | q) / P(q | e)P(e)
P(q | e) = (1 )P(✓T 0
q | ✓T 0
e ) + P(✓T
q | ✓T
e )
P(q | e) = P(✓T 0
q | ✓T 0
e )P(✓T
q | ✓T
e )
P(✓T
q | ✓T
e ) 2 {0, 1}
Ingredients
• Query model for the type-based representation
is provided by a target types oracle
P(t|✓T
q )
Query: guitar origin blues DBpedia Types:
<dbo:Album>: 4
<dbo:MusicalArtist>: 43
...
Freebase Types:
<fb:music.group_member>: 34
<fb:people.deceased_person>: 17
...
Wikipedia Categories:
<dbpedia:Category:Blues_musicians_from_New_Orleans,_Louisiana>: 2
<dbpedia:Category:Blues_songs>: 2
...
Ingredients
• Our experimental environment looks like this:
Query model Entity model
Type-based representation
P(e|q) / P(q|e)P(e)
p(t|✓T
e )p(t|✓T
q )
KL(✓T
q ||✓T
e )
P(q | e) = (1 )P(✓T 0
q | ✓T 0
e ) + P(✓T
q | ✓T
e )
Ingredients
• Entity model for the type-based representation is a
distribution estimated through the entity types
Query: guitar origin blues
Relevant entities:
<dbpedia:The_Merle_Travis_Guitar>
<dbpedia:Blues_Breakers_with_Eric_Clapton>
<dbpedia:Poor_Boy_Blues>
...
...
Freebase Types:
... DBpedia Types:
<dbo:Album>
<dbo:MusicalWork>
...
...
Freebase Types:
... Wikipedia Categories:
<Category:1950_albums>
<Category:Merle_Travis_albums>
...
Results (1)
RQ1. How to represent
type-based information?
Type representation - Model M1
MAP
0
0.058
0.115
0.173
0.23
all assigned types most specific level path-to-top top level
YAGO Freebase Wikipedia DBpedia
Type representation - Model M2
MAP
0
0.045
0.09
0.135
0.18
all assigned types most specific level path-to-top top level
YAGO Freebase Wikipedia DBpedia
Type representation - Model M3
MAP
0
0.055
0.11
0.165
0.22
all assigned types most specific level path-to-top top level
YAGO Freebase Wikipedia DBpedia
Results (2)
RQ2. How to combine type-based and textual information?
Combining information - All assigned types
MAP
0
0.06
0.12
0.18
0.24
YAGO Freebase Wikipedia DBpedia
M1 M2 M3
Combining information - Most-specific-level types
MAP
0
0.06
0.12
0.18
0.24
YAGO Freebase Wikipedia DBpedia
M1 M2 M3
Future work
RQ3: How to estimate type-based information?
Term-based representation
Query model Entity model Query model Entity model
Type-based representation
P(e|q) / P(q|e)P(e)
p(t|✓T
e )p(t|✓T
q )
KL(✓T
q ||✓T
e )
p(t0
| ✓T 0
e )p(t0
| ✓T 0
q )
KL(✓T 0
q k ✓T 0
e )
P(q | e) = (1 )P(✓T 0
q | ✓T 0
e ) + P(✓T
q | ✓T
e )
Future work
• Main focus will be on query typing, but eventually
on entity typing as well
• How to take the best from different type taxonomies

More Related Content

PPT
How to Use WorldCat
PPTX
Semantic Application for Healthcare
PPTX
Issues and activities in authoring ontologies
PDF
Presentation shexer
PPTX
Nursing under graduate 2014
PPTX
Using OWL for the RESO Data Dictionary
PPTX
LIS 653, Session 3: Principles and Standards
PPTX
Library resources & skills - 2019
How to Use WorldCat
Semantic Application for Healthcare
Issues and activities in authoring ontologies
Presentation shexer
Nursing under graduate 2014
Using OWL for the RESO Data Dictionary
LIS 653, Session 3: Principles and Standards
Library resources & skills - 2019

What's hot (20)

PPT
Importing life science at a into Neo4j
PPTX
LIS 653, Session 8: Radical Cataloging
PPT
Web ontology language (owl)
PPTX
Kampmeier ecn 2012
PPTX
LIS 653, Session 6: FRBR & Relationships
PPTX
Semantics as a service at EMBL-EBI
PPTX
Advanced information and research skills for music
PPTX
Semantic web Technology
PPTX
How to access databases
PPT
NCBO SPARQL Endpoint
PPTX
New INSPIRE (basic) 12-2015
PPTX
Ontology and Ontology Libraries: a critical study
PPTX
Domain Identification for Linked Open Data
PDF
Lo5
PDF
Orcid charleston presentation 110410
PPTX
Historic collections for researchers (November 2013)
PPTX
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
PPSX
Introduction to RDF
PPT
Search First Steps
PDF
Changing Data: Implementing Primo for the Tri University Group of Libraries (...
Importing life science at a into Neo4j
LIS 653, Session 8: Radical Cataloging
Web ontology language (owl)
Kampmeier ecn 2012
LIS 653, Session 6: FRBR & Relationships
Semantics as a service at EMBL-EBI
Advanced information and research skills for music
Semantic web Technology
How to access databases
NCBO SPARQL Endpoint
New INSPIRE (basic) 12-2015
Ontology and Ontology Libraries: a critical study
Domain Identification for Linked Open Data
Lo5
Orcid charleston presentation 110410
Historic collections for researchers (November 2013)
Presentation of the INVENiT Expert Meeting on Monday 16 February 2015
Introduction to RDF
Search First Steps
Changing Data: Implementing Primo for the Tri University Group of Libraries (...
Ad

Similar to Type-Aware Entity Retrieval (20)

PDF
On Type-Aware Entity Retrieval
PDF
Type-Aware Entity Retrieval
PDF
Type-Aware Entity Retrieval
PDF
Type-Aware Entity Retrieval
PDF
Task-Based Support in Search Engines
PDF
Task-Based Information Retrieval
PDF
Entity Retrieval (WSDM 2014 tutorial)
PDF
Entity Retrieval (WWW 2013 tutorial)
PDF
A Semantic Search Approach to Task-Completion Engines
PDF
Improving Entity Retrieval on Structured Data
PDF
Entity Retrieval (SIGIR 2013 tutorial)
PDF
Type Information in Entity Retrieval
PDF
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
PDF
Evaluation Initiatives for Entity-oriented Search
PDF
Multilingual Fine-grained Entity Typing
PDF
Entity Search: The Last Decade and the Next
PPT
Addmi 07-taxonomy
PPT
Extensible and Dynamic Topic Types For DDS (out of date)
PDF
Entity Typing Using Distributional Semantics and DBpedia
PDF
Parameterized Fielded Term Dependence Models for Ad-hoc Entity Retrieval from...
On Type-Aware Entity Retrieval
Type-Aware Entity Retrieval
Type-Aware Entity Retrieval
Type-Aware Entity Retrieval
Task-Based Support in Search Engines
Task-Based Information Retrieval
Entity Retrieval (WSDM 2014 tutorial)
Entity Retrieval (WWW 2013 tutorial)
A Semantic Search Approach to Task-Completion Engines
Improving Entity Retrieval on Structured Data
Entity Retrieval (SIGIR 2013 tutorial)
Type Information in Entity Retrieval
Entity Retrieval (tutorial organized by Radialpoint in Montreal)
Evaluation Initiatives for Entity-oriented Search
Multilingual Fine-grained Entity Typing
Entity Search: The Last Decade and the Next
Addmi 07-taxonomy
Extensible and Dynamic Topic Types For DDS (out of date)
Entity Typing Using Distributional Semantics and DBpedia
Parameterized Fielded Term Dependence Models for Ad-hoc Entity Retrieval from...
Ad

More from Darío Garigliotti (20)

PDF
Task Recommendation
PDF
About "Towards Better Text Understanding and Retrieval through Kernel Entity ...
PDF
A Summary of ECIR'18
PDF
A Semantic Search Approach to Task-Completion Engines
PDF
A Knowledge Base of Entity-Oriented Search Intents
PDF
Learning-to-Rank Target Types for Entity-Bearing Queries
PDF
Dive into Deep Learning
PDF
If this is the answer, what was the question?
PDF
Semi-supervised Learning for Word Sense Disambiguation
PDF
Semi-supervised Learning for Word Sense Disambiguation
PDF
Semi-supervised Learning for Word Sense Disambiguation
PDF
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
PDF
Machine Learning - Clustering
PDF
Machine Learning - Classification (ctd.)
PDF
Machine Learning - Classification
PDF
Data Mining - Exploring Data
PDF
Data Mining - Introduction and Data
PDF
Predicate Logic
PDF
Patterns, Automata and Regular Expressions
PDF
The List Data Model
Task Recommendation
About "Towards Better Text Understanding and Retrieval through Kernel Entity ...
A Summary of ECIR'18
A Semantic Search Approach to Task-Completion Engines
A Knowledge Base of Entity-Oriented Search Intents
Learning-to-Rank Target Types for Entity-Bearing Queries
Dive into Deep Learning
If this is the answer, what was the question?
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense Disambiguation
Semi-supervised Learning for Word Sense Disambiguation
FACT-IR. Fairness, Accountability, Confidentiality and Transparency in Inform...
Machine Learning - Clustering
Machine Learning - Classification (ctd.)
Machine Learning - Classification
Data Mining - Exploring Data
Data Mining - Introduction and Data
Predicate Logic
Patterns, Automata and Regular Expressions
The List Data Model

Recently uploaded (20)

PPTX
Microbes in human welfare class 12 .pptx
PPTX
Biomechanics of the Hip - Basic Science.pptx
PPTX
TORCH INFECTIONS in pregnancy with toxoplasma
PPTX
Seminar Hypertension and Kidney diseases.pptx
PPTX
Welcome-grrewfefweg-students-of-2024.pptx
PPT
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PPTX
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
PPTX
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
PDF
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
PPT
Presentation of a Romanian Institutee 2.
PPTX
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PPTX
PMR- PPT.pptx for students and doctors tt
PDF
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
PPT
Mutation in dna of bacteria and repairss
PDF
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum
PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PDF
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
PPT
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine
Microbes in human welfare class 12 .pptx
Biomechanics of the Hip - Basic Science.pptx
TORCH INFECTIONS in pregnancy with toxoplasma
Seminar Hypertension and Kidney diseases.pptx
Welcome-grrewfefweg-students-of-2024.pptx
Heredity-grade-9 Heredity-grade-9. Heredity-grade-9.
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
ap-psych-ch-1-introduction-to-psychology-presentation.pptx
POULTRY PRODUCTION AND MANAGEMENTNNN.pptx
Unit 5 Preparations, Reactions, Properties and Isomersim of Organic Compounds...
Presentation of a Romanian Institutee 2.
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
lecture 2026 of Sjogren's syndrome l .pdf
PMR- PPT.pptx for students and doctors tt
Is Earendel a Star Cluster?: Metal-poor Globular Cluster Progenitors at z ∼ 6
Mutation in dna of bacteria and repairss
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
Communicating Health Policies to Diverse Populations (www.kiu.ac.ug)
1. INTRODUCTION TO EPIDEMIOLOGY.pptx for community medicine

Type-Aware Entity Retrieval

  • 1. Type-aware Entity Retrieval Darío Garigliotti IAI Universitetet i Stavanger (UiS) NTNU - March 3, 2016
  • 2. From Information Retrieval to Entity Retrieval • Traditional Information Retrieval recently extended to an Entity-oriented Search • Satisfaction of more complex information needs • Current support on search engines
  • 3. From Information Retrieval to Entity Retrieval
  • 4. From Information Retrieval to Entity Retrieval
  • 5. Countries where one can pay with the euro Entity Retrieval
  • 6. Countries where one can pay with the euro • Related entities (via a relation or predicate) • Types or categories or classes Entity Retrieval
  • 7. Countries where one can pay with the euro Impressionist art museums in The Netherlands • Related entities (via a relation or predicate) • Types or categories or classes Entity Retrieval
  • 8. Countries where one can pay with the euro Impressionist art museums in The Netherlands • Related entities (via a relation or predicate) • Types or categories or classes Entity Retrieval
  • 9. Countries where one can pay with the euro Impressionist art museums in The Netherlands • Related entities (via a relation or predicate) • Types or categories or classes Entity Retrieval
  • 10. Entity Retrieval Evaluated tasks • Entity ranking (given a textual query and target categories) • List completion (given Q and entity examples, and? types) • Related entity finding (given entity E, relation R and type T) e.g. E = "Schumacher", R = "His teammates when he was on Ferrari", T = "Person" from Q = "Schumacher teammates when he was on Ferrari"
  • 11. Type-aware entity retrieval Our research questions 1. How to represent type-based information? 2. How to combine type-based and textual information? 3. How to estimate type-based information?
  • 12. Type-aware entity retrieval RQ2. How to combine type-based and textual information? • Basics: term-based models • A variety of related tasks across the literature • Entity retrieval approaches • Where to look for entities? How to find them? How to rank them? • Major model families • Common main insight: types help!
  • 13. Type-aware entity retrieval RQ1. How to represent type-based information? • Dimensions we identified • type taxonomies • hierarchical structure • dataset version • Minimal concerning in the related work
  • 14. Type taxonomies • We consider four well-known type taxonomies Type system Wikipedia DBpedia Freebase YAGO #types 753,524 591 1719 568,672 #top-level types NA 58 92 61 #most-specific-level types 753,524 472 1626 549,623 depth NA 7 2 19 entities w/ type 4.12M 3.24M 3.77M 2.89M avg #types/entity 4.02 6.30 9.57 16.44
  • 15. Type representation • We consider different ways of modeling type assignments: Top level, most specific level, and path-to-top r e r e r e
  • 16. Experimental setup • Our experimental environment looks like this: Term-based representation Query model Entity model Query model Entity model Type-based representation P(e|q) / P(q|e)P(e) p(t|✓T e )p(t|✓T q ) KL(✓T q ||✓T e ) p(t0 | ✓T 0 e )p(t0 | ✓T 0 q ) KL(✓T 0 q k ✓T 0 e ) P(q | e) = (1 )P(✓T 0 q | ✓T 0 e ) + P(✓T q | ✓T e )
  • 17. Experimental setup • Term-based component: Mixture of LM method • We obtain combinations of these elements: • Type taxonomies • Models • Type-based representations
  • 18. Ingredients • Model instantiations for • M1 (Mixture): • M2 (Multiplicative): • M3 (Filtering): P(e | q) / P(q | e)P(e) P(q | e) = (1 )P(✓T 0 q | ✓T 0 e ) + P(✓T q | ✓T e ) P(q | e) = P(✓T 0 q | ✓T 0 e )P(✓T q | ✓T e ) P(✓T q | ✓T e ) 2 {0, 1}
  • 19. Ingredients • Query model for the type-based representation is provided by a target types oracle P(t|✓T q ) Query: guitar origin blues DBpedia Types: <dbo:Album>: 4 <dbo:MusicalArtist>: 43 ... Freebase Types: <fb:music.group_member>: 34 <fb:people.deceased_person>: 17 ... Wikipedia Categories: <dbpedia:Category:Blues_musicians_from_New_Orleans,_Louisiana>: 2 <dbpedia:Category:Blues_songs>: 2 ...
  • 20. Ingredients • Our experimental environment looks like this: Query model Entity model Type-based representation P(e|q) / P(q|e)P(e) p(t|✓T e )p(t|✓T q ) KL(✓T q ||✓T e ) P(q | e) = (1 )P(✓T 0 q | ✓T 0 e ) + P(✓T q | ✓T e )
  • 21. Ingredients • Entity model for the type-based representation is a distribution estimated through the entity types Query: guitar origin blues Relevant entities: <dbpedia:The_Merle_Travis_Guitar> <dbpedia:Blues_Breakers_with_Eric_Clapton> <dbpedia:Poor_Boy_Blues> ... ... Freebase Types: ... DBpedia Types: <dbo:Album> <dbo:MusicalWork> ... ... Freebase Types: ... Wikipedia Categories: <Category:1950_albums> <Category:Merle_Travis_albums> ...
  • 22. Results (1) RQ1. How to represent type-based information? Type representation - Model M1 MAP 0 0.058 0.115 0.173 0.23 all assigned types most specific level path-to-top top level YAGO Freebase Wikipedia DBpedia Type representation - Model M2 MAP 0 0.045 0.09 0.135 0.18 all assigned types most specific level path-to-top top level YAGO Freebase Wikipedia DBpedia Type representation - Model M3 MAP 0 0.055 0.11 0.165 0.22 all assigned types most specific level path-to-top top level YAGO Freebase Wikipedia DBpedia
  • 23. Results (2) RQ2. How to combine type-based and textual information? Combining information - All assigned types MAP 0 0.06 0.12 0.18 0.24 YAGO Freebase Wikipedia DBpedia M1 M2 M3 Combining information - Most-specific-level types MAP 0 0.06 0.12 0.18 0.24 YAGO Freebase Wikipedia DBpedia M1 M2 M3
  • 24. Future work RQ3: How to estimate type-based information? Term-based representation Query model Entity model Query model Entity model Type-based representation P(e|q) / P(q|e)P(e) p(t|✓T e )p(t|✓T q ) KL(✓T q ||✓T e ) p(t0 | ✓T 0 e )p(t0 | ✓T 0 q ) KL(✓T 0 q k ✓T 0 e ) P(q | e) = (1 )P(✓T 0 q | ✓T 0 e ) + P(✓T q | ✓T e )
  • 25. Future work • Main focus will be on query typing, but eventually on entity typing as well • How to take the best from different type taxonomies