SlideShare a Scribd company logo
Context-Enhanced Adaptive
Entity Linking
@giusepperizzo
F. Ilievski, G. Rizzo, M. van Erp, J. Plu, R. Troncy
2http://babelfy.org, as of 2016-05-24
Linguistic approach: A text is parsed by a NER classifier. Entity labels are
used to look up resources in a referent KB. A ranking function is used to
select the best match (relatedness, semantic similarity)
End-to-End approach: A dictionary of mentions and links is built from a
referent KB. A text is split in n-grams that are used to look up candidate
links from the dictionary. A selection function is used to pick the best
match (relatedness, semantic similarity, relevance)
Hybrid approach: combination of both
Current approaches performing the NEL task
3
ranking and selection of the candidate links are led by the
relatedness of the entities in the knowledge base
Henri Leland
db:Henry_M._Leland
Lincoln Motor Company
db:Lincoln_Motor_Company
Joe
?
Lincoln
db:Abraham_Lincoln
Cadillac
db:Cadillac
when the context is
poor, head entities are
favoured
4
?
“Henry Leland
… formed the
Lincoln Motor
Company... Joe
drove a Lincoln
for the first
time in his life”
…
Joe PER NIL
Lincoln PRO
db:Lincoln_Motor
_Company
5
Text as
input
Reranking
with
Context
...
Joe PER NIL
Lincoln PRO
db:Lincoln_Motor_
Company
Resolution
and
Classification
Candidate
Selection
Mention
Extraction
General-purpose Hybrid Annotator
6
Text as
input
Resolution
and
Classification
Candidate
Selection
Mention
Extraction
General-purpose Hybrid Annotator
- Longest match
- Entity type
propagation to the
longest match
- Dictionary fuzzy
match
- Entity
popularity
- Named Entity
Recognition
- Proper Noun
Extractor
e1 c1,1, …, c1,10
...
en cn,1, …, cn,10
7
General-purpose Hybrid Annotator (I)
Mention extraction: proper nouns as classified by Stanford POS Tagger (trained with
english-bidirectional-distsim model) and named entities as classified by Stanford
NERClassifierCombiner (trained with CoNLL 2003, MUC 6, MUC 7 corpora)
Resolution and typing
o When one mention is substring of another we take the longest:
o When a part of one mention is a substring of another we do a merge to create a new
one:
POS: (United States, NNPS)
NER: (United States of America, PLACE)
United States of America, PLACE
POS: (United States, NNPS)
NER: (States of America, PLACE)
United States of America, PLACE
Plu et al., Revealing Entities from Textual Documents Using a Hybrid Approach, (ISWC'15) NLP & DBpedia 2015 8
General-purpose Hybrid Annotator (II)
Candidate selection: fuzzy string match over an index based on DBpedia2015-04
o NIL Clustering when no candidates are found. Exact match of labels within the
boundaries of a sentence
o Candidate Ranking if multiple candidates are found.
9
r(l): the score of the label l
L: the Levenshtein distance
m: the extracted mention
title: the title of the label l
R: the set of redirect pages associated to
the label l
D: the set of disambiguation pages
associated to the label l
PR: Pagerank associated to the label l
a, b and c are weights
following the
properties:
● a > b > c
● a + b + c = 1
Text
as
input
Reranking
with
Context
e1 c1,1, …, c1,10
...
en cn,1, …, cn,10
10
e1 c1,1, c1,2, …, c1,10
...
en cn,1, …, cn,9, cn,10
en+1 cn+1
Reranking with Context
Aim: Adapt the linking task to the textual content that is being
analysed
Approach: Leverage the genre and topic domain information about
the text
Apply: 4 heuristics (H1, H2, H3, H4) in cascade. They take the form
of binary rules
11
H1: Order of processing
Process the running text sequentially, starting from the first
sentence. Process the title at the end
Reasoning: Title is typically ambiguous/catchy. The first
sentences of an article are written most explicitly
12
H2: Coherence
Detect if an entity is co-referential (an abbreviation or a
substring) with an entity that occurs previously in the same
news article
Reasoning: Once the writer has clearly introduced an entity,
she can use abbreviations or more ambiguous ways to refer
to it later in text
13
H3: Domain relevance
Use a contextual knowledge base to examine whether a
mention has been frequently and dominantly associated with
a certain entity within a domain
Reasoning: It is customary that the entities mentioned in
domain-specific text stem from the same domain. Also, within
a domain, a mention is typically associated with one
dominant entity
14
H4: Semantic Typing
Check whether the semantic type of the entity resolved by H2
or H3 fits the textual context
Reasoning: The entity should fit the textual context and fulfil
a certain role in text
15
MEANTIME* AIDA-YAGO2**
benchmark the approach
with a corpus composed
of 4 topic-specific gold
standards
test the generalizability of
the approach
16
*Minard et al., MEANTIME, the newsreader multilingual event and time corpus. LREC 2016
**Hoffart et al., Robust Disambiguation of Named Entities in Text. EMNLP 2011
Benchmark corpora
Number
of
Articles
Number of
Tokens
Number
of Entities
Number of
Links
Number
of NILs
Number of
Entity
Types
MEANTI
ME*
airbus 30 3,620 614 414 200 5
apple 30 3,452 812 525 287 5
gm 30 3,641 760 526 234 5
stock 30 3,362 449 331 118 4
AIDA-
YAGO2**
231 46,435 5,616 4,485 1,131 4
17
Corpora statistics
Experimental results
airbus apple gm stock AIDA-YAGO2
P R F1 P R F1 P R F1 P R F1 P R F1
Hybrid 58.7
4
40.5
8
48 19.7
8
10.0
9
13.3
7
50.3
6
26.8
1
34.9
9
59.1
2
32.3
3
41.8 49.1
4
43.4
1
46.1
Hybrid
+H1+H2
59.0
9
40.8
2
48.2
9
19.7
8
10.0
9
13.3
7
55 29.2
8
38.2
1
59.1
2
32.3
3
41.8 48.6
7
43.0
1
45.6
7
Hybrid
+H1+H3
62.5 43.4
8
51.2
8
20.0
7
10.2
9
13.6 63.5
4
34.7
9
44.9
6
66.3
1
37.4
6
47.8
8
57.8
9
52.0
4
54.8
1
Hybrid
+H1+H2+H3
62.1
5
43.2
4
51 20.0
7
10.2
9
13.6 67.3
6
36.8
8
47.6
7
68.9
8
38.9
7
49.8
1
57.6
5
51.8
4
54.5
9
Hybrid
+H1+H2+H3
+H4
61.4
6
42.7
5
50.4
3
20.0
7
10.2
9
13.6 62.1 33.6
5
43.6
5
63.1 35.6
5
45.5
6
55.2
1
49.6
1
52.2
6
Discussion
Reranking with context is effective and brings improvement over the baseline
for all corpora
Improvement also on AIDA-YAGO2, even though it stems from a neutral topic
domain. This is because MEANTIME and AIDA-YAGO2 share the genre domain,
and many of the entities in MEANTIME stem from the neutral domain as well
H1 (Order of Processing) and H3 (Domain Relevance) with these settings are the
most effective heuristics
H4 (Semantic Typing) requires further investigations
19
Future Work
Model the genre and topic domains to contextualize further the entity linking,
i.e. adding more features to improve our adaptive contextual model
Investigate the dynamic adaptability in different contexts using knowledge
bases as inputs
20
Acknowledgements
21

More Related Content

PDF
inteSearch: An Intelligent Linked Data Information Access Framework
PDF
Matching and merging anonymous terms from web sources
PPT
Scalable Discovery Of Hidden Emails From Large Folders
PDF
AINL 2016: Bastrakova, Ledesma, Millan, Zighed
PDF
Detecting Ontological Conflicts in Protocols between Semantic Web Services
PDF
SemFacet paper
PDF
TEXT PLAGIARISM CHECKER USING FRIENDSHIP GRAPHS
PPTX
An approach to source code plagiarism
inteSearch: An Intelligent Linked Data Information Access Framework
Matching and merging anonymous terms from web sources
Scalable Discovery Of Hidden Emails From Large Folders
AINL 2016: Bastrakova, Ledesma, Millan, Zighed
Detecting Ontological Conflicts in Protocols between Semantic Web Services
SemFacet paper
TEXT PLAGIARISM CHECKER USING FRIENDSHIP GRAPHS
An approach to source code plagiarism

What's hot (19)

PPTX
Topic Extraction on Domain Ontology
PDF
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
PPTX
Language Models for Information Retrieval
PDF
Myanmar Named Entity Recognition with Hidden Markov Model
PPTX
PPTX
Lecture 9 - Machine Learning and Support Vector Machines (SVM)
PDF
Miguel Rios - 2015 - Obtaining SMT dictionaries for related languages
PPT
Email Data Cleaning
PDF
Using Deep Learning for Title-Based Semantic Subject Indexing to Reach Compet...
PPT
Boolean Retrieval
PPTX
Argument extraction from news, blog and social media.
DOCX
IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...
PDF
Language and Domain Independent Entity Linking with Quantified Collective Val...
PDF
Language Models for Information Retrieval
PPTX
PPTX
Adversarial and reinforcement learning-based approaches to information retrieval
PPTX
Differentiable reasoning over a virtual knowledge base (Dhingra+ 2020) 文献講読
PPT
07 04-06
PPTX
Text categorization
Topic Extraction on Domain Ontology
Exploring Statistical Language Models for Recommender Systems [RecSys '15 DS ...
Language Models for Information Retrieval
Myanmar Named Entity Recognition with Hidden Markov Model
Lecture 9 - Machine Learning and Support Vector Machines (SVM)
Miguel Rios - 2015 - Obtaining SMT dictionaries for related languages
Email Data Cleaning
Using Deep Learning for Title-Based Semantic Subject Indexing to Reach Compet...
Boolean Retrieval
Argument extraction from news, blog and social media.
IEEE 2014 DOTNET DATA MINING PROJECTS A probabilistic approach to string tran...
Language and Domain Independent Entity Linking with Quantified Collective Val...
Language Models for Information Retrieval
Adversarial and reinforcement learning-based approaches to information retrieval
Differentiable reasoning over a virtual knowledge base (Dhingra+ 2020) 文献講読
07 04-06
Text categorization
Ad

Viewers also liked (20)

PDF
Building a Graph-based Analytics Platform
PPTX
Back to the basics 082815
DOC
Panduan Subsidi Pengawas 2007
PPT
Ethiopia
PDF
PropNex Real Estate Market Analysis July 2009
PDF
INSESA SAS
PPT
Kilifi Pictures
PDF
Edisi 22 Maret Aceh
PPT
great Frank lloyd wright
PPTX
Build Your Pipeline in Web 2.0
PDF
ICS Overview Cycle 1 6 9 1 10
PDF
Ukraine job market overview (Tallinn, June 2014)
PDF
University Of Antwerp The Uk Digital Television Landscape Ken Lawrence
PDF
Edisi 14 april aceh
PDF
Edisi 12 Nov Nas
PPT
How Content Strategy can get you a date - Defuse Dublin 2012
PDF
Edisi 6 Des 2009 Nas
PDF
Binder22
PDF
Edisi18 Sep Nasioanl
PPT
Regex Cards - Powerpoint Format
Building a Graph-based Analytics Platform
Back to the basics 082815
Panduan Subsidi Pengawas 2007
Ethiopia
PropNex Real Estate Market Analysis July 2009
INSESA SAS
Kilifi Pictures
Edisi 22 Maret Aceh
great Frank lloyd wright
Build Your Pipeline in Web 2.0
ICS Overview Cycle 1 6 9 1 10
Ukraine job market overview (Tallinn, June 2014)
University Of Antwerp The Uk Digital Television Landscape Ken Lawrence
Edisi 14 april aceh
Edisi 12 Nov Nas
How Content Strategy can get you a date - Defuse Dublin 2012
Edisi 6 Des 2009 Nas
Binder22
Edisi18 Sep Nasioanl
Regex Cards - Powerpoint Format
Ad

Similar to Context-Enhanced Adaptive Entity Linking (20)

PPTX
Entity Linking in Queries: Tasks and Evaluation
PPTX
2015 07-tuto2-clus type
PDF
Perspectives on mining knowledge graphs from text
PPTX
Mining Web content for Enhanced Search
PDF
Entity Linking
PPTX
2011 03 11 (upm) emadrid lsanchez uc3m anotación semántica de texto
PDF
Entity Linking, Link Prediction, and Knowledge Graph Completion
PDF
Entity Linking Combining Open Source Annotators
PDF
From Linked Data to Semantic Applications
PDF
Revealing Entities From Texts With a Hybrid Approach
PDF
Entity Retrieval (SIGIR 2013 tutorial)
PDF
Entity Retrieval (WSDM 2014 tutorial)
PDF
Mini seminar presentation on context-based NED optimization
PDF
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
PDF
A scalable gibbs sampler for probabilistic entity linking
PPTX
Understanding Queries through Entities
PPTX
NLP & DBpedia
PPTX
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
PDF
Session 1.2 high-precision, context-free entity linking exploiting unambigu...
PDF
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...
Entity Linking in Queries: Tasks and Evaluation
2015 07-tuto2-clus type
Perspectives on mining knowledge graphs from text
Mining Web content for Enhanced Search
Entity Linking
2011 03 11 (upm) emadrid lsanchez uc3m anotación semántica de texto
Entity Linking, Link Prediction, and Knowledge Graph Completion
Entity Linking Combining Open Source Annotators
From Linked Data to Semantic Applications
Revealing Entities From Texts With a Hybrid Approach
Entity Retrieval (SIGIR 2013 tutorial)
Entity Retrieval (WSDM 2014 tutorial)
Mini seminar presentation on context-based NED optimization
Lessons Learnt from the Named Entity rEcognition and Linking (NEEL) Challenge...
A scalable gibbs sampler for probabilistic entity linking
Understanding Queries through Entities
NLP & DBpedia
Natural Language Processing and Search Intent Understanding C3 Conductor 2019...
Session 1.2 high-precision, context-free entity linking exploiting unambigu...
SwissLink: High-Precision, Context-Free Entity Linking Exploiting Unambiguous...

More from Giuseppe Rizzo (20)

PDF
Artificial intelligence for social good
PDF
AI in 60 minutes
PDF
COMPRENDE, PERSONALIZZA, INTERAGISCE E IMPARA: L’AI COGNITIVA PER L’HR
PDF
Understand, Answer and Argument: Conversational Agents
PDF
AI For Profiling Your Customers
PDF
AI for Personalized Chatbot
PDF
Tourist Knowledge Graph Creation to Automating Travel Bookings
PDF
The SentiME System at the SSA Challenge Task 1
PDF
From Data to Knowledge for Tourists
PDF
Enabling Visitors to Explore a Smart City
PDF
NEEL2015 challenge summary
PDF
Inductive Entity Typing Alignment
PDF
Benchmarking the Extraction and Disambiguation of Named Entities on the Seman...
PDF
CrossLanguageSpotter: A Library for Detecting Relations in Polyglot Frameworks
PDF
Learning with the Web. Structuring data to ease machine understanding
PDF
Learning with the Web: Spotting Named Entities on the intersection of NERD an...
PDF
NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud
PDF
The NERD project
PDF
L'enorme archivio di dati: il Web
PDF
NERD: Evaluating Named Entity Recognition Tools in the Web of Data
Artificial intelligence for social good
AI in 60 minutes
COMPRENDE, PERSONALIZZA, INTERAGISCE E IMPARA: L’AI COGNITIVA PER L’HR
Understand, Answer and Argument: Conversational Agents
AI For Profiling Your Customers
AI for Personalized Chatbot
Tourist Knowledge Graph Creation to Automating Travel Bookings
The SentiME System at the SSA Challenge Task 1
From Data to Knowledge for Tourists
Enabling Visitors to Explore a Smart City
NEEL2015 challenge summary
Inductive Entity Typing Alignment
Benchmarking the Extraction and Disambiguation of Named Entities on the Seman...
CrossLanguageSpotter: A Library for Detecting Relations in Polyglot Frameworks
Learning with the Web. Structuring data to ease machine understanding
Learning with the Web: Spotting Named Entities on the intersection of NERD an...
NERD meets NIF: Lifting NLP Extraction Results to the Linked Data Cloud
The NERD project
L'enorme archivio di dati: il Web
NERD: Evaluating Named Entity Recognition Tools in the Web of Data

Recently uploaded (20)

PDF
.pdf is not working space design for the following data for the following dat...
PDF
Fluorescence-microscope_Botany_detailed content
PDF
Business Analytics and business intelligence.pdf
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PDF
Mega Projects Data Mega Projects Data
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
annual-report-2024-2025 original latest.
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
.pdf is not working space design for the following data for the following dat...
Fluorescence-microscope_Botany_detailed content
Business Analytics and business intelligence.pdf
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Mega Projects Data Mega Projects Data
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
Miokarditis (Inflamasi pada Otot Jantung)
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Introduction to Knowledge Engineering Part 1
Qualitative Qantitative and Mixed Methods.pptx
annual-report-2024-2025 original latest.
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
climate analysis of Dhaka ,Banglades.pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
ISS -ESG Data flows What is ESG and HowHow
Business Ppt On Nestle.pptx huunnnhhgfvu

Context-Enhanced Adaptive Entity Linking

  • 1. Context-Enhanced Adaptive Entity Linking @giusepperizzo F. Ilievski, G. Rizzo, M. van Erp, J. Plu, R. Troncy
  • 3. Linguistic approach: A text is parsed by a NER classifier. Entity labels are used to look up resources in a referent KB. A ranking function is used to select the best match (relatedness, semantic similarity) End-to-End approach: A dictionary of mentions and links is built from a referent KB. A text is split in n-grams that are used to look up candidate links from the dictionary. A selection function is used to pick the best match (relatedness, semantic similarity, relevance) Hybrid approach: combination of both Current approaches performing the NEL task 3
  • 4. ranking and selection of the candidate links are led by the relatedness of the entities in the knowledge base Henri Leland db:Henry_M._Leland Lincoln Motor Company db:Lincoln_Motor_Company Joe ? Lincoln db:Abraham_Lincoln Cadillac db:Cadillac when the context is poor, head entities are favoured 4 ?
  • 5. “Henry Leland … formed the Lincoln Motor Company... Joe drove a Lincoln for the first time in his life” … Joe PER NIL Lincoln PRO db:Lincoln_Motor _Company 5
  • 6. Text as input Reranking with Context ... Joe PER NIL Lincoln PRO db:Lincoln_Motor_ Company Resolution and Classification Candidate Selection Mention Extraction General-purpose Hybrid Annotator 6
  • 7. Text as input Resolution and Classification Candidate Selection Mention Extraction General-purpose Hybrid Annotator - Longest match - Entity type propagation to the longest match - Dictionary fuzzy match - Entity popularity - Named Entity Recognition - Proper Noun Extractor e1 c1,1, …, c1,10 ... en cn,1, …, cn,10 7
  • 8. General-purpose Hybrid Annotator (I) Mention extraction: proper nouns as classified by Stanford POS Tagger (trained with english-bidirectional-distsim model) and named entities as classified by Stanford NERClassifierCombiner (trained with CoNLL 2003, MUC 6, MUC 7 corpora) Resolution and typing o When one mention is substring of another we take the longest: o When a part of one mention is a substring of another we do a merge to create a new one: POS: (United States, NNPS) NER: (United States of America, PLACE) United States of America, PLACE POS: (United States, NNPS) NER: (States of America, PLACE) United States of America, PLACE Plu et al., Revealing Entities from Textual Documents Using a Hybrid Approach, (ISWC'15) NLP & DBpedia 2015 8
  • 9. General-purpose Hybrid Annotator (II) Candidate selection: fuzzy string match over an index based on DBpedia2015-04 o NIL Clustering when no candidates are found. Exact match of labels within the boundaries of a sentence o Candidate Ranking if multiple candidates are found. 9 r(l): the score of the label l L: the Levenshtein distance m: the extracted mention title: the title of the label l R: the set of redirect pages associated to the label l D: the set of disambiguation pages associated to the label l PR: Pagerank associated to the label l a, b and c are weights following the properties: ● a > b > c ● a + b + c = 1
  • 10. Text as input Reranking with Context e1 c1,1, …, c1,10 ... en cn,1, …, cn,10 10 e1 c1,1, c1,2, …, c1,10 ... en cn,1, …, cn,9, cn,10 en+1 cn+1
  • 11. Reranking with Context Aim: Adapt the linking task to the textual content that is being analysed Approach: Leverage the genre and topic domain information about the text Apply: 4 heuristics (H1, H2, H3, H4) in cascade. They take the form of binary rules 11
  • 12. H1: Order of processing Process the running text sequentially, starting from the first sentence. Process the title at the end Reasoning: Title is typically ambiguous/catchy. The first sentences of an article are written most explicitly 12
  • 13. H2: Coherence Detect if an entity is co-referential (an abbreviation or a substring) with an entity that occurs previously in the same news article Reasoning: Once the writer has clearly introduced an entity, she can use abbreviations or more ambiguous ways to refer to it later in text 13
  • 14. H3: Domain relevance Use a contextual knowledge base to examine whether a mention has been frequently and dominantly associated with a certain entity within a domain Reasoning: It is customary that the entities mentioned in domain-specific text stem from the same domain. Also, within a domain, a mention is typically associated with one dominant entity 14
  • 15. H4: Semantic Typing Check whether the semantic type of the entity resolved by H2 or H3 fits the textual context Reasoning: The entity should fit the textual context and fulfil a certain role in text 15
  • 16. MEANTIME* AIDA-YAGO2** benchmark the approach with a corpus composed of 4 topic-specific gold standards test the generalizability of the approach 16 *Minard et al., MEANTIME, the newsreader multilingual event and time corpus. LREC 2016 **Hoffart et al., Robust Disambiguation of Named Entities in Text. EMNLP 2011 Benchmark corpora
  • 17. Number of Articles Number of Tokens Number of Entities Number of Links Number of NILs Number of Entity Types MEANTI ME* airbus 30 3,620 614 414 200 5 apple 30 3,452 812 525 287 5 gm 30 3,641 760 526 234 5 stock 30 3,362 449 331 118 4 AIDA- YAGO2** 231 46,435 5,616 4,485 1,131 4 17 Corpora statistics
  • 18. Experimental results airbus apple gm stock AIDA-YAGO2 P R F1 P R F1 P R F1 P R F1 P R F1 Hybrid 58.7 4 40.5 8 48 19.7 8 10.0 9 13.3 7 50.3 6 26.8 1 34.9 9 59.1 2 32.3 3 41.8 49.1 4 43.4 1 46.1 Hybrid +H1+H2 59.0 9 40.8 2 48.2 9 19.7 8 10.0 9 13.3 7 55 29.2 8 38.2 1 59.1 2 32.3 3 41.8 48.6 7 43.0 1 45.6 7 Hybrid +H1+H3 62.5 43.4 8 51.2 8 20.0 7 10.2 9 13.6 63.5 4 34.7 9 44.9 6 66.3 1 37.4 6 47.8 8 57.8 9 52.0 4 54.8 1 Hybrid +H1+H2+H3 62.1 5 43.2 4 51 20.0 7 10.2 9 13.6 67.3 6 36.8 8 47.6 7 68.9 8 38.9 7 49.8 1 57.6 5 51.8 4 54.5 9 Hybrid +H1+H2+H3 +H4 61.4 6 42.7 5 50.4 3 20.0 7 10.2 9 13.6 62.1 33.6 5 43.6 5 63.1 35.6 5 45.5 6 55.2 1 49.6 1 52.2 6
  • 19. Discussion Reranking with context is effective and brings improvement over the baseline for all corpora Improvement also on AIDA-YAGO2, even though it stems from a neutral topic domain. This is because MEANTIME and AIDA-YAGO2 share the genre domain, and many of the entities in MEANTIME stem from the neutral domain as well H1 (Order of Processing) and H3 (Domain Relevance) with these settings are the most effective heuristics H4 (Semantic Typing) requires further investigations 19
  • 20. Future Work Model the genre and topic domains to contextualize further the entity linking, i.e. adding more features to improve our adaptive contextual model Investigate the dynamic adaptability in different contexts using knowledge bases as inputs 20