SlideShare a Scribd company logo
Finding Support Sentences for Entities 
Roi Blanco and Hugo Zaragoza 
Yahoo! Research Barcelona 
July, SIGIR 2010
Outline 
• Task definition 
• Features for ranking sentences 
• Context 
• Evaluation 
• SW1 dataset 
• TREC Novelty Track 
• Conclusions
Finding support sentences for entities
www.evri.com
Finding support sentences for entities
Finding support sentences for entities
Entity Ranking 
• Given a topic, find relevant entities 
• Evaluated in TREC and INEX campaigns 
• Most well-known: people and expert search 
• Many other applications: dates, events, 
locations, companies, ...
Support Sentences for Entities 
• We introduce the task explaining the 
relationship between a query and an entity 
• Applications for entity retrieval, expert 
finding, object ranking, etc. 
• We don’t focus on entity ranking, just on 
the explanations 
• Support Sentences: Hqe(s) ~ p(R|q,e,s) 
• What makes a good sentence depends on 
how general entities and queries are and 
their relationship
PPPPiiciciccaaaassssssssoooo a aaannnndddd P PPPeeeeaaaacccceeee 
ss11 ((00..9900)) 
ss22 ((00..8866)) 
ss33 ((00..8833)) 
........ 
ss11 ((00..9900)) 
ss22 ((00..8866)) 
ss33 ((00..8833)) 
........ 
ss11:: PPiiccaassssoo ((00..8811)),, 
OOccttoobbeerr 11888811 
((00..5522)) 
ss22:: SSppaanniisshh CCiivviill 
WWaarr ((00..7733)),, 
GGuueerrnniiccaa ((00..6655)) 
ss11:: PPiiccaassssoo ((00..8811)),, 
OOccttoobbeerr 11888811 
((00..5522)) 
ss22:: SSppaanniisshh CCiivviill 
WWaarr ((00..7733)),, 
GGuueerrnniiccaa ((00..6655)) 
11994444 
PPiiccaassssoo 
SSppaanniisshh CCiivviill WWaarr 
11994444 
PPiiccaassssoo 
SSppaanniisshh CCiivviill WWaarr
“Flying Circus”
Examples 
• Query: Picasso and Peace 
• Entity: 1944 
• 
“In 1944 Picasso joined the French Communist 
Party, attended an international peace 
conference in Poland, and in 1950 received the 
Stalin Peace Prize from the Soviet government.”
Examples 
• Query: Picasso and Peace 
• Entity: Northern Spain 
“Although it was not conceived by the 
author as a representation of the disasters of 
war, but the Nazi bombing of Guernica (a town 
in Northern Spain), it is now considered an 
iconic representation of the disasters of war.”
• Top-k sentenCceso ren-rtanekixngt (we don’t issue 
any subsequent queries) 
• Vocabulary mismatch problem (support 
sentences that do not contain any query 
term) 
• Entity supported must be in the sentence 
• Introduce small windows of context 
sentences
TTiittllee 
CCoonntteexxtt 
SSeenntteennccee
Features for Ranking 
• Top-k sentences 
• Augmented 
• Entity-candidate set 
• Using sentence scores: 
• Sentence score for the [query,sentence], 
BM25 
• Sentence score for the [query, sentence 
+ context], BM25F 
• Position:
• Aggregation of entity scores 
• sum, max, min, average, ... 
• Options for the entity ranker score E(q,e) 
• Frequency 
• Rarity 
• Combination 
• KLD
Evaluation Framework 
• Semantically Annotated Snapshot of the 
English Wikipedia (sentences + 
annotations) 
• 12 types from WSJ tag-set 
• Judges produce a set of queries and remove 
non-relevant entities 
• Evaluate a set of sentences using a 4-grade 
scale 
• 226 (entity,query) with 45 unique queries
Results
Results (augmented)
Novelty Track 
2003 2004
Conclusions 
• We introduced the task of finding support 
sentences for entities (aka “entity 
snippets”) 
• We engineered several features based on 
scores of sentences and entities 
• We developed an evaluation dataset 
• http://barcelona.research.yahoo.net/dokuwiki 
• Evaluated the task and the role of context 
sentences

More Related Content

PPT
Werven Utrecht
PPTX
Motion review
PPTX
RESTful Web Services @AnkaraPHP meetup
PPTX
Nme magazine deconstruction
PDF
Gic2011 aula8-ingles
PPTX
Profound logic 2012
PPTX
Slinky presentation
PDF
#ForoEGovAR | Bases para las Políticas para las Sociedades del Conocimiento
Werven Utrecht
Motion review
RESTful Web Services @AnkaraPHP meetup
Nme magazine deconstruction
Gic2011 aula8-ingles
Profound logic 2012
Slinky presentation
#ForoEGovAR | Bases para las Políticas para las Sociedades del Conocimiento

Viewers also liked (14)

PDF
Gic2011 aula1-ingles
PDF
Dissertation lyn woodward cwm jan 2010
PDF
Gic2011 aula10-ingles
PDF
Hellomynameis,lindsayhoward
PDF
How to use spybot search and destroy
PDF
Tsa presentation
PDF
Gender in NTFP value chains in the Congo Basin
PDF
Gic2011 aula9-ingles
PPTX
HRM CityMatters Survey Results - November 2013
PPT
Mayonn Inc Bus Plan Presentation
PPT
Physical Science Chapter 3 sec1
PPTX
Halifax Economic Review and Renewal - September 2010
PDF
Communications Systems Research
PPTX
Windows 8
Gic2011 aula1-ingles
Dissertation lyn woodward cwm jan 2010
Gic2011 aula10-ingles
Hellomynameis,lindsayhoward
How to use spybot search and destroy
Tsa presentation
Gender in NTFP value chains in the Congo Basin
Gic2011 aula9-ingles
HRM CityMatters Survey Results - November 2013
Mayonn Inc Bus Plan Presentation
Physical Science Chapter 3 sec1
Halifax Economic Review and Renewal - September 2010
Communications Systems Research
Windows 8
Ad

Similar to Finding support sentences for entities (20)

PPTX
QAestro semantic based composition of QA pipelines
PDF
[系列活動] 人工智慧與機器學習在推薦系統上的應用
PDF
Temporal models for mining, ranking and recommendation in the Web
PDF
Recommender System with Distributed Representation
PPTX
Entity Linking in Queries: Tasks and Evaluation
PDF
Tutorial 1 (information retrieval basics)
PPTX
AI3391 Artificial Intelligence session 24 knowledge representation.pptx
PDF
Improving Entity Retrieval on Structured Data
PPT
sa-mincut-aditya.ppt
PDF
Recommender Systems, Matrices and Graphs
PPTX
Deep Learning Bangalore meet up
PPTX
DLBLR talk
PPT
ppt on sentiment analysis using various techniques
PDF
Entity Search: The Last Decade and the Next
PDF
Extensible Operators and Literals for JavaScript
PDF
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
PDF
Ph d sem_1@iitm
PDF
Deep Learning for Personalized Search and Recommender Systems
PDF
The Magical Art of Extracting Meaning From Data
QAestro semantic based composition of QA pipelines
[系列活動] 人工智慧與機器學習在推薦系統上的應用
Temporal models for mining, ranking and recommendation in the Web
Recommender System with Distributed Representation
Entity Linking in Queries: Tasks and Evaluation
Tutorial 1 (information retrieval basics)
AI3391 Artificial Intelligence session 24 knowledge representation.pptx
Improving Entity Retrieval on Structured Data
sa-mincut-aditya.ppt
Recommender Systems, Matrices and Graphs
Deep Learning Bangalore meet up
DLBLR talk
ppt on sentiment analysis using various techniques
Entity Search: The Last Decade and the Next
Extensible Operators and Literals for JavaScript
Information to Wisdom: Commonsense Knowledge Extraction and Compilation - Part 3
Ph d sem_1@iitm
Deep Learning for Personalized Search and Recommender Systems
The Magical Art of Extracting Meaning From Data
Ad

More from Roi Blanco (14)

PPTX
From Queries to Answers in the Web
PDF
Entity Linking via Graph-Distance Minimization
PDF
Introduction to Big Data
PPTX
Mining Web content for Enhanced Search
PPTX
Influence of Timeline and Named-entity Components on User Engagement
PPTX
Introduction to Information Retrieval
PPT
Searching over the past, present and future
PPTX
Beyond document retrieval using semantic annotations
PPT
Keyword Search over RDF Graphs
PPTX
Large-Scale Semantic Search
PDF
Extending BM25 with multiple query operators
PPTX
Energy-Price-Driven Query Processing in Multi-center Web Search Engines
PPTX
Effective and Efficient Entity Search in RDF data
PPT
Caching Search Engine Results over Incremental Indices
From Queries to Answers in the Web
Entity Linking via Graph-Distance Minimization
Introduction to Big Data
Mining Web content for Enhanced Search
Influence of Timeline and Named-entity Components on User Engagement
Introduction to Information Retrieval
Searching over the past, present and future
Beyond document retrieval using semantic annotations
Keyword Search over RDF Graphs
Large-Scale Semantic Search
Extending BM25 with multiple query operators
Energy-Price-Driven Query Processing in Multi-center Web Search Engines
Effective and Efficient Entity Search in RDF data
Caching Search Engine Results over Incremental Indices

Recently uploaded (20)

PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Cloud computing and distributed systems.
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Machine Learning_overview_presentation.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Empathic Computing: Creating Shared Understanding
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
A Presentation on Artificial Intelligence
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPTX
Spectroscopy.pptx food analysis technology
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
Assigned Numbers - 2025 - Bluetooth® Document
Cloud computing and distributed systems.
Machine learning based COVID-19 study performance prediction
Machine Learning_overview_presentation.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Empathic Computing: Creating Shared Understanding
The Rise and Fall of 3GPP – Time for a Sabbatical?
A Presentation on Artificial Intelligence
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Advanced methodologies resolving dimensionality complications for autism neur...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
sap open course for s4hana steps from ECC to s4
Programs and apps: productivity, graphics, security and other tools
Reach Out and Touch Someone: Haptics and Empathic Computing
NewMind AI Weekly Chronicles - August'25-Week II
Spectroscopy.pptx food analysis technology
Encapsulation_ Review paper, used for researhc scholars
Dropbox Q2 2025 Financial Results & Investor Presentation

Finding support sentences for entities

  • 1. Finding Support Sentences for Entities Roi Blanco and Hugo Zaragoza Yahoo! Research Barcelona July, SIGIR 2010
  • 2. Outline • Task definition • Features for ranking sentences • Context • Evaluation • SW1 dataset • TREC Novelty Track • Conclusions
  • 7. Entity Ranking • Given a topic, find relevant entities • Evaluated in TREC and INEX campaigns • Most well-known: people and expert search • Many other applications: dates, events, locations, companies, ...
  • 8. Support Sentences for Entities • We introduce the task explaining the relationship between a query and an entity • Applications for entity retrieval, expert finding, object ranking, etc. • We don’t focus on entity ranking, just on the explanations • Support Sentences: Hqe(s) ~ p(R|q,e,s) • What makes a good sentence depends on how general entities and queries are and their relationship
  • 9. PPPPiiciciccaaaassssssssoooo a aaannnndddd P PPPeeeeaaaacccceeee ss11 ((00..9900)) ss22 ((00..8866)) ss33 ((00..8833)) ........ ss11 ((00..9900)) ss22 ((00..8866)) ss33 ((00..8833)) ........ ss11:: PPiiccaassssoo ((00..8811)),, OOccttoobbeerr 11888811 ((00..5522)) ss22:: SSppaanniisshh CCiivviill WWaarr ((00..7733)),, GGuueerrnniiccaa ((00..6655)) ss11:: PPiiccaassssoo ((00..8811)),, OOccttoobbeerr 11888811 ((00..5522)) ss22:: SSppaanniisshh CCiivviill WWaarr ((00..7733)),, GGuueerrnniiccaa ((00..6655)) 11994444 PPiiccaassssoo SSppaanniisshh CCiivviill WWaarr 11994444 PPiiccaassssoo SSppaanniisshh CCiivviill WWaarr
  • 11. Examples • Query: Picasso and Peace • Entity: 1944 • “In 1944 Picasso joined the French Communist Party, attended an international peace conference in Poland, and in 1950 received the Stalin Peace Prize from the Soviet government.”
  • 12. Examples • Query: Picasso and Peace • Entity: Northern Spain “Although it was not conceived by the author as a representation of the disasters of war, but the Nazi bombing of Guernica (a town in Northern Spain), it is now considered an iconic representation of the disasters of war.”
  • 13. • Top-k sentenCceso ren-rtanekixngt (we don’t issue any subsequent queries) • Vocabulary mismatch problem (support sentences that do not contain any query term) • Entity supported must be in the sentence • Introduce small windows of context sentences
  • 15. Features for Ranking • Top-k sentences • Augmented • Entity-candidate set • Using sentence scores: • Sentence score for the [query,sentence], BM25 • Sentence score for the [query, sentence + context], BM25F • Position:
  • 16. • Aggregation of entity scores • sum, max, min, average, ... • Options for the entity ranker score E(q,e) • Frequency • Rarity • Combination • KLD
  • 17. Evaluation Framework • Semantically Annotated Snapshot of the English Wikipedia (sentences + annotations) • 12 types from WSJ tag-set • Judges produce a set of queries and remove non-relevant entities • Evaluate a set of sentences using a 4-grade scale • 226 (entity,query) with 45 unique queries
  • 21. Conclusions • We introduced the task of finding support sentences for entities (aka “entity snippets”) • We engineered several features based on scores of sentences and entities • We developed an evaluation dataset • http://barcelona.research.yahoo.net/dokuwiki • Evaluated the task and the role of context sentences