Summary of Papers of  SIGIR 2011 Workshop on Query Representation and Understanding Chetana Gavankar
Ricardo Campos, Alipio Jorge, Gael Dias:  "Using Web Snippets and Query-logs to Measure Implicit Temporal Intents in Queries"
Temporal queries 1.  Atemporal : Queries not sensitive to time like  plan my trip 2. Temporal unambiguous : Queries in concrete time  period. Ex : Haiti earthquake in 2010 3.  Temporal ambiguous : queries with multiple instances over  time. Ex : Cricket worldcup which occurs every four years.
Web snippets and Query Logs Content-Related Resources , based on a web content approach Simply requires the set of web search results. Query-Log Resources , based on similar year-qualified queries Imply that some versions of the query have already been issued.
1. Web snippets ( temporal evidence within web pages): TA(q)= ∑ f ε I  w f  f(q)  I = {Tsnippet(.),TTitle(.),TUrl(.)} Value each feature differently using  w f  18.14 for TTitles, 50.91 for TSnippets and 30.95 for Turl(.) If TA(q) value < 10% then Atemporal.  Dates appearing in query & docs may not match. # Snippets Retrieved with Dates Identifying implicit temporal queries TSnippets = # Snippets Retrieved
Identifying implicit temporal queries 2.Web Query Logs : Temporal activity can be recorded from date & time of request and from user activity.  No. of times query is pre, post qualified by year is WA(q,y)=#(y,q) + #(q,y) α(q) =  ∑ y  WA   (q,y) /  ∑ x #(x,q) +  ∑ x #(q,x) If query qualified with single year then  α(q) =1
Results Temporal information is more frequent in web snippets  than in any of the  query logs  of Google and Yahoo!; Most of the queries have a  TSnippet(.)  value around 20%,  TLogYahoo(.)  and  TLogGoogle(.)  are mostly near to 0%.
Conclusion Future dates common in snippets than query log
Query having dates does not necessarily mean that it has temporal intent (from web query logs of  Google  and yahoo) Ex: October Sky movie
Web snippets statistically more relevant in terms of temporal intent than query logs
Rishiraj Saha Roy, Niloy Ganguly, Monojit Choudhury, Naveen Singh:  &quot;Complex Network Analysis Reveals Kernel-Periphery Structure in Web Search Queries&quot;
Search Queries Search Query language: bag of segments Word  occurrence  n/w: Edge exists if  P ij  > P i  P j Eight complex network models for query logs Query Unrestricted wordnet(local) and (global)
Query Restricted wordnet(local) and (global)
Query Unrestricted SegmentNet(local) and (global)
Query Restricted SegmentNet(local) and (global)
Kernel and Peripheral lexicons Two regimes in DD of word occurrence N/W: 1.K ernel lexicons (K-Lex or modifiers):   Units popular in query (high degrees)
Generic and domain independent
Ex: how to, wikipedia 2.Peripheral lexicon (P-Lex or HEADs): Rare ones with degree much less than those in kernel Ex: Decision Tree algorithm
Degree Disribution |N| = Nodes, |E| = edges C= average clustering coefficient d=mean shortest path between edges C rand  and d rand  are corr. Values in random graph C rand  ~ k'/ |N| ,    d rand  ~ ln(|N|)/ ln(|k'|) k' = average degree of graph Degree distribution= p(k) = nodes with degree k/ total nodes
Two regime power law

More Related Content

ODP
Sigir 2011 proceedings
PPTX
PowerPoint - K-State Laboratory for Knowledge Discovery in ...
PPTX
Duet @ TREC 2019 Deep Learning Track
PPTX
Lecture 9 - Machine Learning and Support Vector Machines (SVM)
PPTX
Dual Embedding Space Model (DESM)
PDF
Can functional programming be liberated from static typing?
PDF
Text Mining Using R
PPT
Understanding WeboNaver
Sigir 2011 proceedings
PowerPoint - K-State Laboratory for Knowledge Discovery in ...
Duet @ TREC 2019 Deep Learning Track
Lecture 9 - Machine Learning and Support Vector Machines (SVM)
Dual Embedding Space Model (DESM)
Can functional programming be liberated from static typing?
Text Mining Using R
Understanding WeboNaver

What's hot (16)

PPTX
Graph Techniques for Natural Language Processing
PPTX
Conversation with-search-engines (Ren et al. 2020)
PPT
Models for Information Retrieval and Recommendation
PPT
Topic Models
PPT
WP3 Further specification of Functionality and Interoperability - Gradmann
PDF
Lecture20 xing
PDF
Crash-course in Natural Language Processing
PDF
Dstc6 an introduction
 
PDF
Text categorization as graph
PPT
Similarity & Recommendation - CWI Scientific Meeting - Sep 27th, 2013
PPT
Collaborative filtering20081111
PDF
Crash Course in Natural Language Processing (2016)
PPTX
hands on: Text Mining With R
PPT
Lec 4,5
ODP
SIGIR 2011
PDF
Natural Language Processing in Practice
Graph Techniques for Natural Language Processing
Conversation with-search-engines (Ren et al. 2020)
Models for Information Retrieval and Recommendation
Topic Models
WP3 Further specification of Functionality and Interoperability - Gradmann
Lecture20 xing
Crash-course in Natural Language Processing
Dstc6 an introduction
 
Text categorization as graph
Similarity & Recommendation - CWI Scientific Meeting - Sep 27th, 2013
Collaborative filtering20081111
Crash Course in Natural Language Processing (2016)
hands on: Text Mining With R
Lec 4,5
SIGIR 2011
Natural Language Processing in Practice
Ad

Similar to Summary of SIGIR 2011 Papers (20)

PDF
Learning to Rank Search Results for Time-Sensitive Queries (poster presentation)
PPTX
Techniques For Deep Query Understanding
PDF
Intent-Aware Temporal Query Modeling for Keyword Suggestion
PDF
Ontological approach for improving semantic web search results
PDF
Ontological approach for improving semantic web search results
PDF
Improving search with neural ranking methods
PDF
Performance Evaluation of Query Processing Techniques in Information Retrieval
PDF
Knowledge discoverylaurahollink
PPT
Improving VIVO search through semantic ranking.
PPTX
Semantic mark-up with schema.org: helping search engines understand the Web
PPTX
Semantic Search at Yahoo
PDF
Web-scale semantic search
PPT
A Pragmatic Approach to Semantic Repositories Benchmarking
PDF
A Network-Aware Approach for Searching As-You-Type in Social Media
PDF
Exploratory computing: designing discovery-driven user experiences
PPTX
News-oriented multimedia search over multiple social networks
PPTX
News-oriented multimedia search over multiple social networks
PPT
Vivo Search
DOCX
List of Journal after read the abstract.docx
PPTX
Beyond document retrieval using semantic annotations
Learning to Rank Search Results for Time-Sensitive Queries (poster presentation)
Techniques For Deep Query Understanding
Intent-Aware Temporal Query Modeling for Keyword Suggestion
Ontological approach for improving semantic web search results
Ontological approach for improving semantic web search results
Improving search with neural ranking methods
Performance Evaluation of Query Processing Techniques in Information Retrieval
Knowledge discoverylaurahollink
Improving VIVO search through semantic ranking.
Semantic mark-up with schema.org: helping search engines understand the Web
Semantic Search at Yahoo
Web-scale semantic search
A Pragmatic Approach to Semantic Repositories Benchmarking
A Network-Aware Approach for Searching As-You-Type in Social Media
Exploratory computing: designing discovery-driven user experiences
News-oriented multimedia search over multiple social networks
News-oriented multimedia search over multiple social networks
Vivo Search
List of Journal after read the abstract.docx
Beyond document retrieval using semantic annotations
Ad

Recently uploaded (20)

PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PPTX
Module on health assessment of CHN. pptx
DOCX
Cambridge-Practice-Tests-for-IELTS-12.docx
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PDF
International_Financial_Reporting_Standa.pdf
PDF
English Textual Question & Ans (12th Class).pdf
PPTX
Unit 4 Computer Architecture Multicore Processor.pptx
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
PPTX
Core Concepts of Personalized Learning and Virtual Learning Environments
PDF
What if we spent less time fighting change, and more time building what’s rig...
PDF
My India Quiz Book_20210205121199924.pdf
PDF
Journal of Dental Science - UDMY (2021).pdf
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
advance database management system book.pdf
PPTX
What’s under the hood: Parsing standardized learning content for AI
PDF
Skin Care and Cosmetic Ingredients Dictionary ( PDFDrive ).pdf
PDF
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
PDF
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf
Paper A Mock Exam 9_ Attempt review.pdf.
Module on health assessment of CHN. pptx
Cambridge-Practice-Tests-for-IELTS-12.docx
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
FORM 1 BIOLOGY MIND MAPS and their schemes
AI-driven educational solutions for real-life interventions in the Philippine...
International_Financial_Reporting_Standa.pdf
English Textual Question & Ans (12th Class).pdf
Unit 4 Computer Architecture Multicore Processor.pptx
Environmental Education MCQ BD2EE - Share Source.pdf
Core Concepts of Personalized Learning and Virtual Learning Environments
What if we spent less time fighting change, and more time building what’s rig...
My India Quiz Book_20210205121199924.pdf
Journal of Dental Science - UDMY (2021).pdf
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
advance database management system book.pdf
What’s under the hood: Parsing standardized learning content for AI
Skin Care and Cosmetic Ingredients Dictionary ( PDFDrive ).pdf
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
BP 505 T. PHARMACEUTICAL JURISPRUDENCE (UNIT 1).pdf

Summary of SIGIR 2011 Papers

  • 1. Summary of Papers of SIGIR 2011 Workshop on Query Representation and Understanding Chetana Gavankar
  • 2. Ricardo Campos, Alipio Jorge, Gael Dias: &quot;Using Web Snippets and Query-logs to Measure Implicit Temporal Intents in Queries&quot;
  • 3. Temporal queries 1. Atemporal : Queries not sensitive to time like plan my trip 2. Temporal unambiguous : Queries in concrete time period. Ex : Haiti earthquake in 2010 3. Temporal ambiguous : queries with multiple instances over time. Ex : Cricket worldcup which occurs every four years.
  • 4. Web snippets and Query Logs Content-Related Resources , based on a web content approach Simply requires the set of web search results. Query-Log Resources , based on similar year-qualified queries Imply that some versions of the query have already been issued.
  • 5. 1. Web snippets ( temporal evidence within web pages): TA(q)= ∑ f ε I w f f(q) I = {Tsnippet(.),TTitle(.),TUrl(.)} Value each feature differently using w f 18.14 for TTitles, 50.91 for TSnippets and 30.95 for Turl(.) If TA(q) value < 10% then Atemporal. Dates appearing in query & docs may not match. # Snippets Retrieved with Dates Identifying implicit temporal queries TSnippets = # Snippets Retrieved
  • 6. Identifying implicit temporal queries 2.Web Query Logs : Temporal activity can be recorded from date & time of request and from user activity. No. of times query is pre, post qualified by year is WA(q,y)=#(y,q) + #(q,y) α(q) = ∑ y WA (q,y) / ∑ x #(x,q) + ∑ x #(q,x) If query qualified with single year then α(q) =1
  • 7. Results Temporal information is more frequent in web snippets than in any of the query logs of Google and Yahoo!; Most of the queries have a TSnippet(.) value around 20%, TLogYahoo(.) and TLogGoogle(.) are mostly near to 0%.
  • 8. Conclusion Future dates common in snippets than query log
  • 9. Query having dates does not necessarily mean that it has temporal intent (from web query logs of Google and yahoo) Ex: October Sky movie
  • 10. Web snippets statistically more relevant in terms of temporal intent than query logs
  • 11. Rishiraj Saha Roy, Niloy Ganguly, Monojit Choudhury, Naveen Singh: &quot;Complex Network Analysis Reveals Kernel-Periphery Structure in Web Search Queries&quot;
  • 12. Search Queries Search Query language: bag of segments Word occurrence n/w: Edge exists if P ij > P i P j Eight complex network models for query logs Query Unrestricted wordnet(local) and (global)
  • 16. Kernel and Peripheral lexicons Two regimes in DD of word occurrence N/W: 1.K ernel lexicons (K-Lex or modifiers): Units popular in query (high degrees)
  • 17. Generic and domain independent
  • 18. Ex: how to, wikipedia 2.Peripheral lexicon (P-Lex or HEADs): Rare ones with degree much less than those in kernel Ex: Decision Tree algorithm
  • 19. Degree Disribution |N| = Nodes, |E| = edges C= average clustering coefficient d=mean shortest path between edges C rand and d rand are corr. Values in random graph C rand ~ k'/ |N| , d rand ~ ln(|N|)/ ln(|k'|) k' = average degree of graph Degree distribution= p(k) = nodes with degree k/ total nodes
  • 21. Conclusion Like NL, Queries reflect kernal-periphery distinction
  • 22. Unlike NL, Query N/W lack small word property for quickly retrieving words from mind
  • 23. More difficult to understand context of segment in query.
  • 24. Peripheral N/W consist of large number of small disconnected components
  • 25. Capability of peripheral units to exist by themselves makes POS identification hard in Queries.
  • 26. Socio-cultural factors govern the kernel-periphery distinction in queries
  • 27. Lidong Bing, Wai Lam: &quot;Investigation of Web Query Refinement via Topic Analysis and Learning with Personalization&quot;
  • 28. Web Query Refinement Query Refinement Substitution
  • 34. ...................... Generate some candidate queries first, and score the quality of these candidates.
  • 35. Latent Topic Analysis in Query Log Query log record (user_id, query, clicked_url, time) Pseudo-document generation: Queries related to the same host are aggregated. General sites like “en.wikipedia.org” are not suitable for latent topic analysis & are eliminated Latent Dirichlet Allocation Algorithm) LDA to conduct the latent semantic topic analysis on the collection of host-based pseudo-documents. Z = set of latent topic s z i Each z i is associated with multinomial distribution of terms P ( tk | z i )= prob of term tk given topic z i
  • 36. Personalization π u ={ π u 1 , π u 2 , … , π u |z| } = profile of the user u , π u i = P ( z i | u ) = probability that the user u prefers the topic z i Generate user-based pseudo-document U for user u . { P ( z 1 | U ), P ( z 2 | U ), … , P ( z | Z | | U )} = profile of u . candidate query q : t 1 , … t n Topic of term t r = z r
  • 37. Topic based scoring with personalization Candidate query score: model parameter P ( zj | zi ) captures the relationship of two topics With personal profile P ( z 1 | u ) = probability that user u prefers the topic z 1
  • 38. Conclusion Framework that considers personalization achieves the best performance. With user profiles, the topic-based scoring part is more reliable