In Situ Evaluation of Entity Ranking 
and Opinion Summarization 
using 
www.findilike.com 
Kavita Ganesan & ChengXiang Zhai 
University of Illinois @ Urbana Champaign
What is findilike? 
• Preference – driven search engine 
– Currently works in hotels domain 
– Finds & ranks hotels based on user preferences: 
Structured: price, distance 
Unstructured: “friendly service”, “clean”, “good views” 
(Based on existing user reviews)  UNIQUE 
• Beyond search: Support for analysis of hotels 
– Opinion summaries 
– Tag cloud visualization of reviews
…What is findilike? 
• Developed as part of PhD. Work – new system 
(Opinion-Driven Decision Support System, UIUC, 2013) 
• Tracked ~1000 unique users from Jan - Aug ‘13 
– Working on speed & reaching out to more users
Evaluating Review Summarization 
Mini Test-bed 
• Base code to extend 
• Set of sample sentences 
• Gold standard summary for those sentences 
• ROUGE toolkit to evaluate the results 
• Data set based on - Ganesan et. al 2010
Evaluating Entity Ranking 
Mini Test-bed 
• Base code to extend 
• Terrier Index of hotel reviews 
• Gold standard ranking of hotels 
• Code to generate nDCG scores. 
• Raw unindexed data set for reference
Building a new ranking model 
Extend Weighting 
Model
DEMO
2 Components that can be evaluated 
through natural user interaction 
1 
Ranking entities based on 
unstructured user preferences 
Opinion-Based Entity Ranking 
(Ganesan & Zhai 2012) 
Summarization of reviews 
Generating short phrases 
summarizing key opinions 
(Ganesan et. al 2010, 2012) 
2
Evaluation of entity ranking 
• Retrieval 
– Interleave results 
Balanced 
interleaving 
(T. Joachims, 2002) 
Base 
DirichletLM 
A click indicates preference… 
Base
Snapshot of pairwise comparison 
results for entity ranking 
# Queries 
B is better 
Algorithms 
DirichletLM, 
Base, PL2 
# Queries 
A is Better 
A B CA > CB 
(A Better) 
CB > CA 
(B Better) 
CA = CB > 0 
(Tie) 
CA = CB = 0 Total 
DLM Base 30 35 2 5 72 
PL2 Base 10 28 3 7 48 
… … … … … … …
Snapshot of pairwise comparison 
results for entity ranking 
A B CA > CB 
(A Better) 
CB > CA 
(B Better) 
Base model 
better, but DLM 
not too far behind 
Base model 
better CA = CB & > 0 
PL2 not 
(Tie) 
CA = CB = 0 Total 
too good 
DLM Base 30 35 2 5 72 
PL2 Base 10 28 3 7 48 
… … … … … … …
Evaluation of review summarization 
Randomly mix top N 
phrases from two 
algorithms 
ALGO1 
ALGO2 Monitor click-through 
More clicks on phrases from Algo1 vs. Algo2  
Algo1 better 
on per 
entity basis
How to submit a new algorithm? 
Submit code 
Performance 
report 
A B CA > CB 
(A Better) 
… … … … CB > CA 
(B Better) 
DLM Base 30 35 
PL2 Base 10 28 
Online Performance 
Test on mini test 
bed 
Sample Code 
Test Data & Gold 
Standard 
Evaluator 
(nDCG, ROUGE) 
Mini Testbed 
Local performance 
Write Java 
based code 
Extend 
existing code 
Implementation
More information about evaluation… 
eval.findilike.com
Thanks! Questions? 
Links 
• Evaluation: http://eval.findilike.com 
• System: http://hotels.findilike.com/ 
• Related Papers: kavita-ganesan.com
References 
• Ganesan, K. A., C. X. Zhai, and E. Viegas, Micropinion Generation: An Unsupervised 
Approach to Generating Ultra-Concise Summaries of Opinions, Proceedings of the 
21st International Conference on World Wide Web 2012 (WWW '12), 2012. 
• Ganesan, K. A., and C. X. Zhai, Opinion-Based Entity Ranking, Information Retrieval, 
vol. 15, issue 2, 2012 
• Ganesan, K. A., C. X. Zhai, and J. Han, Opinosis: A Graph Based Approach to 
Abstractive Summarization of Highly Redundant Opinions, Proceedings of the 23rd 
International Conference on Computational Linguistics (COLING '10), 2010. 
• T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of 
the eighth ACM SIGKDD international conference on Knowledge discovery and 
data mining, KDD ’02, NY, 2002.

More Related Content

PPTX
Opinosis Presentation @ Coling 2010: Opinosis - A Graph Based Approach to Abs...
PPTX
Docentes I. E. D. El Tequendama
PPTX
Presente continúo
PPTX
Interactive tv fri123 7
DOC
Defesa Dr Kleber Amancio
PPT
fooserv home page slides
PPT
Evaluation PP Question 2
PPTX
Interactive tv fri123 7
Opinosis Presentation @ Coling 2010: Opinosis - A Graph Based Approach to Abs...
Docentes I. E. D. El Tequendama
Presente continúo
Interactive tv fri123 7
Defesa Dr Kleber Amancio
fooserv home page slides
Evaluation PP Question 2
Interactive tv fri123 7

Viewers also liked (19)

PDF
Listening exercise ted 2
PPTX
Presente continúo2
PDF
Supermarket
PPTX
Trabajo de realidad nacional
PPTX
El colegio (mesitas)
PDF
Listening 3 activity
ODP
PDF
PPTX
Feelings
PPTX
Flessas c.a 2011
PPT
Hansen
PPTX
Documental
PDF
Listening exercise ted 2
PPTX
Clothes
PPTX
BLOQUES DE REALIDAD NACIONAL
PDF
Blubag brochure
PDF
Soria antigua
ODP
PDF
Soria antigua
Listening exercise ted 2
Presente continúo2
Supermarket
Trabajo de realidad nacional
El colegio (mesitas)
Listening 3 activity
Feelings
Flessas c.a 2011
Hansen
Documental
Listening exercise ted 2
Clothes
BLOQUES DE REALIDAD NACIONAL
Blubag brochure
Soria antigua
Soria antigua
Ad

Similar to In situ evaluation of entity retrieval and opinion summarization (20)

PPTX
Opinion-Based Entity Ranking
PPTX
Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit
PPTX
Opinion Driven Decision Support System
PDF
Empirical Model of Supervised Learning Approach for Opinion Mining
PPTX
Enabling Opinion Driven Decision Making - Kavita Ganesan, GitHub
PPTX
Detection of Fake reviews
PDF
IRJET- Scalable Content Aware Collaborative Filtering for Location Recommenda...
PDF
Tutorial: Context-awareness In Information Retrieval and Recommender Systems
PPTX
Fyp ca2
PDF
IRJET- Implementation of Review Selection using Deep Learning
PDF
Philosophy of IR Evaluation Ellen Voorhees
PDF
Combining the opinion profile modeling with complex context filtering for Con...
PDF
Measuring System Performance in Cultural Heritage Systems
PDF
Web Rec Final Report
PDF
Modern Perspectives on Recommender Systems and their Applications in Mendeley
PDF
A recommendation engine for your applications - M.Orselli - Codemotion Rome 17
PPT
score based ranking of documents
PDF
Modern Perspectives on Recommender Systems and their Applications in Mendeley
PDF
Two Brains are Better than One: User Control in Adaptive Information Access
PPTX
Recommendation system (1).pptx
Opinion-Based Entity Ranking
Enabling Opinion-Driven Decision Making - Sentiment Analysis Innovation Summit
Opinion Driven Decision Support System
Empirical Model of Supervised Learning Approach for Opinion Mining
Enabling Opinion Driven Decision Making - Kavita Ganesan, GitHub
Detection of Fake reviews
IRJET- Scalable Content Aware Collaborative Filtering for Location Recommenda...
Tutorial: Context-awareness In Information Retrieval and Recommender Systems
Fyp ca2
IRJET- Implementation of Review Selection using Deep Learning
Philosophy of IR Evaluation Ellen Voorhees
Combining the opinion profile modeling with complex context filtering for Con...
Measuring System Performance in Cultural Heritage Systems
Web Rec Final Report
Modern Perspectives on Recommender Systems and their Applications in Mendeley
A recommendation engine for your applications - M.Orselli - Codemotion Rome 17
score based ranking of documents
Modern Perspectives on Recommender Systems and their Applications in Mendeley
Two Brains are Better than One: User Control in Adaptive Information Access
Recommendation system (1).pptx
Ad

More from Kavita Ganesan (8)

PPTX
Comparison between cbow, skip gram and skip-gram with subword information (1)
PPTX
Comparison between cbow, skip gram and skip-gram with subword information
PPTX
Introduction to Java Strings, By Kavita Ganesan
PPT
Statistical Methods for Integration and Analysis of Online Opinionated Text...
PPTX
Segmentation of Clinical Texts
PPT
Very Small Tutorial on Terrier 3.0 Retrieval Toolkit
PPTX
Micropinion Generation
PPT
Opinion Mining Tutorial (Sentiment Analysis)
Comparison between cbow, skip gram and skip-gram with subword information (1)
Comparison between cbow, skip gram and skip-gram with subword information
Introduction to Java Strings, By Kavita Ganesan
Statistical Methods for Integration and Analysis of Online Opinionated Text...
Segmentation of Clinical Texts
Very Small Tutorial on Terrier 3.0 Retrieval Toolkit
Micropinion Generation
Opinion Mining Tutorial (Sentiment Analysis)

Recently uploaded (20)

PPT
Module 1.ppt Iot fundamentals and Architecture
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
Getting Started with Data Integration: FME Form 101
PDF
Taming the Chaos: How to Turn Unstructured Data into Decisions
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
WOOl fibre morphology and structure.pdf for textiles
PPTX
Tartificialntelligence_presentation.pptx
PDF
sustainability-14-14877-v2.pddhzftheheeeee
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PPTX
Web Crawler for Trend Tracking Gen Z Insights.pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
DOCX
search engine optimization ppt fir known well about this
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PDF
Unlock new opportunities with location data.pdf
PDF
STKI Israel Market Study 2025 version august
PPT
Geologic Time for studying geology for geologist
Module 1.ppt Iot fundamentals and Architecture
A comparative study of natural language inference in Swahili using monolingua...
Developing a website for English-speaking practice to English as a foreign la...
Group 1 Presentation -Planning and Decision Making .pptx
Getting Started with Data Integration: FME Form 101
Taming the Chaos: How to Turn Unstructured Data into Decisions
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
Enhancing emotion recognition model for a student engagement use case through...
WOOl fibre morphology and structure.pdf for textiles
Tartificialntelligence_presentation.pptx
sustainability-14-14877-v2.pddhzftheheeeee
Final SEM Unit 1 for mit wpu at pune .pptx
Web Crawler for Trend Tracking Gen Z Insights.pptx
DP Operators-handbook-extract for the Mautical Institute
search engine optimization ppt fir known well about this
observCloud-Native Containerability and monitoring.pptx
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
Unlock new opportunities with location data.pdf
STKI Israel Market Study 2025 version august
Geologic Time for studying geology for geologist

In situ evaluation of entity retrieval and opinion summarization

  • 1. In Situ Evaluation of Entity Ranking and Opinion Summarization using www.findilike.com Kavita Ganesan & ChengXiang Zhai University of Illinois @ Urbana Champaign
  • 2. What is findilike? • Preference – driven search engine – Currently works in hotels domain – Finds & ranks hotels based on user preferences: Structured: price, distance Unstructured: “friendly service”, “clean”, “good views” (Based on existing user reviews)  UNIQUE • Beyond search: Support for analysis of hotels – Opinion summaries – Tag cloud visualization of reviews
  • 3. …What is findilike? • Developed as part of PhD. Work – new system (Opinion-Driven Decision Support System, UIUC, 2013) • Tracked ~1000 unique users from Jan - Aug ‘13 – Working on speed & reaching out to more users
  • 4. Evaluating Review Summarization Mini Test-bed • Base code to extend • Set of sample sentences • Gold standard summary for those sentences • ROUGE toolkit to evaluate the results • Data set based on - Ganesan et. al 2010
  • 5. Evaluating Entity Ranking Mini Test-bed • Base code to extend • Terrier Index of hotel reviews • Gold standard ranking of hotels • Code to generate nDCG scores. • Raw unindexed data set for reference
  • 6. Building a new ranking model Extend Weighting Model
  • 8. 2 Components that can be evaluated through natural user interaction 1 Ranking entities based on unstructured user preferences Opinion-Based Entity Ranking (Ganesan & Zhai 2012) Summarization of reviews Generating short phrases summarizing key opinions (Ganesan et. al 2010, 2012) 2
  • 9. Evaluation of entity ranking • Retrieval – Interleave results Balanced interleaving (T. Joachims, 2002) Base DirichletLM A click indicates preference… Base
  • 10. Snapshot of pairwise comparison results for entity ranking # Queries B is better Algorithms DirichletLM, Base, PL2 # Queries A is Better A B CA > CB (A Better) CB > CA (B Better) CA = CB > 0 (Tie) CA = CB = 0 Total DLM Base 30 35 2 5 72 PL2 Base 10 28 3 7 48 … … … … … … …
  • 11. Snapshot of pairwise comparison results for entity ranking A B CA > CB (A Better) CB > CA (B Better) Base model better, but DLM not too far behind Base model better CA = CB & > 0 PL2 not (Tie) CA = CB = 0 Total too good DLM Base 30 35 2 5 72 PL2 Base 10 28 3 7 48 … … … … … … …
  • 12. Evaluation of review summarization Randomly mix top N phrases from two algorithms ALGO1 ALGO2 Monitor click-through More clicks on phrases from Algo1 vs. Algo2  Algo1 better on per entity basis
  • 13. How to submit a new algorithm? Submit code Performance report A B CA > CB (A Better) … … … … CB > CA (B Better) DLM Base 30 35 PL2 Base 10 28 Online Performance Test on mini test bed Sample Code Test Data & Gold Standard Evaluator (nDCG, ROUGE) Mini Testbed Local performance Write Java based code Extend existing code Implementation
  • 14. More information about evaluation… eval.findilike.com
  • 15. Thanks! Questions? Links • Evaluation: http://eval.findilike.com • System: http://hotels.findilike.com/ • Related Papers: kavita-ganesan.com
  • 16. References • Ganesan, K. A., C. X. Zhai, and E. Viegas, Micropinion Generation: An Unsupervised Approach to Generating Ultra-Concise Summaries of Opinions, Proceedings of the 21st International Conference on World Wide Web 2012 (WWW '12), 2012. • Ganesan, K. A., and C. X. Zhai, Opinion-Based Entity Ranking, Information Retrieval, vol. 15, issue 2, 2012 • Ganesan, K. A., C. X. Zhai, and J. Han, Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions, Proceedings of the 23rd International Conference on Computational Linguistics (COLING '10), 2010. • T. Joachims. Optimizing search engines using clickthrough data. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’02, NY, 2002.