Enabling Opinion-Driven
Decision Making
Kavita Ganesan
Senior Data Scientist, GitHub Inc.
@kav_gan
#Activate18 #ActivateSearch
Agenda
• Problem Intro
• Demo of FindiLike – System leveraging user reviews
• FindiLike - Search component
• FindiLike - Summarization component
What people think?
INFLUENCES DECISION MAKING
What companies to work for?
Which cool places to visit?
What data science tools to use?
Before Web became popular
– life was simple
Friends and relatives Consumer Reports
Crazy WWW
What people think?
- e-commerce sites
- online directories
- blog articles
- review sites
- social networking sites
…more
TOO MUCH INFORMATION
TOO FEW TOOLS FOR DECISION MAKING
Looking for a hotel in an
unknown city
Looking for a hotel in an unknown city
1 Safe ! - No bullets
Looking for a hotel in an unknown city
1 Safe ! - No bullets
2 Spacious room - Claustrophobic
Looking for a hotel in an unknown city
1 Safe ! - No bullets
2 Spacious room - Claustrophobic
3 Tempurpedic beds
Can we currently find hotels matching
these criteria?
1 Safe ! - No bullets
2 Spacious room - Claustrophobic
3 Tempurpedic beds
Maybe by reading 2000 reviews?
Information is there – somewhere
Read reviews & perform own data mining
FindiLike – Preference Based Hotel Search
• Research prototype
• Showcase algorithms
leveraging user reviews
Different tools to help
with decision making
based on opinions
Can we use this in other
domains?
Product search
T-shirt:
"baggy"
"v-neck"
"doesn't shrink"
Healthcare provider search
Family Doctor:
"speaks fluent spanish"
"good with kids"
"calming clinic"
How to implement such
a system?
1. Preference Based Search
How to recommend relevant
entities using user reviews
and a users’ unstructured
preference?
2. Query Understanding
"good for kids" == "child friendly" ?
"baggy" == "oversized" ?
3. Summarization
How do you provide concise
views of user reviews?
4. Combine structured + unstructured info
How do you combine
structured + unstructured
info to provide a powerful
search experience?
Let's zoom into two components
Preference Based
Search
Summarization
Query
Understanding
Structured +
Unstrcutred Search
Traditional Search
Query (short)
Document
<content>
Document
<content>
Document
<content>
Click to add text
Query (short)
Document
<content>
Document
<content>
Document
<content>
Traditional Search
Click to add text
Query (Preferences)
Entity
<content>
Entity
<content>
Entity
<content>
Preference Based Search
Each entity represented by entity
document
Hotel Los Angeles
(Entity)
Vagabond Inn
(Entity)
Holiday Inn Los
Angeles
(Entity)
Entity Document (content)
review1: this hotel was absolutely...
review2: nice pet friendly hotel...
Entity Document (content)
review1: extremely old and dirty...
review2: nice little place although...
Entity Document (content)
review1: the continental breakfast was...
review2: I will never stay at this place..
Use regular search technologies to index
Entity Document
review1: this hotel was absolutely...
review2: nice pet friendly hotel...
Entity Document
review1: extremely old and dirty...
review2: nice little place although...
Entity Document
review1: the continental breakfast was...
review2: I will never stay at this place..
Hotel Los Angeles
(Entity)
Vagabond Inn
(Entity)
Holiday Inn Los
Angeles
(Entity)
more keyword
matches = better
entity ranks
Entity Document
Review 1: I loved that this hotel was pet friendly...
Review 2: nice pet friendly hotel...
Review 3: pets are welcome at this hotel...
Review 4: I brought my puppy along since this was...
Pet friendlyPreference / query:
Hotel Los Angeles
#1: Uneven Entity Document Length
extremel
Entity Document
(Hotel A)
Review 1: This hotel was absolutely...
Review 2: Nice pet friendly hotel...
Review 3: Beautiful hotel with friendly...
Review 4: My pets were really happy...
Review 5: Our stay was fantastic. We...
......................................
......................................
......................................
......................................
......................................
......................................
Review 1000: Unfortunately, we had a bad...
Entity Document
(Hotel B)
Review 1: We had muffis for bfast...
Review 2: Dull hotel with very little...
......................................
......................................
......................................
Review 10: Unfortunately, we had a bad...
more matches ==> relevance goes up ↑
#1: Uneven Entity Document Length
Solution: similarity models
that have saturation points
#2: Matching similar concepts
#2: Matching similar concepts
extremel
pet friendly
Entity Document
Review 1: I loved that this hotel was pet friendly...
Review 2: nice dog friendly hotel...
Review 3: pets are welcome at this hotel...
Review 4: Brought my kitty along since this is an animal friendly facility...
Preference / query:
#2: Matching similar concepts
extremel
pet friendly
Entity Document
Review 1: I loved that this hotel was pet friendly...
Review 2: nice dog friendly hotel...
Review 3: pets are welcome at this hotel...
Review 4: Brought my kitty along since this is an animal friendly facility...
Preference / query:
Expand query with related concepts
#3: Dealing with negations
#3: Dealing with negations
extremel
pet friendly
This hotel was NOT pet friendly so I could not bring my dog
Preference / query:
#3: Dealing with negations
extremel
pet friendly
This hotel was NOT pet friendly so I could not bring my dog
Preference / query:
Nearby negations should be considered
Opinion-Based Entity Ranking – Ganesan & Zhai 2012
http://kavita-ganesan.com/opinion-based-entity-ranking/
Let's zoom into summarization of
opinions
Preference Based
Search
Summarization
Query
Understanding
Structured +
Unstrcutred Search
What can review summaries
provide that search cannot?
Unexpected Information
That can make or break the deal
Enabling Opinion Driven Decision Making - Kavita Ganesan, GitHub
Deal
breaker!
Lots of summarization options
Concise or aggregated view of what people are saying
Cleanliness
Comfort
Staff
Views
4.0/5
2.0/5
3.0/5
4.5/5
Natural Language Opinion Summary
• Aggregation of
what people think
• Concise, conveys
essential info
• Analytics
Opinosis: Graph Based Sentence
Compression Approach
(Ganesan & Zhai Coling 2010)
Sentence Compression Approach
“The bed was comfortable, I loved it!”
“The bed was really comfortable..”
“The tempurpedic bed was super comfy”
Sentence Compression Approach
“The bed was comfortable, I loved it!”
“The bed was really comfortable..”
“The tempurpedic bed was super comfy”
bed was comfortable (3)
Opinosis: Uses a Word Graph
“The bed was comfortable, I loved it!”
“The bed was really comfortable..”
“The tempurpedic bed was super comfy”
bed I
really
comfortable
.
tempurpedic
the ,
loved itwas
super
.
WORD GRAPH
What's nice about this technique?
1. Really speedy for ad hoc summarization
• No need to pre-compute
• Generate summaries for specific topics
What's nice about this technique?
1. Really speedy for ad hoc summarization
2. Captures redundancies naturally
• Automatically provides the analytics component
What's nice about this technique?
1. Really speedy for ad hoc summarization
2. Captures redundancies naturally
3. Extract phrases from existing sentence structure
• Limiting external dependencies
Try out Web API or JAR File
www. kavita-ganesan.com/opinosis/
In summary…
Enabling Opinion Driven Decision Making - Kavita Ganesan, GitHub
Leverage text data to empower
consumers…
Help users make faster decisions...
Increase conversion rates & loyalty
Thank you!
Kavita Ganesan
@kav_gan
kavita-ganesan.com
#Activate18 #ActivateSearch

More Related Content

PPTX
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
PDF
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
PPTX
Increase Conversions and Rankings with User Experience (SEM Summit 2016)
PPTX
SEO in the Age of Entities: Using Schema.org for Findability
PDF
Content, Data and Humans
PPTX
Bats, Dolphins and Semantic Search - IAB This is Search Conference 2015
PDF
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
PDF
Paper Writer Services
Opinion-based Article Ranking for Information Retrieval Systems: Factoids and...
The Iceberg Approach - Power from what lies beneath in SEO for a mobile-first...
Increase Conversions and Rankings with User Experience (SEM Summit 2016)
SEO in the Age of Entities: Using Schema.org for Findability
Content, Data and Humans
Bats, Dolphins and Semantic Search - IAB This is Search Conference 2015
Voice Search and Conversation Action Assistive Systems - Challenges & Opportu...
Paper Writer Services

Similar to Enabling Opinion Driven Decision Making - Kavita Ganesan, GitHub (20)

PPT
The Architecture of Social Websites: Reputation
PDF
Critical Thinking for Software Testers
KEY
How to get personal with your customers and have them love it
PDF
Balancing the Dimensions of User Intent
PDF
Magic If Play Analysis
PPT
The filter bubble
PPTX
Context, Narratives & Big Data Analytics
PDF
A Multifaceted Look At Faceting - Ted Sullivan, Lucidworks
PPTX
SearchLove San Diego 2017 | Joel Klettke | Don't Buy Your Customer a Beer: Ho...
PDF
Frakture 4 fold webinar presentation 2015-04_23
PDF
Intro to Vita Beans
PPTX
Andy Crestodina — How to Find Blog Topics Your Audience REALLY Cares About
PPTX
Acceptance, accessible, actionable and auditable
PDF
Everything You Wish You Knew About Search
PPTX
lesson 4Contextualized Online Search and Research Skills.pptx
PDF
Human-centered data: using data science and human-centered design to grow you...
PDF
Bearish SEO: Defining the User Experience for Google’s Panda Search Landscape
PPTX
How to Build a Recommendation Engine on Spark
PDF
The Art of Smart Data
PDF
Is Your Website Trauma-Informed?
The Architecture of Social Websites: Reputation
Critical Thinking for Software Testers
How to get personal with your customers and have them love it
Balancing the Dimensions of User Intent
Magic If Play Analysis
The filter bubble
Context, Narratives & Big Data Analytics
A Multifaceted Look At Faceting - Ted Sullivan, Lucidworks
SearchLove San Diego 2017 | Joel Klettke | Don't Buy Your Customer a Beer: Ho...
Frakture 4 fold webinar presentation 2015-04_23
Intro to Vita Beans
Andy Crestodina — How to Find Blog Topics Your Audience REALLY Cares About
Acceptance, accessible, actionable and auditable
Everything You Wish You Knew About Search
lesson 4Contextualized Online Search and Research Skills.pptx
Human-centered data: using data science and human-centered design to grow you...
Bearish SEO: Defining the User Experience for Google’s Panda Search Landscape
How to Build a Recommendation Engine on Spark
The Art of Smart Data
Is Your Website Trauma-Informed?
Ad

More from Lucidworks (20)

PDF
Search is the Tip of the Spear for Your B2B eCommerce Strategy
PDF
Drive Agent Effectiveness in Salesforce
PPTX
How Crate & Barrel Connects Shoppers with Relevant Products
PPTX
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
PPTX
Connected Experiences Are Personalized Experiences
PDF
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
PPTX
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
PPTX
Preparing for Peak in Ecommerce | eTail Asia 2020
PPTX
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
PPTX
AI-Powered Linguistics and Search with Fusion and Rosette
PDF
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
PPTX
Webinar: Smart answers for employee and customer support after covid 19 - Europe
PDF
Smart Answers for Employee and Customer Support After COVID-19
PPTX
Applying AI & Search in Europe - featuring 451 Research
PPTX
Webinar: Accelerate Data Science with Fusion 5.1
PDF
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
PPTX
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
PPTX
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
PPTX
Webinar: Building a Business Case for Enterprise Search
PPTX
Why Insight Engines Matter in 2020 and Beyond
Search is the Tip of the Spear for Your B2B eCommerce Strategy
Drive Agent Effectiveness in Salesforce
How Crate & Barrel Connects Shoppers with Relevant Products
Lucidworks & IMRG Webinar – Best-In-Class Retail Product Discovery
Connected Experiences Are Personalized Experiences
Intelligent Insight Driven Policing with MC+A, Toronto Police Service and Luc...
[Webinar] Intelligent Policing. Leveraging Data to more effectively Serve Com...
Preparing for Peak in Ecommerce | eTail Asia 2020
Accelerate The Path To Purchase With Product Discovery at Retail Innovation C...
AI-Powered Linguistics and Search with Fusion and Rosette
The Service Industry After COVID-19: The Soul of Service in a Virtual Moment
Webinar: Smart answers for employee and customer support after covid 19 - Europe
Smart Answers for Employee and Customer Support After COVID-19
Applying AI & Search in Europe - featuring 451 Research
Webinar: Accelerate Data Science with Fusion 5.1
Webinar: 5 Must-Have Items You Need for Your 2020 Ecommerce Strategy
Where Search Meets Science and Style Meets Savings: Nordstrom Rack's Journey ...
Apply Knowledge Graphs and Search for Real-World Decision Intelligence
Webinar: Building a Business Case for Enterprise Search
Why Insight Engines Matter in 2020 and Beyond
Ad

Recently uploaded (20)

PDF
Five Habits of High-Impact Board Members
PDF
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
PDF
Unlock new opportunities with location data.pdf
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
A novel scalable deep ensemble learning framework for big data classification...
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Developing a website for English-speaking practice to English as a foreign la...
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
The various Industrial Revolutions .pptx
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Zenith AI: Advanced Artificial Intelligence
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PPT
What is a Computer? Input Devices /output devices
Five Habits of High-Impact Board Members
A Late Bloomer's Guide to GenAI: Ethics, Bias, and Effective Prompting - Boha...
Unlock new opportunities with location data.pdf
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
A comparative study of natural language inference in Swahili using monolingua...
Hybrid horned lizard optimization algorithm-aquila optimizer for DC motor
Final SEM Unit 1 for mit wpu at pune .pptx
A novel scalable deep ensemble learning framework for big data classification...
Assigned Numbers - 2025 - Bluetooth® Document
Developing a website for English-speaking practice to English as a foreign la...
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Enhancing emotion recognition model for a student engagement use case through...
Univ-Connecticut-ChatGPT-Presentaion.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
The various Industrial Revolutions .pptx
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Zenith AI: Advanced Artificial Intelligence
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
What is a Computer? Input Devices /output devices

Enabling Opinion Driven Decision Making - Kavita Ganesan, GitHub

  • 1. Enabling Opinion-Driven Decision Making Kavita Ganesan Senior Data Scientist, GitHub Inc. @kav_gan #Activate18 #ActivateSearch
  • 2. Agenda • Problem Intro • Demo of FindiLike – System leveraging user reviews • FindiLike - Search component • FindiLike - Summarization component
  • 4. What companies to work for?
  • 5. Which cool places to visit?
  • 6. What data science tools to use?
  • 7. Before Web became popular – life was simple Friends and relatives Consumer Reports
  • 8. Crazy WWW What people think? - e-commerce sites - online directories - blog articles - review sites - social networking sites …more
  • 9. TOO MUCH INFORMATION TOO FEW TOOLS FOR DECISION MAKING
  • 10. Looking for a hotel in an unknown city
  • 11. Looking for a hotel in an unknown city 1 Safe ! - No bullets
  • 12. Looking for a hotel in an unknown city 1 Safe ! - No bullets 2 Spacious room - Claustrophobic
  • 13. Looking for a hotel in an unknown city 1 Safe ! - No bullets 2 Spacious room - Claustrophobic 3 Tempurpedic beds
  • 14. Can we currently find hotels matching these criteria? 1 Safe ! - No bullets 2 Spacious room - Claustrophobic 3 Tempurpedic beds
  • 15. Maybe by reading 2000 reviews?
  • 16. Information is there – somewhere Read reviews & perform own data mining
  • 17. FindiLike – Preference Based Hotel Search • Research prototype • Showcase algorithms leveraging user reviews
  • 18. Different tools to help with decision making based on opinions
  • 19. Can we use this in other domains?
  • 21. Healthcare provider search Family Doctor: "speaks fluent spanish" "good with kids" "calming clinic"
  • 22. How to implement such a system?
  • 23. 1. Preference Based Search How to recommend relevant entities using user reviews and a users’ unstructured preference?
  • 24. 2. Query Understanding "good for kids" == "child friendly" ? "baggy" == "oversized" ?
  • 25. 3. Summarization How do you provide concise views of user reviews?
  • 26. 4. Combine structured + unstructured info How do you combine structured + unstructured info to provide a powerful search experience?
  • 27. Let's zoom into two components Preference Based Search Summarization Query Understanding Structured + Unstrcutred Search
  • 29. Click to add text Query (short) Document <content> Document <content> Document <content> Traditional Search Click to add text Query (Preferences) Entity <content> Entity <content> Entity <content> Preference Based Search
  • 30. Each entity represented by entity document Hotel Los Angeles (Entity) Vagabond Inn (Entity) Holiday Inn Los Angeles (Entity) Entity Document (content) review1: this hotel was absolutely... review2: nice pet friendly hotel... Entity Document (content) review1: extremely old and dirty... review2: nice little place although... Entity Document (content) review1: the continental breakfast was... review2: I will never stay at this place..
  • 31. Use regular search technologies to index Entity Document review1: this hotel was absolutely... review2: nice pet friendly hotel... Entity Document review1: extremely old and dirty... review2: nice little place although... Entity Document review1: the continental breakfast was... review2: I will never stay at this place.. Hotel Los Angeles (Entity) Vagabond Inn (Entity) Holiday Inn Los Angeles (Entity)
  • 32. more keyword matches = better entity ranks Entity Document Review 1: I loved that this hotel was pet friendly... Review 2: nice pet friendly hotel... Review 3: pets are welcome at this hotel... Review 4: I brought my puppy along since this was... Pet friendlyPreference / query: Hotel Los Angeles
  • 33. #1: Uneven Entity Document Length extremel Entity Document (Hotel A) Review 1: This hotel was absolutely... Review 2: Nice pet friendly hotel... Review 3: Beautiful hotel with friendly... Review 4: My pets were really happy... Review 5: Our stay was fantastic. We... ...................................... ...................................... ...................................... ...................................... ...................................... ...................................... Review 1000: Unfortunately, we had a bad... Entity Document (Hotel B) Review 1: We had muffis for bfast... Review 2: Dull hotel with very little... ...................................... ...................................... ...................................... Review 10: Unfortunately, we had a bad... more matches ==> relevance goes up ↑
  • 34. #1: Uneven Entity Document Length Solution: similarity models that have saturation points
  • 36. #2: Matching similar concepts extremel pet friendly Entity Document Review 1: I loved that this hotel was pet friendly... Review 2: nice dog friendly hotel... Review 3: pets are welcome at this hotel... Review 4: Brought my kitty along since this is an animal friendly facility... Preference / query:
  • 37. #2: Matching similar concepts extremel pet friendly Entity Document Review 1: I loved that this hotel was pet friendly... Review 2: nice dog friendly hotel... Review 3: pets are welcome at this hotel... Review 4: Brought my kitty along since this is an animal friendly facility... Preference / query: Expand query with related concepts
  • 38. #3: Dealing with negations
  • 39. #3: Dealing with negations extremel pet friendly This hotel was NOT pet friendly so I could not bring my dog Preference / query:
  • 40. #3: Dealing with negations extremel pet friendly This hotel was NOT pet friendly so I could not bring my dog Preference / query: Nearby negations should be considered
  • 41. Opinion-Based Entity Ranking – Ganesan & Zhai 2012 http://kavita-ganesan.com/opinion-based-entity-ranking/
  • 42. Let's zoom into summarization of opinions Preference Based Search Summarization Query Understanding Structured + Unstrcutred Search
  • 43. What can review summaries provide that search cannot?
  • 44. Unexpected Information That can make or break the deal
  • 47. Lots of summarization options Concise or aggregated view of what people are saying Cleanliness Comfort Staff Views 4.0/5 2.0/5 3.0/5 4.5/5
  • 48. Natural Language Opinion Summary • Aggregation of what people think • Concise, conveys essential info • Analytics
  • 49. Opinosis: Graph Based Sentence Compression Approach (Ganesan & Zhai Coling 2010)
  • 50. Sentence Compression Approach “The bed was comfortable, I loved it!” “The bed was really comfortable..” “The tempurpedic bed was super comfy”
  • 51. Sentence Compression Approach “The bed was comfortable, I loved it!” “The bed was really comfortable..” “The tempurpedic bed was super comfy” bed was comfortable (3)
  • 52. Opinosis: Uses a Word Graph “The bed was comfortable, I loved it!” “The bed was really comfortable..” “The tempurpedic bed was super comfy” bed I really comfortable . tempurpedic the , loved itwas super . WORD GRAPH
  • 53. What's nice about this technique? 1. Really speedy for ad hoc summarization • No need to pre-compute • Generate summaries for specific topics
  • 54. What's nice about this technique? 1. Really speedy for ad hoc summarization 2. Captures redundancies naturally • Automatically provides the analytics component
  • 55. What's nice about this technique? 1. Really speedy for ad hoc summarization 2. Captures redundancies naturally 3. Extract phrases from existing sentence structure • Limiting external dependencies
  • 56. Try out Web API or JAR File www. kavita-ganesan.com/opinosis/
  • 59. Leverage text data to empower consumers…
  • 60. Help users make faster decisions...