SlideShare a Scribd company logo
Text Analytics
Project
k.Dharmesh
170202011
Agenda
 Objective
 Approach
 Data Preparation
 Model
 Conclusion
Objective
To analyse the “La Binchoise “ béer reviews and provide useful
insights to improve the brand quality.
Approach
 Taking out La Binchoise beer reviews from all the reviews.
 Splitting up positive and negative reviews.
 Removing stop words from all the reviews.
 Categorization positive, negative and all reviews into single
strings each
 Plotting the word cloud to see the more frequently used words.
Data preparation
 We have to subset the dataset into high rating reviews and low rating
reviews
 This high and low rating reviews are classified based on there rating
 High reviews means >3
 Low Reviews means <3
 Removing the stop word
Data Preparation
 Importing regular expression and removing unnecessary symbols and spaces
Eg : ‘n’ ‘---’ ‘____’ ‘***’ etc
 Removing stop words from all the 3 types of reviews
 We will remove stop words from high and low rating reviews
Word Cloud is Generated
Plot of all positive reviews Plot of all negative reviews
Model
 Generating corpus after tokenizing to look for the concordance of specific words
Eg : In high review plot we can see that “light” is used more. To know what exactly people
are talking about “light” we need to read the full sentence. For that we use concordance
 This type analysis is called as polarity analysis, it is useful to give us good context like as
see above
 The polarity of a word can be identified by studying the occurrence frequency of the word
in a large annotated corpus
Model
 Importing genism package to perform Tfidf model
Insights
 Average rating is 4 and mostly this beer has more positive reviews
 Taste is something which most people are talking about and should improve on taste as
it gives lemon and fruit flavour than a usual beer flavour.

More Related Content

DOCX
Searching on google report
PPTX
Communication workshop in nascenia
PPT
Online feedback correlation using clustering
PDF
Experiences with Sentiment Analysis with Peter Zadrozny
PPTX
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
PPTX
Emotional efficiency Overview
PPTX
Opinion Driven Decision Support System
PDF
AppTweak Masterclass for Clevertap: Planning your ASO strategy from 0 to 100
Searching on google report
Communication workshop in nascenia
Online feedback correlation using clustering
Experiences with Sentiment Analysis with Peter Zadrozny
How Does Customer Feedback Sentiment Analysis Work in Search Marketing?
Emotional efficiency Overview
Opinion Driven Decision Support System
AppTweak Masterclass for Clevertap: Planning your ASO strategy from 0 to 100

Similar to Text analysis (20)

PPS
Top Trans Survey Translation Issues
PPTX
Beyond Engagement: 4 steps to turning digital visitors into customers
PPTX
Sentiment analysis presentation
PDF
E score overview
PDF
Beyond SEO: copywriting for professionals
PDF
Implement BDD with Cucumber and SpecFlow
PDF
Specification-by-Example: A Cucumber Implementation
PPT
Technical Writing For Consultants
PPTX
Top SEO Secrets from the Leading Organizations
PDF
search engine optimization basic seo infromation
PDF
Evidence driven development - a lean methodology to product development
PPTX
SEO Training
PDF
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
PDF
BDD Anti-patterns
PPTX
Workshop 2 - PowerPoint Presentation v10
PPT
Rules to Good Business Writting
PDF
BV - Presentation 101
PDF
COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS
PDF
Ecomagination writer's style guide
Top Trans Survey Translation Issues
Beyond Engagement: 4 steps to turning digital visitors into customers
Sentiment analysis presentation
E score overview
Beyond SEO: copywriting for professionals
Implement BDD with Cucumber and SpecFlow
Specification-by-Example: A Cucumber Implementation
Technical Writing For Consultants
Top SEO Secrets from the Leading Organizations
search engine optimization basic seo infromation
Evidence driven development - a lean methodology to product development
SEO Training
Fake Product Review Monitoring & Removal and Sentiment Analysis of Genuine Re...
BDD Anti-patterns
Workshop 2 - PowerPoint Presentation v10
Rules to Good Business Writting
BV - Presentation 101
COMMTRUST: A MULTI-DIMENSIONAL TRUST MODEL FOR E-COMMERCE APPLICATIONS
Ecomagination writer's style guide
Ad

Recently uploaded (20)

PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PDF
[EN] Industrial Machine Downtime Prediction
PDF
Microsoft 365 products and services descrption
PPTX
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
PPT
DU, AIS, Big Data and Data Analytics.ppt
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PPTX
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PDF
Microsoft Core Cloud Services powerpoint
PPTX
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
PPTX
Business_Capability_Map_Collection__pptx
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PDF
Introduction to the R Programming Language
PPTX
New ISO 27001_2022 standard and the changes
PPTX
Leprosy and NLEP programme community medicine
PDF
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
PDF
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Pilar Kemerdekaan dan Identi Bangsa.pptx
[EN] Industrial Machine Downtime Prediction
Microsoft 365 products and services descrption
QUANTUM_COMPUTING_AND_ITS_POTENTIAL_APPLICATIONS[2].pptx
DU, AIS, Big Data and Data Analytics.ppt
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
(Ali Hamza) Roll No: (F24-BSCS-1103).pptx
Business Analytics and business intelligence.pdf
retention in jsjsksksksnbsndjddjdnFPD.pptx
Microsoft Core Cloud Services powerpoint
Market Analysis -202507- Wind-Solar+Hybrid+Street+Lights+for+the+North+Amer...
Business_Capability_Map_Collection__pptx
STERILIZATION AND DISINFECTION-1.ppthhhbx
Introduction to the R Programming Language
New ISO 27001_2022 standard and the changes
Leprosy and NLEP programme community medicine
Systems Analysis and Design, 12th Edition by Scott Tilley Test Bank.pdf
Votre score augmente si vous choisissez une catégorie et que vous rédigez une...
Optimise Shopper Experiences with a Strong Data Estate.pdf
Ad

Text analysis

  • 2. Agenda  Objective  Approach  Data Preparation  Model  Conclusion
  • 3. Objective To analyse the “La Binchoise “ béer reviews and provide useful insights to improve the brand quality.
  • 4. Approach  Taking out La Binchoise beer reviews from all the reviews.  Splitting up positive and negative reviews.  Removing stop words from all the reviews.  Categorization positive, negative and all reviews into single strings each  Plotting the word cloud to see the more frequently used words.
  • 5. Data preparation  We have to subset the dataset into high rating reviews and low rating reviews  This high and low rating reviews are classified based on there rating  High reviews means >3  Low Reviews means <3  Removing the stop word
  • 6. Data Preparation  Importing regular expression and removing unnecessary symbols and spaces Eg : ‘n’ ‘---’ ‘____’ ‘***’ etc  Removing stop words from all the 3 types of reviews  We will remove stop words from high and low rating reviews
  • 7. Word Cloud is Generated Plot of all positive reviews Plot of all negative reviews
  • 8. Model  Generating corpus after tokenizing to look for the concordance of specific words Eg : In high review plot we can see that “light” is used more. To know what exactly people are talking about “light” we need to read the full sentence. For that we use concordance  This type analysis is called as polarity analysis, it is useful to give us good context like as see above  The polarity of a word can be identified by studying the occurrence frequency of the word in a large annotated corpus
  • 9. Model  Importing genism package to perform Tfidf model
  • 10. Insights  Average rating is 4 and mostly this beer has more positive reviews  Taste is something which most people are talking about and should improve on taste as it gives lemon and fruit flavour than a usual beer flavour.