SlideShare a Scribd company logo
TEXT ANALYTICS
Analysis of reviews fetched
from IMDB for Hobbit Series
1
Submitted By :-
Amrapalli Karan
Kamalika Some
Krishanu Mukherjee
Somenath Sit
Objective
• Web Crawling from IMDB for 3 sequels of The Hobbit.
• Creation of Term Document Matrix and WordCloud
• Dimension Reduction using Latent Semantic Analysis
• Influencing Words in Ratings
• Comparison of sentiments expressed in reviews and ratings
given
5/2/2016 2
Web Crawling
5/2/2016 3
Cleaning of TDM
5/2/2016 4
TDM Dictionary
Final TDM
Filtered TDM
Excluded few common but unnecessary
words like "hobbit", "film", "movie",
"movies“ etc.
Dictionary with
common english
words
Dominating Words in TDM
5/2/2016 5
Hobbit - 2012 Hobbit - 2013 Hobbit - 2014
Dimension Reduction using LSA
5/2/2016 6
TK DK SK
Important Variables in TK matrix
5/2/2016 7
Built a model with “satisfaction” as response variable, to find out which variable are having
more power in predicting the “Ratings
satisfaction=ifelse(Ratings<5,"Dissatisfied",
ifelse(Ratings<7,"Satisfied","Impressed!"))
Dimension Reduction using LSA
5/2/2016 8
• Plotted variable importance with Scree Plot.
• Take optimal no of variables (documents) to filter DK matrix.
DK matrix with important variables
5/2/2016 9
For Hobbit-2012
5/2/2016 10
• Story ,book, like these words
are having deciding power in
“Ratings”.
• People talked more about the
book and the story line.
For Hobbit-2013
5/2/2016 11
• Series ,good , great, story
these words are having
deciding power in “Ratings”.
• In 2013 also viewers were only
impressed with the story,
battles etc.
For Hobbit-2014
5/2/2016 12
• Beside like, good; bad, story
these words are also having
deciding power in “Ratings”.
• Along with the good words,
some negative words have
been used here.
• Story, book these things are not
that effecting in comparison
with previous sequels.
Sentiment Analysis
5/2/2016 13
• A basic task in sentiment analysis is classifying
the polarity of a given text at the document,
sentence, or feature/aspect level — whether
the expressed opinion in a document, a
sentence is positive, negative, or neutral.
• We performed sentiment analysis (polarity) on
the movie reviews of Hobbit and its sequels .
• We used R for the analysis.
Average Ratings
5/2/2016 14
3.65
2.52
4.13
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
Hobbit :An Unexpected Journey (2012) For Hobbit: The Desolation of Smaug
(2013)
For Hobbit: The Battle of the Five
Armies (2014)
Key Findings………
• For Hobbit :An Unexpected Journey (2012),
97% of the negative ratings had a negative
polarity for the corresponding reviews, while
78% of the positive ratings had a positive
polarity for the corresponding reviews.
• 76.5% of the ratings were negative.
• The average polarity of the reviews was
(0.012).
Key Findings….(contd)…
• For Hobbit: The Desolation of Smaug (2013),
85% of the negative ratings had a negative
polarity for the corresponding reviews, while
60% of the positive ratings had a positive
polarity for the corresponding reviews.
• 89% of the ratings were negative.
• The average polarity of the reviews was
(0.004).
Key Findings….(contd)…
• For Hobbit: The Battle of the Five
Armies (2014), only 3% of the negative ratings
had a negative polarity for the corresponding
reviews, while 100% of the positive ratings
had a positive polarity for the corresponding
reviews.
• 68% of the ratings were negative.
• The average polarity of the reviews was
(0.005).
Polarity Comparison - Region wise
5/2/2016 18
Hobbit 2012 Hobbit 2013
Hobbit 2014
Polarity Comparison – Region wise
5/2/2016 19
5/2/2016 20

More Related Content

PPTX
Psychographic Marketing | What You Show Know
PPTX
Outreach campaign status module
PDF
Deep Dive - Consumer Sentiment Rating & Analysis White Paper
PPTX
Sentiment analysis module
PDF
CUbRIK research on social aspects
DOCX
Proposal final
PPTX
Aspect-level sentiment analysis of customer reviews using Double Propagation
PDF
Snapshot of winning submissions- Jigsaw Academy ValueLabs Sentiment Analysis ...
Psychographic Marketing | What You Show Know
Outreach campaign status module
Deep Dive - Consumer Sentiment Rating & Analysis White Paper
Sentiment analysis module
CUbRIK research on social aspects
Proposal final
Aspect-level sentiment analysis of customer reviews using Double Propagation
Snapshot of winning submissions- Jigsaw Academy ValueLabs Sentiment Analysis ...

Viewers also liked (13)

PPTX
Yelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
PDF
Snapchat Group Snaps Proposal
PPT
"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM Conference
PDF
Apache Giraph: Large-scale graph processing done better
PPTX
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Parinds...
PDF
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
PDF
Yelp Project
PPTX
Yelp final
PPT
How Sentiment Analysis works
PDF
Introduction to Sentiment Analysis
PPTX
Market Segmentation with Examples
PPT
The Marketing Environment
PPT
Opinion Mining Tutorial (Sentiment Analysis)
Yelp Data Challenge - Discovering Latent Factors using Ratings and Reviews
Snapchat Group Snaps Proposal
"Managing User-Generated Reviews" - Jed Nachman (Yelp) - 2009 AIM Conference
Apache Giraph: Large-scale graph processing done better
Jigsaw Mortgage Dex Data Analysis Competition Winner Presentation - Parinds...
SentiCheNews - Sentiment Analysis on Newspapers and Tweets
Yelp Project
Yelp final
How Sentiment Analysis works
Introduction to Sentiment Analysis
Market Segmentation with Examples
The Marketing Environment
Opinion Mining Tutorial (Sentiment Analysis)
Ad

Similar to Sentiment analytics (10)

PPTX
Sediment analysis: what is Sediment analysis
PDF
Effective Cyber Security Report Writing
PDF
Effective Cyber Security Report Writing
PDF
[GAN by Hung-yi Lee]Part 3: The recent research of my group
PDF
Text Mining of Movie Reviews
PPTX
Opinion Mining
PPTX
Movie Review Sentiment Analysis Final Project
PDF
Movie Summarization at TRECVID 2022
PDF
Exploiting User Comments for Audio-visual Content Indexing and Retrieval (ECI...
PDF
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
Sediment analysis: what is Sediment analysis
Effective Cyber Security Report Writing
Effective Cyber Security Report Writing
[GAN by Hung-yi Lee]Part 3: The recent research of my group
Text Mining of Movie Reviews
Opinion Mining
Movie Review Sentiment Analysis Final Project
Movie Summarization at TRECVID 2022
Exploiting User Comments for Audio-visual Content Indexing and Retrieval (ECI...
MongoDB .local San Francisco 2020: Go on a Data Safari with MongoDB Charts!
Ad

More from Kamalika Some (7)

PPT
To predict the academic performance of an elementary school using Linear Regr...
PPTX
Advanced Multiple Regression Analysis
PPTX
Telecom Analytics
PPTX
Market Research Analytics
PPTX
Enterprise Resource Management
PPTX
Human resource analytics
PPTX
Human resource analytics
To predict the academic performance of an elementary school using Linear Regr...
Advanced Multiple Regression Analysis
Telecom Analytics
Market Research Analytics
Enterprise Resource Management
Human resource analytics
Human resource analytics

Recently uploaded (20)

PDF
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Computer network topology notes for revision
PDF
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPT
Predictive modeling basics in data cleaning process
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
PPT
Miokarditis (Inflamasi pada Otot Jantung)
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
Managing Community Partner Relationships
PPT
Reliability_Chapter_ presentation 1221.5784
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Capcut Pro Crack For PC Latest Version {Fully Unlocked 2025}
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Computer network topology notes for revision
Data Engineering Interview Questions & Answers Cloud Data Stacks (AWS, Azure,...
Optimise Shopper Experiences with a Strong Data Estate.pdf
oil_refinery_comprehensive_20250804084928 (1).pptx
Business Analytics and business intelligence.pdf
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Predictive modeling basics in data cleaning process
Introduction-to-Cloud-ComputingFinal.pptx
Introduction to Knowledge Engineering Part 1
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
Miokarditis (Inflamasi pada Otot Jantung)
IB Computer Science - Internal Assessment.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Managing Community Partner Relationships
Reliability_Chapter_ presentation 1221.5784
STUDY DESIGN details- Lt Col Maksud (21).pptx
Supervised vs unsupervised machine learning algorithms
The THESIS FINAL-DEFENSE-PRESENTATION.pptx

Sentiment analytics

  • 1. TEXT ANALYTICS Analysis of reviews fetched from IMDB for Hobbit Series 1 Submitted By :- Amrapalli Karan Kamalika Some Krishanu Mukherjee Somenath Sit
  • 2. Objective • Web Crawling from IMDB for 3 sequels of The Hobbit. • Creation of Term Document Matrix and WordCloud • Dimension Reduction using Latent Semantic Analysis • Influencing Words in Ratings • Comparison of sentiments expressed in reviews and ratings given 5/2/2016 2
  • 4. Cleaning of TDM 5/2/2016 4 TDM Dictionary Final TDM Filtered TDM Excluded few common but unnecessary words like "hobbit", "film", "movie", "movies“ etc. Dictionary with common english words
  • 5. Dominating Words in TDM 5/2/2016 5 Hobbit - 2012 Hobbit - 2013 Hobbit - 2014
  • 6. Dimension Reduction using LSA 5/2/2016 6 TK DK SK
  • 7. Important Variables in TK matrix 5/2/2016 7 Built a model with “satisfaction” as response variable, to find out which variable are having more power in predicting the “Ratings satisfaction=ifelse(Ratings<5,"Dissatisfied", ifelse(Ratings<7,"Satisfied","Impressed!"))
  • 8. Dimension Reduction using LSA 5/2/2016 8 • Plotted variable importance with Scree Plot. • Take optimal no of variables (documents) to filter DK matrix.
  • 9. DK matrix with important variables 5/2/2016 9
  • 10. For Hobbit-2012 5/2/2016 10 • Story ,book, like these words are having deciding power in “Ratings”. • People talked more about the book and the story line.
  • 11. For Hobbit-2013 5/2/2016 11 • Series ,good , great, story these words are having deciding power in “Ratings”. • In 2013 also viewers were only impressed with the story, battles etc.
  • 12. For Hobbit-2014 5/2/2016 12 • Beside like, good; bad, story these words are also having deciding power in “Ratings”. • Along with the good words, some negative words have been used here. • Story, book these things are not that effecting in comparison with previous sequels.
  • 13. Sentiment Analysis 5/2/2016 13 • A basic task in sentiment analysis is classifying the polarity of a given text at the document, sentence, or feature/aspect level — whether the expressed opinion in a document, a sentence is positive, negative, or neutral. • We performed sentiment analysis (polarity) on the movie reviews of Hobbit and its sequels . • We used R for the analysis.
  • 14. Average Ratings 5/2/2016 14 3.65 2.52 4.13 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 Hobbit :An Unexpected Journey (2012) For Hobbit: The Desolation of Smaug (2013) For Hobbit: The Battle of the Five Armies (2014)
  • 15. Key Findings……… • For Hobbit :An Unexpected Journey (2012), 97% of the negative ratings had a negative polarity for the corresponding reviews, while 78% of the positive ratings had a positive polarity for the corresponding reviews. • 76.5% of the ratings were negative. • The average polarity of the reviews was (0.012).
  • 16. Key Findings….(contd)… • For Hobbit: The Desolation of Smaug (2013), 85% of the negative ratings had a negative polarity for the corresponding reviews, while 60% of the positive ratings had a positive polarity for the corresponding reviews. • 89% of the ratings were negative. • The average polarity of the reviews was (0.004).
  • 17. Key Findings….(contd)… • For Hobbit: The Battle of the Five Armies (2014), only 3% of the negative ratings had a negative polarity for the corresponding reviews, while 100% of the positive ratings had a positive polarity for the corresponding reviews. • 68% of the ratings were negative. • The average polarity of the reviews was (0.005).
  • 18. Polarity Comparison - Region wise 5/2/2016 18 Hobbit 2012 Hobbit 2013 Hobbit 2014
  • 19. Polarity Comparison – Region wise 5/2/2016 19