SlideShare a Scribd company logo
Sentiment Analysis
by NLTK
Wei-Ting Kuo
PyconApac2015
http://goo.gl/wJelD4
Sentiment Analysis?
Aim to determine the attitude of a
speaker/writer with respect to some text.
Amazon’s Customer Review
Twitter Follower’s replies
Positive or Negative?
This is a good book!
This is a good book!

I like it!
Positive
The first chapter is good,

but the rest is terrible
Negative
(more)

Positive
This is a bad book! Negative
How to compute it?
This is a good book!
From scouter: This is a postive review
Why is sentiment
analysis useful?
• This is a terrible book. Because it’s important, so I
mention three times, terrible, terrible, terrible!!!
Let’s begin with the
easiest way!
Sentiment Dictionary
like 1

good 2

bad -2

terrible -3
This is a good book!
Dictionary

like 1

good 2

bad -2

terrible -3
2
This is a good book!

I like it!
Positive
3
The first chapter is good,

but the rest is terrible
-1 Negative
(more)

Positive
This is a bad book! -2 Negative
AFINN-111
• http://www2.imm.dtu.dk/pubdb/
views/publication_details.php?
id=6010
• A list of words rated between

-5 (neg) to 5 (pos)
Let’s build the dictionary
in Python
Let’s split the sentence first
And compute the score
•
Recap
What if the text is long?
And have many punctuation?
• Nice book! Though it is lack of advanced topics.
It’s still good for beginners.
Doesn’t work!
Sentiment analysis-by-nltk
NLTK to the rescue
• Natural Language ToolKit
• Works with Python3!
Tokenization
the process of breaking a stream of text up into
words, phrases, symbols, or other meaningful
elements called tokens
Word tokenize
It seems we lose some
information
• Nice book! Though it is lack of advanced
topics. It’s still good for beginners.
Positive: 3
Separate to multiple
sentences first
• Nice book!
• Though it is lack of advanced topics.
• It’s still good for beginners.
Positive
Negative
Positive
How to split the text to
sentences?
Sentence tokenize
Compute score for
each sentence
But we still miss some
information in another case
It’s a bad idea to buy this book.
Negative
Negative
The first chapter is good,

but the rest is terrible and confusing
At least the customer mentioned something good,
but it’s not recorded
Let’s count Pos & Neg
separately
The first chapter is good,

but the rest is terrible and confusing
It’s a bad idea to buy this book.
Neg: -5
Neg: -3
Pos:3
In Python
how about new words?
how about domain specific term?
Machine Learning!!!
Traing Data
(with Labels)
Model
Trained ModelReal Data Prediction
Training Data
• This is a good book! Postive
• This is a awesome book! Postive
• This is a bad book! Negative
• This is a terrible book! Negative
• This is a good book! Postive
• This is a awesome book! Postive
• This is a bad book! Negative
• This is a terrible book Negative
Model
This is a good article
This is a bad article
Trained Model
Postive
Negative
The format NLTK use
Prepare the Training Set
Build the model, and train it!
Real Case
• Movie Review Data

http://www.cs.cornell.edu/people/pabo/movie-review-data/
• 5331 positive reviews & 5331 negative reviews
labelled by human.
Positive Reviews
Negative Reviews
Read our data
Separate the data
Training Data Testing Data
Training data to train the model
Testing data to compute the accuracy
Separate our data
Train the data
Compute the accuracy
accuracy = number of correct / total
How to enhance?
• Use the most frequent 1000 words only
• Use different model, maybe SVC
• Read more paper about the latest research
Q & A

More Related Content

PDF
SENTIMENT ANALYSIS OF TWITTER DATA
PDF
Introduction to Sentiment Analysis
PPTX
Sentiment analysis
PPTX
Sentiment Analysis
PPTX
Sentiment analysis
PPTX
New sentiment analysis of tweets using python by Ravi kumar
PDF
Amazon sentimental analysis
PDF
Sentiment Analysis
SENTIMENT ANALYSIS OF TWITTER DATA
Introduction to Sentiment Analysis
Sentiment analysis
Sentiment Analysis
Sentiment analysis
New sentiment analysis of tweets using python by Ravi kumar
Amazon sentimental analysis
Sentiment Analysis

What's hot (20)

PPTX
Sentiment Analysis using Twitter Data
PDF
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
PPTX
Sentiment analysis of Twitter data using python
PPT
Twitter sentiment-analysis Jiit2013-14
PPTX
Presentation on Sentiment Analysis
PPTX
Approaches to Sentiment Analysis
PPT
Social Media Sentiment Analysis
DOCX
Twitter sentiment analysis project report
PPT
How Sentiment Analysis works
PPTX
social network analysis project twitter sentimental analysis
PPTX
Sentiment Analysis on Twitter
DOCX
Python report on twitter sentiment analysis
PPTX
Twitter sentiment analysis ppt
PDF
Sentiment analysis - Our approach and use cases
PPTX
Sentiment Analysis Using Product Review
PPTX
Sentiment Analysis
PDF
project sentiment analysis
PPTX
Sentiment analysis using imdb 50 k data
PPTX
Tweet sentiment analysis (Data mining)
PPTX
Twitter sentiment analysis ppt
Sentiment Analysis using Twitter Data
Sentiment Analysis Using Hybrid Structure of Machine Learning Algorithms
Sentiment analysis of Twitter data using python
Twitter sentiment-analysis Jiit2013-14
Presentation on Sentiment Analysis
Approaches to Sentiment Analysis
Social Media Sentiment Analysis
Twitter sentiment analysis project report
How Sentiment Analysis works
social network analysis project twitter sentimental analysis
Sentiment Analysis on Twitter
Python report on twitter sentiment analysis
Twitter sentiment analysis ppt
Sentiment analysis - Our approach and use cases
Sentiment Analysis Using Product Review
Sentiment Analysis
project sentiment analysis
Sentiment analysis using imdb 50 k data
Tweet sentiment analysis (Data mining)
Twitter sentiment analysis ppt
Ad

Similar to Sentiment analysis-by-nltk (13)

PPTX
Analogy
PPT
Achieving an a in the reading section
PPT
ER: Extensive Reading/Effective Reading
PPTX
Book Reviews and You!
PPT
Feature writing
PPT
November 5 --28. future and modals of probability and advice
PPTX
Unit 5. hướng dẫn viết bài phân tích pie chart.pptx
PPTX
D38-CREM
PPTX
U72 lesson 09
PPTX
Peer review clickbait
PDF
7 Tips for Writing a Film Review
PPTX
Skellig STAR Lesson
PPTX
Types of Paragraph.pptx
Analogy
Achieving an a in the reading section
ER: Extensive Reading/Effective Reading
Book Reviews and You!
Feature writing
November 5 --28. future and modals of probability and advice
Unit 5. hướng dẫn viết bài phân tích pie chart.pptx
D38-CREM
U72 lesson 09
Peer review clickbait
7 Tips for Writing a Film Review
Skellig STAR Lesson
Types of Paragraph.pptx
Ad

Sentiment analysis-by-nltk