SlideShare a Scribd company logo
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4746
SENTIMENT ANALYSIS USING TWITTER DATA
Kirti Jain1, Abhishek Singh2, Arushi Yadav3
1Asst. Professor, Dept. Of Computer Science, Inderprastha Engineering College
2, 3Student, Dept. Of Computer Science, Inderprastha Engineering College
Dr. A. P. J. Abdul Kalam Technical University
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Sentiment analysis as the name suggest it is the analysis of the sentiments/feelings/expression related to any
topic, it is also known as opinion mining. Here the motive is to find the general sentiment associated to given document. We
try to classify the subjective information gathered from some microblogging site according to its polarity such as positive,
neutral and negative using machine learning and natural language processing. In this project, we chose Twitter as the
microblogging source for getting peoples sentiments and try to classify the tweets into positive, neutral and negative
sentiment.
Key Words: Sentiment Analysis, Polarity, Machine Learning, Natural Language Processing, Twitter, Microblogging.
1.INTRODUCTION
Sentiment analysis studies people judgment or thought towards certain entity. Twitter is a resourceful place to find people
sentiment. Here peoples post their thought/experience/ feelings through tweets which has a character limit of 140.
Twitter has a provision for developers to collect data from twitter by releasing their APIs. In this project we are using one
of the twitter API i.e. streaming API which helps to extract the content in the real time.
Here we perform the linguistic analysis by building the classifier using the several machine learning techniques and
natural language processing by using the collected corpus from the Kaggle as the training data to train our classifier and
use the streamed corpus as the testing data to test the result of our classifier to classify the different sentiments related to
tweets.
In this project we focus on the tweets related to airline as the customer shares their experience on the twitter thorough
their tweets and our analyzer helps the airline company to improve their services by keeping an eye on people’s
sentiments by overcoming their flaws.
2. LITERATURE REVIEW
Table -1: Literature Survey Table
S.
NO
Paper Title Authors
y
e
a
r
Methods Remarks
1.
Sentiment Analysis
on Twitter Data
Varsha Sahayak,
Vijaya Shete,
Apashabi Pathan
2
0
1
5
Naïve Bayes,
Maximum Entropy,
SVM
In the survey, we found
that social media related
features can be used to
predict sentiment in
Twitter.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4747
2.
Sentiment Analysis
on Twitter Data
Onam Bharti,
Mrs. Monika
Malhotra
2
0
1
6
Naïve Bayes, KNN
Accuracy (%)
Naive Bayes- 79.66
KNN – 83.59
3.
Sentiment analysis
of twitter data
Hamid Bagheri,
Md Johirul Islam
2
0
1
7
Naïve Bayes, Text
Blob
We realized that the
neutral sentiments are
significantly high which
shows there is a need to
improve Twitter
sentiment analysis.
4.
Study of Twitter
Sentiment Analysis
using Machine
Learning
Algorithms on
Python
Bhumika Gupta,
Monika Negi,
Kanika
Vishwakarma,
Goldi Rawat
2
0
1
7
Bayesian logistic
regression
,Naïve Bayes,
Maximum Entropy
Classifier ,
Support Vector
Machine Algorithm
This research topic has
evolved during the last
decade with models
reaching the efficiency of
almost 85%-90%. But it
still lacks the dimension
of diversity in the data.
Along with this it has a
lot of application issues
with the slang used.
5.
Sentiment Analysis
of Twitter Data
through Big Data
Anusha.N,
Divya.G,
Ramya.B
2
0
1
7
Naïve Bayes
Classification,
Training with Mahout
After the training set has
been prepared, data is
analyzed by uploading it
on HDFS and Naïve
Bayes classification is
carried out.
6.
Machine
Learning-Based
Sentiment
Analysis for
Twitter
Ali Hasan,
Sana Moin,
Ahmad Kari
and
Shahaboddin
Shamshirband
2
0
1
8
Naïve Bayes
Classifier, SVM
Classifier
This paper focuses on
the adoption of
machine-learning
algorithms to
determine the highest
accuracy for election
sentiments.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4748
3. CORE MODULES
These are the modules which are used in our project along with their functions are given in the table below:
Table -2: Core Modules
MODULES FUNCTION
Twitter_data_streaming
It uses tweepy module for streaming live tweets
using Oauth Handler.
Create_dataset
It creates training and testing data set from the
dataset downloaded from kaggle.
Create_validation_dataset
Using the tweets downloaded, validation dataset is
generated.
Feature_extraction_word2vec Finds most similar words present in file.
Term_frequency
Feature extraction using TF-IDF(term frequency–
inverse document frequency)
Term_frequency_computaion Feature extraction using bag of words approach
Naïve_bayes_Classifier Building classifier using naïve bayes algorithm
SVM_classifier Building classifier using SVM approach
Data_setup_neural_network
It adds weight, if positive then 1 then if negative
then 0 if neutral then 2
Neural_networks
Using training data set creates a model using
backward propagation and predicts the results i.e.
by improving weights
Classification
Using training data set model created using SVM,
Naïve Bayes and neural networks and the model is
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4749
4. RESULT
In this section we present the results of our three classifier i.e. Naïve Bayes, SVM, Neural Network.
The accuracy of all the three classifier is computed and shown below:
Table -3: Results
As the SVM has the highest accuracy that’s why we use this classifier to classify our tweets and the
result is as follows:
tested on testing set and validation set and
accuracy is also computed.
Tweets_extraction
Extract tweets to respective documents
documents.
Classifier Accuracy (%)
SVM Classifier 81.57
Naïve Bayes 71.05
Neural Networks 44.73
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4750
Chart -1: Sentiment Analysis Bar Graph
The total no. of tweet was 23986 out of which 4153 were positive, 14407 were negative and 5426 were neutral
i.e.17.31%, 60.06% and 22.62% this much percentage hold by positive, negative and neutral tweets out of total
no. of tweets respectively.
5. CONCLUSION
We presented our result on sentiment analysis using twitter data. We use proposed models of classification in
the supervised machine learning i.e. Naïve Bayes, SVM, and Neural Networks. For feature extraction we used
the bag-of-words, term frequency, tf-idf approach. And by combining them we classify the tweets into three
different sentiment i.e. Positive, Neutral and Negative.
In future work we try to improve the accuracy of our classifier which can detect the sarcasm, irony, humor
content in the tweet.
ACKNOWLEDGEMENT
We take this opportunity to thank our teachers and friends who helped us throughout the project. First and
foremost we would like to thank my guide for the project (Ms. Kirti Jain, Assistant Professor, Computer Science
and Engineering) for her valuable advice and time during development of project.
We would also like to thank Dr. Rekha Kashyap (HOD, Computer Science Department) for her constant support
during the development of the project.
REFERENCES
[1] Varsha Sahayak, Vijaya Shete ,Apashabi Pathan,” Sentiment Analysis on Twitter Data”,Department of
Information Technology, Savitribai Phule Pune University, Pune, India,2015.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4751
[2] Onam Bharti, Mrs. Monika Malhotra,” SENTIMENT ANALYSIS ON TWITTER DATA”, World College of
Technology and Management, Gurgaon , India, 2016.
[3] Bhumika Gupta, Monika Negi, Kanika Vishwakarma, Goldi Rawat, Priyanka Badhani , “ Study of Twitter
Sentiment Analysis using Machine Learning Algorithms on Python ” , C.S.E.D ,G.B.P.E.C, Pauri, Uttarakhand,
India, 2017.
[4] Ali Hasan, Sana Moin, Ahmad Karim and Shahaboddin Shamshirband, “Machine Learning-Based Sentiment
Analysis for Twitter Accounts” , Department for Management of Science and Technology Development, Ton Duc
Thang University,Ho Chi Minh City, Vietnam,2018.
[5] Hamid Bagheri, Md Johirul Islam, “Sentiment analysis of twitter data” , Computer Science Department Iowa
State University,United States of America, 2017.
[6] Anusha.N, Divya.G, Ramya.B, “Sentiment Analysis of Twitter Data through Big Data”, Computer Science and
Engineering Sai Vidya Institute of Technology Bangalore, India, 2017.

More Related Content

PDF
IRJET- Syllabus and Timetable Generation System
PDF
IRJET- E-Attendance Manager: A Review
PDF
IRJET- Identify the Human or Bots Twitter Data using Machine Learning Alg...
PDF
IRJET- Intelligent Laboratory Management System based on Internet of Thin...
PDF
IRJET- College Enquiry Chat-Bot using API.AI
PDF
IRJET- Instant Exam Paper Generator
PDF
IRJET- A Survey on Mining of Tweeter Data for Predicting User Behavior
PDF
IRJET - Student Sentiment Analysis using Android Application
IRJET- Syllabus and Timetable Generation System
IRJET- E-Attendance Manager: A Review
IRJET- Identify the Human or Bots Twitter Data using Machine Learning Alg...
IRJET- Intelligent Laboratory Management System based on Internet of Thin...
IRJET- College Enquiry Chat-Bot using API.AI
IRJET- Instant Exam Paper Generator
IRJET- A Survey on Mining of Tweeter Data for Predicting User Behavior
IRJET - Student Sentiment Analysis using Android Application

What's hot (20)

PDF
IRJET- Intelligence Quotient Tester
PDF
IRJET - College Enquiry Chatbot
PDF
IRJET- Proximity Detection Warning System using Ray Casting
PDF
IRJET- Development of College Enquiry Chatbot using Snatchbot
PDF
Adapting E- learning using Multiagent System
PDF
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
PDF
IRJET- Automated CV Classification using Clustering Technique
PDF
Online examination system
DOCX
VTU final year project report
PDF
Mcsp 060 project guidelines july 2012
PDF
IRJET - Online Assignment System
PDF
A DISTRIBUTED MACHINE LEARNING BASED IDS FOR CLOUD COMPUTING
PDF
Specification based testing
PDF
IRJET- Career Counselling Chatbot
PDF
IRJET- Intelligence Extraction using Machine Learning Technics
PDF
Ignou MCA mini project report
PDF
Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...
PDF
An Effective Job Recruitment System Using Content-based Filtering
PDF
IRJET- PDF Extraction using Data Mining Techniques
PDF
IRJET- A Study on Automated Attendance System using Facial Recognition
IRJET- Intelligence Quotient Tester
IRJET - College Enquiry Chatbot
IRJET- Proximity Detection Warning System using Ray Casting
IRJET- Development of College Enquiry Chatbot using Snatchbot
Adapting E- learning using Multiagent System
CHATBOT FOR COLLEGE RELATED QUERIES | J4RV4I1008
IRJET- Automated CV Classification using Clustering Technique
Online examination system
VTU final year project report
Mcsp 060 project guidelines july 2012
IRJET - Online Assignment System
A DISTRIBUTED MACHINE LEARNING BASED IDS FOR CLOUD COMPUTING
Specification based testing
IRJET- Career Counselling Chatbot
IRJET- Intelligence Extraction using Machine Learning Technics
Ignou MCA mini project report
Automatic Query Expansion Using Word Embedding Based on Fuzzy Graph Connectiv...
An Effective Job Recruitment System Using Content-based Filtering
IRJET- PDF Extraction using Data Mining Techniques
IRJET- A Study on Automated Attendance System using Facial Recognition
Ad

Similar to IRJET- Sentiment Analysis using Twitter Data (20)

PDF
IRJET- Sentiment Analysis of Twitter Data using Python
PDF
Sentiment Analysis of Twitter Data
PDF
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
PDF
IRJET - Twitter Sentiment Analysis using Machine Learning
PDF
IRJET- Twitter Opinion Mining
PDF
Sentiment Analysis on Twitter Data
PDF
Sentimental Emotion Analysis using Python and Machine Learning
PDF
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
PDF
IRJET- Real Time Sentiment Analysis of Political Twitter Data using Machi...
PDF
IRJET-Sentiment Analysis in Twitter
PDF
Twitter Sentiment Analysis
PDF
IRJET- Design and Implementation of Sentiment Analyzer for Top Engineering Co...
PDF
Sentiment Analysis on Twitter data using Machine Learning
PDF
Detailed Investigation of Text Classification and Clustering of Twitter Data ...
PDF
IRJET- Comparative Study of Classification Algorithms for Sentiment Analy...
PDF
Sentiment Analysis and Classification of Tweets using Data Mining
PDF
IRJET - Twitter Sentimental Analysis
PPTX
Svm and maximum entropy model for sentiment analysis of tweets
PDF
IRJET- A Real-Time Twitter Sentiment Analysis and Visualization System: Twisent
PDF
A STUDY ON TWITTER SENTIMENT ANALYSIS USING DEEP LEARNING
IRJET- Sentiment Analysis of Twitter Data using Python
Sentiment Analysis of Twitter Data
Combining Lexicon based and Machine Learning based Methods for Twitter Sentim...
IRJET - Twitter Sentiment Analysis using Machine Learning
IRJET- Twitter Opinion Mining
Sentiment Analysis on Twitter Data
Sentimental Emotion Analysis using Python and Machine Learning
IRJET - Implementation of Twitter Sentimental Analysis According to Hash Tag
IRJET- Real Time Sentiment Analysis of Political Twitter Data using Machi...
IRJET-Sentiment Analysis in Twitter
Twitter Sentiment Analysis
IRJET- Design and Implementation of Sentiment Analyzer for Top Engineering Co...
Sentiment Analysis on Twitter data using Machine Learning
Detailed Investigation of Text Classification and Clustering of Twitter Data ...
IRJET- Comparative Study of Classification Algorithms for Sentiment Analy...
Sentiment Analysis and Classification of Tweets using Data Mining
IRJET - Twitter Sentimental Analysis
Svm and maximum entropy model for sentiment analysis of tweets
IRJET- A Real-Time Twitter Sentiment Analysis and Visualization System: Twisent
A STUDY ON TWITTER SENTIMENT ANALYSIS USING DEEP LEARNING
Ad

More from IRJET Journal (20)

PDF
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
PDF
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
PDF
Kiona – A Smart Society Automation Project
PDF
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
PDF
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
PDF
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
PDF
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
PDF
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
PDF
BRAIN TUMOUR DETECTION AND CLASSIFICATION
PDF
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
PDF
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
PDF
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
PDF
Breast Cancer Detection using Computer Vision
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
PDF
Auto-Charging E-Vehicle with its battery Management.
PDF
Analysis of high energy charge particle in the Heliosphere
PDF
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Enhanced heart disease prediction using SKNDGR ensemble Machine Learning Model
Utilizing Biomedical Waste for Sustainable Brick Manufacturing: A Novel Appro...
Kiona – A Smart Society Automation Project
DESIGN AND DEVELOPMENT OF BATTERY THERMAL MANAGEMENT SYSTEM USING PHASE CHANG...
Invest in Innovation: Empowering Ideas through Blockchain Based Crowdfunding
SPACE WATCH YOUR REAL-TIME SPACE INFORMATION HUB
A Review on Influence of Fluid Viscous Damper on The Behaviour of Multi-store...
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...
Explainable AI(XAI) using LIME and Disease Detection in Mango Leaf by Transfe...
BRAIN TUMOUR DETECTION AND CLASSIFICATION
The Project Manager as an ambassador of the contract. The case of NEC4 ECC co...
"Enhanced Heat Transfer Performance in Shell and Tube Heat Exchangers: A CFD ...
Advancements in CFD Analysis of Shell and Tube Heat Exchangers with Nanofluid...
Breast Cancer Detection using Computer Vision
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
A Novel System for Recommending Agricultural Crops Using Machine Learning App...
Auto-Charging E-Vehicle with its battery Management.
Analysis of high energy charge particle in the Heliosphere
Wireless Arduino Control via Mobile: Eliminating the Need for a Dedicated Wir...

Recently uploaded (20)

PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
DOCX
573137875-Attendance-Management-System-original
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
additive manufacturing of ss316l using mig welding
PPTX
Construction Project Organization Group 2.pptx
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPTX
Geodesy 1.pptx...............................................
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
573137875-Attendance-Management-System-original
R24 SURVEYING LAB MANUAL for civil enggi
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
additive manufacturing of ss316l using mig welding
Construction Project Organization Group 2.pptx
bas. eng. economics group 4 presentation 1.pptx
Internet of Things (IOT) - A guide to understanding
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Geodesy 1.pptx...............................................
Embodied AI: Ushering in the Next Era of Intelligent Systems
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
OOP with Java - Java Introduction (Basics)
UNIT 4 Total Quality Management .pptx
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd

IRJET- Sentiment Analysis using Twitter Data

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4746 SENTIMENT ANALYSIS USING TWITTER DATA Kirti Jain1, Abhishek Singh2, Arushi Yadav3 1Asst. Professor, Dept. Of Computer Science, Inderprastha Engineering College 2, 3Student, Dept. Of Computer Science, Inderprastha Engineering College Dr. A. P. J. Abdul Kalam Technical University ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Sentiment analysis as the name suggest it is the analysis of the sentiments/feelings/expression related to any topic, it is also known as opinion mining. Here the motive is to find the general sentiment associated to given document. We try to classify the subjective information gathered from some microblogging site according to its polarity such as positive, neutral and negative using machine learning and natural language processing. In this project, we chose Twitter as the microblogging source for getting peoples sentiments and try to classify the tweets into positive, neutral and negative sentiment. Key Words: Sentiment Analysis, Polarity, Machine Learning, Natural Language Processing, Twitter, Microblogging. 1.INTRODUCTION Sentiment analysis studies people judgment or thought towards certain entity. Twitter is a resourceful place to find people sentiment. Here peoples post their thought/experience/ feelings through tweets which has a character limit of 140. Twitter has a provision for developers to collect data from twitter by releasing their APIs. In this project we are using one of the twitter API i.e. streaming API which helps to extract the content in the real time. Here we perform the linguistic analysis by building the classifier using the several machine learning techniques and natural language processing by using the collected corpus from the Kaggle as the training data to train our classifier and use the streamed corpus as the testing data to test the result of our classifier to classify the different sentiments related to tweets. In this project we focus on the tweets related to airline as the customer shares their experience on the twitter thorough their tweets and our analyzer helps the airline company to improve their services by keeping an eye on people’s sentiments by overcoming their flaws. 2. LITERATURE REVIEW Table -1: Literature Survey Table S. NO Paper Title Authors y e a r Methods Remarks 1. Sentiment Analysis on Twitter Data Varsha Sahayak, Vijaya Shete, Apashabi Pathan 2 0 1 5 Naïve Bayes, Maximum Entropy, SVM In the survey, we found that social media related features can be used to predict sentiment in Twitter.
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4747 2. Sentiment Analysis on Twitter Data Onam Bharti, Mrs. Monika Malhotra 2 0 1 6 Naïve Bayes, KNN Accuracy (%) Naive Bayes- 79.66 KNN – 83.59 3. Sentiment analysis of twitter data Hamid Bagheri, Md Johirul Islam 2 0 1 7 Naïve Bayes, Text Blob We realized that the neutral sentiments are significantly high which shows there is a need to improve Twitter sentiment analysis. 4. Study of Twitter Sentiment Analysis using Machine Learning Algorithms on Python Bhumika Gupta, Monika Negi, Kanika Vishwakarma, Goldi Rawat 2 0 1 7 Bayesian logistic regression ,Naïve Bayes, Maximum Entropy Classifier , Support Vector Machine Algorithm This research topic has evolved during the last decade with models reaching the efficiency of almost 85%-90%. But it still lacks the dimension of diversity in the data. Along with this it has a lot of application issues with the slang used. 5. Sentiment Analysis of Twitter Data through Big Data Anusha.N, Divya.G, Ramya.B 2 0 1 7 Naïve Bayes Classification, Training with Mahout After the training set has been prepared, data is analyzed by uploading it on HDFS and Naïve Bayes classification is carried out. 6. Machine Learning-Based Sentiment Analysis for Twitter Ali Hasan, Sana Moin, Ahmad Kari and Shahaboddin Shamshirband 2 0 1 8 Naïve Bayes Classifier, SVM Classifier This paper focuses on the adoption of machine-learning algorithms to determine the highest accuracy for election sentiments.
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4748 3. CORE MODULES These are the modules which are used in our project along with their functions are given in the table below: Table -2: Core Modules MODULES FUNCTION Twitter_data_streaming It uses tweepy module for streaming live tweets using Oauth Handler. Create_dataset It creates training and testing data set from the dataset downloaded from kaggle. Create_validation_dataset Using the tweets downloaded, validation dataset is generated. Feature_extraction_word2vec Finds most similar words present in file. Term_frequency Feature extraction using TF-IDF(term frequency– inverse document frequency) Term_frequency_computaion Feature extraction using bag of words approach Naïve_bayes_Classifier Building classifier using naïve bayes algorithm SVM_classifier Building classifier using SVM approach Data_setup_neural_network It adds weight, if positive then 1 then if negative then 0 if neutral then 2 Neural_networks Using training data set creates a model using backward propagation and predicts the results i.e. by improving weights Classification Using training data set model created using SVM, Naïve Bayes and neural networks and the model is
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4749 4. RESULT In this section we present the results of our three classifier i.e. Naïve Bayes, SVM, Neural Network. The accuracy of all the three classifier is computed and shown below: Table -3: Results As the SVM has the highest accuracy that’s why we use this classifier to classify our tweets and the result is as follows: tested on testing set and validation set and accuracy is also computed. Tweets_extraction Extract tweets to respective documents documents. Classifier Accuracy (%) SVM Classifier 81.57 Naïve Bayes 71.05 Neural Networks 44.73
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4750 Chart -1: Sentiment Analysis Bar Graph The total no. of tweet was 23986 out of which 4153 were positive, 14407 were negative and 5426 were neutral i.e.17.31%, 60.06% and 22.62% this much percentage hold by positive, negative and neutral tweets out of total no. of tweets respectively. 5. CONCLUSION We presented our result on sentiment analysis using twitter data. We use proposed models of classification in the supervised machine learning i.e. Naïve Bayes, SVM, and Neural Networks. For feature extraction we used the bag-of-words, term frequency, tf-idf approach. And by combining them we classify the tweets into three different sentiment i.e. Positive, Neutral and Negative. In future work we try to improve the accuracy of our classifier which can detect the sarcasm, irony, humor content in the tweet. ACKNOWLEDGEMENT We take this opportunity to thank our teachers and friends who helped us throughout the project. First and foremost we would like to thank my guide for the project (Ms. Kirti Jain, Assistant Professor, Computer Science and Engineering) for her valuable advice and time during development of project. We would also like to thank Dr. Rekha Kashyap (HOD, Computer Science Department) for her constant support during the development of the project. REFERENCES [1] Varsha Sahayak, Vijaya Shete ,Apashabi Pathan,” Sentiment Analysis on Twitter Data”,Department of Information Technology, Savitribai Phule Pune University, Pune, India,2015.
  • 6. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 05 | May 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 4751 [2] Onam Bharti, Mrs. Monika Malhotra,” SENTIMENT ANALYSIS ON TWITTER DATA”, World College of Technology and Management, Gurgaon , India, 2016. [3] Bhumika Gupta, Monika Negi, Kanika Vishwakarma, Goldi Rawat, Priyanka Badhani , “ Study of Twitter Sentiment Analysis using Machine Learning Algorithms on Python ” , C.S.E.D ,G.B.P.E.C, Pauri, Uttarakhand, India, 2017. [4] Ali Hasan, Sana Moin, Ahmad Karim and Shahaboddin Shamshirband, “Machine Learning-Based Sentiment Analysis for Twitter Accounts” , Department for Management of Science and Technology Development, Ton Duc Thang University,Ho Chi Minh City, Vietnam,2018. [5] Hamid Bagheri, Md Johirul Islam, “Sentiment analysis of twitter data” , Computer Science Department Iowa State University,United States of America, 2017. [6] Anusha.N, Divya.G, Ramya.B, “Sentiment Analysis of Twitter Data through Big Data”, Computer Science and Engineering Sai Vidya Institute of Technology Bangalore, India, 2017.