A Clustering Analysis of Tweet Length and its Relation to Sentiment

A CLUSTERING ANALYSIS OF
TWEET LENGTH AND ITS
RELATION TO SENTIMENT

Research Project
Matthew Mayo
School of Computer Science
Columbus State University
Columbus, GA 31907
CPSC 6185 Intelligent Agents
Dr. Rania Hodhod

Twitter
• Popular microblogging web service
• 140 character per message (tweet) limit
• Started in 2006, over 645 million users today*
• 58 million tweets per day*
• 9,000 tweets per minute*
* Source: www.statisticbrain.com/twitter-statistics

Sentiment Analysis
• Identifying, extracting & processing subjective
information from source material
• Subjective information includes attitudes, emotions
& opinions
• Appropriate for binary classification (positive vs.
negative, good vs. bad, etc.)
• Useful for movie reviews, political election opinions,
etc.

Project Aim
Interested in exploring the relationship between:
• Length of tweet (number of characters)
AND
• Sentiment score of tweet

Problem Description
The research project tasks:
1. Capture Twitter data
2. Build custom sentiment dictionary
3. Process tweets
4. Create dataset
5. Cluster tweet data

A Clustering Analysis of Tweet Length and its Relation to Sentiment

More Related Content

What's hot (10)

Recently uploaded (20)

A Clustering Analysis of Tweet Length and its Relation to Sentiment