SlideShare a Scribd company logo
Where is my tweet?
Henok Mengistu
Insight Data Engineering Fellow
Silicon Valley
Summer 2016
Motivation
Motivation
But, this number doesn't show how the tweet spreads-out?
But, a re-tweet graph could show
A Demo
http://52.33.140.25/
http://www.whereismytweet.online/
InsightDataengineering_Henok_Final
InsightDataengineering_Henok_Final
The data
● An original tweet
{ ….
"text": "McAllen, Texas- 8 miles from U.S. - Mexico border ", "id":
743485792585146368, "user": { "id": 25073877,"screen_name":
"realDonaldTrump" },"created_at": "Thu Jun 16 16:50:00 +0000 2016"
…. }
● A re-tweet
{…
"text": “McAllen, Texas- 8 miles from U.S. - Mexico border”
"screen_name": "trinnitythomps1","created_at": "Thu Jun 16
16:50:13 +0000 2016", "user": {"id": 2894078186}
"retweeted_status":{"id": 743485792585146368}
...}
Data Pipeline
Watches out for a tweet by
a specific user
Filter and sort re-tweets
and
put them in Redis
Builds graph
Creates a topic with tweets
as messages
● I am Henok
– Originally, from Ethiopia
– Currently, a PhD student at the University of
Wyoming
● Working on Evolutionary Computation
– I like playing and watching Soccer
– But skiing, not so much so
Thank you!
Queries
● On the re-tweet graph
– who are my audiences?
● Geographically, social groups
– Betweenness centrality
● Who is relevant to spread out my tweet?
● Identify influential followers

More Related Content

PDF
Job shop scheduling
PPT
Sequencing problems in Operations Research
PDF
Insight dataengineering henok_yelpdemo
PDF
Insight dataengineering henok_yelpdemo
PDF
Insight dataengineering henok_rehearsaldemo
PDF
Insight dataengineering henok_YelpDemo
PDF
Human Mobility (with Mobile Devices)
PPT
Coalmine spie 2012 presentation - jsw -d3
Job shop scheduling
Sequencing problems in Operations Research
Insight dataengineering henok_yelpdemo
Insight dataengineering henok_yelpdemo
Insight dataengineering henok_rehearsaldemo
Insight dataengineering henok_YelpDemo
Human Mobility (with Mobile Devices)
Coalmine spie 2012 presentation - jsw -d3

Similar to InsightDataengineering_Henok_Final (9)

PDF
500Startups @ Twitter
PDF
Twitterology - The Science of Twitter
KEY
What's happening here?
PDF
Twitter for CS10 @ Berkeley (Spring 2011)
PDF
CSE5656 Complex Networks - Gathering Data from Twitter
PDF
How to measure Twitter
PDF
Information Visualization Project
PDF
Unleashing Twitter Data for Fun and Insight
PDF
Unleashing twitter data for fun and insight
500Startups @ Twitter
Twitterology - The Science of Twitter
What's happening here?
Twitter for CS10 @ Berkeley (Spring 2011)
CSE5656 Complex Networks - Gathering Data from Twitter
How to measure Twitter
Information Visualization Project
Unleashing Twitter Data for Fun and Insight
Unleashing twitter data for fun and insight
Ad

InsightDataengineering_Henok_Final

  • 1. Where is my tweet? Henok Mengistu Insight Data Engineering Fellow Silicon Valley Summer 2016
  • 3. Motivation But, this number doesn't show how the tweet spreads-out?
  • 4. But, a re-tweet graph could show
  • 8. The data ● An original tweet { …. "text": "McAllen, Texas- 8 miles from U.S. - Mexico border ", "id": 743485792585146368, "user": { "id": 25073877,"screen_name": "realDonaldTrump" },"created_at": "Thu Jun 16 16:50:00 +0000 2016" …. } ● A re-tweet {… "text": “McAllen, Texas- 8 miles from U.S. - Mexico border” "screen_name": "trinnitythomps1","created_at": "Thu Jun 16 16:50:13 +0000 2016", "user": {"id": 2894078186} "retweeted_status":{"id": 743485792585146368} ...}
  • 9. Data Pipeline Watches out for a tweet by a specific user Filter and sort re-tweets and put them in Redis Builds graph Creates a topic with tweets as messages
  • 10. ● I am Henok – Originally, from Ethiopia – Currently, a PhD student at the University of Wyoming ● Working on Evolutionary Computation – I like playing and watching Soccer – But skiing, not so much so
  • 12. Queries ● On the re-tweet graph – who are my audiences? ● Geographically, social groups – Betweenness centrality ● Who is relevant to spread out my tweet? ● Identify influential followers