SlideShare a Scribd company logo
Introduction to software
that can capture data from
Twitter
Wasim Ahmed,
Information School
Email: wahmed1@Sheffield.ac.uk
Aims
• Disclaimer(s) to using Twitter data
• Overview of current Twitter data retrieval
and analysis software which require no
programming knowledge.
• Overview of public engagement work I
have been doing
27/08/2015 © The University of Sheffield
27/08/2015 © The University of Sheffield
Tools Covered in this
Presentation
• TAGS
• NodeXL
• Mozdeh
• COSMOS Project
• Chorus
Ethical, privacy and copyright
issues when using Twitter data
27/08/2015 © The University of Sheffield
Read best practice guidelines
Refer to resources such as:
• Research using Social Media; Users’
Views link here
• COSMOS Online Guide to Social Media
Research and Ethics link here
• Unlocking the value of social media – a
review of research ethics link here
• Association of Internet Researchers
(AoIR) link here
27/08/2015 © The University of Sheffield
Legal issues
• Sharing of Twitter datasets is prohibited
see https://dev.twitter.com/terms/api-terms
• However, sharing Tweet IDs (to look up
the tweets used is permissible). This is
useful for reproducibility.
27/08/2015 © The University of Sheffield
Programming knowledge!
27/08/2015 © The University of Sheffield
Learn a programming language
Check these resources out to learn how to
code:
• Websites such as Code Academy
• Visit library for programming books
• YouTube Videos
27/08/2015 © The University of Sheffield
Why Twitter (data)?
• See my LSE impact blog post
• Twitter is a popular platform in terms of the media attention it receives and it therefore attracts
more research due to its cultural status
• Twitter makes it easier to find and follow conversations (i.e., by both its search feature and by
tweets appearing in Google search results)
• Twitter has hashtag norms which make it easier gathering, sorting, and expanding searches
when collecting data
• Twitter data is easy to retrieve as major incidents, news stories and events on Twitter tend to be
centred around a hashtag
• The Twitter API is more open and accessible compared to other social media platforms, which
makes Twitter more favourable to developers creating tools to access data. This consequently
increases the availability of tools to researchers.
• Many researchers themselves are using Twitter and because of their favourable personal
experiences, they feel more comfortable with researching a familiar platform.
27/08/2015 © The University of Sheffield
Different types of Twitter API
• Twitter’s Search API – focused on relevance
and not completeness, some tweets and users
may be missing from results
• Twitter Streaming API – The Streaming APIs
give developers low latency access to Twitter’s
global stream of tweet data.
• Firehose API – in theory, 100% of Twitter data
27/08/2015 © The University of Sheffield
How do you retrieve data?
• Use a keyword e.g., Ebola
• Use a hashtag e.g., #EbolaOutbreak
• Combine search queries using AND or OR
operators.
27/08/2015 © The University of Sheffield
27/08/2015 © The University of Sheffield
TAGS – Twitter Archiving
Google Sheets
• Created and maintained by Martin Hawksey
(@mhawksey)
• TAGS is a free Google Sheet template which lets you
setup and run automated collection of search results
from Twitter.
• Set up TAGS here https://tags.hawksey.info/get-
tags/
27/08/2015 © The University of Sheffield
TAGS – Twitter Archiving
Google Sheet
TAGS – Twitter Archiving
Google Sheet
• TAGS also allows you to visualize the
connections between users
• There is an excellent video here
27/08/2015 © The University of Sheffield
27/08/2015 © The University of Sheffield
NodeXL
• NodeXL is a Microsoft Excel Plugin.
• The software can be used to obtain data from Twitter,
YouTube, and Flicker.
• NodeXL runs on Windows operating systems.
• Users can download graph options from the NodeXL graph
gallery.
• NodeXL is very easy to use – The MS Paint for network
graphs (Marc Smith)
27/08/2015 © The University of Sheffield
NodeXL: example network graphs
NodeXL, example network graph of @was3210 NodeXL: Example network graph of @was3210 (using a different layout
to the graph on the left)
27/08/2015 © The University of Sheffield
NodeXL tutorials
• Users can download graph options from the NodeXL Graph
Gallery (http://nodexlgraphgallery.org/Pages/Default.aspx)
• The workbooks used to create a graph (i.e., with the settings
intact) are often linked on the bottom of the page. These can
be downloaded, and further customized.
• There are some excellent NodeXL tutorials on YouTube
(https://www.youtube.com/results?search_query=NodeXl)
27/08/2015 © The University of Sheffield
Mozdeh
• Mozdeh is a product of the ‘Statistical Cybermetrics
Research Group’ at the University of
Wolverhampton.
• Mozdeh is a Windows desktop program that can
gather tweets by automatically searching for
keywords associated with a topic.
• It is also very easy to use.
Mozdeh
27/08/2015 © The University of Sheffield
• An example time series graph of 5,055,299 tweets
related to norovirus
Mozdeh Tutorials
• Great user guide here
• Great theoretical overview here
27/08/2015 © The University of Sheffield
27/08/2015 © The University of Sheffield
COSMOS Project
• The Collaborative Online Social Media Observatory
(COSMOS): Social Media and Data Mining is an
ESRC project a part of the strategic Big Data
investment.
• The COSMOS Project (Burnap et al, 2014) uses the
Streaming API
27/08/2015 © The University of Sheffield
COSMOS Project
• Some of the features include generating:
• Word Clouds
• Frequency charts
• Network graphs
• Maps of tweets
27/08/2015 © The University of Sheffield
COSMOS Project Layout
27/08/2015 © The University of Sheffield
COSMOS Tutorials
• Great video tutorial(s) here
27/08/2015 © The University of Sheffield
Chorus Analytics Tweetcatcher
Desktop Edition
• Chorus-TCD is a product of Brunel University.
• Uses Twitter’s Search API
• Searches as many statuses that are available from
the query at the current point of time.
• It is also very easy to use. There is a great video
introduction here.
27/08/2015 © The University of Sheffield
Chorus
• This is the layout of Chorus Tweet Catcher
Chorus
• This is the layout of Chorus Tweet Vis
27/08/2015 © The University of Sheffield
Chorus Tutorials
• Chorus manual here
• Great video overview of Chorus here
27/08/2015 © The University of Sheffield
What if I want data going back
more than 7 days?
• In most instance you will have to pay for it
• I use Texifter(@texifter) with DiscoverText
(@discovertext)
• Can range from not that expensive to
very expensive depending on query and
time
27/08/2015 © The University of Sheffield
DiscoverText Tutorials
• DiscoverText explained
• You can find DiscoverText’s social data
brochure here
27/08/2015 © The University of Sheffield
Public Engagement
• Started to use Twitter when started my
PhD – connected with #NSMNSS and
#PhDChat community
• Started a research blog
27/08/2015 © The University of Sheffield
Public Engagement
Benefits of Twitter include:
• Getting tricky PhD questions answered
• Finding out about conferences
• Networking with other academics, making
new friends
7/08/2015 © The University of Sheffield
Public Engagement
Benefits of a blog include:
• Early feedback on PhD work – my first two
slides!
• More visibility and interest in work
7/08/2015 © The University of Sheffield
Map of my Twitter network
27/08/2015 © The University of Sheffield
Questions?
• Tweet me! @was3210
• Questions related to the tools?
• TAGS = @mhawksey
• NodeXL = @marc_smith
• COSMOS = @pbFeed
• Mozdeh = @mikethelwall
27/08/2015 © The University of Sheffield
To
Discover
And
Understand.

More Related Content

PPTX
Social Media for Marketing An Overview of Specialist Software
PPTX
An overview of Twitter analytics
PPT
Visibrain platform in relation to Starbucks Redcups controversy
PPT
Using Twitter as a data source: An overview of ethical challenges
PPTX
The Role of Social Media for Humanitarian Assistance and Disaster Management
PPTX
Social Media Analytics Department For Work and Pensions Research Seminar
PPTX
An Introduction to NodeXL
PPTX
Life after High Storrs: PhD study and social media research
Social Media for Marketing An Overview of Specialist Software
An overview of Twitter analytics
Visibrain platform in relation to Starbucks Redcups controversy
Using Twitter as a data source: An overview of ethical challenges
The Role of Social Media for Humanitarian Assistance and Disaster Management
Social Media Analytics Department For Work and Pensions Research Seminar
An Introduction to NodeXL
Life after High Storrs: PhD study and social media research

What's hot (20)

PPTX
Social Media: A Practical Approach
PPTX
Communicating Science Through Social Media: Tools and Techniques
PDF
Practical Tools Social Media For Consumer Insight (Guest Lecture)
PPTX
Nordmedia 2013 Villi, Matikainen & Khaldarova
PDF
On the use of social media for evidence-based policing
PPTX
The Shift to Open Access Publishing
PPTX
Collaborative Open Access Publishing: the Ubiquity Partnet Network
PPTX
Don't Mention the G Word - How the University of Sheffield got Googled
PPTX
Keynote Talk - Gaining Powerful Insights into Social Media Listening
PDF
Do You Mind NSA Affair? Does the Global Surveillance Disclosure Impact Our St...
PPTX
A coordinated approach to Library and Information Science Research: the UK ex...
PPTX
Centre for Social Informatics - January 2016
PDF
Let's Work Together: UCD Research, UCD Library & Altmetrics
PPTX
Stop Press: Libraries' Role in the Future of Publishing
PPTX
Best Practices for Linked Data Education
PPTX
From Tweetations to Citations: Social Media and the Researcher
PPTX
Creating a UK-wide network of LIS researchers
PDF
Open Education and Open Development – working together
PPTX
Development of a Linked Data curriculum
PDF
LinkedUp - European Data Forum
Social Media: A Practical Approach
Communicating Science Through Social Media: Tools and Techniques
Practical Tools Social Media For Consumer Insight (Guest Lecture)
Nordmedia 2013 Villi, Matikainen & Khaldarova
On the use of social media for evidence-based policing
The Shift to Open Access Publishing
Collaborative Open Access Publishing: the Ubiquity Partnet Network
Don't Mention the G Word - How the University of Sheffield got Googled
Keynote Talk - Gaining Powerful Insights into Social Media Listening
Do You Mind NSA Affair? Does the Global Surveillance Disclosure Impact Our St...
A coordinated approach to Library and Information Science Research: the UK ex...
Centre for Social Informatics - January 2016
Let's Work Together: UCD Research, UCD Library & Altmetrics
Stop Press: Libraries' Role in the Future of Publishing
Best Practices for Linked Data Education
From Tweetations to Citations: Social Media and the Researcher
Creating a UK-wide network of LIS researchers
Open Education and Open Development – working together
Development of a Linked Data curriculum
LinkedUp - European Data Forum
Ad

Viewers also liked (10)

PPTX
Ethical Challenges of Using Social Media Data In Research
PPTX
An Introduction to NodeXL for Social Scientists
PPTX
Informatics for Disease Surveillance – New Technologies
PPTX
Insights From Social Media
PDF
20130504 - FeWeb - Twitter API
PPTX
Twitter API, Streaming and SharePoint 2013
PPTX
Development of Twitter Application #8 - Streaming API
PDF
The Art of Social Media Analysis with Twitter & Python
PDF
REST to RESTful Web Service
PDF
How to Become a Thought Leader in Your Niche
Ethical Challenges of Using Social Media Data In Research
An Introduction to NodeXL for Social Scientists
Informatics for Disease Surveillance – New Technologies
Insights From Social Media
20130504 - FeWeb - Twitter API
Twitter API, Streaming and SharePoint 2013
Development of Twitter Application #8 - Streaming API
The Art of Social Media Analysis with Twitter & Python
REST to RESTful Web Service
How to Become a Thought Leader in Your Niche
Ad

Similar to Introduction to software that can be used to capture and analyse Twitter data (20)

PPTX
Social Media Analytics Lecture
PPT
2009 Node XL Overview: Social Network Analysis in Excel 2007
PDF
2009-C&T-NodeXL and social queries - a social media network analysis toolkit
PDF
Weller social media as research data_psm15
PDF
Querying open data with R - Talk at April SheffieldR Users Gp
PDF
FAIR data: LOUD for all audiences
PPT
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
PPTX
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
PPTX
2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007
PDF
Here's the Data. Here's what you can do with it - Janani Kalyanam
PDF
CSE5656 Complex Networks - Gathering Data from Twitter
PDF
The data we want
PDF
Social recommendation, influence or else
PPTX
2009 December NodeXL Overview
PPTX
Social Network Analysis with NodeXL Part 1
PPTX
Easy Data, Hard Data? Twitter Research and the Politics of Data Access
PDF
Complex Networks: Science, Programming, and Databases
PPT
Picturing the Social: Talk for Transforming Digital Methods Winter School
PPTX
The Future of LOD
Social Media Analytics Lecture
2009 Node XL Overview: Social Network Analysis in Excel 2007
2009-C&T-NodeXL and social queries - a social media network analysis toolkit
Weller social media as research data_psm15
Querying open data with R - Talk at April SheffieldR Users Gp
FAIR data: LOUD for all audiences
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
Analyzing Social Media with Digital Methods. Possibilities, Requirements, and...
2009 - Node XL v.84+ - Social Media Network Visualization Tools For Excel 2007
Here's the Data. Here's what you can do with it - Janani Kalyanam
CSE5656 Complex Networks - Gathering Data from Twitter
The data we want
Social recommendation, influence or else
2009 December NodeXL Overview
Social Network Analysis with NodeXL Part 1
Easy Data, Hard Data? Twitter Research and the Politics of Data Access
Complex Networks: Science, Programming, and Databases
Picturing the Social: Talk for Transforming Digital Methods Winter School
The Future of LOD

Recently uploaded (20)

PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
A Presentation on Touch Screen Technology
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Hybrid model detection and classification of lung cancer
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
1. Introduction to Computer Programming.pptx
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
A Presentation on Artificial Intelligence
PPTX
Tartificialntelligence_presentation.pptx
PDF
A comparative analysis of optical character recognition models for extracting...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
1 - Historical Antecedents, Social Consideration.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
A Presentation on Touch Screen Technology
MIND Revenue Release Quarter 2 2025 Press Release
Hybrid model detection and classification of lung cancer
Unlocking AI with Model Context Protocol (MCP)
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Univ-Connecticut-ChatGPT-Presentaion.pdf
Assigned Numbers - 2025 - Bluetooth® Document
1. Introduction to Computer Programming.pptx
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
A comparative study of natural language inference in Swahili using monolingua...
Programs and apps: productivity, graphics, security and other tools
A Presentation on Artificial Intelligence
Tartificialntelligence_presentation.pptx
A comparative analysis of optical character recognition models for extracting...

Introduction to software that can be used to capture and analyse Twitter data

  • 1. Introduction to software that can capture data from Twitter Wasim Ahmed, Information School Email: wahmed1@Sheffield.ac.uk
  • 2. Aims • Disclaimer(s) to using Twitter data • Overview of current Twitter data retrieval and analysis software which require no programming knowledge. • Overview of public engagement work I have been doing 27/08/2015 © The University of Sheffield
  • 3. 27/08/2015 © The University of Sheffield Tools Covered in this Presentation • TAGS • NodeXL • Mozdeh • COSMOS Project • Chorus
  • 4. Ethical, privacy and copyright issues when using Twitter data 27/08/2015 © The University of Sheffield Read best practice guidelines
  • 5. Refer to resources such as: • Research using Social Media; Users’ Views link here • COSMOS Online Guide to Social Media Research and Ethics link here • Unlocking the value of social media – a review of research ethics link here • Association of Internet Researchers (AoIR) link here 27/08/2015 © The University of Sheffield
  • 6. Legal issues • Sharing of Twitter datasets is prohibited see https://dev.twitter.com/terms/api-terms • However, sharing Tweet IDs (to look up the tweets used is permissible). This is useful for reproducibility. 27/08/2015 © The University of Sheffield
  • 7. Programming knowledge! 27/08/2015 © The University of Sheffield
  • 8. Learn a programming language Check these resources out to learn how to code: • Websites such as Code Academy • Visit library for programming books • YouTube Videos 27/08/2015 © The University of Sheffield
  • 9. Why Twitter (data)? • See my LSE impact blog post • Twitter is a popular platform in terms of the media attention it receives and it therefore attracts more research due to its cultural status • Twitter makes it easier to find and follow conversations (i.e., by both its search feature and by tweets appearing in Google search results) • Twitter has hashtag norms which make it easier gathering, sorting, and expanding searches when collecting data • Twitter data is easy to retrieve as major incidents, news stories and events on Twitter tend to be centred around a hashtag • The Twitter API is more open and accessible compared to other social media platforms, which makes Twitter more favourable to developers creating tools to access data. This consequently increases the availability of tools to researchers. • Many researchers themselves are using Twitter and because of their favourable personal experiences, they feel more comfortable with researching a familiar platform. 27/08/2015 © The University of Sheffield
  • 10. Different types of Twitter API • Twitter’s Search API – focused on relevance and not completeness, some tweets and users may be missing from results • Twitter Streaming API – The Streaming APIs give developers low latency access to Twitter’s global stream of tweet data. • Firehose API – in theory, 100% of Twitter data 27/08/2015 © The University of Sheffield
  • 11. How do you retrieve data? • Use a keyword e.g., Ebola • Use a hashtag e.g., #EbolaOutbreak • Combine search queries using AND or OR operators. 27/08/2015 © The University of Sheffield
  • 12. 27/08/2015 © The University of Sheffield TAGS – Twitter Archiving Google Sheets • Created and maintained by Martin Hawksey (@mhawksey) • TAGS is a free Google Sheet template which lets you setup and run automated collection of search results from Twitter. • Set up TAGS here https://tags.hawksey.info/get- tags/
  • 13. 27/08/2015 © The University of Sheffield TAGS – Twitter Archiving Google Sheet
  • 14. TAGS – Twitter Archiving Google Sheet • TAGS also allows you to visualize the connections between users • There is an excellent video here 27/08/2015 © The University of Sheffield
  • 15. 27/08/2015 © The University of Sheffield NodeXL • NodeXL is a Microsoft Excel Plugin. • The software can be used to obtain data from Twitter, YouTube, and Flicker. • NodeXL runs on Windows operating systems. • Users can download graph options from the NodeXL graph gallery. • NodeXL is very easy to use – The MS Paint for network graphs (Marc Smith)
  • 16. 27/08/2015 © The University of Sheffield NodeXL: example network graphs NodeXL, example network graph of @was3210 NodeXL: Example network graph of @was3210 (using a different layout to the graph on the left)
  • 17. 27/08/2015 © The University of Sheffield NodeXL tutorials • Users can download graph options from the NodeXL Graph Gallery (http://nodexlgraphgallery.org/Pages/Default.aspx) • The workbooks used to create a graph (i.e., with the settings intact) are often linked on the bottom of the page. These can be downloaded, and further customized. • There are some excellent NodeXL tutorials on YouTube (https://www.youtube.com/results?search_query=NodeXl)
  • 18. 27/08/2015 © The University of Sheffield Mozdeh • Mozdeh is a product of the ‘Statistical Cybermetrics Research Group’ at the University of Wolverhampton. • Mozdeh is a Windows desktop program that can gather tweets by automatically searching for keywords associated with a topic. • It is also very easy to use.
  • 19. Mozdeh 27/08/2015 © The University of Sheffield • An example time series graph of 5,055,299 tweets related to norovirus
  • 20. Mozdeh Tutorials • Great user guide here • Great theoretical overview here 27/08/2015 © The University of Sheffield
  • 21. 27/08/2015 © The University of Sheffield COSMOS Project • The Collaborative Online Social Media Observatory (COSMOS): Social Media and Data Mining is an ESRC project a part of the strategic Big Data investment. • The COSMOS Project (Burnap et al, 2014) uses the Streaming API
  • 22. 27/08/2015 © The University of Sheffield COSMOS Project • Some of the features include generating: • Word Clouds • Frequency charts • Network graphs • Maps of tweets
  • 23. 27/08/2015 © The University of Sheffield COSMOS Project Layout
  • 24. 27/08/2015 © The University of Sheffield COSMOS Tutorials • Great video tutorial(s) here
  • 25. 27/08/2015 © The University of Sheffield Chorus Analytics Tweetcatcher Desktop Edition • Chorus-TCD is a product of Brunel University. • Uses Twitter’s Search API • Searches as many statuses that are available from the query at the current point of time. • It is also very easy to use. There is a great video introduction here.
  • 26. 27/08/2015 © The University of Sheffield Chorus • This is the layout of Chorus Tweet Catcher
  • 27. Chorus • This is the layout of Chorus Tweet Vis 27/08/2015 © The University of Sheffield
  • 28. Chorus Tutorials • Chorus manual here • Great video overview of Chorus here 27/08/2015 © The University of Sheffield
  • 29. What if I want data going back more than 7 days? • In most instance you will have to pay for it • I use Texifter(@texifter) with DiscoverText (@discovertext) • Can range from not that expensive to very expensive depending on query and time 27/08/2015 © The University of Sheffield
  • 30. DiscoverText Tutorials • DiscoverText explained • You can find DiscoverText’s social data brochure here 27/08/2015 © The University of Sheffield
  • 31. Public Engagement • Started to use Twitter when started my PhD – connected with #NSMNSS and #PhDChat community • Started a research blog 27/08/2015 © The University of Sheffield
  • 32. Public Engagement Benefits of Twitter include: • Getting tricky PhD questions answered • Finding out about conferences • Networking with other academics, making new friends 7/08/2015 © The University of Sheffield
  • 33. Public Engagement Benefits of a blog include: • Early feedback on PhD work – my first two slides! • More visibility and interest in work 7/08/2015 © The University of Sheffield
  • 34. Map of my Twitter network 27/08/2015 © The University of Sheffield
  • 35. Questions? • Tweet me! @was3210 • Questions related to the tools? • TAGS = @mhawksey • NodeXL = @marc_smith • COSMOS = @pbFeed • Mozdeh = @mikethelwall 27/08/2015 © The University of Sheffield