SlideShare a Scribd company logo
Workshop 1A:
Data Collection & Network Analysis with
@Netlytic & the iGraph R Package
Anatoliy Gruzd
gruzd@ryerson.ca
@gruzd
Associate Professor, Ted Rogers School of Management
Director, Social Media Lab
Ryerson University
#SMSociety15
Toronto, July 27, 2015
Outline
• Making Sense of Social Media Data
• Practice Part 1: Netlytic
• Practice Part 2: R + igraph
Anatoliy Gruzd 3
Growth of Social Media and
Social Networks Data
Facebook
1B+
users
Twitter
500M+
usersSocial Media have
become an integral
part of our daily lives!
How to Make Sense of
Social Media Data?
Anatoliy Gruzd Twitter: @gruzd 5
Social Big Data -> Visualizations -> Understanding
(Development, Application & Validation)
How to Make Sense of
Social Media Data?
Anatoliy Gruzd Twitter: @gruzd 6
How to Make Sense of
Social Media Data?
Social Network Analysis (SNA)
• Nodes = People
• Edges /Ties (lines) = Relations/
“Who retweeted/ replied/
mentioned whom”
Anatoliy Gruzd Twitter: @gruzd 7
Studying Online Social Networks
http://www.visualcomplexity.com/vc
• Forum networks
• Blog networks
• Friends’ networks (Facebook,
Twitter, Google+, etc…)
• Networks of like-minded people
(YouTube, Flickr, etc…)
Anatoliy Gruzd Twitter: @gruzd 8
• Reduce the large quantity of
data into a more concise
representation
• Makes it much easier to
understand what is going on in
a group
Advantages of
Social Network Analysis
Once the network is discovered,
we can find out:
• How do people interact with each
other,
• Who are the most/least active
members of a group,
• Who is influential in a group,
• Who is susceptible to being
influenced, etc…
Anatoliy Gruzd Twitter: @gruzd 9
Workshop on Data Collection & Network Analysis with @Netlytic & the iGraph R Package
Anatoliy Gruzd Twitter: @dalprof2012 Olympics in London
Anatoliy Gruzd Twitter: @dalprof
#tarsand Twitter Community
White, B., Castleden, H., & Gruzd, A. (2015). Talking to Twitter users: Motivations behind
Twitter use on the Alberta oil sands and the Northern Gateway Pipeline. First Monday,
20(1). DOI: 10.5210/fm.v20i1.5404
Common approach for collecting social
network data:
• Self-reported social network data
may not be available/accurate
• Surveys or interviews
Problems with surveys or interviews
• Time-consuming
• Questions can be too sensitive
• Answers are subjective or incomplete
• Participant can forget people and
interactions
• Different people perceive events and
relationships differently
How Do We Collect Information About Online Social Networks?
Anatoliy Gruzd Twitter: @gruzd 14
• Common approach: surveys or interviews
• A sample question about students’ perceived social structures
How Do We Collect Information About Social Networks?
Please indicate on a scale from [1] to [5],
YOUR FRIENDSHIP RELATIONSHIP WITH EACH STUDENT IN THE CLASS
[1] - don’t know this person
[2] - just another member of class
[3] - a slight friendship
[4] - a friend
[5] - a close friend
Alice D. [1] [2] [3] [4] [5]
…
Richard S. [1] [2] [3] [4] [5]
Source: C. Haythornthwaite, 1999
Anatoliy Gruzd Twitter: @gruzd 15
Goal: Automated Networks Discovery
Challenge: Figuring out what content-based features of online interactions can
help to uncover nodes and ties between group members
How Do We Collect Information About Online Social Networks?
16
Automated Discovery of Social Networks
Emails
Nick
Rick
Dick
• Nodes = People
• Ties = “Who talks to whom”
• Tie strength = The number of
messages exchanged between
individuals
17
Automated Discovery of Social Networks
“Many to Many” Communication
ChatMailing listservForum Comments
18
Automated Discovery of Social Networks
Approach 1: Chain Network (Reply-to)
FROM: Sam
PREVIOUS POSTER: Gabriel
....
....
....
Posting
header
Content
19
Automated Discovery of Social Networks
Approach 1: Chain Network (Reply-to)
FROM: Sam
PREVIOUS POSTER: Gabriel
“ Nick, Gina and Gabriel: I apologize for not backing this up
with a good source, but I know from reading about this topic that … ”
Posting
header
Content
Possible Missing Connections:
• Sam -> Nick
• Sam -> Gina
• Nick <-> Gina 20
21
Chain Networks: missed info.
FROM: Eva
REFERENCE CHAIN: Gabriel, Sam, Gina
“ Gina, I owe you a cookie. This is exactly what I wanted to know.
I was already planning on taking 402 next semester,
and now I have something to look forward to! ”
FROM: Fred
“ I wonder if that could be why other libraries
around the world have resisted changing –
it's too much work, and as Dan pointed out, too expensive. ”
Ex.2
Ex.3
21
Automated Discovery of Social Networks
Approach 2: Name Network
FROM: Ann
“Steve and Natasha, I couldn't wait to see your site.
I knew it was going to [be] awesome!”
This approach looks for personal names in the content of the messages to
identify social connections between group members.
22
Chain Network
(less connections)
Name Network
(more connections)
Comparing Chain vs Name Networks
Example: Youtube comments
Chain Network Name Network
23
• Main Communicative Functions of Personal Names (Leech, 1999)
– getting attention and identifying addressee
– maintaining and reinforcing social relationships
• Names are “one of the few textual carriers of identity” in discussions
on the web (Doherty, 2004)
• Their use is crucial for the creation and maintenance of a sense of
community (Ubon, 2005)
Automated Discovery of Social Networks
Approach 2: Name Network
24
Automated Discovery of Social Networks
Name Network Method: Challenges
Kurt Cobain, a lead singer for the
rock band Nirvana
chris is not a group member
Santa Monica Public Library
John Dewey, philosopher &
educator
mark up language
Solution:
- Name alias resolution
25
Example: Twitter Networks
@John
@Peter
@Paul
• Nodes = People
• Ties = “Who retweeted/
replied/mentioned whom”
• Tie strength = The number of
retweets, replies or mentions
How to Make Sense of Social Media Data?
26
Automated Discovery of Social Networks
Twitter Data Example
27
Chain Network ties Name Network ties
none @Cheeflo -> @JoeProf
@Cheeflo -> @VMosco
Automated Discovery of Social Networks
Twitter Data Example
28
Chain Network ties Name Network ties
@gruzd -> @sidneyeve @gruzd -> @sidneyeve
Comparing Chain vs Name Networks
Example: Twitter data - #SMSociety15 hashtag
Chain Network Name Network
10 nodes, 19 ties 105 nodes, 152 ties
Anatoliy Gruzd
Netlytic.org
cloud-based research infrastructure for automated text analysis & discovery
of social networks from social big data
Networks
Stats
Content
30
Tutorial: Analyzing #SMSociety15 on Twitter
https://netlytic.org/home/?p=10676
Anatoliy Gruzd 31
Social Media Research Toolkit
maintained by the Social Media Lab
http://socialmedialab.ca/?page_id=7801
Anatoliy Gruzd 34
TOOLS

More Related Content

PDF
Social Media Data Collection & Network Analysis with Netlytic and R
PDF
Who are We Studying: Humans or Bots?
PDF
The Use of Social Media during the 2014 Crisis In Ukraine
PDF
Social listening: how to do it and how to use (SNA Perspective)
PDF
Predicting what gets ‘Likes’ on Facebook: case study of BlogTO
PPTX
Twitter Data Analytics
PPT
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
PDF
CrowdTruth @VU Faculty Colloquium (June 2015)
Social Media Data Collection & Network Analysis with Netlytic and R
Who are We Studying: Humans or Bots?
The Use of Social Media during the 2014 Crisis In Ukraine
Social listening: how to do it and how to use (SNA Perspective)
Predicting what gets ‘Likes’ on Facebook: case study of BlogTO
Twitter Data Analytics
Twitter analytics: some thoughts on sampling, tools, data, ethics and user re...
CrowdTruth @VU Faculty Colloquium (June 2015)

What's hot (20)

PDF
Social Web 2014: Final Presentations (Part I)
PPTX
20151001 charles university prague - marc smith - node xl-picturing political...
PPTX
2015 pdf-marc smith-node xl-social media sna
PPTX
2017 05-26 NodeXL Twitter search #shakeupshow
PPT
Picturing the Social: Talk for Transforming Digital Methods Winter School
PPTX
20120622 web sci12-won-marc smith-semantic and social network analysis of …
PPTX
Visualizing Co-Retweeting Behavior for Recommending Relevant Real-Time Content
PPTX
20120301 strata-marc smith-mapping social media networks with no coding using...
PDF
OpenThreads: The Community of OpenStreetMap Mailing List
PDF
Identifying Influencers on Social Media Using Social Network Analysis
PDF
Broker Bots: Analyzing automated activity during High Impact Events on Twitter
PPTX
Dynamics of a Scandal: The Centrelink Robodebt Affair on Twitter
PPTX
Infotainment and the Impact of Connective Action: The Case of #MilkedDry
PDF
Instagramming The Ends of Identity: Pre-birth and post-death identity pract...
PPTX
2010 sept - mobile web africa - marc smith - says who - mapping social medi...
PPTX
"Hashtags as Spectacle: #bostonstrong and The Materiality of Metadata" (EGSA ...
PPT
Social media engagement
PPTX
2013 NodeXL Social Media Network Analysis
PPT
here comes social advocacy (the full monty)
PDF
Tumblr 2014 - statistical overview and comparison with popular social services
Social Web 2014: Final Presentations (Part I)
20151001 charles university prague - marc smith - node xl-picturing political...
2015 pdf-marc smith-node xl-social media sna
2017 05-26 NodeXL Twitter search #shakeupshow
Picturing the Social: Talk for Transforming Digital Methods Winter School
20120622 web sci12-won-marc smith-semantic and social network analysis of …
Visualizing Co-Retweeting Behavior for Recommending Relevant Real-Time Content
20120301 strata-marc smith-mapping social media networks with no coding using...
OpenThreads: The Community of OpenStreetMap Mailing List
Identifying Influencers on Social Media Using Social Network Analysis
Broker Bots: Analyzing automated activity during High Impact Events on Twitter
Dynamics of a Scandal: The Centrelink Robodebt Affair on Twitter
Infotainment and the Impact of Connective Action: The Case of #MilkedDry
Instagramming The Ends of Identity: Pre-birth and post-death identity pract...
2010 sept - mobile web africa - marc smith - says who - mapping social medi...
"Hashtags as Spectacle: #bostonstrong and The Materiality of Metadata" (EGSA ...
Social media engagement
2013 NodeXL Social Media Network Analysis
here comes social advocacy (the full monty)
Tumblr 2014 - statistical overview and comparison with popular social services
Ad

Viewers also liked (17)

PDF
Introduction To Igraph and Shiny
PPT
Social Network Analysis in R
PDF
Social Network Analysis With R
PPT
Social Network Analysis
PDF
Examining Coexistence between Splachnaceae Mosses with Individual-Based Model...
PDF
Igraph
PDF
SXSW 2014 | NOTAS SOBRE O FUTURO DO VAREJO
PDF
Manual buzzmonitor versão completa - 28 de agosto de 2014
PDF
3 passos para monitoramento e análise estratégica de redes sociais
PDF
10 métricas para medir o sucesso do seu canal no Youtube
PDF
Social CRM Estratégico
PDF
12 métricas essenciais para gerenciar a presença da sua marca no Facebook
PDF
Social network analysis
PDF
Social Networks and Social Capital
PPT
Social media analysis in R using twitter API
DOCX
Twitter analysis by Kaify Rais
PDF
Text Mining with R -- an Analysis of Twitter Data
Introduction To Igraph and Shiny
Social Network Analysis in R
Social Network Analysis With R
Social Network Analysis
Examining Coexistence between Splachnaceae Mosses with Individual-Based Model...
Igraph
SXSW 2014 | NOTAS SOBRE O FUTURO DO VAREJO
Manual buzzmonitor versão completa - 28 de agosto de 2014
3 passos para monitoramento e análise estratégica de redes sociais
10 métricas para medir o sucesso do seu canal no Youtube
Social CRM Estratégico
12 métricas essenciais para gerenciar a presença da sua marca no Facebook
Social network analysis
Social Networks and Social Capital
Social media analysis in R using twitter API
Twitter analysis by Kaify Rais
Text Mining with R -- an Analysis of Twitter Data
Ad

Similar to Workshop on Data Collection & Network Analysis with @Netlytic & the iGraph R Package (20)

PPTX
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
PDF
Marc Smith - Charting Collections of Connections in Social Media: Creating Ma...
PPTX
20111103 con tech2011-marc smith
PPTX
LSS'11: Charting Collections Of Connections In Social Media
PPTX
20111123 mwa2011-marc smith
PDF
socialnetworkanalysis-100225055227-phpapp02.pdf
PPTX
2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...
PPTX
20110719 social media research foundation-charting collections of connections
PPTX
AI Class Topic 5: Social Network Graph
PPTX
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
PPT
Social network
PPT
Social network (1)
PPTX
20110128 connected action-node xl-sea of connections
PDF
Exploring Social Media with NodeXL
PDF
Socialnetworkanalysis 100225055227-phpapp02
PDF
Dissertation Social Network Sites
PPTX
Social Media Analytics
PPTX
2014 TheNextWeb-Mapping connections with NodeXL
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
Marc Smith - Charting Collections of Connections in Social Media: Creating Ma...
20111103 con tech2011-marc smith
LSS'11: Charting Collections Of Connections In Social Media
20111123 mwa2011-marc smith
socialnetworkanalysis-100225055227-phpapp02.pdf
2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...
20110719 social media research foundation-charting collections of connections
AI Class Topic 5: Social Network Graph
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
Social network
Social network (1)
20110128 connected action-node xl-sea of connections
Exploring Social Media with NodeXL
Socialnetworkanalysis 100225055227-phpapp02
Dissertation Social Network Sites
Social Media Analytics
2014 TheNextWeb-Mapping connections with NodeXL

More from Toronto Metropolitan University (20)

PDF
The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...
PDF
Computational Approaches to Studying Anti-Social Behaviour on Social Media
PDF
The Role of Open Access & Social Media in Knowledge Mobilization and Discovery
PDF
Examining toxic interactions and political engagement on Twitter
PDF
Who is Influencing the #GDPR Discussion on Twitter: Implications for Public ...
PDF
#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates
PDF
Research & Teaching in the Social Media Age
PDF
Social Media for Informal Learning: a Case of #Twitterstorians
PDF
The State of Social Media Research After Cambridge Analytica
PDF
Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...
PDF
From 13 Reasons Why to Suicide Watch: Reddit Discussions about the Controvers...
PDF
Introduction to Social Network Analysis
PDF
Learning Analytics Dashboard for Twitter
PDF
Altmetrics: Listening & Giving Voice to Ideas with Social Media Data
PDF
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
PDF
Studying Online & Offline Communities through the Prism of Social Media Data
PDF
Examining Sentiments and Popularity of Pro- and Anti-Vaccination Videos on Yo...
PDF
Social media data stewardship: The ethics of social media data use for research
PDF
Sampling and recruiting on Facebook
PDF
Research with Social Media Data: Stewardship & Ethical Considerations
The Fog of War: Examining the Spread of Dis- & Misinformation in the Russia-U...
Computational Approaches to Studying Anti-Social Behaviour on Social Media
The Role of Open Access & Social Media in Knowledge Mobilization and Discovery
Examining toxic interactions and political engagement on Twitter
Who is Influencing the #GDPR Discussion on Twitter: Implications for Public ...
#FakeNews Travels Fast — How Social Bots and Trolls Are Reshaping Public Debates
Research & Teaching in the Social Media Age
Social Media for Informal Learning: a Case of #Twitterstorians
The State of Social Media Research After Cambridge Analytica
Roundtable: Social Media Users' Privacy Expectations & the Ethics of Using Th...
From 13 Reasons Why to Suicide Watch: Reddit Discussions about the Controvers...
Introduction to Social Network Analysis
Learning Analytics Dashboard for Twitter
Altmetrics: Listening & Giving Voice to Ideas with Social Media Data
You're Hired: Examining Acceptance of Social Media Screening of Job Applicants
Studying Online & Offline Communities through the Prism of Social Media Data
Examining Sentiments and Popularity of Pro- and Anti-Vaccination Videos on Yo...
Social media data stewardship: The ethics of social media data use for research
Sampling and recruiting on Facebook
Research with Social Media Data: Stewardship & Ethical Considerations

Recently uploaded (20)

PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
Pharma ospi slides which help in ospi learning
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
Institutional Correction lecture only . . .
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Cell Types and Its function , kingdom of life
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
01-Introduction-to-Information-Management.pdf
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PPTX
GDM (1) (1).pptx small presentation for students
PDF
O7-L3 Supply Chain Operations - ICLT Program
102 student loan defaulters named and shamed – Is someone you know on the list?
Final Presentation General Medicine 03-08-2024.pptx
A systematic review of self-coping strategies used by university students to ...
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
Pharma ospi slides which help in ospi learning
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Microbial disease of the cardiovascular and lymphatic systems
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Institutional Correction lecture only . . .
O5-L3 Freight Transport Ops (International) V1.pdf
Cell Types and Its function , kingdom of life
2.FourierTransform-ShortQuestionswithAnswers.pdf
Microbial diseases, their pathogenesis and prophylaxis
01-Introduction-to-Information-Management.pdf
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Abdominal Access Techniques with Prof. Dr. R K Mishra
GDM (1) (1).pptx small presentation for students
O7-L3 Supply Chain Operations - ICLT Program

Workshop on Data Collection & Network Analysis with @Netlytic & the iGraph R Package

  • 1. Workshop 1A: Data Collection & Network Analysis with @Netlytic & the iGraph R Package Anatoliy Gruzd gruzd@ryerson.ca @gruzd Associate Professor, Ted Rogers School of Management Director, Social Media Lab Ryerson University #SMSociety15 Toronto, July 27, 2015
  • 2. Outline • Making Sense of Social Media Data • Practice Part 1: Netlytic • Practice Part 2: R + igraph Anatoliy Gruzd 3
  • 3. Growth of Social Media and Social Networks Data Facebook 1B+ users Twitter 500M+ usersSocial Media have become an integral part of our daily lives!
  • 4. How to Make Sense of Social Media Data? Anatoliy Gruzd Twitter: @gruzd 5
  • 5. Social Big Data -> Visualizations -> Understanding (Development, Application & Validation) How to Make Sense of Social Media Data? Anatoliy Gruzd Twitter: @gruzd 6
  • 6. How to Make Sense of Social Media Data? Social Network Analysis (SNA) • Nodes = People • Edges /Ties (lines) = Relations/ “Who retweeted/ replied/ mentioned whom” Anatoliy Gruzd Twitter: @gruzd 7
  • 7. Studying Online Social Networks http://www.visualcomplexity.com/vc • Forum networks • Blog networks • Friends’ networks (Facebook, Twitter, Google+, etc…) • Networks of like-minded people (YouTube, Flickr, etc…) Anatoliy Gruzd Twitter: @gruzd 8
  • 8. • Reduce the large quantity of data into a more concise representation • Makes it much easier to understand what is going on in a group Advantages of Social Network Analysis Once the network is discovered, we can find out: • How do people interact with each other, • Who are the most/least active members of a group, • Who is influential in a group, • Who is susceptible to being influenced, etc… Anatoliy Gruzd Twitter: @gruzd 9
  • 10. Anatoliy Gruzd Twitter: @dalprof2012 Olympics in London
  • 11. Anatoliy Gruzd Twitter: @dalprof #tarsand Twitter Community White, B., Castleden, H., & Gruzd, A. (2015). Talking to Twitter users: Motivations behind Twitter use on the Alberta oil sands and the Northern Gateway Pipeline. First Monday, 20(1). DOI: 10.5210/fm.v20i1.5404
  • 12. Common approach for collecting social network data: • Self-reported social network data may not be available/accurate • Surveys or interviews Problems with surveys or interviews • Time-consuming • Questions can be too sensitive • Answers are subjective or incomplete • Participant can forget people and interactions • Different people perceive events and relationships differently How Do We Collect Information About Online Social Networks? Anatoliy Gruzd Twitter: @gruzd 14
  • 13. • Common approach: surveys or interviews • A sample question about students’ perceived social structures How Do We Collect Information About Social Networks? Please indicate on a scale from [1] to [5], YOUR FRIENDSHIP RELATIONSHIP WITH EACH STUDENT IN THE CLASS [1] - don’t know this person [2] - just another member of class [3] - a slight friendship [4] - a friend [5] - a close friend Alice D. [1] [2] [3] [4] [5] … Richard S. [1] [2] [3] [4] [5] Source: C. Haythornthwaite, 1999 Anatoliy Gruzd Twitter: @gruzd 15
  • 14. Goal: Automated Networks Discovery Challenge: Figuring out what content-based features of online interactions can help to uncover nodes and ties between group members How Do We Collect Information About Online Social Networks? 16
  • 15. Automated Discovery of Social Networks Emails Nick Rick Dick • Nodes = People • Ties = “Who talks to whom” • Tie strength = The number of messages exchanged between individuals 17
  • 16. Automated Discovery of Social Networks “Many to Many” Communication ChatMailing listservForum Comments 18
  • 17. Automated Discovery of Social Networks Approach 1: Chain Network (Reply-to) FROM: Sam PREVIOUS POSTER: Gabriel .... .... .... Posting header Content 19
  • 18. Automated Discovery of Social Networks Approach 1: Chain Network (Reply-to) FROM: Sam PREVIOUS POSTER: Gabriel “ Nick, Gina and Gabriel: I apologize for not backing this up with a good source, but I know from reading about this topic that … ” Posting header Content Possible Missing Connections: • Sam -> Nick • Sam -> Gina • Nick <-> Gina 20
  • 19. 21 Chain Networks: missed info. FROM: Eva REFERENCE CHAIN: Gabriel, Sam, Gina “ Gina, I owe you a cookie. This is exactly what I wanted to know. I was already planning on taking 402 next semester, and now I have something to look forward to! ” FROM: Fred “ I wonder if that could be why other libraries around the world have resisted changing – it's too much work, and as Dan pointed out, too expensive. ” Ex.2 Ex.3 21
  • 20. Automated Discovery of Social Networks Approach 2: Name Network FROM: Ann “Steve and Natasha, I couldn't wait to see your site. I knew it was going to [be] awesome!” This approach looks for personal names in the content of the messages to identify social connections between group members. 22
  • 21. Chain Network (less connections) Name Network (more connections) Comparing Chain vs Name Networks Example: Youtube comments Chain Network Name Network 23
  • 22. • Main Communicative Functions of Personal Names (Leech, 1999) – getting attention and identifying addressee – maintaining and reinforcing social relationships • Names are “one of the few textual carriers of identity” in discussions on the web (Doherty, 2004) • Their use is crucial for the creation and maintenance of a sense of community (Ubon, 2005) Automated Discovery of Social Networks Approach 2: Name Network 24
  • 23. Automated Discovery of Social Networks Name Network Method: Challenges Kurt Cobain, a lead singer for the rock band Nirvana chris is not a group member Santa Monica Public Library John Dewey, philosopher & educator mark up language Solution: - Name alias resolution 25
  • 24. Example: Twitter Networks @John @Peter @Paul • Nodes = People • Ties = “Who retweeted/ replied/mentioned whom” • Tie strength = The number of retweets, replies or mentions How to Make Sense of Social Media Data? 26
  • 25. Automated Discovery of Social Networks Twitter Data Example 27 Chain Network ties Name Network ties none @Cheeflo -> @JoeProf @Cheeflo -> @VMosco
  • 26. Automated Discovery of Social Networks Twitter Data Example 28 Chain Network ties Name Network ties @gruzd -> @sidneyeve @gruzd -> @sidneyeve
  • 27. Comparing Chain vs Name Networks Example: Twitter data - #SMSociety15 hashtag Chain Network Name Network 10 nodes, 19 ties 105 nodes, 152 ties
  • 28. Anatoliy Gruzd Netlytic.org cloud-based research infrastructure for automated text analysis & discovery of social networks from social big data Networks Stats Content 30
  • 29. Tutorial: Analyzing #SMSociety15 on Twitter https://netlytic.org/home/?p=10676 Anatoliy Gruzd 31
  • 30. Social Media Research Toolkit maintained by the Social Media Lab http://socialmedialab.ca/?page_id=7801 Anatoliy Gruzd 34 TOOLS