SlideShare a Scribd company logo
20121010 marc smith - mapping collections of connections in social media with node xl
Charting Collections of
                                                       Connections
                                                     In Social Media:
                                                    Creating Maps &
                                                      Measures with
                                                         NodeXL




A project from the Social Media Research Foundation: http://www.smrfoundation.org
About Me
Introductions
Marc A. Smith
Chief Social Scientist
Connected Action Consulting Group
Marc@connectedaction.net
http://www.connectedaction.net
http://www.codeplex.com/nodexl
http://www.twitter.com/marc_smith
http://delicious.com/marc_smith/Paper
http://www.flickr.com/photos/marc_smith
http://www.facebook.com/marc.smith.sociologist
http://www.linkedin.com/in/marcasmith
http://www.slideshare.net/Marc_A_Smith
http://www.smrfoundation.org
Social Media Research Foundation
       http://smrfoundation.org
Social Media
(email, Facebook, Twitter,
YouTube, and more)
is all about
connections

     from people


               to people.

                             5
Patterns are

               left behind
                             6
There are many kinds of ties….
Like, Link, Reply, Rate, Review, Favorite, Friend, Follow, Forward, Edit, Tag, Comment, Check-in…




                                      http://www.flickr.com/photos/stevendepolo/3254238329
“Think Link”
    Nodes & Edges


        Is related to




A                       B
Each contains one or more
                      social networks




World Wide Web
Location, Location, Location
Position, Position, Position
알철수 - Ahn Chul Soo
20121009-NodeXL-Twitter-문재인 - Moon Jae In
20121009-NodeXL-Twitter-박근혜 Park Keun Hye
Strong ties
Weak ties
Strength of Weak ties
p://www.flickr.com/photos/fullaperture/81266869/
20121010 marc smith - mapping collections of connections in social media with node xl
Social
   Networks
• History:
  from the
  dawn of
  time!
• Theory and
  method:
  1934 ->
• Jacob L.
  Moreno
• http://en.wiki
  pedia.org/wiki
  /Jacob_L._Mor
  eno

         Jacob Moreno’s early social network diagram of positive and negative relationships among members of a football
                                                                team.
          Originally published in Moreno, J. L. (1934). Who shall survive? Washington, DC: Nervous and Mental Disease
                                                        Publishing Company.
A nearly social network diagram of relationships among workers in a factory
       illustrates the positions different workers occupy within the workgroup.
Originally published in Roethlisberger, F., and Dickson, W. (1939). Management and
               the worker. Cambridge, UK: Cambridge University Press.
Hubs
Bridges
http://www.flickr.com/photos/storm-crypt/3047698741
http://www.flickr.com/photos/library_of_congress/3295494976/sizes/o/in/photostream/
http://www.flickr.com/photos/amycgx/3119640267/
Network of connections among “#Debate AND Obama” mentioning Twitter users
Like MSPaint™ for graphs.
                    — the Community




Introduction to NodeXL
NodeXL
Network Overview Discovery and Exploration add-in for Excel 2007/2010




              A minimal network can
           illustrate the ways different
         locations have different values
             for centrality and degree
20121010 marc smith - mapping collections of connections in social media with node xl
#teaparty
                                                                       15 November 2011


#occupywallstreet
15 November 2011




http://www.newscientist.com/blogs/onepercent/2011/11/occupy-vs-tea-party-what-their.html
Social Network Theory
http://en.wikipedia.org/wiki/Social_network
• Central tenet
    – Social structure emerges from
    – the aggregate of relationships (ties)
    – among members of a population
• Phenomena of interest
    – Emergence of cliques and clusters
    – from patterns of relationships
    – Centrality (core), periphery (isolates),
                                                 Source: Richards, W.
    – betweenness                                (1986). The NEGOPY
• Methods                                        network analysis
                                                 program. Burnaby, BC:
    – Surveys, interviews, observations,         Department of
                                                 Communication, Simon
      log file analysis, computational           Fraser University. pp.7-
      analysis of matrices                       16


(Hampton &Wellman, 1999; Paolillo, 2001; Wellman, 2001)
SNA 101
                                • Node
                A
                                   – “actor” on which relationships act; 1-mode versus 2-mode networks
                                • Edge
B                                  – Relationship connecting nodes; can be directional
                        C       • Cohesive Sub-Group
                                   – Well-connected group; clique; cluster                  A B D E
                                • Key Metrics
                                   – Centrality (group or individual measure)
    D                                    • Number of direct connections that individuals have with others in the group (usually look at
                                           incoming connections only)
                E                        • Measure at the individual node or group level
                                   – Cohesion (group measure)
                                         • Ease with which a network can connect
                                         • Aggregate measure of shortest path between each node pair at network level reflects
                                           average distance
                                   – Density (group measure)
                                         • Robustness of the network
                                         • Number of connections that exist in the group out of 100% possible
                                   – Betweenness (individual measure)
        F                   G            • # shortest paths between each node pair that a node is on
                                         • Measure at the individual node level
                                • Node roles
                                   – Peripheral – below average centrality      C
            H                      – Central connector – above average centrality                    D
                    I              – Broker – above average betweenness         E
NodeXL
 Free/Open Social Network Analysis add-in for Excel 2007/2010 makes graph
theory as easy as a pie chart, with integrated analysis of social media sources.
                          http://nodexl.codeplex.com
http://www.youtube.com/watch?v=0M3T65Iw3Ac

NodeXL Video
Goal: Make SNA easier
• Existing Social Network Tools are challenging
  for many novice users
• Tools like Excel are widely used
• Leveraging a spreadsheet as a host for SNA
  lowers barriers to network data analysis and
  display
Twitter Network for “Microsoft Research”
              *BEFORE*
Twitter Network for “Microsoft Research”
               *AFTER*
Network Motif Simplification




                 Cody Dunne, University of Maryland
NodeXL
Graph Gallery
Now Available
Communities
in Cyberspace
This graph represents a
     directed network of
      1,360 Twitter users
    whose recent tweets
contained "contraceptive
 OR contraception". The
   network was obtained
 on Friday, 08 June 2012
  at 13:22 UTC. There is
 an edge for each follows
 relationship. There is an
  edge for each "replies-
     to" relationship in a
 tweet. There is an edge
     for each "mentions"
        relationship in a
   tweet. There is a self-
loop edge for each tweet
 that is not a "replies-to"
     or "mentions". The
 tweets were made over
   the 2-day period from
  Thursday, 07 June 2012
  at 18:46 UTC to Friday,
   08 June 2012 at 13:06
       UTC. The graph's
vertices were grouped by
cluster using the Clauset-
 Newman-Moore cluster
    algorithm. The edge
     colors are based on
 relationship values. The
vertex sizes are based on
   each user’s number of
      followers. Table 1
    reports the summary
    network metrics that
      describe the graph.
Summary network metrics
 Table 1. Summary network metrics for the graph in Figure 1
 Network Metric                                      Value
                                  Graph Type      Directed
                                     Vertices        1360
                               Unique Edges          5641
                        Edges With Duplicates         771
                                  Total Edges        6412
                                   Self-Loops        1096
                        Connected Components          427
          Single-Vertex Connected Components          395
  Maximum Vertices in a Connected Component           880
        Max Edges in a Connected Component           5818
        Maximum Geodesic Distance (Diameter)           12
                  Average Geodesic Distance      3.557807
                                Graph Density 0.002705817
                                   Modularity    0.446145
The Vertices spreadsheet lists users who contributed a
       tweet containing the terms “contraception OR
contraceptives” over two days in early June 2012. Users are
 ranked by their computed betweenness centrality within
 the network of follows, replies, and mentions edges. The
 top 10 vertices, ranked by betweenness centrality are the
   accounts at the center of the network. These include:
    @thinkprogress, @gatesfoundation, @SandraFluke,
  @maleeek, @Change, @foxandfriends, @melindagates,
          @AshleyJudd, @cnalive, and @SOHLTC.
Welser, Howard T., Eric Gleave, Danyel
 Fisher, and Marc Smith. 2007. Visualizing the
 Signatures of Social Roles in Online Discussion
 Groups.
 The Journal of Social Structure. 8(2).




Experts and “Answer People”                                 Discussion people, Topic setters


                              Discussion starters, Topic setters
NodeXL calculates
network metrics and
    word pairs
Contrasting groups
The Content summary
 spreadsheet displays the most
frequently used URLs, hashtags,
   and user names within the
 network as a whole and within
   each calculated sub-group.
Contrast hashtags in Groups 2 & 4
Contrasting URL references
Word Pair Contrasts
20121010 marc smith - mapping collections of connections in social media with node xl
NodeXL Ribbon in Excel
NodeXL data import sources
Example NodeXL data importer for Twitter
NodeXL imports “edges” from social media data sources
NodeXL displays subgraph images along with network metadata




NodeXL creates a list of “vertices” from imported social media edges
Perform
                   collections of
                     common
                  operations with
    NodeXL         a single click

  Automation
makes analysis
simple and fast
NodeXL Network Metrics
NodeXL “Autofill columns” simplifies mapping data attributes to display attributes
20121010 marc smith - mapping collections of connections in social media with node xl
NodeXL enables filtering of networks
NodeXL Generates Overall Network Metrics
20121010 marc smith - mapping collections of connections in social media with node xl
20121010 marc smith - mapping collections of connections in social media with node xl
Social Network Maps Reveal


Key influencers in any topic.

        Sub-groups.

          Bridges.
Social Media Research Foundation
    People             Disciplines                Institutions

   University      Computer Science         University of Maryland
    Faculty
   Students            HCI, CSCW            Oxford Internet Institute

   Industry        Machine Learning           Stanford University

  Independent   Information Visualization     Microsoft Research

  Researchers            UI/UX                 Illinois Institute of
                                                    Technology
  Developers    Social Science/Sociology       Connected Action

                   Network Analysis                  Cornell

                    Collective Action        Morningside Analytics
What we are trying to do:
Open Tools, Open Data, Open Scholarship
• Build the “Firefox of GraphML” – open tools for
  collecting and visualizing social media data
• Connect users to network analysis – make
  network charts as easy as making a pie chart
• Connect researchers to social media data sources
• Archive: Be the “Allen Very Large Telescope Array”
  for Social Media data – coordinate and aggregate
  the results of many user’s data collection and
  analysis
• Create open access research papers & findings
• Make “collections of connections” easy for users
  to manage
What we have done: Open Tools
• NodeXL
• Data providers (“spigots”)
  –   ThreadMill Message Board
  –   Exchange Enterprise Email
  –   Voson Hyperlink
  –   SharePoint
  –   Facebook
  –   Twitter
  –   YouTube
  –   Flickr
What we have done: Open Data
• NodeXLGraphGallery.org
  – User generated collection
    of network graphs,
    datasets and annotations
  – Collective repository for
    the research community
  – Published collections of
    data from a range of social
    media data sources to help
    students and researchers
    connect with data of
    interest and relevance
What we have done: Open Scholarship
What we have done: Open Scholarship
What we want to do:
(Build the tools to) map the social web
• Move NodeXL to the web: (Node[NOT]XL)
   – Node for Google Doc Spreadsheets?
   – WebGL Canvas? D3.JS? Sigma.JS
• Connect to more data sources of interest:
   – RDF, MediaWikis, Gmail, NYT, Citation Networks
• Solve hard network manipulation UI problems:
   – Modal transform, Time series, Automated layouts
• Grow and maintain archives of social media network data sets for
  research use.
• Improve network science education:
   – Workshops on social media network analysis
   – Live lectures and presentations
   – Videos and training materials
How you can help
• Sponsor a feature
• Sponsor workshops
• Sponsor a student
• Schedule training
• Sponsor the foundation
• Donate your money, code, computation, storage,
  bandwidth, data or employee’s time
• Help promote the work of the Social Media
  Research Foundation
20121010 marc smith - mapping collections of connections in social media with node xl
Who is the mayor of your hashtag?




                   Find out at: http://netbadges.com
Who is the mayor of your hashtag?




                                    Find out at: http://netbadges.com
Who is the mayor of your hashtag?
         http://netbadges.com




                                Find out at: http://netbadges.com
Charting Collections of
                                                       Connections
                                                     In Social Media:
                                                    Creating Maps &
                                                      Measures with
                                                         NodeXL




A project from the Social Media Research Foundation: http://www.smrfoundation.org
20121010 marc smith - mapping collections of connections in social media with node xl

More Related Content

PPTX
2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...
PPTX
Think Link: Network Insights with No Programming Skills
PPTX
2016 SocialMedia.Org Marc Smith-NodeXL-Social Media SNA
PPTX
2015 pdf-marc smith-node xl-social media sna
PPTX
2014 TheNextWeb-Mapping connections with NodeXL
PPTX
20120301 strata-marc smith-mapping social media networks with no coding using...
PPTX
20151001 charles university prague - marc smith - node xl-picturing political...
PPTX
20120622 web sci12-won-marc smith-semantic and social network analysis of …
2015 #MMeasure-Marc Smith-NodeXL Mapping social media using social network ma...
Think Link: Network Insights with No Programming Skills
2016 SocialMedia.Org Marc Smith-NodeXL-Social Media SNA
2015 pdf-marc smith-node xl-social media sna
2014 TheNextWeb-Mapping connections with NodeXL
20120301 strata-marc smith-mapping social media networks with no coding using...
20151001 charles university prague - marc smith - node xl-picturing political...
20120622 web sci12-won-marc smith-semantic and social network analysis of …

What's hot (20)

PPTX
2017 05-26 NodeXL Twitter search #shakeupshow
PPTX
2013 passbac-marc smith-node xl-sna-social media-formatted
PPTX
How to use social media network analysis for amplification
PPTX
2010 sept - mobile web africa - marc smith - says who - mapping social medi...
PPTX
2013 NodeXL Social Media Network Analysis
PPTX
20111123 mwa2011-marc smith
PDF
Marc Smith - Charting Collections of Connections in Social Media: Creating Ma...
PPT
2010 june - personal democracy forum - marc smith - mapping political socia...
PPTX
20110128 connected action-node xl-sea of connections
PPT
Where Am I Aiming? We, Me and the Network - TTIX 2010
PPTX
Simplifying Social Network Diagrams
PPTX
20111103 con tech2011-marc smith
PPTX
LSS'11: Charting Collections Of Connections In Social Media
PPT
2008 - ICWSM - Marc Smith - Some Dimensions Of Social Media
PPTX
2009 December NodeXL Overview
PPTX
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
PPT
Jill Freyne - Collecting community wisdom: integrating social search and soci...
PPT
Social Web 2.0 Class Week 4: Social Networks, Privacy
PDF
Ph.D. defense: semantic social network analysis
PPTX
Just What Is Social in Social Media? An Actor-Network Critique of Twitter Age...
2017 05-26 NodeXL Twitter search #shakeupshow
2013 passbac-marc smith-node xl-sna-social media-formatted
How to use social media network analysis for amplification
2010 sept - mobile web africa - marc smith - says who - mapping social medi...
2013 NodeXL Social Media Network Analysis
20111123 mwa2011-marc smith
Marc Smith - Charting Collections of Connections in Social Media: Creating Ma...
2010 june - personal democracy forum - marc smith - mapping political socia...
20110128 connected action-node xl-sea of connections
Where Am I Aiming? We, Me and the Network - TTIX 2010
Simplifying Social Network Diagrams
20111103 con tech2011-marc smith
LSS'11: Charting Collections Of Connections In Social Media
2008 - ICWSM - Marc Smith - Some Dimensions Of Social Media
2009 December NodeXL Overview
20121001 pawcon 2012-marc smith - mapping collections of connections in socia...
Jill Freyne - Collecting community wisdom: integrating social search and soci...
Social Web 2.0 Class Week 4: Social Networks, Privacy
Ph.D. defense: semantic social network analysis
Just What Is Social in Social Media? An Actor-Network Critique of Twitter Age...
Ad

Viewers also liked (13)

PPTX
Ppt definición del emprendedor
PPT
2009 Node XL Overview: Social Network Analysis in Excel 2007
RTF
Chapter 1—the challenge of human resources management
PDF
UXPeople 2015: Юрий Ветров — Платформенное мышление
PPTX
Perfil del emprendedor
PDF
A Pictorial history of the formations of the Waffen SS
PPTX
How to become category captain
PPTX
Caso clinico perinatal
PDF
Working Safely With Container Unloading
PDF
7 ‘Hidden’ Sources of Big Data That You Have
PDF
Diagnóstico tardío y enfermedad avanzada de VIH en pacientes adultos en un ho...
PPTX
Mother Tongue Based - Multilingual Education (MTB-MLE) in Philippines
PDF
Behind the Scenes: Launching HubSpot Tokyo
Ppt definición del emprendedor
2009 Node XL Overview: Social Network Analysis in Excel 2007
Chapter 1—the challenge of human resources management
UXPeople 2015: Юрий Ветров — Платформенное мышление
Perfil del emprendedor
A Pictorial history of the formations of the Waffen SS
How to become category captain
Caso clinico perinatal
Working Safely With Container Unloading
7 ‘Hidden’ Sources of Big Data That You Have
Diagnóstico tardío y enfermedad avanzada de VIH en pacientes adultos en un ho...
Mother Tongue Based - Multilingual Education (MTB-MLE) in Philippines
Behind the Scenes: Launching HubSpot Tokyo
Ad

Similar to 20121010 marc smith - mapping collections of connections in social media with node xl (20)

PPTX
4. social network analysis
PDF
Fbk Seminar Michela Ferron
PPTX
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
PDF
How Important Social Graphs are for DTN Routing
PPTX
20110719 social media research foundation-charting collections of connections
PPTX
Network analysis lecture
PPTX
Social Network Analysis (SNA) 2018
PDF
Oxford Digital Humanities Summer School
PPT
Social Networks of Performance
PDF
Social network analysis basics
PDF
CS6010 Social Network Analysis Unit V
PPTX
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
ZIP
Social Networks and Computer Science
PDF
Document 8 1.pdf
PPTX
Node XL - features and demo
PDF
Jürgens diata12-communities
PDF
Exploring Social Media with NodeXL
PPTX
Social Network Analysis - an Introduction (minus the Maths)
PPTX
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
4. social network analysis
Fbk Seminar Michela Ferron
2010-November-8-NIA - Smart Society and Civic Culture - Marc Smith
How Important Social Graphs are for DTN Routing
20110719 social media research foundation-charting collections of connections
Network analysis lecture
Social Network Analysis (SNA) 2018
Oxford Digital Humanities Summer School
Social Networks of Performance
Social network analysis basics
CS6010 Social Network Analysis Unit V
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Social Networks and Computer Science
Document 8 1.pdf
Node XL - features and demo
Jürgens diata12-communities
Exploring Social Media with NodeXL
Social Network Analysis - an Introduction (minus the Maths)
Mining and analyzing social media part 2 - hicss47 tutorial - dave king

More from Marc Smith (14)

PPTX
Think link what is an edge - NodeXL
PPTX
20130724 ted x-marc smith-digital health futures empowerment or coercion
PDF
2012 ona practitioner-courseflyer
PDF
2011 IEEE Social Computing Nodexl: Group-In-A-Box
PPTX
20110830 Introducing the Social Media Research Foundation
PPTX
Personal Digital Archiving 2011 - Charting Collections of Connections in Soci...
PPT
Analyzing social media networks with NodeXL - Chapter-14 Images
PPT
Analyzing social media networks with NodeXL - Chapter-13 Images
PPT
Analyzing social media networks with NodeXL - Chapter- 12 images
PPT
Analyzing social media networks with NodeXL - Chapter-11 Images
PPT
Analyzing social media networks with NodeXL - Chapter-10 Images
PPT
Analyzing social media networks with NodeXL - Chapter- 09 Images
PPT
Analyzing social media networks with NodeXL - Chapter- 08 images
PPT
Analyzing social media networks with NodeXL - Chapter-07 Images
Think link what is an edge - NodeXL
20130724 ted x-marc smith-digital health futures empowerment or coercion
2012 ona practitioner-courseflyer
2011 IEEE Social Computing Nodexl: Group-In-A-Box
20110830 Introducing the Social Media Research Foundation
Personal Digital Archiving 2011 - Charting Collections of Connections in Soci...
Analyzing social media networks with NodeXL - Chapter-14 Images
Analyzing social media networks with NodeXL - Chapter-13 Images
Analyzing social media networks with NodeXL - Chapter- 12 images
Analyzing social media networks with NodeXL - Chapter-11 Images
Analyzing social media networks with NodeXL - Chapter-10 Images
Analyzing social media networks with NodeXL - Chapter- 09 Images
Analyzing social media networks with NodeXL - Chapter- 08 images
Analyzing social media networks with NodeXL - Chapter-07 Images

Recently uploaded (20)

PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
KodekX | Application Modernization Development
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Approach and Philosophy of On baking technology
PPTX
Cloud computing and distributed systems.
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
NewMind AI Weekly Chronicles - August'25 Week I
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
The AUB Centre for AI in Media Proposal.docx
Mobile App Security Testing_ A Comprehensive Guide.pdf
KodekX | Application Modernization Development
Advanced methodologies resolving dimensionality complications for autism neur...
MIND Revenue Release Quarter 2 2025 Press Release
Understanding_Digital_Forensics_Presentation.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Encapsulation_ Review paper, used for researhc scholars
MYSQL Presentation for SQL database connectivity
Approach and Philosophy of On baking technology
Cloud computing and distributed systems.
Network Security Unit 5.pdf for BCA BBA.
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Per capita expenditure prediction using model stacking based on satellite ima...

20121010 marc smith - mapping collections of connections in social media with node xl

  • 2. Charting Collections of Connections In Social Media: Creating Maps & Measures with NodeXL A project from the Social Media Research Foundation: http://www.smrfoundation.org
  • 3. About Me Introductions Marc A. Smith Chief Social Scientist Connected Action Consulting Group Marc@connectedaction.net http://www.connectedaction.net http://www.codeplex.com/nodexl http://www.twitter.com/marc_smith http://delicious.com/marc_smith/Paper http://www.flickr.com/photos/marc_smith http://www.facebook.com/marc.smith.sociologist http://www.linkedin.com/in/marcasmith http://www.slideshare.net/Marc_A_Smith http://www.smrfoundation.org
  • 4. Social Media Research Foundation http://smrfoundation.org
  • 5. Social Media (email, Facebook, Twitter, YouTube, and more) is all about connections from people to people. 5
  • 6. Patterns are left behind 6
  • 7. There are many kinds of ties…. Like, Link, Reply, Rate, Review, Favorite, Friend, Follow, Forward, Edit, Tag, Comment, Check-in… http://www.flickr.com/photos/stevendepolo/3254238329
  • 8. “Think Link” Nodes & Edges Is related to A B
  • 9. Each contains one or more social networks World Wide Web
  • 12. 알철수 - Ahn Chul Soo
  • 17. Strength of Weak ties p://www.flickr.com/photos/fullaperture/81266869/
  • 19. Social Networks • History: from the dawn of time! • Theory and method: 1934 -> • Jacob L. Moreno • http://en.wiki pedia.org/wiki /Jacob_L._Mor eno Jacob Moreno’s early social network diagram of positive and negative relationships among members of a football team. Originally published in Moreno, J. L. (1934). Who shall survive? Washington, DC: Nervous and Mental Disease Publishing Company.
  • 20. A nearly social network diagram of relationships among workers in a factory illustrates the positions different workers occupy within the workgroup. Originally published in Roethlisberger, F., and Dickson, W. (1939). Management and the worker. Cambridge, UK: Cambridge University Press.
  • 21. Hubs
  • 26. Network of connections among “#Debate AND Obama” mentioning Twitter users
  • 27. Like MSPaint™ for graphs. — the Community Introduction to NodeXL
  • 28. NodeXL Network Overview Discovery and Exploration add-in for Excel 2007/2010 A minimal network can illustrate the ways different locations have different values for centrality and degree
  • 30. #teaparty 15 November 2011 #occupywallstreet 15 November 2011 http://www.newscientist.com/blogs/onepercent/2011/11/occupy-vs-tea-party-what-their.html
  • 31. Social Network Theory http://en.wikipedia.org/wiki/Social_network • Central tenet – Social structure emerges from – the aggregate of relationships (ties) – among members of a population • Phenomena of interest – Emergence of cliques and clusters – from patterns of relationships – Centrality (core), periphery (isolates), Source: Richards, W. – betweenness (1986). The NEGOPY • Methods network analysis program. Burnaby, BC: – Surveys, interviews, observations, Department of Communication, Simon log file analysis, computational Fraser University. pp.7- analysis of matrices 16 (Hampton &Wellman, 1999; Paolillo, 2001; Wellman, 2001)
  • 32. SNA 101 • Node A – “actor” on which relationships act; 1-mode versus 2-mode networks • Edge B – Relationship connecting nodes; can be directional C • Cohesive Sub-Group – Well-connected group; clique; cluster A B D E • Key Metrics – Centrality (group or individual measure) D • Number of direct connections that individuals have with others in the group (usually look at incoming connections only) E • Measure at the individual node or group level – Cohesion (group measure) • Ease with which a network can connect • Aggregate measure of shortest path between each node pair at network level reflects average distance – Density (group measure) • Robustness of the network • Number of connections that exist in the group out of 100% possible – Betweenness (individual measure) F G • # shortest paths between each node pair that a node is on • Measure at the individual node level • Node roles – Peripheral – below average centrality C H – Central connector – above average centrality D I – Broker – above average betweenness E
  • 33. NodeXL Free/Open Social Network Analysis add-in for Excel 2007/2010 makes graph theory as easy as a pie chart, with integrated analysis of social media sources. http://nodexl.codeplex.com
  • 35. Goal: Make SNA easier • Existing Social Network Tools are challenging for many novice users • Tools like Excel are widely used • Leveraging a spreadsheet as a host for SNA lowers barriers to network data analysis and display
  • 36. Twitter Network for “Microsoft Research” *BEFORE*
  • 37. Twitter Network for “Microsoft Research” *AFTER*
  • 38. Network Motif Simplification Cody Dunne, University of Maryland
  • 42. This graph represents a directed network of 1,360 Twitter users whose recent tweets contained "contraceptive OR contraception". The network was obtained on Friday, 08 June 2012 at 13:22 UTC. There is an edge for each follows relationship. There is an edge for each "replies- to" relationship in a tweet. There is an edge for each "mentions" relationship in a tweet. There is a self- loop edge for each tweet that is not a "replies-to" or "mentions". The tweets were made over the 2-day period from Thursday, 07 June 2012 at 18:46 UTC to Friday, 08 June 2012 at 13:06 UTC. The graph's vertices were grouped by cluster using the Clauset- Newman-Moore cluster algorithm. The edge colors are based on relationship values. The vertex sizes are based on each user’s number of followers. Table 1 reports the summary network metrics that describe the graph.
  • 43. Summary network metrics Table 1. Summary network metrics for the graph in Figure 1 Network Metric Value Graph Type Directed Vertices 1360 Unique Edges 5641 Edges With Duplicates 771 Total Edges 6412 Self-Loops 1096 Connected Components 427 Single-Vertex Connected Components 395 Maximum Vertices in a Connected Component 880 Max Edges in a Connected Component 5818 Maximum Geodesic Distance (Diameter) 12 Average Geodesic Distance 3.557807 Graph Density 0.002705817 Modularity 0.446145
  • 44. The Vertices spreadsheet lists users who contributed a tweet containing the terms “contraception OR contraceptives” over two days in early June 2012. Users are ranked by their computed betweenness centrality within the network of follows, replies, and mentions edges. The top 10 vertices, ranked by betweenness centrality are the accounts at the center of the network. These include: @thinkprogress, @gatesfoundation, @SandraFluke, @maleeek, @Change, @foxandfriends, @melindagates, @AshleyJudd, @cnalive, and @SOHLTC.
  • 45. Welser, Howard T., Eric Gleave, Danyel Fisher, and Marc Smith. 2007. Visualizing the Signatures of Social Roles in Online Discussion Groups. The Journal of Social Structure. 8(2). Experts and “Answer People” Discussion people, Topic setters Discussion starters, Topic setters
  • 48. The Content summary spreadsheet displays the most frequently used URLs, hashtags, and user names within the network as a whole and within each calculated sub-group.
  • 49. Contrast hashtags in Groups 2 & 4
  • 55. Example NodeXL data importer for Twitter
  • 56. NodeXL imports “edges” from social media data sources
  • 57. NodeXL displays subgraph images along with network metadata NodeXL creates a list of “vertices” from imported social media edges
  • 58. Perform collections of common operations with NodeXL a single click Automation makes analysis simple and fast
  • 60. NodeXL “Autofill columns” simplifies mapping data attributes to display attributes
  • 63. NodeXL Generates Overall Network Metrics
  • 66. Social Network Maps Reveal Key influencers in any topic. Sub-groups. Bridges.
  • 67. Social Media Research Foundation People Disciplines Institutions University Computer Science University of Maryland Faculty Students HCI, CSCW Oxford Internet Institute Industry Machine Learning Stanford University Independent Information Visualization Microsoft Research Researchers UI/UX Illinois Institute of Technology Developers Social Science/Sociology Connected Action Network Analysis Cornell Collective Action Morningside Analytics
  • 68. What we are trying to do: Open Tools, Open Data, Open Scholarship • Build the “Firefox of GraphML” – open tools for collecting and visualizing social media data • Connect users to network analysis – make network charts as easy as making a pie chart • Connect researchers to social media data sources • Archive: Be the “Allen Very Large Telescope Array” for Social Media data – coordinate and aggregate the results of many user’s data collection and analysis • Create open access research papers & findings • Make “collections of connections” easy for users to manage
  • 69. What we have done: Open Tools • NodeXL • Data providers (“spigots”) – ThreadMill Message Board – Exchange Enterprise Email – Voson Hyperlink – SharePoint – Facebook – Twitter – YouTube – Flickr
  • 70. What we have done: Open Data • NodeXLGraphGallery.org – User generated collection of network graphs, datasets and annotations – Collective repository for the research community – Published collections of data from a range of social media data sources to help students and researchers connect with data of interest and relevance
  • 71. What we have done: Open Scholarship
  • 72. What we have done: Open Scholarship
  • 73. What we want to do: (Build the tools to) map the social web • Move NodeXL to the web: (Node[NOT]XL) – Node for Google Doc Spreadsheets? – WebGL Canvas? D3.JS? Sigma.JS • Connect to more data sources of interest: – RDF, MediaWikis, Gmail, NYT, Citation Networks • Solve hard network manipulation UI problems: – Modal transform, Time series, Automated layouts • Grow and maintain archives of social media network data sets for research use. • Improve network science education: – Workshops on social media network analysis – Live lectures and presentations – Videos and training materials
  • 74. How you can help • Sponsor a feature • Sponsor workshops • Sponsor a student • Schedule training • Sponsor the foundation • Donate your money, code, computation, storage, bandwidth, data or employee’s time • Help promote the work of the Social Media Research Foundation
  • 76. Who is the mayor of your hashtag? Find out at: http://netbadges.com
  • 77. Who is the mayor of your hashtag? Find out at: http://netbadges.com
  • 78. Who is the mayor of your hashtag? http://netbadges.com Find out at: http://netbadges.com
  • 79. Charting Collections of Connections In Social Media: Creating Maps & Measures with NodeXL A project from the Social Media Research Foundation: http://www.smrfoundation.org

Editor's Notes

  • #16: http://www.flickr.com/photos/lizjones/1571656758/sizes/o/
  • #17: http://www.flickr.com/photos/kjander/3123883124/sizes/o/
  • #25: http://www.flickr.com/photos/library_of_congress/3295494976/sizes/o/in/photostream/
  • #26: http://www.flickr.com/photos/amycgx/3119640267/
  • #29: A tutorial on analyzing social media networks is available from: casci.umd.edu/NodeXL_TeachingDifferent positions within a network can be measured using network metrics.
  • #65: Virgin America
  • #66: Dell Listens and Dell Cares