SlideShare a Scribd company logo
Interactive visualization and exploration
of network data with gephi
Bernhard Rieder
Universiteit van Amsterdam
Mediastudies Department
and some conceptual context
Two kinds of mathematics
Can there be data analysis without math? No.
Does this imply epistemological commitments? Yes.
But there are choices, e.g. between:
☉ Confirmatory data analysis => deductive
☉ Exploratory data analysis (Tukey 1962) => inductive
Two kinds of mathematics
Statistics
Observed: objects and properties
Inferred: social forces
Data representation: the table
Visual representation: quantity charts
Grouping: "class" (similar properties)
Graph-theory
Observed: objects and relations
Inferred: structure
Data representation: the matrix
Visual representation: network diagrams
Grouping: "clique" (dense relations)
Graph theory
Leonhard Euler, "Seven Bridges of Königsberg", 1735
Introducing the "point and line" model
Graph theory
Develops over the 20th century, in particular the second half.
Integrates branches of mathematics (topology, geometry, statistics, etc.).
Graph theory is "the mathematics of structure" (Harary 1965), "a
mathematical model for any system involving a binary relation" (Harary
1969); it makes relational structure calculable.
"Perhaps even more than to the contact between mankind and nature, graph theory owes to
the contact of human beings between each other." (König 1936)
Basic ideas
Moreno 1934
Graph theory developed in
exchange with sociometry,
small-group research and
(later) social exchange
theory.
Starting point:
"the sociometric test"
(experimental definition of
"relation")
Basic ideas
Forsythe and Katz, 1946, "adjacency matrix"
Harary, Graph Theory, 1969
Basic ideas
The network singularity
Why do network analysis and visualization? Which arguments are put
forward?
☉ New media: technical and conceptual structures modeled as networks
☉ The network imaginary: networks as analytical device and trending topic
☉ Calculative capacities: powerful techniques and tools
☉ Visualization: the network diagram, "visual analytics"
☉ Logistics: data, software, and hardware are available and cheap
☉ Methodology I: dissatisfaction with statistics => SNA
☉ Methodology II: a "new science of networks" (Watts 2005) emerged
☉ Society: diversification, problems with demographics / statistics / theory
Basic ideas
Adamic and Glance, "Divided They Blog", 2005
Graph theory
Graph theory consists of or provides:
☉ A basic conceptual and formal model (point and line)
☉ Descriptive and analytical language to talk about specific graphs
☉ Extensive calculability of structure
☉ Various “native” (and non-native) forms of visualization
Formalization
"As we have seen, the basic terms of digraph theory are point and line. Thus, if an
appropriate coordination is made so that each entity of an empirical system is identified
with a point and each relationship is identified with a line, then for all true statements
about structural properties of the obtained digraph there are corresponding true statements
about structural properties of the empirical system." (Harary et al. 1965)
There is always an epistemological commitment!
=> What can "carry" the reductionism and formalization?
Much of these data can be
analyzed as graphs.
Social media formalize
interaction at the interface.
Basic ideas
What Kind of Phenomena/Data?
Interactive networks (Watts 2004): link encodes tangible interaction
☉ social network
☉ citation networks
☉ hypertext networks
Symbolic networks (Watts 2004): link is conceptual
☉ co-presence (Tracker Tracker, IMDB, etc.)
☉ co-word
☉ any kind of "structure" that can be formalized as point and line
=> do all kinds of analysis (SNA, transportation, text mining, etc.)
=> analyze structural properties in various ways
Basic ideas
File formats
To be able to begin, we need data in a graph file format. There are a
number of different file formats used to specify graphs.
Different formats have different capacities (e.g. .gexf allows to specify
time intervals).
The guess (.gdf) format:
http://courses.polsys.net/gephi/
Basic ideas
What is a graph?
An abstract representation of nodes connected by links.
Two ways of analyzing graphs:
☉ numerical analysis (graph statistics, structural measures, etc.)
☉ visualization (network diagram, matrix, arc diagram, etc.)
Basic ideas
Wikipedia: Glossary of graph theory
Tools are easy, concepts are hard
http://courses.polsys.net/gephi/
Vertices and edges!
Nodes and lines!
Two main types:
Directed (e.g. Twitter)
Undirected (e.g. Facebook)
Properties of nodes:
degree, centrality, etc.
Properties of edges:
weight, direction, etc.
Properties of the graph:
averages, diameter, communities, etc.
Basic ideas
What is a graph?
A
B
C
D
a-b
b-d
b-c c-d
Nodes, Degree:
A: 1, B: 3, C: 2, D: 2
Nodes, Weighted Degree:
A: 1, B: 3, C: 3, D: 3
Edges, Weight:
a-b: 1, b-c: 1, b-d: 1, c-d: 2
Graph, diameter: 2
Graph, density: 0.667 (4 edges out of 6)
Graph, average shortest path: 1.334
Numbers are great for comparison!
Basic ideas
Basic ideashttp://courses.polsys.net/gephi/
Basic ideas
Interactive visual analytics
Bringing structure to the surface (gephi panel: "layout")
☉ different spatializations (force, geometry, etc.)
Projecting variables into the diagram (gephi panel: "ranking")
☉ Size (nodes, edges, labels, etc.)
☉ Color (nodes, edges, labels, etc.)
Deriving measures (gephi panel: "statistics")
☉ Properties of nodes, edges, structure => new variables
Analysis: e.g. correlation between spatial layout and variables?
Layout algorithms transform n-dimensional
adjacency matrices into two-dimensional diagrams
Every algorithm/technique reveals the structure
of the graph differently, shows different aspects
Basic ideas
Interactive visual analytics
Bringing structure to the surface (gephi panel: "layout")
☉ different spatializations (force, geometry, etc.)
Projecting variables into the diagram (gephi panel: "ranking")
☉ Size (nodes, edges, labels, etc.)
☉ Color (nodes, edges, labels, etc.)
Analysis: e.g. “correlation” between spatial layout and variables?
Basic ideas
Nine measures of centrality (Freeman 1979)
Basic ideas
Interactive visual analytics
Bringing structure to the surface (gephi panel: "layout")
☉ different spatializations (force, geometry, etc.)
Projecting variables into the diagram (gephi panel: "ranking")
☉ Size (nodes, edges, labels, etc.)
☉ Color (nodes, edges, labels, etc.)
Deriving measures (gephi panel: "statistics")
☉ Properties of nodes, edges, structure => new variables
Analysis: e.g. “correlation” between spatial layout and variables?
Basic ideas
Basic ideas
Label PR α=0.85 PR α=0.7 PR α=0.55 PR α=0.4 In-Degree Out-Degree Degree
n34 0.0944 0.0743 0.0584 0.0460 4 1 5
n1 0.0867 0.0617 0.0450 0.0345 1 2 3
n17 0.0668 0.0521 0.0423 0.0355 2 1 3
n39 0.0663 0.0541 0.0453 0.0388 5 1 6
n22 0.0619 0.0506 0.0441 0.0393 5 1 6
n27 0.0591 0.0451 0.0371 0.0318 1 0 1
n38 0.0522 0.0561 0.0542 0.0486 6 0 6
n11 0.0492 0.0372 0.0306 0.0274 3 1 4
FB group "Islam is dangerous"
Friendship network, color: betweenness centrality
2.339 members
Average degree of 39.69
81.7% have at least one friend in the group
55.4% five or more
37.2% have 20 or more
founder and admin has 609 friends
Twitter 1% sample, co-hashtag analysis
227,029 unique hashtags, 1627 displayed (freq >= 50)
Size: frequency
Color: modularity
Size: frequency
Color: user diversity
Twitter 1% sample, co-hashtag analysis
227,029 unique hashtags, 1627 displayed (freq >= 50)
Size: frequency
Color: degree
Twitter 1% sample, co-hashtag analysis
227,029 unique hashtags, 1627 displayed (freq >= 50)
Network statistics
betweenness centrality
degree
Relational elements of graphs can
be represented as tables (nodes
have properties) and analyzed
through statistics.
Network statistics bridge the gap
between individual units and the
structural forms they are
embedded in.
This is currently an extremely
prolific field of research.
Twitter 1% sample
Co-hashtag analysis
Degree vs.
wordFrequency
Degree vs. userDiversity
Twitter 1% sample
Co-hashtag analysis
Basic ideas
PlugIn: Spatial Ranking
Co-like analysis of my personal FB network:
Nodes: users / Links: "liking the same thing"
Example 3: our imagination
Basic ideas
PlugIn: Multimodal Projection
Basic ideas
Basic ideas
PlugIn: GeoLayout
Thank You
rieder@uva.nl
https://www.digitalmethods.net
http://thepoliticsofsystems.net
"Far better an approximate answer to the right question,
which is often vague, than an exact answer to the wrong
question, which can always be made precise. Data
analysis must progress by approximate answers, at best,
since its knowledge of what the problem really is will at
best be approximate." (Tukey 1962)

More Related Content

PPTX
No dues management system prepared by HRITIKA RAJ (Shivalik College of engg.,...
PDF
Intro To Geospatial
PDF
地球地図を利用した地図タイルの作成 - FOSS4G TOKYO 2014 全体セッション2
PDF
IRJET- Fake Profile Identification using Machine Learning
PPTX
Smart Attendance System using QR Code with SMS Notification
PDF
QGIS Module 3
PDF
Driver Drowsiness Detection System using Google ML Kit Face Detection API and...
PPTX
Internship Presentation 1 Web Developer
No dues management system prepared by HRITIKA RAJ (Shivalik College of engg.,...
Intro To Geospatial
地球地図を利用した地図タイルの作成 - FOSS4G TOKYO 2014 全体セッション2
IRJET- Fake Profile Identification using Machine Learning
Smart Attendance System using QR Code with SMS Notification
QGIS Module 3
Driver Drowsiness Detection System using Google ML Kit Face Detection API and...
Internship Presentation 1 Web Developer

What's hot (20)

PPTX
Project presentation
PDF
Chapter ii - Web-based Library Management System of East West Colleges
PDF
Architecting the ArcGIS Platform
PDF
QGIS Tutorial 2
PDF
“Waste Food Management and Donation App”
PPTX
Suggestion box
DOC
Online old books sales by hemraj gahlot
PPTX
Object detection with deep learning
DOCX
Online bus ticket booking
DOC
Online voting system full thesis project by jahir
PPTX
Intoduction to Digilocker
PPTX
Library management system project
PPTX
Vehicle Management - Features, Advantages, Benefits
PPTX
online bus ticket booking system
PPTX
Library Management System.powerpoint.pptx
PDF
Android Fingerprint Authentication
PPTX
Online shopping with shopping cart ppt 1
PPTX
Object extraction from satellite imagery using deep learning
PDF
E office presentation
PDF
IGNOU Assignment Front Page
Project presentation
Chapter ii - Web-based Library Management System of East West Colleges
Architecting the ArcGIS Platform
QGIS Tutorial 2
“Waste Food Management and Donation App”
Suggestion box
Online old books sales by hemraj gahlot
Object detection with deep learning
Online bus ticket booking
Online voting system full thesis project by jahir
Intoduction to Digilocker
Library management system project
Vehicle Management - Features, Advantages, Benefits
online bus ticket booking system
Library Management System.powerpoint.pptx
Android Fingerprint Authentication
Online shopping with shopping cart ppt 1
Object extraction from satellite imagery using deep learning
E office presentation
IGNOU Assignment Front Page
Ad

Viewers also liked (20)

PDF
Gephi short introduction
PPTX
Gephi bbc
ODP
Use case for Using Gephi for Social Network Analysis of facebook
PPTX
Introduction to Network Analysis in Gephi
PDF
Rogers data days_2014_slides_opti
KEY
Web Flags Summer School 2012
PDF
Crawling and Scraping tutorial at the Digital Methods Summer School 2013
PDF
Digital Methods Summer School 2014 Tool Medley
PDF
Rogers studyingpoliticalissues mar2014_optimized_ii_
PDF
Repurposing Wikipedia: Wikipedia as data set and analytical device
PDF
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
PDF
Hashtag lifelines
PDF
National Tracking Ecologies - Digital Methods Summer School 2013
PDF
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
KEY
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...
PDF
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
PDF
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
PDF
Digital Methods Summer School 2015 Tool Medley
PPTX
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
PDF
Digital Methods Tool Medley
Gephi short introduction
Gephi bbc
Use case for Using Gephi for Social Network Analysis of facebook
Introduction to Network Analysis in Gephi
Rogers data days_2014_slides_opti
Web Flags Summer School 2012
Crawling and Scraping tutorial at the Digital Methods Summer School 2013
Digital Methods Summer School 2014 Tool Medley
Rogers studyingpoliticalissues mar2014_optimized_ii_
Repurposing Wikipedia: Wikipedia as data set and analytical device
Post-social methods? Issues in live research, by Noortje Marres and Esther We...
Hashtag lifelines
National Tracking Ecologies - Digital Methods Summer School 2013
Cross-Platform Profiling tutorial at the Digital Methods Summer School 2013
Traces of the Trackers. Tracking the Trackers: A historical analysis using th...
Tracking the Trackers tutorial at the Digital Methods Summer School 2013
Rogers digitalmethodsaftersocialmedia nov2013_optimized_
Digital Methods Summer School 2015 Tool Medley
Richard Rogers, Otherwise Engaged: Critical Analytics and the New Meanings of...
Digital Methods Tool Medley
Ad

Similar to Interactive visualization and exploration of network data with Gephi (20)

PPTX
Interactive visualization and exploration of network data with gephi
PPTX
Social Network Analysis Introduction including Data Structure Graph overview.
PPT
SSRI_pt1.ppt
PDF
Graph Analyses with Python and NetworkX
PPTX
Data Structure Graph DMZ #DMZone
PPTX
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
PPTX
Graph-Theory-The-Foundations-of-Modern-Networks.pptx
PDF
Mining Social Graph Data
PPTX
Gephi, Graphx, and Giraph
PPTX
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
PPTX
The Science Of Social Networks
PDF
Descobrindo o tesouro escondido nos seus dados usando grafos.
PPTX
Apache Spark GraphX highlights.
PPTX
lec3_socialnetwork_part1.pptx
PDF
Graph Analytics with Greenplum and Apache MADlib
PPTX
Graph_Theory_and_Applications_Presentation.pptx
PPT
Social Network Based Information Systems (Tin180 Com)
PPTX
Network analysis lecture
PDF
Complex Networks Analysis @ Universita Roma Tre
PDF
Link Analysis in Networks - or - Finding the Terrorists
Interactive visualization and exploration of network data with gephi
Social Network Analysis Introduction including Data Structure Graph overview.
SSRI_pt1.ppt
Graph Analyses with Python and NetworkX
Data Structure Graph DMZ #DMZone
Social Network Analysis - Lecture 4 in Introduction to Computational Social S...
Graph-Theory-The-Foundations-of-Modern-Networks.pptx
Mining Social Graph Data
Gephi, Graphx, and Giraph
Frontiers of Computational Journalism week 8 - Visualization and Network Anal...
The Science Of Social Networks
Descobrindo o tesouro escondido nos seus dados usando grafos.
Apache Spark GraphX highlights.
lec3_socialnetwork_part1.pptx
Graph Analytics with Greenplum and Apache MADlib
Graph_Theory_and_Applications_Presentation.pptx
Social Network Based Information Systems (Tin180 Com)
Network analysis lecture
Complex Networks Analysis @ Universita Roma Tre
Link Analysis in Networks - or - Finding the Terrorists

More from Digital Methods Initiative (14)

PDF
Query Design for Digital Methods by Richard Rogers
PDF
Digital Methods by Richard Rogers
PDF
The Birth of Social Media Methods
PDF
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
PDF
Digital Methods Summer School 2013 Tool Medley
PDF
Dmi12 workshops - crawling and scraping
PDF
Digital Methods Tool Medley. Digital Methods Summer School 2012
PDF
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
PDF
DMI Workshop: When Search Becomes Research
PDF
DMI Workshop: Crawling and Scraping
PDF
DMI Workshop: Data visualization. Analytical clouding.
KEY
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...
PDF
DMI Workshop. Data visualization: Clouding
PDF
IIPC Dutch Blogosphere
Query Design for Digital Methods by Richard Rogers
Digital Methods by Richard Rogers
The Birth of Social Media Methods
Studying Facebook via Data Extraction: a Netvizz tutorial at the Digital Meth...
Digital Methods Summer School 2013 Tool Medley
Dmi12 workshops - crawling and scraping
Digital Methods Tool Medley. Digital Methods Summer School 2012
Digital Methods Winterschool 2012: API - Interfaces to the Cloud
DMI Workshop: When Search Becomes Research
DMI Workshop: Crawling and Scraping
DMI Workshop: Data visualization. Analytical clouding.
DMI Workshop: Wikileaks and the Myth of (Data-Driven) Citizen Journalism (wik...
DMI Workshop. Data visualization: Clouding
IIPC Dutch Blogosphere

Recently uploaded (20)

PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Pharma ospi slides which help in ospi learning
PDF
RMMM.pdf make it easy to upload and study
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Business Ethics Teaching Materials for college
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
PDF
TR - Agricultural Crops Production NC III.pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PPTX
master seminar digital applications in india
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
Pre independence Education in Inndia.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
Abdominal Access Techniques with Prof. Dr. R K Mishra
O5-L3 Freight Transport Ops (International) V1.pdf
Pharma ospi slides which help in ospi learning
RMMM.pdf make it easy to upload and study
Pharmacology of Heart Failure /Pharmacotherapy of CHF
The Healthy Child – Unit II | Child Health Nursing I | B.Sc Nursing 5th Semester
Microbial disease of the cardiovascular and lymphatic systems
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Business Ethics Teaching Materials for college
Week 4 Term 3 Study Techniques revisited.pptx
TR - Agricultural Crops Production NC III.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Module 4: Burden of Disease Tutorial Slides S2 2025
master seminar digital applications in india
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Pre independence Education in Inndia.pdf
O7-L3 Supply Chain Operations - ICLT Program

Interactive visualization and exploration of network data with Gephi

  • 1. Interactive visualization and exploration of network data with gephi Bernhard Rieder Universiteit van Amsterdam Mediastudies Department and some conceptual context
  • 2. Two kinds of mathematics Can there be data analysis without math? No. Does this imply epistemological commitments? Yes. But there are choices, e.g. between: ☉ Confirmatory data analysis => deductive ☉ Exploratory data analysis (Tukey 1962) => inductive
  • 3. Two kinds of mathematics Statistics Observed: objects and properties Inferred: social forces Data representation: the table Visual representation: quantity charts Grouping: "class" (similar properties) Graph-theory Observed: objects and relations Inferred: structure Data representation: the matrix Visual representation: network diagrams Grouping: "clique" (dense relations)
  • 4. Graph theory Leonhard Euler, "Seven Bridges of Königsberg", 1735 Introducing the "point and line" model
  • 5. Graph theory Develops over the 20th century, in particular the second half. Integrates branches of mathematics (topology, geometry, statistics, etc.). Graph theory is "the mathematics of structure" (Harary 1965), "a mathematical model for any system involving a binary relation" (Harary 1969); it makes relational structure calculable. "Perhaps even more than to the contact between mankind and nature, graph theory owes to the contact of human beings between each other." (König 1936)
  • 6. Basic ideas Moreno 1934 Graph theory developed in exchange with sociometry, small-group research and (later) social exchange theory. Starting point: "the sociometric test" (experimental definition of "relation")
  • 8. Forsythe and Katz, 1946, "adjacency matrix"
  • 10. Basic ideas The network singularity Why do network analysis and visualization? Which arguments are put forward? ☉ New media: technical and conceptual structures modeled as networks ☉ The network imaginary: networks as analytical device and trending topic ☉ Calculative capacities: powerful techniques and tools ☉ Visualization: the network diagram, "visual analytics" ☉ Logistics: data, software, and hardware are available and cheap ☉ Methodology I: dissatisfaction with statistics => SNA ☉ Methodology II: a "new science of networks" (Watts 2005) emerged ☉ Society: diversification, problems with demographics / statistics / theory
  • 11. Basic ideas Adamic and Glance, "Divided They Blog", 2005
  • 12. Graph theory Graph theory consists of or provides: ☉ A basic conceptual and formal model (point and line) ☉ Descriptive and analytical language to talk about specific graphs ☉ Extensive calculability of structure ☉ Various “native” (and non-native) forms of visualization
  • 13. Formalization "As we have seen, the basic terms of digraph theory are point and line. Thus, if an appropriate coordination is made so that each entity of an empirical system is identified with a point and each relationship is identified with a line, then for all true statements about structural properties of the obtained digraph there are corresponding true statements about structural properties of the empirical system." (Harary et al. 1965) There is always an epistemological commitment! => What can "carry" the reductionism and formalization?
  • 14. Much of these data can be analyzed as graphs. Social media formalize interaction at the interface.
  • 15. Basic ideas What Kind of Phenomena/Data? Interactive networks (Watts 2004): link encodes tangible interaction ☉ social network ☉ citation networks ☉ hypertext networks Symbolic networks (Watts 2004): link is conceptual ☉ co-presence (Tracker Tracker, IMDB, etc.) ☉ co-word ☉ any kind of "structure" that can be formalized as point and line => do all kinds of analysis (SNA, transportation, text mining, etc.) => analyze structural properties in various ways
  • 16. Basic ideas File formats To be able to begin, we need data in a graph file format. There are a number of different file formats used to specify graphs. Different formats have different capacities (e.g. .gexf allows to specify time intervals). The guess (.gdf) format: http://courses.polsys.net/gephi/
  • 17. Basic ideas What is a graph? An abstract representation of nodes connected by links. Two ways of analyzing graphs: ☉ numerical analysis (graph statistics, structural measures, etc.) ☉ visualization (network diagram, matrix, arc diagram, etc.)
  • 18. Basic ideas Wikipedia: Glossary of graph theory Tools are easy, concepts are hard http://courses.polsys.net/gephi/
  • 19. Vertices and edges! Nodes and lines! Two main types: Directed (e.g. Twitter) Undirected (e.g. Facebook) Properties of nodes: degree, centrality, etc. Properties of edges: weight, direction, etc. Properties of the graph: averages, diameter, communities, etc. Basic ideas What is a graph? A B C D a-b b-d b-c c-d Nodes, Degree: A: 1, B: 3, C: 2, D: 2 Nodes, Weighted Degree: A: 1, B: 3, C: 3, D: 3 Edges, Weight: a-b: 1, b-c: 1, b-d: 1, c-d: 2 Graph, diameter: 2 Graph, density: 0.667 (4 edges out of 6) Graph, average shortest path: 1.334 Numbers are great for comparison!
  • 22. Basic ideas Interactive visual analytics Bringing structure to the surface (gephi panel: "layout") ☉ different spatializations (force, geometry, etc.) Projecting variables into the diagram (gephi panel: "ranking") ☉ Size (nodes, edges, labels, etc.) ☉ Color (nodes, edges, labels, etc.) Deriving measures (gephi panel: "statistics") ☉ Properties of nodes, edges, structure => new variables Analysis: e.g. correlation between spatial layout and variables?
  • 23. Layout algorithms transform n-dimensional adjacency matrices into two-dimensional diagrams
  • 24. Every algorithm/technique reveals the structure of the graph differently, shows different aspects
  • 25. Basic ideas Interactive visual analytics Bringing structure to the surface (gephi panel: "layout") ☉ different spatializations (force, geometry, etc.) Projecting variables into the diagram (gephi panel: "ranking") ☉ Size (nodes, edges, labels, etc.) ☉ Color (nodes, edges, labels, etc.) Analysis: e.g. “correlation” between spatial layout and variables?
  • 27. Nine measures of centrality (Freeman 1979)
  • 28. Basic ideas Interactive visual analytics Bringing structure to the surface (gephi panel: "layout") ☉ different spatializations (force, geometry, etc.) Projecting variables into the diagram (gephi panel: "ranking") ☉ Size (nodes, edges, labels, etc.) ☉ Color (nodes, edges, labels, etc.) Deriving measures (gephi panel: "statistics") ☉ Properties of nodes, edges, structure => new variables Analysis: e.g. “correlation” between spatial layout and variables?
  • 31. Label PR α=0.85 PR α=0.7 PR α=0.55 PR α=0.4 In-Degree Out-Degree Degree n34 0.0944 0.0743 0.0584 0.0460 4 1 5 n1 0.0867 0.0617 0.0450 0.0345 1 2 3 n17 0.0668 0.0521 0.0423 0.0355 2 1 3 n39 0.0663 0.0541 0.0453 0.0388 5 1 6 n22 0.0619 0.0506 0.0441 0.0393 5 1 6 n27 0.0591 0.0451 0.0371 0.0318 1 0 1 n38 0.0522 0.0561 0.0542 0.0486 6 0 6 n11 0.0492 0.0372 0.0306 0.0274 3 1 4
  • 32. FB group "Islam is dangerous" Friendship network, color: betweenness centrality 2.339 members Average degree of 39.69 81.7% have at least one friend in the group 55.4% five or more 37.2% have 20 or more founder and admin has 609 friends
  • 33. Twitter 1% sample, co-hashtag analysis 227,029 unique hashtags, 1627 displayed (freq >= 50) Size: frequency Color: modularity
  • 34. Size: frequency Color: user diversity Twitter 1% sample, co-hashtag analysis 227,029 unique hashtags, 1627 displayed (freq >= 50)
  • 35. Size: frequency Color: degree Twitter 1% sample, co-hashtag analysis 227,029 unique hashtags, 1627 displayed (freq >= 50)
  • 36. Network statistics betweenness centrality degree Relational elements of graphs can be represented as tables (nodes have properties) and analyzed through statistics. Network statistics bridge the gap between individual units and the structural forms they are embedded in. This is currently an extremely prolific field of research.
  • 37. Twitter 1% sample Co-hashtag analysis Degree vs. wordFrequency
  • 38. Degree vs. userDiversity Twitter 1% sample Co-hashtag analysis
  • 40. Co-like analysis of my personal FB network: Nodes: users / Links: "liking the same thing" Example 3: our imagination
  • 44. Thank You rieder@uva.nl https://www.digitalmethods.net http://thepoliticsofsystems.net "Far better an approximate answer to the right question, which is often vague, than an exact answer to the wrong question, which can always be made precise. Data analysis must progress by approximate answers, at best, since its knowledge of what the problem really is will at best be approximate." (Tukey 1962)

Editor's Notes

  • #3: Tukey, The Future of Data Analysis, 1962We don’t have rigorous methods for hypothesis testing in network analysis.
  • #4: Allows for all kinds of folding, combinations, etc. – Math is not homogeneous, but sprawling!Different forms of reasoning, different modes of aggregation.These are already analytical frameworks, different ways of formalizing.Statistics: atomism, structure is implicit ("hidden forces", "social forces" cf. Durhkeim) => groups are abstractions, constituted by socioeconomic similaritySocial Network Analysis: atomism, structure is explicit ("dyadic forces") => groups are concrete, constituted by social exchange
  • #9: Now we can calculate (in particular via matrix algebra).
  • #10: Handbooks on graph theory are full of exhaustive discussions of basic graph types. Loads of vocabulary and analytical approaches.
  • #13: Handbooks on graph theory are full of exhaustive discussions of basic graph types. Loads of vocabulary and analytical approaches.
  • #15: Very large scale systems on the one side, but highly concentrated data repositories on the other.The promise of data analysis is, of course, to use that data to make sense of all the complexity.
  • #24: Visualization is, again, one type of analysis.Which properties of the network are "made salient" by an algorithm?http://thepoliticsofsystems.net/2010/10/one-network-and-four-algorithms/Models behind: spring simulation, simulated annealing (http://wiki.cns.iu.edu/pages/viewpage.action?pageId=1704113)
  • #25: Non force-based layouts can be extremely useful. Gephi can produce those as well
  • #28: Network analysis has produced a large number of calculated metrics that take into account the structure of the network."All in all, this process resulted in the specification of nine centrality measures based on three conceptual foundations. Three are based on the degrees of points and are indexes of communication activity. Three are based on the betweenness of points and are indexes of potential for control of communication. And three are based on closeness and are indexes either of independence or efficiency." (Freeman 1979)What concepts are these metrics based on?
  • #32: Network metrics are highly dependent on individual variables. Here: the same network with PageRank with four different values for the dampening parameter alpha. (red=highest PR value, yellow=second highest, turquoise=third highest)See Rieder 2012: http://computationalculture.net/article/what_is_in_pagerank
  • #33: From DMI workshop on anti-Islamism and right-wing extremism.We can also look at interaction patters: activity structure, held together by leaders?
  • #34: Extend word lists (what am I missing?), account for refraction. Rieder & Gerlitz 2013: http://journal.media-culture.org.au/index.php/mcjournal/article/viewArticle/620Rieder 2012: http://firstmonday.org/ojs/index.php/fm/article/view/4199/3359
  • #35: Project variables into the graph User diversity = no of unique users of a hashtag divided by hashtag frequency
  • #36: Larger roles of hashtags, not all are issue markers!
  • #38: There is no need to analyze and visualize a graph as a network.Characterize hashtags in relation to a whole. (their role beyond a particular topic sample), better understand our "fishing pole" (the sample technique) and the weight it carries.Tbt: throwback thursday
  • #41: This is a technical process, but to be a method, there needs to be adequation between a conceptual element and a technical one.These steps translate a large number of commitments to particular ideas.A postdemographic (Rogers) approach.