Network analysis lecture

Network Analysis
Sara Terp, 2015

Network Analysis
• What is a network?
• What features does a network have?
• What analysis is possible with those features?
• How do we explain that analysis?

“Network”
“A group of interconnected people or things”
(Oxford English Dictionary)
Use networks to understand, use and explain
relationships

Infrastructure Networks
NPR: Visualising the US Power Grid
Transport for London: London Underground Map

Social Networks
(Sara’s Facebook friends, in Gephi)

Words
(Wise blogpost on word co-occurance matrices)

Network Analysis
Use networks to understand, use and explain
relationships

Network Features
C
D
A
B
E
F
G
Node
Edge
Directed
edge
Undirected
edge
Clique

Network Representations
• Diagram
• Adjacency matrix
[[ 0, 1, 1, 1, 0, 0, 0, 0, 0, 1],
[ 1, 0, 0, 1, 1, 0, 1, 0, 0, 1],
[ 1, 0, 0, 1, 0, 1, 0, 0, 0, 0],
[ 1, 1, 1, 0, 1, 1, 1, 0, 0, 0],
[ 0, 1, 0, 1, 0, 0, 1, 0, 0, 0],
[ 0, 0, 1, 1, 0, 0, 1, 0, 0, 0],
[ 0, 1, 0, 1, 1, 1, 0, 0, 0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 1, 0],
[ 0, 0, 0, 0, 0, 0, 0, 1, 0, 1],
[ 1, 1, 0, 0, 0, 0, 0, 0, 1, 0]]
• Adjacency list
{0: [1, 2, 3, 9], 1: [0, 9, 3, 4, 6], 2: [0, 3, 5], 3: [0, 1, 2, 4, 5, 6],
4: [1, 3, 6], 5: [2, 3, 6], 6: [1, 3, 4, 5], 7: [8], 8: [9, 7], 9: [8, 1, 0]}
• Edge list
{(0,1),(0,2),(0,3),(0,9),(1,3),(1,4),(1,6),(1,9),(2,3),(2,5),(3,4),
(3,5),(3,6),(4,6),(5,6),(7,8),(8,9)}
• Maths
G = (V,E,e)
3
6
0
1
5
7
2
98
4

The NetworkX Library
• Python network analysis library
import networkx as nx
edgelist =
{(0,1),(0,2),(0,3),(0,9),(1,3),(1,4),(1,6),(1,9),(2,3),(2,5),(3,4),(3,5),(3,6),(4,6),(5,6),(7
,8),(8,9)}
G = nx.Graph()
for edge in edgelist:
G.add_edge(edge[0], edge[1])

Node Centrality
• Finding the most “important”/“influential” nodes
• i.e. how “central” is a node to the network

Degree centrality: “who has
lots of friends?”
3
6
0
1
5
7
2
98
4
3 0.666
0 0.555
1 0.555
5 0.444
6 0.444
2 0.333
4 0.333
9 0.333
8 0.222
7 0.111
nx.degree_centrality(G)
= number of edges directly connected to n

Betweenness centrality: “who
are the bridges”?
3
6
0
1
5
7
2
98
4
9 0.38
0 0.23
1 0.23
8 0.22
3 0.10
5 0.02
6 0.02
2 0.00
4 0.00
7 0.00 nx.betweenness_centrality(G)
= (number of shortest paths including n / total
number of shortest paths) / number of pairs of
nodes

Closeness centrality: “who
are the hubs”?
3
6
0
1
5
7
2
98
4
0 0.64
1 0.64
3 0.60
9 0.60
5 0.52
6 0.52
2 0.50
4 0.50
8 0.42
7 0.31 nx.closeness_centrality(G)
= sum(distance to each other node) / (number of nodes-1)

Eigenvalue centrality “who
has most network influence”?
3
6
0
1
5
7
2
98
4
3 0.48
0 0.39
1 0.39
5 0.35
6 0.35
2 0.28
4 0.28
9 0.19
8 0.04
7 0.01
nx.eigenvector_centrality(G)

Network properties
• Characteristic path length: average shortest
distance between all pairs of nodes
• Clustering coefficient: how likely a network is to
contain highly-connected groups
• Degree distribution: histogram of node degrees

Community Detection
“Are there groups in this network?”
“What can I do with that information?”

Disconnected Networks
• Not all nodes are connected to each other
• Connected component = every node in the
component can be reached from every other node
• Giant component = connected component that
covers most of the network

Cliques and K-Cores
nx.find_cliques(G)
nx.k_clique_communities(G, 3)
3
6
0
1
5
7
2
98
4
3-cores: [[0,2,3,5], [1,3,4,6]]
2-core: [0,1,2,3,4,5,6,9]
4-cliques: [[0,2,3,5],[1,3,4,6]]
3-cliques: [[0,1,3],[0,1,9]]
2-cliques: [[7,8],[8,9]]

Other Clique methods
• N-clique: every node in the clique is connected to all
other nodes by a path of length n or less
• P-clique: each node is connected to at least p% of
the other nodes in the group.

Network Effects
Predict how information or states (e.g. political opinion
or rumours) are most likely to move across a network

Diffusion (Simple contagion)
3
6
0
1
5
7
2
98
4

Complex contagion
3
6
0
1
5
7
2
98
4

Describing Networks
bl.ocks.org/mbostock/4062045
http://bost.ocks.org/mike/uberdata/
http://bl.ocks.org/mbostock/7607999
Network diagram Edge bundling

Network Analysis Tools
• Python libraries:
• NetworkX
• iGraph
• graph-tool
• Matplotlib (visualisation)
• Pygraphviz (visualisation)
• Mayavi (3d visualisation)
Longer list: http://en.wikipedia.org/wiki/Social_network_analysis_software
• Standalone tools:
• SNAP
• GUESS
• NetMiner (free for students)
• Gephi (visualisation)
• GraphViz (visualisation)
• NodeXL (excel add-on)

Network analysis lecture

More Related Content

What's hot (20)

Similar to Network analysis lecture (20)

More from Sara-Jayne Terp (20)

Recently uploaded (20)

Network analysis lecture

Editor's Notes