SlideShare a Scribd company logo
Complex Systems Models
in the Social Sciences
(Lecture 3)
daniel martin katz
illinois institute of technology
chicago kent college of law
@computationaldanielmartinkatz.com computationallegalstudies.com
Back to Where We Ended
Our Last Class
Stanley Milgram’s
Other Experiment
Milgram was interested in the
structure of society
Including the social distance
between individuals
While the term “six degrees” is often
attributed to milgram it can be traced to ideas
from hungarian author Frigyes Karinthy
What is the average distance
between two individuals in
society?
Stanley Milgram’s
Other Experiment
NE
MA
Six Degrees of Separation?
NE
MA
Target person worked in Boston as a stockbroker
296 senders from Boston and Omaha.
20% of senders reached target.
Average chain length = 6.5.
And So the term ...
“Six degrees of Separation”
Six Degrees
Six Degrees is a claim that “average path
length” between two individuals in society
is ~ 6
The idea of ‘Six Degrees’ Popularized
through plays/movies and the kevin bacon
game
http://oracleofbacon.org/
Six Degrees of Kevin Bacon
Visualization Source: Duncan J. Watts, Six Degrees
Six Degrees of Kevin Bacon
But What is Wrong
with Milgram’s Logic?
150(150) = 22,500
150 3 = 3,375,000
150 4 = 506,250,000
150 5= 75,937,500,000
The Strength of ‘Weak’ Ties
Does Milgram get
it right? (Mark Granovetter)
Visualization Source: Early Friendster – MIT Network
www.visualcomplexity.com
Strong and Weak Ties
(Clustered
v.
Spanning)
Clustering ----
My Friends’ Friends
are also likely to
be friends
So Was Milgram Correct?
Small Worlds (i.e. Six Degrees) was a theoretical
and an empirical Claim
The Theoretical Account Was Incorrect
The Empirical Claim was still intact
Query as to how could real social networks
display both small worlds and clustering?
At the Same time, the Strength of Weak Ties was
also an Theoretical and Empirical proposition
Watts and Strogatz (1998)
A few random links in an otherwise clustered
graph yields the types of small world
properties found by Milgram
“Randomness” is key bridge between the small
world result and the clustering that is
commonly observed in real social networks
Watts and Strogatz (1998)
A Small Amount of Random Rewiring or
Something akin to Weak Ties—Allows for
Clustering and Small Worlds
Random Graphlocally Clustered
Different Form of
Network Representation
1 mode
2 mode
2 mode
Actors
and
Movies
Different Forms of
Network Representation
1 mode
Actor to Actor
Could be Binary
(0,1)
Did they
Co-Appear?
Different Forms of
Network Representation
Different Forms of
Network Representation
1 mode
Actor to Actor
Could also be
Weighted
(I.E. Edge Weights by
Number of
Co-Appearences)
Features of Networks
Mesoscopic Community Structures
We will discuss these next week
Macroscopic Graph Level Properties
We will discuss these today
Microscopic Node Level Properties
We will discuss these Next week
Macroscopic Graph
Level Properties
Degree Distributions (Outdegree & Indegree)
Clustering Coefficients
Connected Components
Shortest Paths
Density
Shortest Paths
Shortest Paths
The shortest set of links
connecting two nodes
Also, known as the geodesic path
In many graphs, there are multiple
shortest paths
Shortest Paths
Shortest Paths
A and C are connected by
2 shortest paths
A – E – B - C
A – E – D - C
Diameter: the largest geodesic distance
in the graph
The distance between A and C is
the maximum for the graph: 4
Shortest Paths
I n t h e W a t t s - S t r o g a t z M o d e l
Shortest Paths are reduced by
increasing levels of random rewiring
Clustering Coefficients
Clustering Coefficients
Measure of the tendency of nodes
in a graph to cluster
Both a graph level average for
clustering
Also, a local version which is
interested in cliqueness of a graph
Density
Density = Of the connections
that could exist between n nodes
directed graph: emax = n*(n-1)
(each of the n nodes can connect to (n-1) other nodes)
undirected graph emax = n*(n-1)/2

(since edges are undirected, count each one only once)
What Fraction are Present?
Density
What fraction are present?
density = e / emax
For example, out of 12

possible connections..
this graph
this graph has 7,
giving it a density of 

7/12 = 0.58
A “fully connected graph has a density =1
Connected Components
We are often interested in whether
the graph has a single or multiple
connected components
Strong Components
Giant Component
Weak Components
“Largest Weakly Connected Component” in the
SCOTUS Citation Network
There exist cases that are not in this visual as
they are disconnected as of the year 1830
However, by 2009, 99% of SCOTUS Decisions are
in the Largest Weakly Connected Component
Connected Components
Open “Giant Component” from
the netlogo models Library
Connected Components
Notice the
fraction of
nodes in the
giant component
Notice the Size of
the “Giant
Component”
Model has
been
advanced
25+ Ticks
Connected Components
Model has
been
advanced
80+ Ticks
Notice the
fraction of
nodes in the
giant component
Notice the Size of
the “Giant
Component”
Connected Components
Model has
been
advanced
120+ Ticks
Notice the
fraction of
nodes in the
giant component
Notice the Size of the
“Giant Component”
now = “num-nodes”
in the slider
Degree Distributions
outdegree

how many directed edges (arcs)
originate at a node
indegree

how many directed edges (arcs) are
incident on a node
degree (in or out)

number of edges incident on a node
Indegree=3
Outdegree=2
Degree=5
Node Degree
from
Matrix Values
Outdegree:
outdegree for node 3 = 2,
which we obtain by summing
the number of non-zero
entries in the 3rd row
Indegree:
indegree for node 3 = 1,
which we obtain by summing
the number of non-zero
entries in the 3rd column
Degree Distributions
These are Degree Count for particular nodes
but we are also interested in the distribution
of arcs (or edges) across all nodes
These Distributions are called “degree
distributions”
Degree distribution: A frequency count of
the occurrence of each degree
Degree Distributions
Imagine we have this 8 node network:
In-degree sequence:
[2, 2, 2, 1, 1, 1, 1, 0]
Out-degree sequence:
[2, 2, 2, 2, 1, 1, 1, 0]
(undirected) degree sequence:
[3, 3, 3, 2, 2, 1, 1, 1]
Degree Distributions
Imagine we have this 8 node network:
In-degree distribution:
[(2,3) (1,4) (0,1)]
Out-degree distribution:
[(2,4) (1,3) (0,1)]
(undirected) distribution:
[(3,3) (2,2) (1,3)]
Why are Degree
Distributions Useful?
They are the signature of a dynamic process
We will discuss in greater detail tomorrow
Consider several canonical network models
Canonical Network Models
Erdős-Renyi
Random Network
Highly Clustered
Network
Watts-Strogatz
Small World Network
Barabási-Albert
Preferential
Attachment Network
Why are Degree
Distributions Useful?
Barabási-Albert
Preferential
Attachment Network
Power Law /
Scale Free
Networks
Barabási-Albert
Preferential Attachment
Netlogo Models Library --> Networks --> Preferential Attachment
Watch the Changing
Degree Distribution
Barabási-Albert
Preferential Attachment
Netlogo Models Library --> Networks --> Preferential Attachment
Barabási-Albert
Preferential Attachment
Netlogo Models Library --> Networks --> Preferential Attachment
Barabási-Albert
Preferential Attachment
Netlogo Models Library --> Networks --> Preferential Attachment
Barabási-Albert
Preferential Attachment
Netlogo Models Library --> Networks --> Preferential Attachment
Barabási-Albert
Preferential Attachment
Netlogo Models Library --> Networks --> Preferential Attachment
Readings on Power law /
Scale free Networks
Check out Lada Adamic’s Power Law Tutorial
Describes distinctions between the Zipf,
Power-law and Pareto distribution
http://www.hpl.hp.com/research/idl/papers/ranking/ranking.html
This is the original paper that gave rise to
all of the other power law networks papers:
A.-L. Barabási & R. Albert, Emergence of scaling in random
networks, Science 286, 509–512 (1999)
Power Laws Seem
to be Everywhere
Power Laws Seem
to be Everywhere
How Do I Know Something
is Actually a Power Law?
Clauset, Shalizi & Newman
http://arxiv.org/abs/0706.1062
argues for the use of MLE
instead of linear regression
Demonstrates that a number
of prior papers mistakenly
called their distribution a
power law
Here is why you should use
Maximum Likelihood Estimation
(MLE) instead of linear
regression
You recover the power law
when its present
Notice spread between the
Yellow and red lines
Back to the Random Graph
Models for a Moment
Poisson distribution
Erdos-Renyi is the default random
graph model:
randomly draw E edges
between N nodes
There are no hubs in the network
Rather, there exists a narrow
distribution of connectivities
Back to the Random Graph
Models for a Moment
let there be n people
p is the probability that any two of them are ‘friends’
Binomial Poisson Normal
limit p small Limit large n
Random
Graphs
Power Law
networks
Generating Power Law
Distributed Networks
Pseudocode for the growing power law networks:
Start with small number of nodes
add new vertices one by one
each new edge connects to an existing vertex in
proportion to the number of edges that vertex
already displays (i.e. preferentially attach)
Growing Power Law
Distributed Networks
The previous pseudocode is not a unique solution
A variety of other growth dynamics are possible
In the simple case this is a system that extremely
“sensitive to initial conditions”
upstarts who garner early advantage are able to
extend their relative advantage in later periods
for example, imagine you receive a higher interest
rate the more money you have “rich get richer”

More Related Content

PDF
ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lecture 1 - Professor...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...
PDF
Network Analysis and Law: Introductory Tutorial @ Jurix 2011 Meeting (Vienna)
PPTX
10 More than a Pretty Picture: Visual Thinking in Network Studies
PPTX
11 Keynote (2017)
ICPSR - Complex Systems Models in the Social Sciences - Lecture 2 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 1 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 4 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 8 and 9 - Pro...
Network Analysis and Law: Introductory Tutorial @ Jurix 2011 Meeting (Vienna)
10 More than a Pretty Picture: Visual Thinking in Network Studies
11 Keynote (2017)

What's hot (20)

PPTX
13 Community Detection
PPTX
15 Network Visualization and Communities
PDF
Sinks Method Paper Presentation @ Duke Political Networks Conference 2010
PPT
01 Introduction to Networks Methods and Measures
PPTX
01 Network Data Collection (2017)
PPTX
03 Communities in Networks (2017)
PPT
How to conduct a social network analysis: A tool for empowering teams and wor...
PDF
Legal Analytics Course - Class 11 - Network Analysis and Law - Professors Dan...
PPTX
04 Network Data Collection
PPT
The Basics of Social Network Analysis
PPTX
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
PPTX
02 Network Data Collection
PPTX
Social Network Analysis Introduction including Data Structure Graph overview.
PPTX
02 Descriptive Statistics (2017)
PDF
Social Network Analysis
PPTX
04 Data Visualization (2017)
PPTX
04 Diffusion and Peer Influence
PPTX
07 Network Visualization
PDF
05 Communities in Network
PPTX
11 Network Experiments and Interventions
13 Community Detection
15 Network Visualization and Communities
Sinks Method Paper Presentation @ Duke Political Networks Conference 2010
01 Introduction to Networks Methods and Measures
01 Network Data Collection (2017)
03 Communities in Networks (2017)
How to conduct a social network analysis: A tool for empowering teams and wor...
Legal Analytics Course - Class 11 - Network Analysis and Law - Professors Dan...
04 Network Data Collection
The Basics of Social Network Analysis
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
02 Network Data Collection
Social Network Analysis Introduction including Data Structure Graph overview.
02 Descriptive Statistics (2017)
Social Network Analysis
04 Data Visualization (2017)
04 Diffusion and Peer Influence
07 Network Visualization
05 Communities in Network
11 Network Experiments and Interventions
Ad

Viewers also liked (20)

PDF
ICPSR - Complex Systems Models in the Social Sciences - Bonus Content - Profe...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 3 - Profe...
PDF
Complex Systems Computing - Webscraping - Bonus Module
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lecture 7 - Professor...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 6 - Profe...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 5 - Profe...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 9 - Profe...
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 4 - Profe...
PDF
ICPSR - Complex Systems Models in the Social Sciences - 2013 - Professor Dani...
PDF
Presentation @ 24th International Conference on Legal Knowledge and Informati...
PDF
What is Computational Legal Studies? Presentation @ University of Houston - ...
DOCX
Encuesta informatica
PPTX
Tobacco control in china progress barriers and challenges
PPTX
Using legal challenges, Kirstie Douse
ODT
Empresa de jehimi salva
PDF
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...
PDF
Go Forth And Code
PDF
Technology, Data and Computation Session @ The World Bank - Law, Justice, and...
ICPSR - Complex Systems Models in the Social Sciences - Bonus Content - Profe...
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 3 - Profe...
Complex Systems Computing - Webscraping - Bonus Module
ICPSR - Complex Systems Models in the Social Sciences - Lecture 7 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 6 - Profe...
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 5 - Profe...
ICPSR - Complex Systems Models in the Social Sciences - Lecture 6 - Professor...
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 9 - Profe...
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 4 - Profe...
ICPSR - Complex Systems Models in the Social Sciences - 2013 - Professor Dani...
Presentation @ 24th International Conference on Legal Knowledge and Informati...
What is Computational Legal Studies? Presentation @ University of Houston - ...
Encuesta informatica
Tobacco control in china progress barriers and challenges
Using legal challenges, Kirstie Douse
Empresa de jehimi salva
ICPSR - Complex Systems Models in the Social Sciences - Lab Session 7, 8 - Pr...
Go Forth And Code
Technology, Data and Computation Session @ The World Bank - Law, Justice, and...
Ad

Similar to ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor Daniel Martin Katz (20)

PDF
An Introduction to Networks
PPT
Socialnetworkanalysis (Tin180 Com)
PPTX
Slides BARABASI BARA BASA Barabasi.ppt.pptx
PDF
Random graph models
PPT
Microsoft Research, India Social Networks And Their Applications To Web (Ti...
PPTX
Network Measures: Characterizing networks
PPT
It’s a “small world” after all
PDF
Complexity Play&Learn
PDF
Lecture 5 - Qunatifying a Network.pdf
PDF
graph_theory_1-11.pdf___________________
PPT
mathematics of network science: basic definitions
PPTX
Networks
PPT
Tn 110 lecture 8
PPTX
Network properties
PDF
Graph Evolution Models
PDF
SN- Lecture 8
PDF
Link Analysis in Networks - or - Finding the Terrorists
PDF
Minicourse on Network Science
PDF
Exploratory social network analysis with pajek
PPTX
8.-Graphs information technologies graph
An Introduction to Networks
Socialnetworkanalysis (Tin180 Com)
Slides BARABASI BARA BASA Barabasi.ppt.pptx
Random graph models
Microsoft Research, India Social Networks And Their Applications To Web (Ti...
Network Measures: Characterizing networks
It’s a “small world” after all
Complexity Play&Learn
Lecture 5 - Qunatifying a Network.pdf
graph_theory_1-11.pdf___________________
mathematics of network science: basic definitions
Networks
Tn 110 lecture 8
Network properties
Graph Evolution Models
SN- Lecture 8
Link Analysis in Networks - or - Finding the Terrorists
Minicourse on Network Science
Exploratory social network analysis with pajek
8.-Graphs information technologies graph

More from Daniel Katz (20)

PDF
Legal Analytics versus Empirical Legal Studies - or - Causal Inference vs Pre...
PDF
Can Law Librarians Help Law Become More Data Driven ? An Open Question in Ne...
DOCX
Why We Are Open Sourcing ContraxSuite and Some Thoughts About Legal Tech and ...
PDF
Fin (Legal) Tech – Law’s Future from Finance’s Past (Some Thoughts About the ...
PDF
Exploring the Physical Properties of Regulatory Ecosystems - Professors Danie...
PDF
Law + Complexity & Prediction: Toward a Characterization of Legal Systems as ...
PDF
Building Your Personal (Legal) Brand - Some Thoughts for Law Students and Oth...
PDF
Measure Twice, Cut Once - Solving the Legal Profession Biggest Challenges Tog...
PDF
Artificial Intelligence and Law - 
A Primer
PDF
Machine Learning as a Service: #MLaaS, Open Source and the Future of (Legal) ...
PDF
LexPredict - Empowering the Future of Legal Decision Making
PDF
{Law, Tech, Design, Delivery} Observations Regarding Innovation in the Legal ...
PDF
Legal Analytics Course - Class 12 - Data Preprocessing using dPlyR - Professo...
PDF
Legal Analytics Course - Class 10 - Information Visualization + DataViz in R ...
PDF
Legal Analytics Course - Class #4 - Github and RMarkdown Tutorial - Professor...
PDF
Legal Analytics Course - Class 9 - Clustering Algorithms (K-Means & Hierarch...
PDF
Legal Analytics Course - Class 8 - Introduction to Random Forests and Ensembl...
PDF
Legal Analytics Course - Class 7 - Binary Classification with Decision Tree L...
PDF
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
PDF
Legal Analytics Course - Class 5 - Quantitative Legal Prediction + Data Drive...
Legal Analytics versus Empirical Legal Studies - or - Causal Inference vs Pre...
Can Law Librarians Help Law Become More Data Driven ? An Open Question in Ne...
Why We Are Open Sourcing ContraxSuite and Some Thoughts About Legal Tech and ...
Fin (Legal) Tech – Law’s Future from Finance’s Past (Some Thoughts About the ...
Exploring the Physical Properties of Regulatory Ecosystems - Professors Danie...
Law + Complexity & Prediction: Toward a Characterization of Legal Systems as ...
Building Your Personal (Legal) Brand - Some Thoughts for Law Students and Oth...
Measure Twice, Cut Once - Solving the Legal Profession Biggest Challenges Tog...
Artificial Intelligence and Law - 
A Primer
Machine Learning as a Service: #MLaaS, Open Source and the Future of (Legal) ...
LexPredict - Empowering the Future of Legal Decision Making
{Law, Tech, Design, Delivery} Observations Regarding Innovation in the Legal ...
Legal Analytics Course - Class 12 - Data Preprocessing using dPlyR - Professo...
Legal Analytics Course - Class 10 - Information Visualization + DataViz in R ...
Legal Analytics Course - Class #4 - Github and RMarkdown Tutorial - Professor...
Legal Analytics Course - Class 9 - Clustering Algorithms (K-Means & Hierarch...
Legal Analytics Course - Class 8 - Introduction to Random Forests and Ensembl...
Legal Analytics Course - Class 7 - Binary Classification with Decision Tree L...
Legal Analytics Course - Class 6 - Overfitting, Underfitting, & Cross-Validat...
Legal Analytics Course - Class 5 - Quantitative Legal Prediction + Data Drive...

Recently uploaded (20)

PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
Cell Structure & Organelles in detailed.
PDF
Basic Mud Logging Guide for educational purpose
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
01-Introduction-to-Information-Management.pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Pre independence Education in Inndia.pdf
O7-L3 Supply Chain Operations - ICLT Program
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
Module 4: Burden of Disease Tutorial Slides S2 2025
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Final Presentation General Medicine 03-08-2024.pptx
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
2.FourierTransform-ShortQuestionswithAnswers.pdf
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPH.pptx obstetrics and gynecology in nursing
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Anesthesia in Laparoscopic Surgery in India
Cell Structure & Organelles in detailed.
Basic Mud Logging Guide for educational purpose
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Microbial disease of the cardiovascular and lymphatic systems
01-Introduction-to-Information-Management.pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
Pre independence Education in Inndia.pdf

ICPSR - Complex Systems Models in the Social Sciences - Lecture 3 - Professor Daniel Martin Katz

  • 1. Complex Systems Models in the Social Sciences (Lecture 3) daniel martin katz illinois institute of technology chicago kent college of law @computationaldanielmartinkatz.com computationallegalstudies.com
  • 2. Back to Where We Ended Our Last Class
  • 3. Stanley Milgram’s Other Experiment Milgram was interested in the structure of society Including the social distance between individuals While the term “six degrees” is often attributed to milgram it can be traced to ideas from hungarian author Frigyes Karinthy What is the average distance between two individuals in society?
  • 5. Six Degrees of Separation? NE MA Target person worked in Boston as a stockbroker 296 senders from Boston and Omaha. 20% of senders reached target. Average chain length = 6.5. And So the term ... “Six degrees of Separation”
  • 6. Six Degrees Six Degrees is a claim that “average path length” between two individuals in society is ~ 6 The idea of ‘Six Degrees’ Popularized through plays/movies and the kevin bacon game http://oracleofbacon.org/
  • 7. Six Degrees of Kevin Bacon
  • 8. Visualization Source: Duncan J. Watts, Six Degrees Six Degrees of Kevin Bacon
  • 9. But What is Wrong with Milgram’s Logic? 150(150) = 22,500 150 3 = 3,375,000 150 4 = 506,250,000 150 5= 75,937,500,000
  • 10. The Strength of ‘Weak’ Ties Does Milgram get it right? (Mark Granovetter) Visualization Source: Early Friendster – MIT Network www.visualcomplexity.com Strong and Weak Ties (Clustered v. Spanning) Clustering ---- My Friends’ Friends are also likely to be friends
  • 11. So Was Milgram Correct? Small Worlds (i.e. Six Degrees) was a theoretical and an empirical Claim The Theoretical Account Was Incorrect The Empirical Claim was still intact Query as to how could real social networks display both small worlds and clustering? At the Same time, the Strength of Weak Ties was also an Theoretical and Empirical proposition
  • 12. Watts and Strogatz (1998) A few random links in an otherwise clustered graph yields the types of small world properties found by Milgram “Randomness” is key bridge between the small world result and the clustering that is commonly observed in real social networks
  • 13. Watts and Strogatz (1998) A Small Amount of Random Rewiring or Something akin to Weak Ties—Allows for Clustering and Small Worlds Random Graphlocally Clustered
  • 14. Different Form of Network Representation 1 mode 2 mode
  • 15. 2 mode Actors and Movies Different Forms of Network Representation
  • 16. 1 mode Actor to Actor Could be Binary (0,1) Did they Co-Appear? Different Forms of Network Representation
  • 17. Different Forms of Network Representation 1 mode Actor to Actor Could also be Weighted (I.E. Edge Weights by Number of Co-Appearences)
  • 18. Features of Networks Mesoscopic Community Structures We will discuss these next week Macroscopic Graph Level Properties We will discuss these today Microscopic Node Level Properties We will discuss these Next week
  • 19. Macroscopic Graph Level Properties Degree Distributions (Outdegree & Indegree) Clustering Coefficients Connected Components Shortest Paths Density
  • 20. Shortest Paths Shortest Paths The shortest set of links connecting two nodes Also, known as the geodesic path In many graphs, there are multiple shortest paths
  • 21. Shortest Paths Shortest Paths A and C are connected by 2 shortest paths A – E – B - C A – E – D - C Diameter: the largest geodesic distance in the graph The distance between A and C is the maximum for the graph: 4
  • 22. Shortest Paths I n t h e W a t t s - S t r o g a t z M o d e l Shortest Paths are reduced by increasing levels of random rewiring
  • 23. Clustering Coefficients Clustering Coefficients Measure of the tendency of nodes in a graph to cluster Both a graph level average for clustering Also, a local version which is interested in cliqueness of a graph
  • 24. Density Density = Of the connections that could exist between n nodes directed graph: emax = n*(n-1) (each of the n nodes can connect to (n-1) other nodes) undirected graph emax = n*(n-1)/2
 (since edges are undirected, count each one only once) What Fraction are Present?
  • 25. Density What fraction are present? density = e / emax For example, out of 12
 possible connections.. this graph this graph has 7, giving it a density of 
 7/12 = 0.58 A “fully connected graph has a density =1
  • 26. Connected Components We are often interested in whether the graph has a single or multiple connected components Strong Components Giant Component Weak Components
  • 27. “Largest Weakly Connected Component” in the SCOTUS Citation Network There exist cases that are not in this visual as they are disconnected as of the year 1830 However, by 2009, 99% of SCOTUS Decisions are in the Largest Weakly Connected Component
  • 28. Connected Components Open “Giant Component” from the netlogo models Library
  • 29. Connected Components Notice the fraction of nodes in the giant component Notice the Size of the “Giant Component” Model has been advanced 25+ Ticks
  • 30. Connected Components Model has been advanced 80+ Ticks Notice the fraction of nodes in the giant component Notice the Size of the “Giant Component”
  • 31. Connected Components Model has been advanced 120+ Ticks Notice the fraction of nodes in the giant component Notice the Size of the “Giant Component” now = “num-nodes” in the slider
  • 32. Degree Distributions outdegree
 how many directed edges (arcs) originate at a node indegree
 how many directed edges (arcs) are incident on a node degree (in or out)
 number of edges incident on a node Indegree=3 Outdegree=2 Degree=5
  • 33. Node Degree from Matrix Values Outdegree: outdegree for node 3 = 2, which we obtain by summing the number of non-zero entries in the 3rd row Indegree: indegree for node 3 = 1, which we obtain by summing the number of non-zero entries in the 3rd column
  • 34. Degree Distributions These are Degree Count for particular nodes but we are also interested in the distribution of arcs (or edges) across all nodes These Distributions are called “degree distributions” Degree distribution: A frequency count of the occurrence of each degree
  • 35. Degree Distributions Imagine we have this 8 node network: In-degree sequence: [2, 2, 2, 1, 1, 1, 1, 0] Out-degree sequence: [2, 2, 2, 2, 1, 1, 1, 0] (undirected) degree sequence: [3, 3, 3, 2, 2, 1, 1, 1]
  • 36. Degree Distributions Imagine we have this 8 node network: In-degree distribution: [(2,3) (1,4) (0,1)] Out-degree distribution: [(2,4) (1,3) (0,1)] (undirected) distribution: [(3,3) (2,2) (1,3)]
  • 37. Why are Degree Distributions Useful? They are the signature of a dynamic process We will discuss in greater detail tomorrow Consider several canonical network models
  • 38. Canonical Network Models Erdős-Renyi Random Network Highly Clustered Network Watts-Strogatz Small World Network Barabási-Albert Preferential Attachment Network
  • 39. Why are Degree Distributions Useful? Barabási-Albert Preferential Attachment Network
  • 40. Power Law / Scale Free Networks
  • 41. Barabási-Albert Preferential Attachment Netlogo Models Library --> Networks --> Preferential Attachment Watch the Changing Degree Distribution
  • 42. Barabási-Albert Preferential Attachment Netlogo Models Library --> Networks --> Preferential Attachment
  • 43. Barabási-Albert Preferential Attachment Netlogo Models Library --> Networks --> Preferential Attachment
  • 44. Barabási-Albert Preferential Attachment Netlogo Models Library --> Networks --> Preferential Attachment
  • 45. Barabási-Albert Preferential Attachment Netlogo Models Library --> Networks --> Preferential Attachment
  • 46. Barabási-Albert Preferential Attachment Netlogo Models Library --> Networks --> Preferential Attachment
  • 47. Readings on Power law / Scale free Networks Check out Lada Adamic’s Power Law Tutorial Describes distinctions between the Zipf, Power-law and Pareto distribution http://www.hpl.hp.com/research/idl/papers/ranking/ranking.html This is the original paper that gave rise to all of the other power law networks papers: A.-L. Barabási & R. Albert, Emergence of scaling in random networks, Science 286, 509–512 (1999)
  • 48. Power Laws Seem to be Everywhere
  • 49. Power Laws Seem to be Everywhere
  • 50. How Do I Know Something is Actually a Power Law?
  • 51. Clauset, Shalizi & Newman http://arxiv.org/abs/0706.1062 argues for the use of MLE instead of linear regression Demonstrates that a number of prior papers mistakenly called their distribution a power law Here is why you should use Maximum Likelihood Estimation (MLE) instead of linear regression You recover the power law when its present Notice spread between the Yellow and red lines
  • 52. Back to the Random Graph Models for a Moment Poisson distribution Erdos-Renyi is the default random graph model: randomly draw E edges between N nodes There are no hubs in the network Rather, there exists a narrow distribution of connectivities
  • 53. Back to the Random Graph Models for a Moment let there be n people p is the probability that any two of them are ‘friends’ Binomial Poisson Normal limit p small Limit large n
  • 55. Generating Power Law Distributed Networks Pseudocode for the growing power law networks: Start with small number of nodes add new vertices one by one each new edge connects to an existing vertex in proportion to the number of edges that vertex already displays (i.e. preferentially attach)
  • 56. Growing Power Law Distributed Networks The previous pseudocode is not a unique solution A variety of other growth dynamics are possible In the simple case this is a system that extremely “sensitive to initial conditions” upstarts who garner early advantage are able to extend their relative advantage in later periods for example, imagine you receive a higher interest rate the more money you have “rich get richer”