SlideShare a Scribd company logo
NETWORK ANALYSIS:
PEOPLE AND OPEN
SOURCE COMMUNITIES
Dawn M. Foster
@geekygirldawn	
  
dawn@dawnfoster.com	
  
fastwonderblog.com
PhD	
  Student	
  
University	
  of	
  Greenwich	
  
London,	
  UK
WHOAMI
• Geek, traveler, reader
• 20 year tech career. Past 15
years doing community &
open source (Intel, Jive,
Puppet Labs, etc.)
• PhD student at University of
Greenwich researching Linux
kernel
Photos by Josh Bancroft, Don Park
WHAT IS NETWORK
ANALYSIS?
Studies relationships
between units and looks for
patterns and structure in
those relationships
Image from ANAMIA Project
AGENDA AND INFO
• Gathering your data
• Data manipulation for
network analysis
• Visualization
• What else can you do?
Image from a Northern Marina Islands Network
Scripts, Data, and More:

github.com/geekygirldawn/oscon_2015
I 💖 METRICS GRIMOIRE
MailingListStats aka MLStats
CVSAnalY - repos
Bicho - bugs
More
Photo by Bitergia
http://metricsgrimoire.github.io/
MLSTATS
a) Install mlstats
$ python setup.py install
b) Create database
mysql> create database mlstats;
c) Import data by running mlstats
$ mlstats --db-user=USERNAME --db-password=PASS http://URLOFYOURLIST
EXTRACT DATA
SELECT mp.email_address AS sender,

(SELECT mp2.email_address FROM 

messages m2, messages_people mp2 WHERE
m2.is_response_of=m.is_response_of 

AND mp2.message_id=m2.is_response_of limit 1)
AS receiver FROM messages_people mp, messages m
WHERE YEAR(m.first_date)=2015 AND
MONTH(m.first_date)=1 AND
mp.message_id=m.message_id;
people
sending emails
subquery: who
they replied to
limittime
formanageable
data
Output:
sender@example.com in_reply_to@example.com
sender1@example.com in_reply_to1@example.com
sender2@example.com in_reply_to2@example.com
...
EXTRACT DATA: SCRIPTS
Reformat / clean up data
Reproducible
Reduce human error
oscon.py script
Image from Mark Grealish
github.com/geekygirldawn/oscon_2015
R / VISONE / GOURCE
Convert data for better use
with network analysis
Visualize data using

RStudio, Visone, and Gource
Image from WebOps.com
WHAT ELSE?
So many visualization tools
Python network packages
Network analysis is more
than just pretty pictures!
Dawn Foster
University of Greenwich
Centre for Business Network Analysis
www2.gre.ac.uk/about/faculty/business/research/centres/cbna/home
@geekygirldawn, dawn@dawnfoster.com
fastwonderblog.com
THANK YOU

More Related Content

PDF
Network Analysis: Tech Evangelism London Meetup
PDF
Network Relationships and Job Changes of Software Developers at Sunbelt 2016
PDF
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...
PDF
Open source: A Job and an Adventure - FLOSSUK 2016
PPT
Semantic Wiki Based Collaborative Scientific Modeling Infrastructure
PPTX
Commit yourself already: Contributing to open source
PPTX
Predicting Discussions on the Social Semantic Web
PPT
Webometrics 2.0: Blogometrics, Wikimetrics, Tagometrics, and Sociometrics Rev...
Network Analysis: Tech Evangelism London Meetup
Network Relationships and Job Changes of Software Developers at Sunbelt 2016
Network Analysis: People and Open Source Communities - LinuxCon Seattle and D...
Open source: A Job and an Adventure - FLOSSUK 2016
Semantic Wiki Based Collaborative Scientific Modeling Infrastructure
Commit yourself already: Contributing to open source
Predicting Discussions on the Social Semantic Web
Webometrics 2.0: Blogometrics, Wikimetrics, Tagometrics, and Sociometrics Rev...

What's hot (20)

PDF
The Social Semantic Web
PPTX
The end of the scientific paper as we know it (in 4 easy steps)
PPTX
Linked Data past, present and futures
PDF
An Automated Snowball Census of the Political Web - JITP 2011
PPTX
The end of the scientific paper as we know it (or not...)
PPT
Webometrics 1.0 from AltaVista to Small Worlds and Genre Drift
PPTX
Semantic Web questions we couldn't ask 10 years ago
PDF
Charleston Conference 2013 - 13_1107
PPTX
Mining the Go Developer Community
PPT
Broad Data
PPT
Sharing ideas and sharing data: Researchers and Web 2.0
PDF
Designing a second generation of open data platforms
PDF
Osw Digital Humanities
PDF
BCcampus a-great-babbling-bazaar
 
PPTX
The Web of Data: do we actually understand what we built?
PPTX
Dan Needham & Phil Cross (mimas) – Names Project
PDF
Franck Rebillard, Professeur Université Paris 3
PDF
Digital Methods by Richard Rogers
PDF
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
PDF
Rogers studyingpoliticalissues mar2014_optimized_ii_
The Social Semantic Web
The end of the scientific paper as we know it (in 4 easy steps)
Linked Data past, present and futures
An Automated Snowball Census of the Political Web - JITP 2011
The end of the scientific paper as we know it (or not...)
Webometrics 1.0 from AltaVista to Small Worlds and Genre Drift
Semantic Web questions we couldn't ask 10 years ago
Charleston Conference 2013 - 13_1107
Mining the Go Developer Community
Broad Data
Sharing ideas and sharing data: Researchers and Web 2.0
Designing a second generation of open data platforms
Osw Digital Humanities
BCcampus a-great-babbling-bazaar
 
The Web of Data: do we actually understand what we built?
Dan Needham & Phil Cross (mimas) – Names Project
Franck Rebillard, Professeur Université Paris 3
Digital Methods by Richard Rogers
Academic Libraries and Big Data: Trends in Collection, Publication, Preservat...
Rogers studyingpoliticalissues mar2014_optimized_ii_
Ad

Similar to Network analysis: People and open source communities (11)

PPTX
(Icmia 2013) personalized community detection using collaborative similarity ...
DOC
Final report on sgi social networking
PDF
Social Media Analytics with a pinch of semantics
PDF
Neo4j GraphTalks Zürich - Einführung
PDF
Road to NODES 2023: Graphing Relational Databases
PPT
MCPL2013 - Social network analyses in organizations: challenges and approache...
PPT
MiningEmailSocialNetworks
PPTX
Graph Database Query Languages
PPTX
Neo4j GraphTalk Wien - Einführung
PPTX
Graphs in the Real World
PDF
Introductory Talk on Social Network Analysis at Facebook Developer Circle Me...
(Icmia 2013) personalized community detection using collaborative similarity ...
Final report on sgi social networking
Social Media Analytics with a pinch of semantics
Neo4j GraphTalks Zürich - Einführung
Road to NODES 2023: Graphing Relational Databases
MCPL2013 - Social network analyses in organizations: challenges and approache...
MiningEmailSocialNetworks
Graph Database Query Languages
Neo4j GraphTalk Wien - Einführung
Graphs in the Real World
Introductory Talk on Social Network Analysis at Facebook Developer Circle Me...
Ad

More from Dawn Foster (20)

PDF
CHAOSS Metrics Overview and Examples
PDF
Be a Good Corporate Citizen in Kubernetes
PDF
Overcoming Imposter Syndrome to Become a Conference Speaker!
PDF
How to Be a Good Corporate Citizen in Open Source
PDF
Open Source Collaboration and Companies: Finding the Right Balance
PDF
Navigating Open Source Risk
PDF
Measuring Project Health at VMware
PDF
Navigating Open Source Risk
PDF
Collaborative Leadership: Governance Beyond Company Affiliation
PDF
Collaborative Leadership: Governance Beyond Company Affiliation
PDF
Collaborative Leadership: Governance Beyond Company Affiliation
PDF
Collaborative Leadership: Governance Beyond Company Affiliation
PDF
Is this Open Source Project Healthy or Lifeless?
PDF
Collaboration in Linux Kernel Mailing Lists
PDF
Be a Good Corporate Citizen in Kubernetes
PDF
Being a Good Corporate Citizen in Open Source
PDF
Building Community for your Company’s OSS Projects
PDF
Building Community for your Company’s OSS Project
PDF
How to be a terrible hiring manager
PDF
A week in the Life of Kubernetes
CHAOSS Metrics Overview and Examples
Be a Good Corporate Citizen in Kubernetes
Overcoming Imposter Syndrome to Become a Conference Speaker!
How to Be a Good Corporate Citizen in Open Source
Open Source Collaboration and Companies: Finding the Right Balance
Navigating Open Source Risk
Measuring Project Health at VMware
Navigating Open Source Risk
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
Is this Open Source Project Healthy or Lifeless?
Collaboration in Linux Kernel Mailing Lists
Be a Good Corporate Citizen in Kubernetes
Being a Good Corporate Citizen in Open Source
Building Community for your Company’s OSS Projects
Building Community for your Company’s OSS Project
How to be a terrible hiring manager
A week in the Life of Kubernetes

Recently uploaded (20)

PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PPTX
MYSQL Presentation for SQL database connectivity
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Spectroscopy.pptx food analysis technology
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Empathic Computing: Creating Shared Understanding
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Encapsulation theory and applications.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
cuic standard and advanced reporting.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
“AI and Expert System Decision Support & Business Intelligence Systems”
MYSQL Presentation for SQL database connectivity
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
The AUB Centre for AI in Media Proposal.docx
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Spectroscopy.pptx food analysis technology
sap open course for s4hana steps from ECC to s4
Empathic Computing: Creating Shared Understanding
NewMind AI Weekly Chronicles - August'25 Week I
Encapsulation theory and applications.pdf
Understanding_Digital_Forensics_Presentation.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
cuic standard and advanced reporting.pdf
Spectral efficient network and resource selection model in 5G networks
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx

Network analysis: People and open source communities

  • 1. NETWORK ANALYSIS: PEOPLE AND OPEN SOURCE COMMUNITIES Dawn M. Foster @geekygirldawn   dawn@dawnfoster.com   fastwonderblog.com PhD  Student   University  of  Greenwich   London,  UK
  • 2. WHOAMI • Geek, traveler, reader • 20 year tech career. Past 15 years doing community & open source (Intel, Jive, Puppet Labs, etc.) • PhD student at University of Greenwich researching Linux kernel Photos by Josh Bancroft, Don Park
  • 3. WHAT IS NETWORK ANALYSIS? Studies relationships between units and looks for patterns and structure in those relationships Image from ANAMIA Project
  • 4. AGENDA AND INFO • Gathering your data • Data manipulation for network analysis • Visualization • What else can you do? Image from a Northern Marina Islands Network Scripts, Data, and More:
 github.com/geekygirldawn/oscon_2015
  • 5. I 💖 METRICS GRIMOIRE MailingListStats aka MLStats CVSAnalY - repos Bicho - bugs More Photo by Bitergia http://metricsgrimoire.github.io/
  • 6. MLSTATS a) Install mlstats $ python setup.py install b) Create database mysql> create database mlstats; c) Import data by running mlstats $ mlstats --db-user=USERNAME --db-password=PASS http://URLOFYOURLIST
  • 7. EXTRACT DATA SELECT mp.email_address AS sender,
 (SELECT mp2.email_address FROM 
 messages m2, messages_people mp2 WHERE m2.is_response_of=m.is_response_of 
 AND mp2.message_id=m2.is_response_of limit 1) AS receiver FROM messages_people mp, messages m WHERE YEAR(m.first_date)=2015 AND MONTH(m.first_date)=1 AND mp.message_id=m.message_id; people sending emails subquery: who they replied to limittime formanageable data Output: sender@example.com in_reply_to@example.com sender1@example.com in_reply_to1@example.com sender2@example.com in_reply_to2@example.com ...
  • 8. EXTRACT DATA: SCRIPTS Reformat / clean up data Reproducible Reduce human error oscon.py script Image from Mark Grealish github.com/geekygirldawn/oscon_2015
  • 9. R / VISONE / GOURCE Convert data for better use with network analysis Visualize data using
 RStudio, Visone, and Gource
  • 11. WHAT ELSE? So many visualization tools Python network packages Network analysis is more than just pretty pictures!
  • 12. Dawn Foster University of Greenwich Centre for Business Network Analysis www2.gre.ac.uk/about/faculty/business/research/centres/cbna/home @geekygirldawn, dawn@dawnfoster.com fastwonderblog.com THANK YOU