SlideShare a Scribd company logo
Research Overview
The research uses five dimensions of
proximity theory to explore this question:
“How do participants, who are paid by
firms, collaborate within a fluid
organization?”
Despite increased participation from paid
software developers, little research has been
conducted to investigate collaboration as it
relates contributors who are employed by
firms to work within a fluid organization.
Research Setting
Linux Kernel Community Case Study1:
•  Open source software
•  Over 85% of contributors paid
•  Neutral: competing companies
•  19M lines of code
•  11K developers
•  1200 organisations
References
1.  Corbet, J., Kroah-Hartman, G. & McPherson, A., 2015. Linux Kernel Development:
How Fast is it Going, Who is Doing It, What Are They Doing and Who is Sponsoring
the Work, Available at: http://www.linuxfoundation.org/publications/linux-foundation/
who-writes-linux-2015.
2.  March, J.G. & Simon, H.A., 1993. Organizations Second Ed., Malden, MA: Blackwell.
3.  Dobusch, L. & Schoeneborn, D., 2015. Fluidity, Identity, and Organizationality: The
Communicative Constitution of Anonymous. Journal of Management Studies, 52(8),
pp.1005–1035.
4.  Glance, N.S. & Huberman, B.A., 1994. Social dilemmas and fluid organizations,
Hillsdale, NJ: Lawrence Erlbaum.
5.  Balland, P.A., 2012. Proximity and the Evolution of Collaboration Networks: Evidence
from Research and Development Projects within the Global Navigation Satellite
System (GNSS) Industry. Regional Studies, 46(6), pp.741–756.
6.  Crescenzi, R., Nathan, M. & Rodríguez-Pose, A., 2016. Do inventors talk to strangers?
On proximity and collaborative knowledge creation. Research Policy, 45(1), pp.177–
194.
7.  Knoben, J. & Oerlemans, L. a G., 2006. Proximity and inter-organizational
collaboration: A literature review. International Journal of Management Reviews, 8(2),
pp.71–89.
8.  Cantner, U. & Graf, H., 2006. The network of innovators in Jena: An application of
social network analysis. Research Policy, 35(4), pp.463–480.
9.  Boschma, R., 2005. Proximity and Innovation: A Critical Assessment. Regional
Studies, 39(1), pp. 61–74.
10.  Butts, C.T., 2008. A relational event framework for social action. Sociological
Methodology, 38(1), pp.155-200.
11.  Quintane, E., Pattison, P.E., Robins, G.L. and Mol, J.M., 2013. Short-and long-term
stability in organizational networks: Temporal structures of project teams. Social
Networks, 35(4), pp.528-540.
12.  Opsahl, T. and Hogan, B., 2011. Modeling the evolution of continuously-observed
networks: Communication in a Facebook-like community. arXiv preprint arXiv:
1010.2141.
Method
Relational Event Framework
•  Predicting events in an ordinal sequence is
product of multinomial likelihoods.10
•  Ordinal model estimated using Multinomial
Conditional Logistic Regression, specifically
Cox regression estimated using MLE.11
•  Using clogit in R, which is based on coxph.
•  Realized event compared to 3 randomly
sampled possible events.12
•  10 day moving window.
Background
March and Simon2 define organizations as systems for coordinating activities between individuals
to facilitate cooperation with a focus on supporting decision-making processes. The notion of
organization can be expanded to include fluid organizations that emerge when people collaborate
and make decisions within a community that is recognized by its collective identity.3
Collaboration between individuals occurs within these fluid organizations; however, collaboration
within fluid organizations has been shown to reveal complex behavior with many dimensions.4
Proximity theory can been used to investigate various dimensions of collaboration5,6,7 and other
complex topics related to collaboration, such as knowledge transfer and innovation.8,9
There are several approaches to proximity theory7, and this research uses five dimensions:
cognitive, organizational, social, institutional and geographical.9
Collaboration between Software Developers
and the Impact of Proximity
Dawn M. Foster, Guido Conaldi, Riccardo De Vita
Business School, Centre for Business Network Analysis
Data
Descriptive Statistics
•  Dataset: USB Mailing List (linux-usb) 2013-11-01 - 2015-11-01
•  Messages (Events): 7799 in 3264 threads
•  Ties: based on Ego replying to a message from Alter
•  Actors: 882 (Egos: 691, Alters: 717)
Variable Operationalization
Proximity:
•  Geographic: time zone similarity (temporal geo prox)
•  Organizational: both work for same firm
•  Social prox: # of times dyad participated in same thread
•  Cognitive prox: contribute to same source code subsystems
•  Institutional prox: both employed by firms
Dyadic-Level Covariates:
•  Is Maintainer: one or both are in leadership (maintainer) position
•  Is Committer: one or both have made code contributions
•  Alter Maintainer: Alter is in a leadership (maintainer) position
Network-Level Covariates:
•  Transitive closure: num of x’s ego replied to where x has replied to alter
•  Cyclic closure: num of x’s alter replied to where x has replied to ego
•  Shared partnership in: same x replies to both ego and alter
•  Shared partnership out: ego and alter reply to messages by same x
•  Repeated events: number of times ego replied to messages by alter
•  Recency effect: 1/n with n as number of people alter emailed before ego
•  Participation shift: 1 if last person alter replied to on mailing list was ego
xe a
xe a
e a
e a
a
1/3
1/2
1
xa e
xe a
XXXVII Sunbelt Conference
30 May 2017 – 4 June 2017
Beijing, China
Preliminary Results
•  Proximity is relevant in explaining
collaboration ties within a fluid
organization.
•  Preliminary results are aligned with
qualitative analysis from interviews
with software developers in this
setting.
•  Further Research: Expand beyond 2
years of data from one mailing list to
see if the same results hold for other
mailing lists.
coef exp(coef) se(coef)
org proximity 5.763e-01 1.779e+00 6.280e-02 ***
social prox 3.369e+01 4.290e+14 1.047e+00 ***
cognitive prox -4.620e-01 6.301e-01 1.237e-01 ***
geo proximity 1.756e-01 1.192e+00 9.354e-02 .
inst prox (corp)2.597e-01 1.297e+00 4.535e-02 ***
is maintainer 5.128e-01 1.670e+00 1.167e-01 ***
is committer 3.335e-01 1.396e+00 5.548e-02 ***
alter maint -6.667e-01 5.134e-01 3.894e-01 .
cyclic closure 1.685e+01 2.080e+07 7.209e-01 ***
shared part in -3.263e+01 6.721e-15 1.020e+00 ***
shared part out-2.713e+01 1.653e-12 1.095e+00 ***
transitive clsr 1.060e+00 2.885e+00 5.555e-01 .
repeated events 1.684e+01 2.051e+07 5.773e-01 ***
recency effect 6.070e+00 4.326e+02 2.362e-01 ***
particip shift -3.090e+00 4.550e-02 2.386e-01 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1

More Related Content

PDF
Benchmarking the Privacy-­Preserving People Search
PPTX
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
PPT
How to conduct a social network analysis: A tool for empowering teams and wor...
PPTX
Conversation graphs in Online Social Media
PPTX
Aahb workshop
PDF
Social Network Analysis
PPTX
Community analysis using graph representation learning on social networks
PPT
The Basics of Social Network Analysis
Benchmarking the Privacy-­Preserving People Search
Social Network Analysis: What It Is, Why We Should Care, and What We Can Lear...
How to conduct a social network analysis: A tool for empowering teams and wor...
Conversation graphs in Online Social Media
Aahb workshop
Social Network Analysis
Community analysis using graph representation learning on social networks
The Basics of Social Network Analysis

What's hot (20)

PDF
Fuzzy AndANN Based Mining Approach Testing For Social Network Analysis
PPTX
A comparative study of social network analysis tools
PPTX
20 Network Experiments
PDF
Social Network Analysis (SNA) Made Easy
PPTX
Social network analysis
PPTX
18 Diffusion Models and Peer Influence
PPTX
11 Network Experiments and Interventions
PPTX
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
PPTX
04 Diffusion and Peer Influence
PPT
Social network analysis course 2010 - 2011
PPTX
12 SN&H Keynote: Thomas Valente, USC
PPTX
Who creates trends in online social media
PPTX
Social Network Analysis (SNA) 2018
PPT
Kostas Zafiropoulos - Discussion of eParticipation topics in Greek political ...
PPT
Social Network Analysis
PDF
Social Contagion Theory
PPTX
00 Social Influence Effects on Men's HIV Testing
PPTX
Comparison of Online Social Relations in terms of Volume vs. Interaction: A C...
PPT
2009 Node XL Overview: Social Network Analysis in Excel 2007
Fuzzy AndANN Based Mining Approach Testing For Social Network Analysis
A comparative study of social network analysis tools
20 Network Experiments
Social Network Analysis (SNA) Made Easy
Social network analysis
18 Diffusion Models and Peer Influence
11 Network Experiments and Interventions
Mining and analyzing social media part 2 - hicss47 tutorial - dave king
04 Diffusion and Peer Influence
Social network analysis course 2010 - 2011
12 SN&H Keynote: Thomas Valente, USC
Who creates trends in online social media
Social Network Analysis (SNA) 2018
Kostas Zafiropoulos - Discussion of eParticipation topics in Greek political ...
Social Network Analysis
Social Contagion Theory
00 Social Influence Effects on Men's HIV Testing
Comparison of Online Social Relations in terms of Volume vs. Interaction: A C...
2009 Node XL Overview: Social Network Analysis in Excel 2007
Ad

Similar to Collaboration between Software Developers and the Impact of Proximity (20)

PDF
Understanding Collaboration in Fluid Organizations, a Proximity Approach
PDF
Multilevel Collaboration between Software Developers and the Impact of Proxim...
PPTX
Enabling Multiple Dimensions of Proximity to Sustain Cross-sector Networks fo...
PPT
PPT
Network Analysis Lim 97
PPTX
SMART Seminar: Massively Interacting Systems
PPTX
Supporting team coordination across organizational boundary in GSD
PDF
Proximity Thinking
PPTX
DOC
Indicators Of Community Table
PDF
Collaborative Innovation Networks, Virtual Communities, and Geographical Clus...
PDF
Enterprise 2.0 - Efficient Collaboration and Knowledge Exchange
PPT
What Gives Life to our Community
PPT
Toward Hybrid Computing
PPT
Friendsters @ Work (SDForum)
PPT
The Emerge Show02 Ng Ti P
PPTX
Collaboration Recommender
PDF
Ph.D. defense: semantic social network analysis
PDF
Proximity Thinking Quick Intro
PDF
Managing Creativity: Oxymoron or Necessity?
Understanding Collaboration in Fluid Organizations, a Proximity Approach
Multilevel Collaboration between Software Developers and the Impact of Proxim...
Enabling Multiple Dimensions of Proximity to Sustain Cross-sector Networks fo...
Network Analysis Lim 97
SMART Seminar: Massively Interacting Systems
Supporting team coordination across organizational boundary in GSD
Proximity Thinking
Indicators Of Community Table
Collaborative Innovation Networks, Virtual Communities, and Geographical Clus...
Enterprise 2.0 - Efficient Collaboration and Knowledge Exchange
What Gives Life to our Community
Toward Hybrid Computing
Friendsters @ Work (SDForum)
The Emerge Show02 Ng Ti P
Collaboration Recommender
Ph.D. defense: semantic social network analysis
Proximity Thinking Quick Intro
Managing Creativity: Oxymoron or Necessity?
Ad

More from Dawn Foster (20)

PDF
CHAOSS Metrics Overview and Examples
PDF
Be a Good Corporate Citizen in Kubernetes
PDF
Overcoming Imposter Syndrome to Become a Conference Speaker!
PDF
How to Be a Good Corporate Citizen in Open Source
PDF
Open Source Collaboration and Companies: Finding the Right Balance
PDF
Navigating Open Source Risk
PDF
Measuring Project Health at VMware
PDF
Navigating Open Source Risk
PDF
Collaborative Leadership: Governance Beyond Company Affiliation
PDF
Collaborative Leadership: Governance Beyond Company Affiliation
PDF
Collaborative Leadership: Governance Beyond Company Affiliation
PDF
Collaborative Leadership: Governance Beyond Company Affiliation
PDF
Is this Open Source Project Healthy or Lifeless?
PDF
Collaboration in Linux Kernel Mailing Lists
PDF
Be a Good Corporate Citizen in Kubernetes
PDF
Being a Good Corporate Citizen in Open Source
PDF
Building Community for your Company’s OSS Projects
PDF
Building Community for your Company’s OSS Project
PDF
How to be a terrible hiring manager
PDF
A week in the Life of Kubernetes
CHAOSS Metrics Overview and Examples
Be a Good Corporate Citizen in Kubernetes
Overcoming Imposter Syndrome to Become a Conference Speaker!
How to Be a Good Corporate Citizen in Open Source
Open Source Collaboration and Companies: Finding the Right Balance
Navigating Open Source Risk
Measuring Project Health at VMware
Navigating Open Source Risk
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
Collaborative Leadership: Governance Beyond Company Affiliation
Is this Open Source Project Healthy or Lifeless?
Collaboration in Linux Kernel Mailing Lists
Be a Good Corporate Citizen in Kubernetes
Being a Good Corporate Citizen in Open Source
Building Community for your Company’s OSS Projects
Building Community for your Company’s OSS Project
How to be a terrible hiring manager
A week in the Life of Kubernetes

Recently uploaded (20)

PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Spectroscopy.pptx food analysis technology
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Electronic commerce courselecture one. Pdf
PPTX
Cloud computing and distributed systems.
PDF
Machine learning based COVID-19 study performance prediction
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Per capita expenditure prediction using model stacking based on satellite ima...
sap open course for s4hana steps from ECC to s4
MYSQL Presentation for SQL database connectivity
Spectroscopy.pptx food analysis technology
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Digital-Transformation-Roadmap-for-Companies.pptx
Unlocking AI with Model Context Protocol (MCP)
The AUB Centre for AI in Media Proposal.docx
20250228 LYD VKU AI Blended-Learning.pptx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Electronic commerce courselecture one. Pdf
Cloud computing and distributed systems.
Machine learning based COVID-19 study performance prediction
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Review of recent advances in non-invasive hemoglobin estimation
Reach Out and Touch Someone: Haptics and Empathic Computing

Collaboration between Software Developers and the Impact of Proximity

  • 1. Research Overview The research uses five dimensions of proximity theory to explore this question: “How do participants, who are paid by firms, collaborate within a fluid organization?” Despite increased participation from paid software developers, little research has been conducted to investigate collaboration as it relates contributors who are employed by firms to work within a fluid organization. Research Setting Linux Kernel Community Case Study1: •  Open source software •  Over 85% of contributors paid •  Neutral: competing companies •  19M lines of code •  11K developers •  1200 organisations References 1.  Corbet, J., Kroah-Hartman, G. & McPherson, A., 2015. Linux Kernel Development: How Fast is it Going, Who is Doing It, What Are They Doing and Who is Sponsoring the Work, Available at: http://www.linuxfoundation.org/publications/linux-foundation/ who-writes-linux-2015. 2.  March, J.G. & Simon, H.A., 1993. Organizations Second Ed., Malden, MA: Blackwell. 3.  Dobusch, L. & Schoeneborn, D., 2015. Fluidity, Identity, and Organizationality: The Communicative Constitution of Anonymous. Journal of Management Studies, 52(8), pp.1005–1035. 4.  Glance, N.S. & Huberman, B.A., 1994. Social dilemmas and fluid organizations, Hillsdale, NJ: Lawrence Erlbaum. 5.  Balland, P.A., 2012. Proximity and the Evolution of Collaboration Networks: Evidence from Research and Development Projects within the Global Navigation Satellite System (GNSS) Industry. Regional Studies, 46(6), pp.741–756. 6.  Crescenzi, R., Nathan, M. & Rodríguez-Pose, A., 2016. Do inventors talk to strangers? On proximity and collaborative knowledge creation. Research Policy, 45(1), pp.177– 194. 7.  Knoben, J. & Oerlemans, L. a G., 2006. Proximity and inter-organizational collaboration: A literature review. International Journal of Management Reviews, 8(2), pp.71–89. 8.  Cantner, U. & Graf, H., 2006. The network of innovators in Jena: An application of social network analysis. Research Policy, 35(4), pp.463–480. 9.  Boschma, R., 2005. Proximity and Innovation: A Critical Assessment. Regional Studies, 39(1), pp. 61–74. 10.  Butts, C.T., 2008. A relational event framework for social action. Sociological Methodology, 38(1), pp.155-200. 11.  Quintane, E., Pattison, P.E., Robins, G.L. and Mol, J.M., 2013. Short-and long-term stability in organizational networks: Temporal structures of project teams. Social Networks, 35(4), pp.528-540. 12.  Opsahl, T. and Hogan, B., 2011. Modeling the evolution of continuously-observed networks: Communication in a Facebook-like community. arXiv preprint arXiv: 1010.2141. Method Relational Event Framework •  Predicting events in an ordinal sequence is product of multinomial likelihoods.10 •  Ordinal model estimated using Multinomial Conditional Logistic Regression, specifically Cox regression estimated using MLE.11 •  Using clogit in R, which is based on coxph. •  Realized event compared to 3 randomly sampled possible events.12 •  10 day moving window. Background March and Simon2 define organizations as systems for coordinating activities between individuals to facilitate cooperation with a focus on supporting decision-making processes. The notion of organization can be expanded to include fluid organizations that emerge when people collaborate and make decisions within a community that is recognized by its collective identity.3 Collaboration between individuals occurs within these fluid organizations; however, collaboration within fluid organizations has been shown to reveal complex behavior with many dimensions.4 Proximity theory can been used to investigate various dimensions of collaboration5,6,7 and other complex topics related to collaboration, such as knowledge transfer and innovation.8,9 There are several approaches to proximity theory7, and this research uses five dimensions: cognitive, organizational, social, institutional and geographical.9 Collaboration between Software Developers and the Impact of Proximity Dawn M. Foster, Guido Conaldi, Riccardo De Vita Business School, Centre for Business Network Analysis Data Descriptive Statistics •  Dataset: USB Mailing List (linux-usb) 2013-11-01 - 2015-11-01 •  Messages (Events): 7799 in 3264 threads •  Ties: based on Ego replying to a message from Alter •  Actors: 882 (Egos: 691, Alters: 717) Variable Operationalization Proximity: •  Geographic: time zone similarity (temporal geo prox) •  Organizational: both work for same firm •  Social prox: # of times dyad participated in same thread •  Cognitive prox: contribute to same source code subsystems •  Institutional prox: both employed by firms Dyadic-Level Covariates: •  Is Maintainer: one or both are in leadership (maintainer) position •  Is Committer: one or both have made code contributions •  Alter Maintainer: Alter is in a leadership (maintainer) position Network-Level Covariates: •  Transitive closure: num of x’s ego replied to where x has replied to alter •  Cyclic closure: num of x’s alter replied to where x has replied to ego •  Shared partnership in: same x replies to both ego and alter •  Shared partnership out: ego and alter reply to messages by same x •  Repeated events: number of times ego replied to messages by alter •  Recency effect: 1/n with n as number of people alter emailed before ego •  Participation shift: 1 if last person alter replied to on mailing list was ego xe a xe a e a e a a 1/3 1/2 1 xa e xe a XXXVII Sunbelt Conference 30 May 2017 – 4 June 2017 Beijing, China Preliminary Results •  Proximity is relevant in explaining collaboration ties within a fluid organization. •  Preliminary results are aligned with qualitative analysis from interviews with software developers in this setting. •  Further Research: Expand beyond 2 years of data from one mailing list to see if the same results hold for other mailing lists. coef exp(coef) se(coef) org proximity 5.763e-01 1.779e+00 6.280e-02 *** social prox 3.369e+01 4.290e+14 1.047e+00 *** cognitive prox -4.620e-01 6.301e-01 1.237e-01 *** geo proximity 1.756e-01 1.192e+00 9.354e-02 . inst prox (corp)2.597e-01 1.297e+00 4.535e-02 *** is maintainer 5.128e-01 1.670e+00 1.167e-01 *** is committer 3.335e-01 1.396e+00 5.548e-02 *** alter maint -6.667e-01 5.134e-01 3.894e-01 . cyclic closure 1.685e+01 2.080e+07 7.209e-01 *** shared part in -3.263e+01 6.721e-15 1.020e+00 *** shared part out-2.713e+01 1.653e-12 1.095e+00 *** transitive clsr 1.060e+00 2.885e+00 5.555e-01 . repeated events 1.684e+01 2.051e+07 5.773e-01 *** recency effect 6.070e+00 4.326e+02 2.362e-01 *** particip shift -3.090e+00 4.550e-02 2.386e-01 *** --- Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1