SlideShare a Scribd company logo
21st International Conference on Mining Software Repositories
Incivility in Open Source Projects:
A Comprehensive Annotated Dataset of Locked GitHub
Issue Threads
Ramtin Ehsani, Mia Mohammad Imran, Robert Zita, Kostadin Damevski, Preetha Chatterjee
Drexel University
Preprint: https://arxiv.org/abs/2402.04183
Virginia Commonwealth
University
Elmhurst University
imranm3@vcu.edu
Motivation and Research Objective
● Fostering healthy collaborations in OSS is challenging
● Understanding and addressing incivility within OSS
discussions
● A lack of a comprehensive approach to address uncivil
interactions
● Lack of large annotated SE datasets
Research Objective: Curating a dataset of locked GitHub
issues enables analyzing incivility in OSS development
Annotated dataset of locked GitHub issue threads with heated discussions
Dataset Annotation
● 404 Locked issue threads from 213 GitHub projects, and 5,961
Individual comments
● Locked as "too heated" or demonstrated clear characteristics
indicative of heated discussions
● A total of 19 annotators
● To further improve the annotation quality, we used GPT-4
● Manually checked the instances of disagreements between GPT-4
and annotators
● Tone Bearing Discussion Features (TBDFs), uncivil features*
○ Bitter frustration, Impatience, Mocking, Irony, Vulgarity, etc
● Triggers*
○ Failed use of code, Technical disagreements, Communication breakdown, etc
● Targets*
○ People, Code/Tool, Company/organization, Undirected
● Consequences*
○ Discontinued further discussion, Escalating further, etc
*
C. Miller, S. Cohen, D. Klug, B. Vasilescu and C. Kästner, "“Did You Miss My Comment or What?” Understanding Toxicity in Open Source Discussions," 2022
*
Isabella Ferreira, Jinghui Cheng, and Bram Adams, The "Shut the f**k up" Phenomenon: Characterizing Incivility in Open Source Code Review Discussions, 2021
*
Jaydeb Sarker, Asif Kamal Turzo, Ming Dong, and Amiangshu Bosu, Automated Identification of Toxic Code Reviews Using ToxiCR, 2023
*
Our open coding process
Annotated Features
Dataset Description
● 1,365 comments annotated with an uncivil feature
● Bitter frustration, Impatience, and Mocking are the most prevalent
TBDFs
● Failed use of tool/code or error messages the most common Trigger
● People are the most common Target
● Discontinued further discussion is the most common Consequence
● A curated dataset of 404 locked issue threads
from 213 GitHub projects [Scan QR Code]
● Bitter frustration, Impatience, and Mocking
are the most prevalent TBDFs
● Failed use of tool/code or error messages
the most common trigger
● People are the most common target
● Discontinued further discussion is the most
common consequence
Preprint: https://arxiv.org/abs/2307.15631
ramtin.ehsani@drexel.edu
Preprint: https://arxiv.org/abs/2402.04183
imranm3@vcu.edu
Summary Research Directions
● Automated moderation bot development
● Impact of incivility on project health
● Effectiveness of moderation strategies
● Early warning systems development
● Underrepresented communities'
experiences
● Predicting heated thread locking
● Identifying productive intervention points

More Related Content

PDF
Incivility in Open Source Projects: A Comprehensive Annotated Dataset of Lock...
PDF
Sentiment Analysis on the Linux Kernel
PDF
Communication between open source developers
PPTX
Leveraging the Crowd: Supporting Newcomers to Build an OSS Community
PDF
WikiLoop: Big tech's Open Knowledge contributions
PDF
A Longitudinal Study on the Maintainers' Sentiment of a Large Scale Open Sour...
PPTX
contributing to open source in just about any skill
PPTX
Emotion Classification In Software Engineering Texts: A Comparative Analysis ...
Incivility in Open Source Projects: A Comprehensive Annotated Dataset of Lock...
Sentiment Analysis on the Linux Kernel
Communication between open source developers
Leveraging the Crowd: Supporting Newcomers to Build an OSS Community
WikiLoop: Big tech's Open Knowledge contributions
A Longitudinal Study on the Maintainers' Sentiment of a Large Scale Open Sour...
contributing to open source in just about any skill
Emotion Classification In Software Engineering Texts: A Comparative Analysis ...

Recently uploaded (20)

PDF
Machine learning based COVID-19 study performance prediction
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPT
Teaching material agriculture food technology
PDF
Empathic Computing: Creating Shared Understanding
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
Cloud computing and distributed systems.
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Encapsulation theory and applications.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Machine learning based COVID-19 study performance prediction
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Teaching material agriculture food technology
Empathic Computing: Creating Shared Understanding
A comparative analysis of optical character recognition models for extracting...
Cloud computing and distributed systems.
Agricultural_Statistics_at_a_Glance_2022_0.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Digital-Transformation-Roadmap-for-Companies.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Encapsulation theory and applications.pdf
MYSQL Presentation for SQL database connectivity
Encapsulation_ Review paper, used for researhc scholars
sap open course for s4hana steps from ECC to s4
Assigned Numbers - 2025 - Bluetooth® Document
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Ad
Ad

Incivility in Open Source Projects: A Comprehensive Annotated Dataset of Locked GitHub Issue Threads

  • 1. 21st International Conference on Mining Software Repositories Incivility in Open Source Projects: A Comprehensive Annotated Dataset of Locked GitHub Issue Threads Ramtin Ehsani, Mia Mohammad Imran, Robert Zita, Kostadin Damevski, Preetha Chatterjee Drexel University Preprint: https://arxiv.org/abs/2402.04183 Virginia Commonwealth University Elmhurst University imranm3@vcu.edu
  • 2. Motivation and Research Objective ● Fostering healthy collaborations in OSS is challenging ● Understanding and addressing incivility within OSS discussions ● A lack of a comprehensive approach to address uncivil interactions ● Lack of large annotated SE datasets Research Objective: Curating a dataset of locked GitHub issues enables analyzing incivility in OSS development Annotated dataset of locked GitHub issue threads with heated discussions
  • 3. Dataset Annotation ● 404 Locked issue threads from 213 GitHub projects, and 5,961 Individual comments ● Locked as "too heated" or demonstrated clear characteristics indicative of heated discussions ● A total of 19 annotators ● To further improve the annotation quality, we used GPT-4 ● Manually checked the instances of disagreements between GPT-4 and annotators
  • 4. ● Tone Bearing Discussion Features (TBDFs), uncivil features* ○ Bitter frustration, Impatience, Mocking, Irony, Vulgarity, etc ● Triggers* ○ Failed use of code, Technical disagreements, Communication breakdown, etc ● Targets* ○ People, Code/Tool, Company/organization, Undirected ● Consequences* ○ Discontinued further discussion, Escalating further, etc * C. Miller, S. Cohen, D. Klug, B. Vasilescu and C. Kästner, "“Did You Miss My Comment or What?” Understanding Toxicity in Open Source Discussions," 2022 * Isabella Ferreira, Jinghui Cheng, and Bram Adams, The "Shut the f**k up" Phenomenon: Characterizing Incivility in Open Source Code Review Discussions, 2021 * Jaydeb Sarker, Asif Kamal Turzo, Ming Dong, and Amiangshu Bosu, Automated Identification of Toxic Code Reviews Using ToxiCR, 2023 * Our open coding process Annotated Features
  • 5. Dataset Description ● 1,365 comments annotated with an uncivil feature ● Bitter frustration, Impatience, and Mocking are the most prevalent TBDFs ● Failed use of tool/code or error messages the most common Trigger ● People are the most common Target ● Discontinued further discussion is the most common Consequence
  • 6. ● A curated dataset of 404 locked issue threads from 213 GitHub projects [Scan QR Code] ● Bitter frustration, Impatience, and Mocking are the most prevalent TBDFs ● Failed use of tool/code or error messages the most common trigger ● People are the most common target ● Discontinued further discussion is the most common consequence Preprint: https://arxiv.org/abs/2307.15631 ramtin.ehsani@drexel.edu Preprint: https://arxiv.org/abs/2402.04183 imranm3@vcu.edu Summary Research Directions ● Automated moderation bot development ● Impact of incivility on project health ● Effectiveness of moderation strategies ● Early warning systems development ● Underrepresented communities' experiences ● Predicting heated thread locking ● Identifying productive intervention points