Automatically Identifying the Quality of Developer Chats for Post Hoc Use

Automatically Identifying the Quality of
Developer Chats for Post Hoc Use
0
Preprint: https://preethac.github.io/files/TOSEM21.pdf
@PreethaChatterj
preethac@drexel.edu https://preethac.github.io
Transactions on Software Engineering and Methodology (TOSEM)
Journal-first presentation at ASE 2022
Preetha Chatterjee Kostadin Damevski Nicholas A. Kraft Lori Pollock

1
Why Analyze Quality of Information?
Built-in mechanisms in Q&A forums
Accepted answers, vote counts, user reputation
Prior research [Sillito ‘12, Duijn ‘15, Yang ’16]
Conciseness of answers, code readability
❌
No quality assessment
mechanism for chats
First step to building effective data-driven software tools
 API recommendation systems
 Virtual assistants for programming help
 Enhance quality of search engines

Consider this Conversation
Author Utterance
Alexia Hi, I have a file with following contents
1234 alphabet /vag/one/arun > 1454 bigdata /home/two/ogra > 5684 apple /vinay/three/dire,
but i want the output to be like
1234 alphabet one > 1454 bigdata two > 5684 apple three
Elaina sed −r ’s|(.+)/[ˆ/]+/([ˆ/]+)/.+|12|g’
Corina Even though I dont have anything to do with this question, could you explain the logic behind the
answer? The formatting sentence seem so random
Elaina ‘sed -r‘ is an extended mode, so + is enabled (matches one or more characters, unlike * that matches
zero or more); s///g or s|||g or any symbol instead of | is how a basic replacing expression is
constructed.
The first field is what to match, the second is what to replace it with.;(.+) /[ˆ/]+/([ˆ/]+) /.+ (.+)/
matches anything from the start until the first / and puts found characters in the first group (1);
[ˆ/]+/ matches anything that is not a slash, and then a slash (‘vag/‘ or ‘home/‘); ([ˆ/]+)/ matches the
same thing, but puts the stuff found in-between slashes in the second group 2; and then .+ matches
whatever comes next to the end of line; and the second field tells sed to replace the line with 12,
so our saved groups side-by-side: the first group was everything before the first slash, and the
second group was the stuff between 2nd and 3rd slashes
Corina Ah ok, thanks a lot for the explanation!
2
• Concise
• Details of the problem and solution
• Indication of answer acceptance

Now Consider This Conversation
3
Author Utterance
Cody Hello guys I got a huge problem
Holli Cody: ask away
Cody We’ve been ask as assignment the implementation of Dijkstra’s and Bellman Ford’s algorithm for
calculating the shortest path in a given graph
Holli So what’s the issue?; run into a problem?
Cody I don’t really know how to start and that’s my problem
… ….
Darrin Cody: how much experience do you have writing code?; for example, there are quite a number of
existing examples of the algorithms you’re talking about
Cody basic i’m just starting
Darrin ok; can you describe the steps on how you execute the algorithm?; and do you understand why
those steps are necessary?; if so, then the next step you take is translating your written description
of the process into pseudocode; once you have a reasonable sequence of actions, you then
implement the pseudocode in your language of choice; frankly, the first two items are always the
most difficult; because it requires you to understand the problem domain; once you understand it,
making it work is usually much less effort
Rachel Cody: oof graph theory for a beginner. do you understand how those algorithms work, ?
Cody Yes I understand how those work
Darrin just having trouble translating described steps to code?
• Lengthy
• Lacks relevant details of the problem
• Too much noise

Post Hoc Quality Conversations
A conversation is considered post hoc quality based on
the availability and ease of identifying information
to gain useful software-related knowledge
4
Recruited human judges to
analyze 400 conversations

 Logistic Regression
 Stochastic Gradient
Boosted Trees
 Random Forest
 Sequential Neural
Network
Automatically Identify Post Hoc Quality Conversations
Developer
Chats Extraction of
Features
Classification
 Knowledge
Seeking/Sharing
 Contextual
 Succinct
 Well Written
 Participant Experience
Prediction of
Quality
Binary Prediction
• Post Hoc
• Non Post Hoc

Features to Identify Post Hoc Quality Conversation
Knowledge
Seeking /
Sharing
Succinct
Well
written
Contextual
Attributes of conversation
1. Primary question?
2. Knowledge-seeking question?
3. Accepted answers?
4. #Authors
1. #API Mentions
2. #URL
3. Code
4. Code Description
5. Size of code
6. Error Message
7. #Software Specific terms
1. #Utterances
2. #Sentences
3. #Words
4. Time Span
5. #Text Speaks
6. #Questions
7. Unique Information
8. Avg Shortest Path
9. Avg Graph Degree
Attributes of conversation
1. #Misspellings
2. #Incomplete Sentences
3. Readability Metrics
1. Questioner Experience
2. Participants Experience
Participant
Experience

Gold Set
7
Community
(Slack Channels)
#Conv
pythondev#help 400
clojurians#clojure 400
elmlang#beginners 400
elmlang#general 400
racket#general 400
Total 2k
Evaluation Methodology
# Post Hoc = 1310
# Non Post Hoc = 690

RQ1: How effective are machine learning-based techniques for
automatic identification of post hoc quality developer chats?
8
Evaluation Results
 Stochastic Gradient Boosted Trees (SGBT)
 Logistic Regression (LR)
 Random Forest (RF)
 Sequential Neural Network (SNN)
Baseline: Software-related conversations
based on presence of code

RQ1: How effective are machine learning-based techniques for
automatic identification of post hoc quality developer chats?
9
Machine learning-based techniques outperform heuristic-based baseline
SNN provides best performance, with F1 and AUC = 0.86, MCC = 0.55
Evaluation Results

RQ2: Which features result in more effective automatic identification?
Top 8 Features Information Gain
#Utterances 0.204
#Sentences 0.182
Software-specific Terms 0.174
#Words 0.171
#Authors 0.158
Time Span 0.156
Participants’ Experience 0.151
Avg Graph Degree 0.146
10
Evaluation Results
Length Coherence Topic of discussion Participant knowledge

RQ3: What types of conversations are difficult to automatically
detect as post hoc quality using our techniques?
11
False Negative False Positive
Evaluation Results

RQ3: What types of conversations are difficult to automatically
detect as post hoc quality using our techniques?
12
False Negative False Positive
FP: classifiers struggled distinguishing conversations based on the quality of answers
FN: very short conversations (3-4 utterances), not enough content for our features
Evaluation Results

Summary: Identifying Post Hoc Quality Conversations
Machine learning-based approach to automatically identify
post hoc quality developer conversations
• Best performance using Sequential Neural Network
with F-measure and AUC of 0.86, MCC of 0.55
• Most informative quality features:
Length, coherence, topic of discussion, and participant experience
13
Significance: Advances the field of information mining by using high-quality
information from developer chats
 Efficient information gathering towards building software maintenance tools
 Enrich existing knowledge-bases and community knowledge

Automatically Identifying the Quality of Developer Chats for Post Hoc Use

More Related Content

Similar to Automatically Identifying the Quality of Developer Chats for Post Hoc Use (20)

More from Preetha Chatterjee (7)

Recently uploaded (20)

Automatically Identifying the Quality of Developer Chats for Post Hoc Use

Editor's Notes