CMPT470-usask-guest-lecture

SUPPORTING SOFTWARE CHANGE TASKS
USING AUTOMATED QUERY
REFORMULATIONS
Masud Rahman
PhD Candidate
Department of Computer Science
University of Saskatchewan, Canada
Email: masud.rahman@usask.ca
CMPT 470/816: Advanced Software Engineering

A TALE OF SOFTWARE CHANGE
2
Alex
Bob
Code base Customer
Code base
Bug repository

A TALE OF SOFTWARE CHANGE (CONTD.)
3
Alex
Bob
Customer
Buggy
files
Bug
report
Change
implementation
Keywords
Code search
Code
base

TALK OUTLINE
4
Automated Query
Reformulation
Part I: Suggest keywords from
the change request texts
Part II: Reformulate initial
query of developer using
codebase

STRICT: INFORMATION RETRIEVAL
BASED SEARCH TERM IDENTIFICATION
FOR CONCEPT LOCATION
Mohammad Masudur Rahman, Chanchal K. Roy
International Conference on Software Analysis, Evolution and
Reengineering (SANER 2017), Klagenfurt, Austria

SOFTWARE CHANGE TASK
6
Task Summary
Task Description
Other Information

SOFTWARE CHANGE TASK:
DOMAIN CONCEPT--ARTIFACT MAPPING
IResource
element
Tree
Level
Provider
7
Domain concepts
Project artifacts
(e.g., classes, methods)
Our
contribution:
Identifying
such concepts

QUIZ TEST-I
8
ID Query QE
1. Custom search results view iresource
2. Custom search results search results view
3. element iresource provider level tree
4. Custom search results hierarchically java search results
1331
636
01
570

EXISTING WORKS
 Query reformulation & expansion
 Haiduc et al, ICSE 2013
 Gay et al, ICSM 2009
 Shepherd et al, ASOD 2007
 Query quality analysis
 Haiduc et al, ASE 2011
 Haiduc et al, ICPC 2011
 Haiduc et al, ICSE 2012
 Software artifact mining
 Howard et al, MSR 2013
 Kevic & Fritz, MSR 2014
 Heuristics
 Kevic & Fritz, ICSE 2014
9
• Most studies expect the
developer to provide an initial
query
• Developers succeed only in
12.2% of cases (Kevic & Fritz,
ICSE 2014)
Initial search query for a
change task.

PAGERANK ALGORITHM: WEB LINK ANALYSIS
10Size of a face ∞ Size of the faces pointing to it
Most important face
in this crowd

SEARCH TERM IDENTIFICATION USING
TEXTRANK & POSRANK, TWO VARIANTS OF
PAGERANK
11

SCHEMATIC DIAGRAM: PROPOSED
APPROACH
12
Change
request
Preprocessing
TextRank
calculation
POSRank
calculation
Ranking
Search terms
Focus of this talk

TEXTRANK: TERM IMPORTANCE USING CO-
OCCURRENCE (MIHALCEA ET AL, EMNLP 2004)
13
IResource-------IJavaElement, element-----reported
Node = Distinct word
Edge = Two words co-
occurring in the same
context

POSRANK: TERM IMPORTANCE USING SYNTACTIC
DEPENDENCE (BLANCO & LIOMA, INF. RETR. 2012)
14
Edge = Syntactic
dependence between
various parts of
speech in the sentence
Verb-------Noun, Verb---Adjective
Jespersen’s Theory of 3 Ranks
Noun
Verb Adjective

TERM IMPORTANCE
(ADAPTED FROM PAGERANK)
15
 
 )(
)10(
|)(|
)(
)1()(
ivInj
j
j
i
vOut
vS
vS 
•Vi – node of interest
•Vj – node connected to Vi through incoming links
• – damping factor (i.e., probability of choosing a node in the
network)
•In(Vi) – incoming nodes to Vi
•Out(Vj) – outgoing nodes from Vj

TERM IMPORTANCE (EXPLAINED)
16
Vi
Vj3
Vj5
Vj4
Vj2
Vj6
Vj1
Term Score (Vi) = TextRank (Vi) + POSRank (Vi)

EXPERIMENTAL DATASET
17
8 Projects (Apache + Eclipse)
GitHub commits &
Change set
BugZilla + JIRA issues
1,939 change tasks

EXPERIMENTAL SETUP
18
Change
request
Baseline
query
Suggested
query
Code search
Our ranks
Baseline
ranks
Compare
Query Effectiveness
Mean Avearge Precision
Mean Recall
Top-K Accuracy

EXPERIMENTAL RESULTS
(QUERY EFFECTIVENESS)
19
Query Pairs Improved Worsened P-value Preserved MRD
STRICT vs. Title 57.84% 34.94% <0.001* 7.22% -147
STRICT vs. Title
(10 keywords)
62.49% 32.26% <0.001* 5.25% -201
STRICT vs.
Description
53.84% 38.21% <0.001* 7.95% -329
STRICT vs.
(Title + Desc.)
52.36% 39.94% <0.001* 7.70% -265
*= Significant Difference, MRD = Mean Rank Difference

(RETRIEVAL PERFORMANCE)
20*Our performance is significantly higher for each metric

21
Our Top-K accuracy is clearly higher for various K-values

COMPARISON WITH EXISTING METHODS
(QUERY EFFECTIVENESS)
22
Technique Improved Worsened Preserved MRD
Kevic & Fritz, ICSE
2014
40.09% 53.95% 5.96% +101
Rocchio’s Method,
ICSE 2013
37.59% 56.38% 6.03% +45
STRICT 57.84%* 34.94%* 7.22% -147
*= Significant Difference, MRD = Mean Rank Difference

than the state-of-the-art

24Our Top-K accuracy is clearly higher for various K-values

TAKE-HOME MESSAGES
 Identifying initial search terms is challenging.
 Only 12.20% of developer’s search terms are
relevant.
 PageRank Algorithm adapted for term
importance.
 We combined TextRank and POSRank for
identifying important terms.
 Experiments with 1,939 change tasks from 8
systems of Apache & Eclipse.
 57.84% of queries improved by STRICT.
 Comparison with state-of-the-art approach
validates our approach. 25

IMPROVED QUERY REFORMULATION FOR
CONCEPT LOCATION USING CODERANK AND
DOCUMENT STRUCTURES
Mohammad Masudur Rahman, Chanchal K. Roy
International Conference on Automated Software Engineering
(ASE 2017), Urbana-Champaign, IL, USA

AN EXAMPLE CHANGE REQUEST
27
Field Content
Issue ID 31110
Product eclipse.jdt.debug
Title Debbugger Source Lookup does not work with variables
Description In the Debugger Source Lookup dialog I can also select
variables for source lookup. (Advanced... > Add
Variables). I selected the variable which points to the
archive containing the source file for the type, but the
debugger still claims that he cannot find the source

SEARCH KEYWORD SELECTION
28
Field Content
Issue ID 31110
Title Debbugger Source Lookup does not work with
variables
Description In the Debugger Source Lookup dialog I can also
select variables for source lookup. (Advanced... > Add
debugger still claims that he cannot find the source.

CHANGE REQUEST TO CODE MAPPING
29
Field Content
Issue ID 31110
Title Debbugger Source Lookup does not work with
variables
Description In the Debugger Source Lookup dialog I can also
select variables for source lookup. (Advanced... > Add
debugger still claims that
he cannot find the source

BASELINE SEARCH QUERIES
30
Technique Query QE
Baseline debugger source lookup
Baseline debugger source lookup work variables
Baseline
query
Baseline +
Expansion terms
Pseudo-relevance Feedback
79
77
Code search
Top-K
documents

TRADITIONAL QUERY REFORMULATIONS
31
Technique Reformulated Query QE
RSV 1990 debugger source lookup work variables +
launch configuration jdt java debug
30
Sisman &
Kak 2013
debugger source lookup work variables +
test exception suite core code
51
Refoqus
2013
launch jdt configuration classpath project
12
Technique Query QE
Baseline debugger source lookup 79
Baseline debugger source lookup work variables 77

BIG PICTURE: TERM WEIGHTING
32


RFDd t
t
n
D
dftIDFTF log)),log(1()(
Baseline
query
Baseline +
Expansion terms

BIG PICTURE: TERM WEIGHTING
33


RFDd t
t
n
D
dftIDFTF log)),log(1()(
• Different semantics
• Different structures

OUR CONTRIBUTIONS (2)
 Novel term weighting method – CodeRank
 Novel query reformulation technique -- ACER
34

CODERANK: TERM WEIGHTING FOR SOURCE
CODE TERMS
35

CODERANK CALCULATION: STEP I
36

CODERANK CALCULATION: STEP II
37
resolveRuntimeClasspathEntry
Resolve Runtime Classpath Entry

CODERANK CALCULATION: STEP III
38


)(
)10(
|)(|
)(
)1()(
iVInj j
j
i
VOut
VS
VS 
Most important face
in this crowd
1. resolve
2. required
3. launch
4. classpath
5. runtime

ACER: QUERY REFORMULATION USING
CODERANK & MACHINE LEARNING
39

SOURCE DOCUMENT STRUCTURES
41
Class signature
Method signature
Field signature

ACER: SELECTION OF THE BEST QUERY
REFORMULATION
42
Ref. candidate
(method sig.)
Ref. candidate
(field sig.)
Ref. candidate
(method + field sigs)
Data re-samplingMachine learning
(Ensemble learning)
Select of the best
reformulation
Reformulated
query

ACER: QUERY REFORMULATIONS
43
Technique Query QE
Baseline debugger source lookup 79
Baseline debugger source lookup work variables 77
Refoqus
2013
launch jdt configuration classpath project
12
CodeRank
(method)
launch debug resolve required classpath
02
CodeRank
(field)
label classpath system resolution launch
06
CodeRank
(both)
java type launch classpath label
16
ACER debugger source lookup work variables +
launch debug resolve required classpath
02
ML

EXPERIMENTAL DATASET
44
8 Projects (Apache + Eclipse)
GitHub commits &
Change set
BugZilla + JIRA issues
1,675 change
requests

EXPERIMENTAL SETUP
45
Change
request
Baseline
query
Reformulated
query
Code search
Our ranks
Baseline
ranks
Compare
Query Effectiveness (QE)
Mean Reciprocal Rank (MRR)
Top-K Accuracy

RESEARCH QUESTIONS (5)
 RQ1: Does ACER improve baseline queries
significantly?
 RQ2: Does CodeRank perform better than the
traditional term weights (e.g., TF-IDF)?
 RQ3: Does document structure make a
difference in query reformulation?
 RQ4: How stemming, query length and relevance
feedback size affect our performance?
 RQ5: Does ACER outperform the state-of-the-art
in query reformulation for concept location?
46

ANSWERING RQ1: QUERY EFFECTIVENESS OVER
BASELINE
47
Query Pairs Improved (MRD Worsened
(MRD)
P-value Preserved
CodeRankmethod vs.
Baseline
58.93% (-61) 37.99% (+131) 0.007* 3.08%
CodeRankfield vs.
Baseline
52.51% (-51) 44.57% (+151) 0.063 2.91%
CodeRankboth vs.
Baseline
58.62% (-51) 38.19% (+136) *0.018* 3.20%
ACER vs. Baseline 71.05% (-81) 2.51% (+104) <0.001* 26.44%
*= Significant difference between improvements and worsening, MRD = Mean Rank
Difference

ANSWERING RQ2: CODERANK VS. TRADITIONAL
TERM WEIGHTS
49

ANSWERING RQ3: DO SOURCE DOCUMENT
STRUCTURES MATTER?
50

ANSWERING RQ3: DO SOURCE DOCUMENT
STRUCTURES MATTER?
51

ANSWERING RQ4: IMPACT OF
REFORMULATION LENGTH
52

RQ5: COMPARISON WITH EXISTING METHODS
1. CodeRank
2. Document contexts
3. Data re-sampling

TAKE-HOME MESSAGES
 Reformulation of a search query is highly challenging
for the developers, costs lots of efforts.
 Traditional term weights are not sufficient enough.
 We provide CodeRank that exploits source term
semantics and source document contexts.
 We provide ACER that provides the best from a set of
reformulation candidates prepared by CodeRank.
 Experiments with 1,675 change requests from 8 OSS
systems of Apache & Eclipse.
 71% of queries improved, only 3% worsened by ACER.
 Comparison with five methods including the state-of-the-
art validates our approach. 54

THANK YOU !!! QUESTIONS?
55
More details on CodeRank & ACER:
http://www.usask.ca/~masud.rahman/acer/
Contact: masud.rahman@usask.ca
More details on STRICT:
http://homepage.usask.ca/~masud.rahman/strict/

RQ5: COMPARISON WITH EXISTING METHODS
56Our Top-K accuracy is clearly higher for various K-values

PROVOCATIVE STATEMENT
 We need better algorithms to overcome
“vocabulary mismatch issue”. Where to start
from? Which source/repository is more appropriate
beside project source code?
57

PROBABLE QUESTIONS
 Did you do stemming?
 No we didn’t since many recent studies reported negative
performance. Especially does not help when the texts contain
structured items like camel case tokens.
 Which one is better TextRank and POSRank?
 The performed quite similarly. But we combined them since
they convey two distinct aspects of connectivity.
 Which settings did you apply for the ranking
algorithm?
 Details in the paper. But these PR-based algorithms have a
tendency of converging scores despite their initial settings
unlike simple VSM based models.
 Can this be used for query reformulation?
 Could be yes, if you can convert the artifact into the text
graph. We are basically working with that using source code.
58

PROBABLE QUESTIONS
 Recent studies show that IR-based methods are not
effective if the bug report is not rich.
 Yup, that’s true. We need more techniques to better write the
bug reports. Plus, we need better methods to address
vocabulary mismatch issue.
 Why didn’t you consider any stuff from the source
code?
 We are suggesting the initial query. Yes, the source will be
used for query-reformulation. We also showed that our initial
query is better than the baselines as used by the developers
frequently.
 How is the cost? How long it take?
 It is pretty much real time. We are planning to develop an IDE
plug-in recently.
59

CMPT470-usask-guest-lecture

More Related Content

What's hot (17)

Similar to CMPT470-usask-guest-lecture (20)

More from Masud Rahman (20)

Recently uploaded (20)

CMPT470-usask-guest-lecture

Editor's Notes