Adaptive User Feedback for IR-based Traceability Recovery

Annibale Panichella Andy ZaidmanAndrea De Lucia
Adaptive User Feedback for
IR-based Traceability Recovery

Traceability Recovery
Search
Information
Retrieval
Method (LSI,
VSM, etc.)

Search Use Case 1 Software
Artefacts
used as
query

Use Case 1
Class1.java
nl.tudelft.package1.class
public class Class1{
private int attribute 1;
private String attribute 2; …
Class5.java
public Class5 (int parameter1, int parameter2, String parameter3)…
Class2.java
public class Class2 extends Class1 {
private int attribute 1; …
Class63.java
public Class63 (Class12 parameter1, int parameter2)…
90%
85%
81%
74%
Class27.java
62%
Search
List of
candidate
links

Use Case 1
Class1.java
Class5.java
Class2.java
Class63.java
Class27.java
✓
✗
✗
✓
✓
Search

Relevance Feedback
Hayes et al., “Advancing Candidate
Link Generation for Requirements
Tracing: the Study of Methods”
IEEE Transaction on Software Engineering
2006

Relevance Feedback
2006
Use Case 1
Class1.java
Class5.java
Class2.java
Class63.java
81%
74%
Class27.java
62%
Search
✓
✗

Relevance Feedback
2006
Use Case 1
Class1.java
Class5.java
Class2.java
Class63.java
Class27.java
✓
✗
Search
81%
74%
62%
Learning Process

Relevance Feedback
2006
Use Case 1
Class1.java
Class5.java
✓
✗
Search
81%
74%
62%
43%
Class2.java
Class27.java
Class63.java
76%
77%

Relevance Feedback
2006
Use Case 1
Class1.java
Class5.java
Class2.java
Class63.java
77%
76%
Class27.java
43%
✓
✗
Search

Relevance Feedback
2006
Use Case 1
Class1.java
Class5.java
Class2.java
Class63.java
Class27.java
✓
✗
✓
✓
✗
Search

Relevance Feedback
2006
“Analyst feedback improves the ﬁnal trace
results” for requirements tracing
(Standard Rocchio)

Relevance Feedback
2006
De Lucia et al., “Incremental
Approach and User Feedback: a
Silver Bullet for Traceability Recovery”
International Conference on Software
Maintenance, 2006
Relevance feedback
does not improve and
sometimes worsens
the accuracy of an IR
method when applied
to different software
artefacts
(Standard Rocchio)

Theory behind Relevance Feedback
Mannin et al, “Introduction to
Information Retrieval”.
Cambridge University Press, 2008.

Assumption 1. Queries contain few words if
compared to the size of documents to retrieve
Short query
Web pages with hundreds of words

Use Case
Test Case
Query?
Query?

In traceability, source artefacts (queries) can be
more verbose than target artefacts (documents)

Assumption 2. Cluster hypothesis: relevant
documents must be similar to each others, i.e.,
they should cluster in the vector space
Relevant
Documents
Non-Relevant
Documents
Query
Term1
Term2

Language used in software artefacts is much
more homogeneous than natural language
Term1
Term2

Language used in software artefacts is much
more homogeneous than natural language
In traceability, source artefacts (queries) can
be more verbose than target artefacts
(documents)

Adaptive Relevance Feedback
List = initial ranked list of candidate links
while not (stopping criterion) {
Get the link (source, target) on top of List
The user classifies (source, target)
Apply the standard Rocchio to source
}
(Standard Rocchio)

if (source < target)
if (target < source)
Apply the standard Rocchio to target
}
Adaptive Standard Rocchio
Apply the relevance
feedback only to the
shortest artefacts
(Assumption 1)

if (source < target && TruePositive(source) > FalsePositive(source))
if (source < target && TruePositive(source) > FalsePositive(source))
Apply the standard Rocchio to target
}
Apply the relevance
feedback only to the
shortest artefacts
(Assumption 1)
Adaptive Standard Rocchio
Apply the relevance feedback if and only
if the number of correct links is >= to the
number of false positives
(Assumption 2)

Empirical Evaluation
Context: three software projects

We investigates the following research questions:
RQ1: Does the adaptive relevance feedback improve the performances of the
Vector Space Model?
RQ2: Does the adaptive relevance feedback outperform the standard relevance
feedback?

We investigates the following research questions:
RQ1: Does the adaptive relevance feedback improve the performances of the
Vector Space Model?
RQ2: Does the adaptive relevance feedback outperform the standard relevance
feedback?
Metrics:
Precision = TP/ (TP+FP)
Recall = TP / (Tot Links)
Wilcoxon Test (non-parametric)

Empirical Results
Easy-Clinic: tracing UC onto CC Easy-Clinic: tracing ID onto CC
Easy-Clinic: tracing TC onto CC

Empirical Results
i-Trust: tracing UC onto JSP Modis: tracing HLR onto LLR

Empirical ResultsAveragePrecision
0
23
45
68
90
UC-CC ID-CC TC-CC i-Trust Modis
VSM RF Adaptive RF
Statistical Signiﬁcance:
AdaptiveRF > VSM = 4/5
AdaptiveRF > RF = 4/5
RF > VSM = 1/5
Wilcoxon test

Adaptive User Feedback for IR-based Traceability Recovery

More Related Content

What's hot (20)

Viewers also liked (17)

Similar to Adaptive User Feedback for IR-based Traceability Recovery (20)

More from Annibale Panichella (20)

Recently uploaded (20)

Adaptive User Feedback for IR-based Traceability Recovery