Evaluating SZZ Implementations Through a Developer-informed Oracle (ICSE 2021)

Evaluating SZZ Implementations
Through a Developer-informed Oracle
Giovanni Rosa, Luca Pascarella, Simone Scalabrino, Rosalia Tufano,
Gabriele Bavota, Michele Lanza, Rocco Oliveto

Find out changes that can lead to a problem
and avoid them in future
Understanding where bugs are introduced allows to…

Estimate how much a program is error-prone

Better allocate resources in testing activities

Śliwerski
Zimmermann
Zeller
@ MSR 2005

Step 1
SZZ in a nutshell
bug report
analysis

Step 1
(A)
Bug-fixing
commit
(B)
git blame
(C)
Buggy
commit
SZZ in a nutshell
bug report
analysis

Step 1
Step 2
Filtering of resulting
commits
SZZ in a nutshell
(A)
Bug-fixing
commit
(B)
git blame
(C)
Buggy
commit
bug report
analysis

Step 1
bug-inducing
commit
Step 2 Step 3
SZZ in a nutshell
commits
(A)
Bug-fixing
commit
(B)
git blame
(C)
Buggy
commit
bug report
analysis

Different SZZ variants proposed

Evaluating and
comparing the SZZ
variants
Da Costa et al. @ TSE 2016

Evaluating and
comparing the SZZ
variants
Small datasets used for evaluation

Evaluating and
comparing the SZZ
variants
Small datasets used for evaluation
Validation manually performed by
researchers

Define a dataset validated by
the developers
The way

fixes a search bug
introduced by 2508e12
and fixes a typo in the
README.md

Heuristic approach
1
keyword-based filter
AI-powered syntax analysis
Duplicate commits removal

Heuristic approach
2
Duplicate commits removal

3 Heuristic approach
duplicate commits removal

Manual validation
False
positives
Bug report
data

Bug report data
fixes #1740 quote pov-ray binary on windows
this fixes a bug introduced by #3523741…
URL
Date when the
issue is reported
https://tracker.freecadweb.org/view.php?id=1740
Commit
message

19,6M
3,6k
1,9k
Analyzed commits:
Extracted commits:
After manual validation:

Top programming languages
0
185
370
C
P
y
t
h
o
n
C
+
+
J
S
J
a
v
a
P
H
P
R
u
b
y
C
#

1,1k
129
Final number of commits:
Commits with issue report:

How do different variants of SZZ
perform in identifying
bug-inducing changes?

B-SZZ
Śliwerski et al. @ MSR 2005

R-SZZ e L-SZZ
B-SZZ
AG-SZZ
DJ-SZZ
Śliwerski et al. @ MSR 2005 Williams and Spacco @ ISSTA 2008
Kim et al. @ ASE 2006 Davies et al. @ JSE 2013

R-SZZ e L-SZZ
B-SZZ
AG-SZZ
MA-SZZ
DJ-SZZ
RA-SZZ
Śliwerski et al. @ MSR 2005 Williams and Spacco @ ISSTA 2008 Da Costa et al. @ TSE 2016
Kim et al. @ ASE 2006 Davies et al. @ JSE 2013 Neto et al. @ SANER 2018

Open-Source implementations
SZZ Unleashed
(DJ-SZZ)
OpenSZZ
(B-SZZ)
PyDriller
(AG-SZZ)
RA-SZZ
(RA-SZZ)

Step 1
bug-inducing
commit
Step 2 Step 3
Our experiment
commits
(A)
Bug-fixing
commit
(B)
git blame
(C)
Buggy
commit
bug report
analysis

Results
0.66 (R-SZZ)
Precision
Recall
F1-score
0.72 (SZZ@UNL)
0.61 (R-SZZ)

Results
0.66 (R-SZZ)
Precision
Recall
F1-score
0.72 (SZZ@UNL)
0.61 (R-SZZ)
0.09 (SZZ@UNL)
0.19 (SZZ@OPN)
Java only
0.16 (SZZ@UNL)

“ The buggy line is
not always impacted
in the bug-fix „
Lesson 1

“ SZZ is sensible to
history rewritings „
Lesson 2

“ Looking at the
big picture in
code changes „
Lesson 3

Take a look at our SZZ implementation!
https://github.com/grosa1/pyszz

Evaluating SZZ Implementations Through a Developer-informed Oracle (ICSE 2021)

More Related Content

What's hot (20)

Similar to Evaluating SZZ Implementations Through a Developer-informed Oracle (ICSE 2021) (20)

More from Giovanni Rosa (7)

Recently uploaded (20)

Evaluating SZZ Implementations Through a Developer-informed Oracle (ICSE 2021)