SlideShare a Scribd company logo
AN EMPIRICAL STUDY OF THE RELATION
BETWEEN STRONG CHANGE COUPLING
AND DEFECTS USING HISTORY AND SOCIAL
METRICS IN THE APACHE ARIES PROJECT
Igor
Wiese
Rodrigo
Kuroda
Reginaldo
Ré
Gustavo
Oliva
Marco
Gerosa
2
MOTIVATION
+ Dependencies
+ Maintenance effort
[Banker et al.1998]
+ Defects
[D’ambros et al. 2009b, Cataldo et al. 2009, Kirbas et. al 2014]
+ ripple effects
[Pressman, 2001]
Software systems comprise
artifacts that depend on each other
3
MOTIVATION SCENARIO
Subsystem A
Class a
Class c
Class b
Subsystem B
Class d
Class e
CHANGE COUPLING
4
Artifacts a1
Artifacts a2
time
Co-change
commit
A change coupling indicates that two artifacts
changed together (co-changed) in the past,
making them evolutionarily connected.
WHY IS CHANGE COUPLING IMPORTANT?
5
Correlation with
Defects
[D’ambros et al. 2009b,
Kirbas et. al 2014]
Change
Coupling
Change Impact Analysis
[Zimmermann et al. 2005, Zhou et al.
2008, Hassan 2009]
Code
Refactoring
[D’ambros et al. 2009a]
Understand
the Sotware
Architecture
[Zimmermann et al.
2003]
Modularity
Assessment
[[[Silva et al. 2014]
Trace
Requiriments
[Ali et al. 2013]
Change couplings reveal relationships not present in
the code or in the documentation.
6
THE MAIN GOALS OF THIS PAPER ARE:
1
2
investigate the relation between
strong change couplings and
the number of defects (RQ1)
characterize strong change couplings
using historical and social metrics (RQ2)
3
predict defects associated with strong
change couplings (RQ3)
7
METHODOLOGY
Data
Collection
Identify the strong
change couplings
Compute the
set of Metrics
Build the
Classifiers
RQ1: Are strong change couplings
related to defects?
RQ2: Can historical and social
metrics identify if a change
coupling is strong?
RQ3: Can we predict defects
associated with strong change
couplings?
8
IDENTIFYING STRONG CHANGE COUPLINGS
Support
Number of transactions when both artifacts were changed together (support = 3).
Confidence
The confidence for the change coupling from A to B would be 3/9 = 0.33.
In turn, the confidence for the change coupling from B to A would be 3/4 = 0.75.
Strong change coupling
After ranking all change couplings by support value, we used a quartile analysis to
determine the relevant change couplings. All couplings with support value higher
than the third quartile were labeled as “strong”.
Artifact A
Artifact B
time
Co-change
commit
9
APACHE ARIES
The Aries project delivers a set of pluggable Java
components enabling an enterprise OSGi
application programming model
..has had 3,901 commits made by 35 contributors
representing 189,241 lines of code
397 Commits Up +77 (24%) from previous 12 months
11 Contributors Up +1 (10%) from previous 12 months
We collected data from six releases, 556
issue reports, five years of software
developments history
10
RQ1: ARE STRONG CHANGE COUPLINGS
RELATED TO DEFECTS?
The majority of the strong change couplings could be
associated with at least one defect.
strong change couplings are moderately correlated (rho 0.46, p < 0.001)
11
RQ2: CAN HISTORICAL AND SOCIAL METRICS
IDENTIFY IF A CHANGE COUPLING IS STRONG?
To answer this question, we conducted two studies:
•  Cross-validation for each release;
•  Using one release to train, the consecutive release to test
Cross Validation Results:
•  Models with high
Accuracy
•  AUC ranging 88% - 99%
•  F-measure from
70%-99%
Inter-Release Train/Test
•  Models can correctly
predicted strong change
couplings range from 56%
to 100%.
•  Problems when the train
release have few
instances
12
RQ2: WHICH ARE THE BEST METRICS TO IDENTIFY
STRONG CHANGE COUPLINGS?
•  In average 5 metrics were used (models were
built with 16 metrics)
•  The best subset selected:
•  The length of discussion (number of words) – 4
releases
•  The number of distinct committers – 3 releases
•  The experience of committers – 2 releases
•  The number of defect issues associated with – 2
releases
•  The number of weeks between the first and last
commit – 2 releases
13
RQ3: CAN WE PREDICT DEFECTS ASSOCIATED
WITH STRONG CHANGE COUPLINGS?
We found 81 strong change couplings in the post-release
to fix defects. We correctly predicted 37 strong change
couplings (45.67%).
CONCLUSIONS
•  We studied the relationship between strong change couplings
and defects
•  We developed models with high accuracy to identify strong
change couplings
•  In some cases, it is necessary to group the data.
•  In average, 5 metrics were necessary to get better results
•  6 metrics were selected at least in two releases.
•  We correctly predict 45.67% of strong change couplings changed
•  We want to go deeper and investigate the ways in which strong
change couplings can influence code quality
•  Build tools to monitor and track the “damage” caused by these
couplings;
•  Change propagation analysis using contextual information;
14
http://lapessc.ime.usp.br/
Thank you
igor@utfpr.edu.br

More Related Content

PPTX
On correctness in RDF stream processor benchmarking
PPTX
Less is More: Building Selective Anomaly Ensembles with Application to Event...
PPT
Что не так с рекламным текстом? Презентация с вебинара
DOC
Jason Brown Resume-1
PPTX
Clip 4
PPTX
El jorobado de notredame
PDF
Fgxpress apresentação fácil de entender - copia (22) - copia
On correctness in RDF stream processor benchmarking
Less is More: Building Selective Anomaly Ensembles with Application to Event...
Что не так с рекламным текстом? Презентация с вебинара
Jason Brown Resume-1
Clip 4
El jorobado de notredame
Fgxpress apresentação fácil de entender - copia (22) - copia

Viewers also liked (10)

DOC
доклад текст
PPTX
Remajaislamideologis 130627023248-phpapp02
PDF
Fundraising Shops
PPTX
Construccion de cuerpos geométricos
ODP
Biztanleria eta lana
PDF
Case Study - Electric Star Ltd
PDF
Property for sale in dubai
PPTX
Tecnologia de la informacion 701 yuranis y camila
PDF
Review paper
PPTX
Nano teks editor
доклад текст
Remajaislamideologis 130627023248-phpapp02
Fundraising Shops
Construccion de cuerpos geométricos
Biztanleria eta lana
Case Study - Electric Star Ltd
Property for sale in dubai
Tecnologia de la informacion 701 yuranis y camila
Review paper
Nano teks editor
Ad

Similar to AN EMPIRICAL STUDY OF THE RELATION BETWEEN STRONG CHANGE COUPLING AND DEFECTS USING HISTORY AND SOCIAL METRICS IN THE APACHE ARIES PROJECT (20)

PDF
USING STRUCTURAL HOLES METRICS FROM COMMUNICATION NETWORKS TO PREDICT CHANGE ...
PDF
Early Detection of Collaboration Conflicts & Risks in Software Development
PDF
Csmr13c.ppt
PDF
130404 fehmi jaafar - on the relationship between program evolution and fau...
PPTX
Beyond Strong Consistency
PDF
An Implementation on Effective Robot Mission under Critical Environemental Co...
PDF
Is robustness really robust? how different definitions of robustness impact d...
PDF
Mining and Untangling Change Genealogies (PhD Defense Talk)
PPTX
[ASE2014] An Empirical Study on Reducing Omission Errors in Practice
PDF
Pairwise testing
PPTX
AIAA-Aviation-VariableFidelity-2014-Mehmani
PDF
Team Nile EE416 Final Report
PPTX
nature inspired algorithms
PDF
DEFECT PREDICTION USING ORDER STATISTICS
PDF
A new development in the hierarchical clustering of repertory grid data
PDF
International Journal of Engineering Research and Development
PDF
F1083644
PDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
PDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
PPT
2Regression testing refers to a software testing technique that re-runs non-f...
USING STRUCTURAL HOLES METRICS FROM COMMUNICATION NETWORKS TO PREDICT CHANGE ...
Early Detection of Collaboration Conflicts & Risks in Software Development
Csmr13c.ppt
130404 fehmi jaafar - on the relationship between program evolution and fau...
Beyond Strong Consistency
An Implementation on Effective Robot Mission under Critical Environemental Co...
Is robustness really robust? how different definitions of robustness impact d...
Mining and Untangling Change Genealogies (PhD Defense Talk)
[ASE2014] An Empirical Study on Reducing Omission Errors in Practice
Pairwise testing
AIAA-Aviation-VariableFidelity-2014-Mehmani
Team Nile EE416 Final Report
nature inspired algorithms
DEFECT PREDICTION USING ORDER STATISTICS
A new development in the hierarchical clustering of repertory grid data
International Journal of Engineering Research and Development
F1083644
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
2Regression testing refers to a software testing technique that re-runs non-f...
Ad

Recently uploaded (20)

PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
TR - Agricultural Crops Production NC III.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
Pharma ospi slides which help in ospi learning
PDF
Classroom Observation Tools for Teachers
PPTX
Lesson notes of climatology university.
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Basic Mud Logging Guide for educational purpose
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Pre independence Education in Inndia.pdf
PDF
Complications of Minimal Access Surgery at WLH
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Computing-Curriculum for Schools in Ghana
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Anesthesia in Laparoscopic Surgery in India
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
TR - Agricultural Crops Production NC III.pdf
Microbial diseases, their pathogenesis and prophylaxis
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
102 student loan defaulters named and shamed – Is someone you know on the list?
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Pharma ospi slides which help in ospi learning
Classroom Observation Tools for Teachers
Lesson notes of climatology university.
GDM (1) (1).pptx small presentation for students
Basic Mud Logging Guide for educational purpose
Pharmacology of Heart Failure /Pharmacotherapy of CHF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Pre independence Education in Inndia.pdf
Complications of Minimal Access Surgery at WLH
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Computing-Curriculum for Schools in Ghana
Final Presentation General Medicine 03-08-2024.pptx
Anesthesia in Laparoscopic Surgery in India

AN EMPIRICAL STUDY OF THE RELATION BETWEEN STRONG CHANGE COUPLING AND DEFECTS USING HISTORY AND SOCIAL METRICS IN THE APACHE ARIES PROJECT

  • 1. AN EMPIRICAL STUDY OF THE RELATION BETWEEN STRONG CHANGE COUPLING AND DEFECTS USING HISTORY AND SOCIAL METRICS IN THE APACHE ARIES PROJECT Igor Wiese Rodrigo Kuroda Reginaldo Ré Gustavo Oliva Marco Gerosa
  • 2. 2 MOTIVATION + Dependencies + Maintenance effort [Banker et al.1998] + Defects [D’ambros et al. 2009b, Cataldo et al. 2009, Kirbas et. al 2014] + ripple effects [Pressman, 2001] Software systems comprise artifacts that depend on each other
  • 3. 3 MOTIVATION SCENARIO Subsystem A Class a Class c Class b Subsystem B Class d Class e
  • 4. CHANGE COUPLING 4 Artifacts a1 Artifacts a2 time Co-change commit A change coupling indicates that two artifacts changed together (co-changed) in the past, making them evolutionarily connected.
  • 5. WHY IS CHANGE COUPLING IMPORTANT? 5 Correlation with Defects [D’ambros et al. 2009b, Kirbas et. al 2014] Change Coupling Change Impact Analysis [Zimmermann et al. 2005, Zhou et al. 2008, Hassan 2009] Code Refactoring [D’ambros et al. 2009a] Understand the Sotware Architecture [Zimmermann et al. 2003] Modularity Assessment [[[Silva et al. 2014] Trace Requiriments [Ali et al. 2013] Change couplings reveal relationships not present in the code or in the documentation.
  • 6. 6 THE MAIN GOALS OF THIS PAPER ARE: 1 2 investigate the relation between strong change couplings and the number of defects (RQ1) characterize strong change couplings using historical and social metrics (RQ2) 3 predict defects associated with strong change couplings (RQ3)
  • 7. 7 METHODOLOGY Data Collection Identify the strong change couplings Compute the set of Metrics Build the Classifiers RQ1: Are strong change couplings related to defects? RQ2: Can historical and social metrics identify if a change coupling is strong? RQ3: Can we predict defects associated with strong change couplings?
  • 8. 8 IDENTIFYING STRONG CHANGE COUPLINGS Support Number of transactions when both artifacts were changed together (support = 3). Confidence The confidence for the change coupling from A to B would be 3/9 = 0.33. In turn, the confidence for the change coupling from B to A would be 3/4 = 0.75. Strong change coupling After ranking all change couplings by support value, we used a quartile analysis to determine the relevant change couplings. All couplings with support value higher than the third quartile were labeled as “strong”. Artifact A Artifact B time Co-change commit
  • 9. 9 APACHE ARIES The Aries project delivers a set of pluggable Java components enabling an enterprise OSGi application programming model ..has had 3,901 commits made by 35 contributors representing 189,241 lines of code 397 Commits Up +77 (24%) from previous 12 months 11 Contributors Up +1 (10%) from previous 12 months We collected data from six releases, 556 issue reports, five years of software developments history
  • 10. 10 RQ1: ARE STRONG CHANGE COUPLINGS RELATED TO DEFECTS? The majority of the strong change couplings could be associated with at least one defect. strong change couplings are moderately correlated (rho 0.46, p < 0.001)
  • 11. 11 RQ2: CAN HISTORICAL AND SOCIAL METRICS IDENTIFY IF A CHANGE COUPLING IS STRONG? To answer this question, we conducted two studies: •  Cross-validation for each release; •  Using one release to train, the consecutive release to test Cross Validation Results: •  Models with high Accuracy •  AUC ranging 88% - 99% •  F-measure from 70%-99% Inter-Release Train/Test •  Models can correctly predicted strong change couplings range from 56% to 100%. •  Problems when the train release have few instances
  • 12. 12 RQ2: WHICH ARE THE BEST METRICS TO IDENTIFY STRONG CHANGE COUPLINGS? •  In average 5 metrics were used (models were built with 16 metrics) •  The best subset selected: •  The length of discussion (number of words) – 4 releases •  The number of distinct committers – 3 releases •  The experience of committers – 2 releases •  The number of defect issues associated with – 2 releases •  The number of weeks between the first and last commit – 2 releases
  • 13. 13 RQ3: CAN WE PREDICT DEFECTS ASSOCIATED WITH STRONG CHANGE COUPLINGS? We found 81 strong change couplings in the post-release to fix defects. We correctly predicted 37 strong change couplings (45.67%).
  • 14. CONCLUSIONS •  We studied the relationship between strong change couplings and defects •  We developed models with high accuracy to identify strong change couplings •  In some cases, it is necessary to group the data. •  In average, 5 metrics were necessary to get better results •  6 metrics were selected at least in two releases. •  We correctly predict 45.67% of strong change couplings changed •  We want to go deeper and investigate the ways in which strong change couplings can influence code quality •  Build tools to monitor and track the “damage” caused by these couplings; •  Change propagation analysis using contextual information; 14