SlideShare a Scribd company logo
An Evaluation of Effort-
Aware Fine-Grained Just-in-
Time Defect Prediction
Methods
Sousuke Amasaki1, Hirohisa Aman2, Tomoyuki Yokogawa1
1Okayama Prefectural University
2Ehime University
Software defect prediction target gets
finer for saving resource
• Traditional Software Defect Prediction
• Focus on modules (classes, files, methods)
• Just-in-Time Software Defect Prediction
• Focus on commits (a set of file changes)
• Fine-grained JIT Software Defect Prediction
• Focus on file changes in a commit
SEAA 2022 in Gran Canaria 2
Buggy!
commit
Buggy!
commit
Buggy!
Projects still have little resource for SQA
• Traditional SDP is a classification problem for all targets
• Effort-aware SDP is a ranking problem under limited budget
SEAA 2022 in Gran Canaria 3
Buggy! Buggy!
100%
Buggy!
Buggy!
90%! 80%! 70%! 60%! 50%!
Check all ones
predicted as
buggy
Actually
check ones fit
in a budget
Buggy! Buggy! Buggy!
Controversy over effort-aware JIT SDP
SEAA 2022 in Gran Canaria 4
• Supervised vs. Unsupervised
• Supervised needs training data
to make a prediction
• e.g., EALR[10] , CBS+[23]
• Unsupervised needs no data
and utilizes heuristics
• e.g., LT[20], CCUM[21]
Question:
Which is better for Fine-grained JIT SDP?
Commits in 1 months
Experiment Design
SEAA 2022 in Gran Canaria 5
Commits in 3 months
Futur
e
Pa
st
Sliding Window for
training data
Time-sensitive approach
Sliding Window for
test data
Datasets
• A replication package by Trautsch et al.[25]
• 38 Java OSS projects of Apache
• 35 out of 38 were used
• Features from a fine-grained software defect prediction study[15]
• e.g., Lines added, committer’s experience
• Linkages between commits and bug reports are recovered by 2 ways
• Ad-hoc
• Keyword search such as ”fix” in a commit message
• ITS
• Links from commits to BRs and manually validated
SEAA 2022 in Gran Canaria 6
Used for comparisons
JIT SDPs applied to Fine-grained data
Unsupervised sorts commits
according to risk scores
• LT, AGE
• CCUM
Supervised predicts a risk score
with features then sorts them
• EALR
• Train linear model to predict D
• CBS+
• Train logistic model to classify
commits
• Group defective/non-defective
• Sort commits in a group
SEAA 2022 in Gran Canaria 7
Research Questions
RQ1: What are the differences between using manually
validated links and using ad-hoc links for evaluation
in terms of effort-aware performance measures?
RQ2: Which supervised or unsupervised methods are better than
the others for fine-grained JIT defect prediction?
RQ3: Does using non-linear learning models improve
the performance of supervised methods?
SEAA 2022 in Gran Canaria 8
Comparison Results for RQ2
SEAA 2022 in Gran Canaria 9
No clear differences
Best but negligible and
low-performance
Best but negligible and
low-performance
Ranking measure
(no resource limit)
Best
Answer to RQ2: CBS+ was the best but negligible and low-performance
Non-linear JIT SDPs for evaluation
• Assumption
• Using non-linear models for EALR and CBS+ may capture complex
relations
• NL JIT SDPs
• EARF
• Train Random Forests to predict D
• CBS+RF
• Train Random Forests to classify commits
• Group defective/non-defective
• Sort commits in a group
SEAA 2022 in Gran Canaria 10
Comparison Results for RQ3
SEAA 2022 in Gran Canaria 11
Degradation
Stay low-performance
Stay low-performance
Effective only on EALR
Answer to RQ3: Non-linear model was a little help
Effective only on EALR
Conclusion
• Examined JIT SDP methods on Fine-grained JIT SDP with
effort-aware measures
• CBS+ (supervised) was the best one
• All methods were low-performance
• Non-linear model was a little help
• Future Work
• Additional features for prediction
• Investigation of other methods
SEAA 2022 in Gran Canaria 12

More Related Content

PPTX
An Exploration of Challenges Limiting Pragmatic Software Defect Prediction
PPTX
Survey on Software Defect Prediction
PDF
Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)
PDF
Survey on Software Defect Prediction
PDF
A novel approach to enhancing software quality assurance through early detect...
PDF
Software Defect Prediction Using Local and Global Analysis
PDF
F017652530
PDF
A Review on Software Fault Detection and Prevention Mechanism in Software Dev...
An Exploration of Challenges Limiting Pragmatic Software Defect Prediction
Survey on Software Defect Prediction
Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)
Survey on Software Defect Prediction
A novel approach to enhancing software quality assurance through early detect...
Software Defect Prediction Using Local and Global Analysis
F017652530
A Review on Software Fault Detection and Prevention Mechanism in Software Dev...

Similar to An Evaluation of Effort-Aware Fine-Grained Just-in-Time Defect Prediction Methods (20)

PDF
A tale of bug prediction in software development
PDF
A simplified predictive framework for cost evaluation to fault assessment usi...
PDF
O0181397100
PDF
Insights of effectivity analysis of learning-based approaches towards softwar...
PDF
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
PPT
A GENERAL SOFTWARE DEFECT-PRONENESS PREDICTION FRAMEWORK.ppt
PDF
Predicting Defective Lines Using a Model-Agnostic Technique
PDF
Defect Prediction: Accomplishments and Future Challenges
PDF
A survey of fault prediction using machine learning algorithms
PDF
TOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUES
PDF
DEFECT PREDICTION USING ORDER STATISTICS
PPTX
software Engineering ppt containing the research work relevant in the field o...
PDF
Development of software defect prediction system using artificial neural network
PDF
Software Defect Prediction Using Radial Basis and Probabilistic Neural Networks
PDF
50120140501001
PPTX
PDF
A Tale of Experiments on Bug Prediction
PPT
A Regression Analysis Approach for Building a Prediction Model for System Tes...
PDF
Personalized Defect Prediction
DOCX
A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...
A tale of bug prediction in software development
A simplified predictive framework for cost evaluation to fault assessment usi...
O0181397100
Insights of effectivity analysis of learning-based approaches towards softwar...
IRJET - A Novel Approach for Software Defect Prediction based on Dimensio...
A GENERAL SOFTWARE DEFECT-PRONENESS PREDICTION FRAMEWORK.ppt
Predicting Defective Lines Using a Model-Agnostic Technique
Defect Prediction: Accomplishments and Future Challenges
A survey of fault prediction using machine learning algorithms
TOWARDS PREDICTING SOFTWARE DEFECTS WITH CLUSTERING TECHNIQUES
DEFECT PREDICTION USING ORDER STATISTICS
software Engineering ppt containing the research work relevant in the field o...
Development of software defect prediction system using artificial neural network
Software Defect Prediction Using Radial Basis and Probabilistic Neural Networks
50120140501001
A Tale of Experiments on Bug Prediction
A Regression Analysis Approach for Building a Prediction Model for System Tes...
Personalized Defect Prediction
A Novel Approach to Improve Software Defect Prediction Accuracy Using Machine...
Ad

More from SEAA 2022 (18)

PDF
Risk and Engineering Knowledge Integration in Cyber-physical Production Syste...
PDF
Bad Smells in Industrial Automation: Sniffing out Feature Envy
PDF
Software Architecture Challenges in Process Automation - From Code Generation...
PDF
From Traditional to Digital: How software, data and AI are transforming the e...
PDF
Exploiting dynamic analysis for architectural smell detection: a preliminary ...
PDF
On the Role of Personality Traits in Implementation Tasks: A Preliminary Inve...
PDF
An Empirical Analysis of Microservices Systems Using Consumer-Driven Contract...
PDF
Have Java Production Methods Co-Evolved With Test Methods Properly?: A Fine-G...
PDF
A Preliminary Conceptualization and Analysis on Automated Static Analysis Too...
PDF
The Impact of Forced Working-From-Home on Code Technical Debt: An Industrial ...
PDF
Service Classification through Machine Learning: Aiding in the Efficient Ide...
PDF
Maintainability Challenges inML:ASLR
PDF
Model-Driven Optimization: Generating Smart Mutation Operators for Multi-Obj...
PDF
An Industrial Experience Report about Challenges from Continuous Monitoring, ...
PDF
API Deprecation: A Systematic Mapping Study
PDF
MDEML_UMLsec4Edge Extending UMLsec to model data-protection-compliant edge co...
PDF
EMMM: A Unified Meta-Model for Tracking Machine Learning Experiments
PDF
Easing the Reuse of ML Solutions by Interactive Clustering-based Autotuning i...
Risk and Engineering Knowledge Integration in Cyber-physical Production Syste...
Bad Smells in Industrial Automation: Sniffing out Feature Envy
Software Architecture Challenges in Process Automation - From Code Generation...
From Traditional to Digital: How software, data and AI are transforming the e...
Exploiting dynamic analysis for architectural smell detection: a preliminary ...
On the Role of Personality Traits in Implementation Tasks: A Preliminary Inve...
An Empirical Analysis of Microservices Systems Using Consumer-Driven Contract...
Have Java Production Methods Co-Evolved With Test Methods Properly?: A Fine-G...
A Preliminary Conceptualization and Analysis on Automated Static Analysis Too...
The Impact of Forced Working-From-Home on Code Technical Debt: An Industrial ...
Service Classification through Machine Learning: Aiding in the Efficient Ide...
Maintainability Challenges inML:ASLR
Model-Driven Optimization: Generating Smart Mutation Operators for Multi-Obj...
An Industrial Experience Report about Challenges from Continuous Monitoring, ...
API Deprecation: A Systematic Mapping Study
MDEML_UMLsec4Edge Extending UMLsec to model data-protection-compliant edge co...
EMMM: A Unified Meta-Model for Tracking Machine Learning Experiments
Easing the Reuse of ML Solutions by Interactive Clustering-based Autotuning i...
Ad

Recently uploaded (20)

PPT
protein biochemistry.ppt for university classes
PPTX
2. Earth - The Living Planet earth and life
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PPTX
microscope-Lecturecjchchchchcuvuvhc.pptx
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
HPLC-PPT.docx high performance liquid chromatography
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PDF
An interstellar mission to test astrophysical black holes
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PDF
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PPTX
famous lake in india and its disturibution and importance
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
Microbiology with diagram medical studies .pptx
protein biochemistry.ppt for university classes
2. Earth - The Living Planet earth and life
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Biophysics 2.pdffffffffffffffffffffffffff
microscope-Lecturecjchchchchcuvuvhc.pptx
7. General Toxicologyfor clinical phrmacy.pptx
HPLC-PPT.docx high performance liquid chromatography
Phytochemical Investigation of Miliusa longipes.pdf
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
An interstellar mission to test astrophysical black holes
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
AlphaEarth Foundations and the Satellite Embedding dataset
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
MIRIDeepImagingSurvey(MIDIS)oftheHubbleUltraDeepField
2. Earth - The Living Planet Module 2ELS
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
famous lake in india and its disturibution and importance
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
Microbiology with diagram medical studies .pptx

An Evaluation of Effort-Aware Fine-Grained Just-in-Time Defect Prediction Methods

  • 1. An Evaluation of Effort- Aware Fine-Grained Just-in- Time Defect Prediction Methods Sousuke Amasaki1, Hirohisa Aman2, Tomoyuki Yokogawa1 1Okayama Prefectural University 2Ehime University
  • 2. Software defect prediction target gets finer for saving resource • Traditional Software Defect Prediction • Focus on modules (classes, files, methods) • Just-in-Time Software Defect Prediction • Focus on commits (a set of file changes) • Fine-grained JIT Software Defect Prediction • Focus on file changes in a commit SEAA 2022 in Gran Canaria 2 Buggy! commit Buggy! commit Buggy!
  • 3. Projects still have little resource for SQA • Traditional SDP is a classification problem for all targets • Effort-aware SDP is a ranking problem under limited budget SEAA 2022 in Gran Canaria 3 Buggy! Buggy! 100% Buggy! Buggy! 90%! 80%! 70%! 60%! 50%! Check all ones predicted as buggy Actually check ones fit in a budget Buggy! Buggy! Buggy!
  • 4. Controversy over effort-aware JIT SDP SEAA 2022 in Gran Canaria 4 • Supervised vs. Unsupervised • Supervised needs training data to make a prediction • e.g., EALR[10] , CBS+[23] • Unsupervised needs no data and utilizes heuristics • e.g., LT[20], CCUM[21] Question: Which is better for Fine-grained JIT SDP?
  • 5. Commits in 1 months Experiment Design SEAA 2022 in Gran Canaria 5 Commits in 3 months Futur e Pa st Sliding Window for training data Time-sensitive approach Sliding Window for test data
  • 6. Datasets • A replication package by Trautsch et al.[25] • 38 Java OSS projects of Apache • 35 out of 38 were used • Features from a fine-grained software defect prediction study[15] • e.g., Lines added, committer’s experience • Linkages between commits and bug reports are recovered by 2 ways • Ad-hoc • Keyword search such as ”fix” in a commit message • ITS • Links from commits to BRs and manually validated SEAA 2022 in Gran Canaria 6 Used for comparisons
  • 7. JIT SDPs applied to Fine-grained data Unsupervised sorts commits according to risk scores • LT, AGE • CCUM Supervised predicts a risk score with features then sorts them • EALR • Train linear model to predict D • CBS+ • Train logistic model to classify commits • Group defective/non-defective • Sort commits in a group SEAA 2022 in Gran Canaria 7
  • 8. Research Questions RQ1: What are the differences between using manually validated links and using ad-hoc links for evaluation in terms of effort-aware performance measures? RQ2: Which supervised or unsupervised methods are better than the others for fine-grained JIT defect prediction? RQ3: Does using non-linear learning models improve the performance of supervised methods? SEAA 2022 in Gran Canaria 8
  • 9. Comparison Results for RQ2 SEAA 2022 in Gran Canaria 9 No clear differences Best but negligible and low-performance Best but negligible and low-performance Ranking measure (no resource limit) Best Answer to RQ2: CBS+ was the best but negligible and low-performance
  • 10. Non-linear JIT SDPs for evaluation • Assumption • Using non-linear models for EALR and CBS+ may capture complex relations • NL JIT SDPs • EARF • Train Random Forests to predict D • CBS+RF • Train Random Forests to classify commits • Group defective/non-defective • Sort commits in a group SEAA 2022 in Gran Canaria 10
  • 11. Comparison Results for RQ3 SEAA 2022 in Gran Canaria 11 Degradation Stay low-performance Stay low-performance Effective only on EALR Answer to RQ3: Non-linear model was a little help Effective only on EALR
  • 12. Conclusion • Examined JIT SDP methods on Fine-grained JIT SDP with effort-aware measures • CBS+ (supervised) was the best one • All methods were low-performance • Non-linear model was a little help • Future Work • Additional features for prediction • Investigation of other methods SEAA 2022 in Gran Canaria 12

Editor's Notes

  • #3: Software defect prediction, or SDP, is an active research topic for decades. Traditional SDP utilizes static code metrics such as cyclomatic number to characterize software modules. Therefore, the unit of prediction is file, class, or methods. Nowadays, software projects are formed around a repository that records software development activities. Just-in-Time software defect prediction, or JIT SDP, utilizes those records to characterize code changes. JIT SDP builds a defect prediction model utilizing all code changes of a project. The diversity of developers may make the code changes heterogeneous. More homogeneous sets of code changes may contribute to better defect prediction. Version control systems enabled us to attribute code changes to individual developers. Personalized defect prediction can be implemented now.
  • #9: We thus set these research questions. RQ1. Then, we focused on two types developers, namely, active developers and the rest of the developers. The second research question is. RQ3.