Can we predict the quality of
spectrum-based fault
localization?
Mojdeh Golagha
fortiss
Lionel C. Briand
University of Luxembourg, University of Ottawa
Alexander Pretschner
Technical University of Munich
Problem
Spectrum-based Fault localization
25.10.2020
Why is the effectiveness of spectrum based fault localization techniques so unpredictable?
2
Source code Test Suite
Block Test 1 Test 2 Test 3 Tarantula Ochiai DStar
1 ∞ ∞
2 ∞ ∞ ∞
3 ∞
„ Not widely applied in practice yet.
„ Effectiveness varies greatly from case to case.
„ New algorithms and ideas as well as adjustments to the
test suites to improve effectiveness.
„ Why is the effectiveness of these techniques so
unpredictable?
„ What are the factors that influence the effectiveness of
fault localization?
„ Can we accurately predict fault localization effectiveness?
Solution
25.10.2020
Based on precise hypotheses, we define metrics and assess their influence on fault localization
effectiveness.
3
I. Define metrics.
II. Generate a data set.
III. Apply classification analysis to assess the influence of metrics.
IV. Build the most accurate prediction model for effectiveness.
Define Metrics
Metrics
25.10.2020
Metrics capture static aspects of the source code, dynamic aspects of test executions, properties of the
test suite, and fault properties.
5
Generate Data Set
Data Set
25.10.2020
Our final data set has 341 instances and 70 variables. The “effective” label has been assigned to
193 of these instances.
7
Each observation is a faulty version extracted from
Defects4J
Method-level SBFL.
• Rank of the faulty
method
• 1 to 10 -> “effective”
• Otherwise “ineffective”
Variables Class Labels
Data Analysis
Analysis of Significance
25.10.2020
To assess the significance and impact of each metric individually.
9
Pre-analysis: Pearson’s Correlation to remove correlated metrics
I. Odds ratio analysis in univariate logistic regression
II. Average information gain
Multivariate Analysis
25.10.2020
To generate prediction models. One model for each group and one model for all groups combined.
10
I. Tree
II. Logistic Regression
III. SVM
IV. Random Forest
V. Adaboost
Multivariate Analysis
25.10.2020
Considering the StatDynaTest metrics, random forest and logistic regression yield the best, almost
identical, results.
11
Logistic Regression Model Random Forest Model
Multivariate Analysis
25.10.2020
There are eight metrics in common among them:
Static - % Methods with Nesting Depth>5, Mean # of Fields per Type,
Dynamic - Mean Node Degree, Max Node Out Degree, Graph Diameter, Response for Class,
Test - % Method Coverage, % Methods Covered in Failing Tests.
12
Results
25.10.202013
• A combination of only a few static, dynamic, and test metrics enables the construction of a prediction model with
excellent discrimination power between levels of effectiveness:
- eight metrics yielding an AUC of .86 (Logistic Regression on Selected-SDT metrics)
- fifteen metrics yielding an AUC of .88 (Random Forest on StatDynaTest metrics)
• A confidence factor that can be used to assess the potential effectiveness of fault localization.
• The most influential metrics can also be used as a guide for corrective actions on code and test suites leading to
more effective fault localization.
• The effectiveness of fault localization depends more on the complexity of the code and test suite than on the fault
type and location.
• Selected dynamic metrics measure the degree to which the call graph is entangled. If the dynamic call graph of
the tests is highly entangled and coupled, it is difficult to localize a fault, no matter where it happens.
©2020
Mojdeh Golagha
golagha@fortiss.org

More Related Content

PDF
Testing Machine Learning-enabled Systems: A Personal Perspective
PDF
Metamorphic Security Testing for Web Systems
PDF
A practical guide for using Statistical Tests to assess Randomized Algorithms...
PDF
Model-Driven Run-Time Enforcement of Complex Role-Based Access Control Policies
PDF
AI in SE: A 25-year Journey
PDF
Automated Testing of Autonomous Driving Assistance Systems
PDF
Evaluating Model Testing and Model Checking for Finding Requirements Violatio...
PDF
Testing of Cyber-Physical Systems: Diversity-driven Strategies
Testing Machine Learning-enabled Systems: A Personal Perspective
Metamorphic Security Testing for Web Systems
A practical guide for using Statistical Tests to assess Randomized Algorithms...
Model-Driven Run-Time Enforcement of Complex Role-Based Access Control Policies
AI in SE: A 25-year Journey
Automated Testing of Autonomous Driving Assistance Systems
Evaluating Model Testing and Model Checking for Finding Requirements Violatio...
Testing of Cyber-Physical Systems: Diversity-driven Strategies

What's hot (20)

PDF
OCLR: A More Expressive, Pattern-Based Temporal Extension of OCL
PDF
Automated Testing of Autonomous Driving Assistance Systems
PDF
SSBSE 2020 keynote
PDF
Keynote SBST 2014 - Search-Based Testing
PDF
Automating System Test Case Classification and Prioritization for Use Case-Dr...
PDF
Enabling Automated Software Testing with Artificial Intelligence
PDF
Mining Assumptions for Software Components using Machine Learning
PDF
Practical Constraint Solving for Generating System Test Data
PDF
Scalable and Cost-Effective Model-Based Software Verification and Testing
PDF
Automatic Test Suite Generation for Key-Points Detection DNNs using Many-Obje...
PDF
Supporting Change in Product Lines within the Context of Use Case-driven Deve...
PDF
Analyzing Natural-Language Requirements: The Not-too-sexy and Yet Curiously D...
PDF
Scalable Software Testing and Verification of Non-Functional Properties throu...
PDF
Functional Safety in ML-based Cyber-Physical Systems
PDF
Extracting Domain Models from Natural-Language Requirements: Approach and Ind...
PDF
Automated Inference of Access Control Policies for Web Applications
PDF
Artificial Intelligence for Automated Software Testing
PDF
Improving Fault Localization for Simulink Models using Search-Based Testing a...
PDF
Synthetic Data Generation for Statistical Testing
PDF
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
OCLR: A More Expressive, Pattern-Based Temporal Extension of OCL
Automated Testing of Autonomous Driving Assistance Systems
SSBSE 2020 keynote
Keynote SBST 2014 - Search-Based Testing
Automating System Test Case Classification and Prioritization for Use Case-Dr...
Enabling Automated Software Testing with Artificial Intelligence
Mining Assumptions for Software Components using Machine Learning
Practical Constraint Solving for Generating System Test Data
Scalable and Cost-Effective Model-Based Software Verification and Testing
Automatic Test Suite Generation for Key-Points Detection DNNs using Many-Obje...
Supporting Change in Product Lines within the Context of Use Case-driven Deve...
Analyzing Natural-Language Requirements: The Not-too-sexy and Yet Curiously D...
Scalable Software Testing and Verification of Non-Functional Properties throu...
Functional Safety in ML-based Cyber-Physical Systems
Extracting Domain Models from Natural-Language Requirements: Approach and Ind...
Automated Inference of Access Control Policies for Web Applications
Artificial Intelligence for Automated Software Testing
Improving Fault Localization for Simulink Models using Search-Based Testing a...
Synthetic Data Generation for Statistical Testing
Comparing Offline and Online Testing of Deep Neural Networks: An Autonomous C...
Ad

Similar to Can we predict the quality of spectrum-based fault localization? (20)

PPTX
software Engineering ppt containing the research work relevant in the field o...
PDF
Mining System Logs to Learn Error Predictors, Universität Stuttgart, Stuttgar...
PDF
Towards a Better Understanding of the Impact of Experimental Components on De...
PDF
Kaggle Higgs Boson Machine Learning Challenge
PPTX
Learning machine learning with Yellowbrick
PDF
WIA 2019 - Steering Model Selection with Visual Diagnostics
PDF
Steering Model Selection with Visual Diagnostics: Women in Analytics 2019
PDF
Steering Model Selection with Visual Diagnostics
PDF
Applied machine learning: Insurance
PDF
A simplified predictive framework for cost evaluation to fault assessment usi...
PDF
Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)
PPTX
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
PDF
Model Checking-Enhanced Spectrum-Based Fault Localization
PDF
Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...
PPT
Cukic Promise08 V3
PDF
A Test-suite Diagnosability Metric for Spectrum-based Fault Localization Appr...
PDF
Survey on Software Defect Prediction
PDF
Deep_Learning__INAF_baroncelli.pdf
PDF
Cheatsheet machine-learning-tips-and-tricks
PPT
Rob Baarda - Are Real Test Metrics Predictive for the Future?
software Engineering ppt containing the research work relevant in the field o...
Mining System Logs to Learn Error Predictors, Universität Stuttgart, Stuttgar...
Towards a Better Understanding of the Impact of Experimental Components on De...
Kaggle Higgs Boson Machine Learning Challenge
Learning machine learning with Yellowbrick
WIA 2019 - Steering Model Selection with Visual Diagnostics
Steering Model Selection with Visual Diagnostics: Women in Analytics 2019
Steering Model Selection with Visual Diagnostics
Applied machine learning: Insurance
A simplified predictive framework for cost evaluation to fault assessment usi...
Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)
#Interactive Session by Vivek Patle and Jahnavi Umarji, "Empowering Functiona...
Model Checking-Enhanced Spectrum-Based Fault Localization
Machine Learning Performance Evaluation: Tips and Pitfalls - Jose Hernandez O...
Cukic Promise08 V3
A Test-suite Diagnosability Metric for Spectrum-based Fault Localization Appr...
Survey on Software Defect Prediction
Deep_Learning__INAF_baroncelli.pdf
Cheatsheet machine-learning-tips-and-tricks
Rob Baarda - Are Real Test Metrics Predictive for the Future?
Ad

More from Lionel Briand (20)

PDF
LTM: Scalable and Black-box Similarity-based Test Suite Minimization based on...
PDF
TEASMA: A Practical Methodology for Test Adequacy Assessment of Deep Neural N...
PDF
Automated Test Case Repair Using Language Models
PDF
Automated Testing and Safety Analysis of Deep Neural Networks
PDF
FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categorie...
PDF
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
PDF
Precise and Complete Requirements? An Elusive Goal
PDF
Large Language Models for Test Case Evolution and Repair
PDF
Metamorphic Testing for Web System Security
PDF
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
PDF
Fuzzing for CPS Mutation Testing
PDF
Data-driven Mutation Analysis for Cyber-Physical Systems
PDF
Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems
PDF
ATM: Black-box Test Case Minimization based on Test Code Similarity and Evolu...
PDF
Black-box Safety Analysis and Retraining of DNNs based on Feature Extraction ...
PDF
PRINS: Scalable Model Inference for Component-based System Logs
PDF
Revisiting the Notion of Diversity in Software Testing
PDF
Applications of Search-based Software Testing to Trustworthy Artificial Intel...
PDF
Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
PDF
Mathematicians, Social Scientists, or Engineers? The Split Minds of Software ...
LTM: Scalable and Black-box Similarity-based Test Suite Minimization based on...
TEASMA: A Practical Methodology for Test Adequacy Assessment of Deep Neural N...
Automated Test Case Repair Using Language Models
Automated Testing and Safety Analysis of Deep Neural Networks
FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categorie...
Requirements in Engineering AI- Enabled Systems: Open Problems and Safe AI Sy...
Precise and Complete Requirements? An Elusive Goal
Large Language Models for Test Case Evolution and Repair
Metamorphic Testing for Web System Security
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-...
Fuzzing for CPS Mutation Testing
Data-driven Mutation Analysis for Cyber-Physical Systems
Many-Objective Reinforcement Learning for Online Testing of DNN-Enabled Systems
ATM: Black-box Test Case Minimization based on Test Code Similarity and Evolu...
Black-box Safety Analysis and Retraining of DNNs based on Feature Extraction ...
PRINS: Scalable Model Inference for Component-based System Logs
Revisiting the Notion of Diversity in Software Testing
Applications of Search-based Software Testing to Trustworthy Artificial Intel...
Autonomous Systems: How to Address the Dilemma between Autonomy and Safety
Mathematicians, Social Scientists, or Engineers? The Split Minds of Software ...

Recently uploaded (20)

PDF
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
PPTX
AMADEUS TRAVEL AGENT SOFTWARE | AMADEUS TICKETING SYSTEM
PPTX
Oracle Fusion HCM Cloud Demo for Beginners
PDF
EaseUS PDF Editor Pro 6.2.0.2 Crack with License Key 2025
PDF
Salesforce Agentforce AI Implementation.pdf
PDF
Types of Token_ From Utility to Security.pdf
PPTX
Weekly report ppt - harsh dattuprasad patel.pptx
PPTX
WiFi Honeypot Detecscfddssdffsedfseztor.pptx
DOCX
Modern SharePoint Intranet Templates That Boost Employee Engagement in 2025.docx
PDF
Autodesk AutoCAD Crack Free Download 2025
PPTX
Trending Python Topics for Data Visualization in 2025
PPTX
Cybersecurity: Protecting the Digital World
PDF
AI/ML Infra Meetup | Beyond S3's Basics: Architecting for AI-Native Data Access
PDF
Ableton Live Suite for MacOS Crack Full Download (Latest 2025)
PDF
Microsoft Office 365 Crack Download Free
PDF
DuckDuckGo Private Browser Premium APK for Android Crack Latest 2025
PDF
Website Design Services for Small Businesses.pdf
PPTX
"Secure File Sharing Solutions on AWS".pptx
DOCX
How to Use SharePoint as an ISO-Compliant Document Management System
PPTX
Advanced SystemCare Ultimate Crack + Portable (2025)
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
AMADEUS TRAVEL AGENT SOFTWARE | AMADEUS TICKETING SYSTEM
Oracle Fusion HCM Cloud Demo for Beginners
EaseUS PDF Editor Pro 6.2.0.2 Crack with License Key 2025
Salesforce Agentforce AI Implementation.pdf
Types of Token_ From Utility to Security.pdf
Weekly report ppt - harsh dattuprasad patel.pptx
WiFi Honeypot Detecscfddssdffsedfseztor.pptx
Modern SharePoint Intranet Templates That Boost Employee Engagement in 2025.docx
Autodesk AutoCAD Crack Free Download 2025
Trending Python Topics for Data Visualization in 2025
Cybersecurity: Protecting the Digital World
AI/ML Infra Meetup | Beyond S3's Basics: Architecting for AI-Native Data Access
Ableton Live Suite for MacOS Crack Full Download (Latest 2025)
Microsoft Office 365 Crack Download Free
DuckDuckGo Private Browser Premium APK for Android Crack Latest 2025
Website Design Services for Small Businesses.pdf
"Secure File Sharing Solutions on AWS".pptx
How to Use SharePoint as an ISO-Compliant Document Management System
Advanced SystemCare Ultimate Crack + Portable (2025)

Can we predict the quality of spectrum-based fault localization?

  • 1. Can we predict the quality of spectrum-based fault localization? Mojdeh Golagha fortiss Lionel C. Briand University of Luxembourg, University of Ottawa Alexander Pretschner Technical University of Munich
  • 2. Problem Spectrum-based Fault localization 25.10.2020 Why is the effectiveness of spectrum based fault localization techniques so unpredictable? 2 Source code Test Suite Block Test 1 Test 2 Test 3 Tarantula Ochiai DStar 1 ∞ ∞ 2 ∞ ∞ ∞ 3 ∞ „ Not widely applied in practice yet. „ Effectiveness varies greatly from case to case. „ New algorithms and ideas as well as adjustments to the test suites to improve effectiveness. „ Why is the effectiveness of these techniques so unpredictable? „ What are the factors that influence the effectiveness of fault localization? „ Can we accurately predict fault localization effectiveness?
  • 3. Solution 25.10.2020 Based on precise hypotheses, we define metrics and assess their influence on fault localization effectiveness. 3 I. Define metrics. II. Generate a data set. III. Apply classification analysis to assess the influence of metrics. IV. Build the most accurate prediction model for effectiveness.
  • 5. Metrics 25.10.2020 Metrics capture static aspects of the source code, dynamic aspects of test executions, properties of the test suite, and fault properties. 5
  • 7. Data Set 25.10.2020 Our final data set has 341 instances and 70 variables. The “effective” label has been assigned to 193 of these instances. 7 Each observation is a faulty version extracted from Defects4J Method-level SBFL. • Rank of the faulty method • 1 to 10 -> “effective” • Otherwise “ineffective” Variables Class Labels
  • 9. Analysis of Significance 25.10.2020 To assess the significance and impact of each metric individually. 9 Pre-analysis: Pearson’s Correlation to remove correlated metrics I. Odds ratio analysis in univariate logistic regression II. Average information gain
  • 10. Multivariate Analysis 25.10.2020 To generate prediction models. One model for each group and one model for all groups combined. 10 I. Tree II. Logistic Regression III. SVM IV. Random Forest V. Adaboost
  • 11. Multivariate Analysis 25.10.2020 Considering the StatDynaTest metrics, random forest and logistic regression yield the best, almost identical, results. 11 Logistic Regression Model Random Forest Model
  • 12. Multivariate Analysis 25.10.2020 There are eight metrics in common among them: Static - % Methods with Nesting Depth>5, Mean # of Fields per Type, Dynamic - Mean Node Degree, Max Node Out Degree, Graph Diameter, Response for Class, Test - % Method Coverage, % Methods Covered in Failing Tests. 12
  • 13. Results 25.10.202013 • A combination of only a few static, dynamic, and test metrics enables the construction of a prediction model with excellent discrimination power between levels of effectiveness: - eight metrics yielding an AUC of .86 (Logistic Regression on Selected-SDT metrics) - fifteen metrics yielding an AUC of .88 (Random Forest on StatDynaTest metrics) • A confidence factor that can be used to assess the potential effectiveness of fault localization. • The most influential metrics can also be used as a guide for corrective actions on code and test suites leading to more effective fault localization. • The effectiveness of fault localization depends more on the complexity of the code and test suite than on the fault type and location. • Selected dynamic metrics measure the degree to which the call graph is entangled. If the dynamic call graph of the tests is highly entangled and coupled, it is difficult to localize a fault, no matter where it happens.