SlideShare a Scribd company logo
Proceedings of the 16th European Conference on Software Maintenance and
 Reengineering




Filtering Bug Reports for Fix-Time
             Analysis
                        Ahmed Lamkanfi, Serge Demeyer



 Antwerp Systems and Software Modelling
 Ansymo                                   1 /13
Bug Report Fix-Time Prediction
Predicting Eclipse Bug Lifetimes   A Comparative Exploration of FreeBSD Bug Lifetimes   Predicting the Fix Time of Bugs
           Panjer et al.                             Bougie et al.                                 Giger et al.




                                                        2 /13
Bug Report Fix-Time Prediction
Predicting Eclipse Bug Lifetimes   A Comparative Exploration of FreeBSD Bug Lifetimes   Predicting the Fix Time of Bugs
           Panjer et al.                             Bougie et al.                                 Giger et al.




                                                        2 /13
Bug Report Fix-Time Prediction
Predicting Eclipse Bug Lifetimes   A Comparative Exploration of FreeBSD Bug Lifetimes   Predicting the Fix Time of Bugs
           Panjer et al.                             Bougie et al.                                 Giger et al.




                                                        2 /13
History of all reported bugs

Bug Database   Uncover facts about history

               Make predictions about future




                  3 /13
History of all reported bugs

Bug Database        Uncover facts about history

                    Make predictions about future



  Fix-time of a bug?
   ✓ Time between opening and resolving a bug.


                        3 /13
Cases under Study: Eclipse and Mozilla

           Project      Nr. of Bugs            Period
           Platform             76.456   Oct. 2001 - Oct. 2007

             PDE                11.117   Oct. 2001 - Oct. 2007

             JDT                41.691   Oct. 2001 - Oct. 2007

             CDT                11.468   Oct. 2001 - Oct. 2007

             GEF                1.587    Oct. 2001 - Oct. 2007

             Core           143.542      Mar. 1997 - Jul. 2008

           Bugzilla             19.135   Mar. 2003 - Jul. 2008

            Firefox             79.272   Jul. 1999 - Jul. 2008

          Thunderbird           23.408   Jan. 2000 - Jul. 2008

          SeaMonkey             85.143   Nov. 1995 - Jul. 2008



                        4 /13
3000
                               1000
                               300
                               100
                                 30
Fix−Time (logarithmic scale)




                                 10
                                  3
                                  1
                                0.3
                                0.1
                               0.03
                               0.01
                         0.003
                         0.001
         0.0003
         0.0001


                                      Platform   PDE    JDT       CDT   GEF

                                                       Projects
                                                        5 /13
3000
                               1000
                               300
                               100
                                 30
Fix−Time (logarithmic scale)




                                 10
                                  3
                                  1
                                0.3
                                0.1
                               0.03
                               0.01
                         0.003
                         0.001
         0.0003
         0.0001



                                      Core   Bugzilla   Firefox    Thunderbird   Seamonkey

                                                        Projects
                                                        6 /13
Summary of the Box-Plots

       Project              Smallest Fix-time
       Platform                10 seconds

         PDE                   12 seconds

         JDT                   10 seconds

         CDT                    9 seconds

         GEF                    8 seconds

         Core                   11 seconds

       Bugzilla                 3 seconds

        Firefox                13 seconds

      Thunderbird              18 seconds

      SeaMonkey                14 seconds



                    7 /13
Ask a
developer!




      8 /13
Ask a
            developer!
➡“the developer has already the
necessary code changes ready to fix a
bug, then files a bug to make sure it's
getting tracked in the system”




                    8 /13
Filtering out unreliable reports?
✓ How does this impact the accuracy?




                     9 /13
Filtering out unreliable reports?
✓ How does this impact the accuracy?



Small experiment
✓ Based on experiment from “Predicting the Fix
   Time of Bugs” from Giger et al. (2010)




                       9 /13
Train from the history of bug reports
    ✓ Fields are extracted from the reports
          day opened, month opened, platform, reporter,
        ➡
          severity,...

    ✓ Naïve Bayes classifiers learns the
       characteristics from the reports

    ✓ 10-fold cross validation




                        10/13
Train from the history of bug reports
    ✓ Fields are extracted from the reports
          day opened, month opened, platform, reporter,
        ➡
          severity,...

    ✓ Naïve Bayes classifiers learns the
       characteristics from the reports

    ✓ 10-fold cross validation

  Bugs are grouped in two sets
          ✓ Fast: fixtime ≤ median
          ✓ Slow: fixtime > median
                        10/13
Evaluation:
✓ Receiver Operating Characteristic(ROC) curve

✓ Area Under Curve(AUC): 0.5 is random
  prediction; 1.0 perfect classification




                     11/13
Evaluation:
 ✓ Receiver Operating Characteristic(ROC) curve

 ✓ Area Under Curve(AUC): 0.5 is random
   prediction; 1.0 perfect classification


Two-fold experiment
✓ With and without the filtering of bug reports
✓ Threshold for filtering set to 1/2 of the first
   quartile


                        11/13
Accuracy Results

 Project      AUC Before      AUC After
 Platform             0.692     0.700

   PDE                0.641     0.661

   JDT                0.646     0.649

   CDT                0.693     0.708

   GEF                0.663     0.732

   Core               0.663     0.686

 Bugzilla             0.722     0.733

  Firefox             0.623     0.653

Thunderbird           0.657     0.645

SeaMonkey             0.698     0.706



              12/13
Conclusions
✓ More investigation needed when dealing
  with real-world data

✓ Some bugs are fixed conspicuously fast!
✓ More preprocessing/filtering may lead to
  improved results


                     13/13

More Related Content

PDF
Comparing Text Mining Algorithms for Predicting the Severity of a Reported Bug
PDF
Mining Software Archives 2010 Presentation
PDF
MIning Software Repositories (MSR) 2010 presentation
PDF
The Eclipse and Mozilla Defect Tracking Dataset: a Genuine Dataset for Mining...
PDF
Predicting Reassignments of Bug Reports — an Exploratory Investigation
PPTX
Tuning Java Servers
PPTX
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
PDF
Microservices with Micronaut
Comparing Text Mining Algorithms for Predicting the Severity of a Reported Bug
Mining Software Archives 2010 Presentation
MIning Software Repositories (MSR) 2010 presentation
The Eclipse and Mozilla Defect Tracking Dataset: a Genuine Dataset for Mining...
Predicting Reassignments of Bug Reports — an Exploratory Investigation
Tuning Java Servers
Evolving Prometheus for the Cloud Native World (FOSDEM 2018)
Microservices with Micronaut

Similar to Filtering Bug Reports for Fix-Time Analysis (20)

PDF
Enabling Java in Latency Sensitive Applications by Gil Tene, CTO, Azul Systems
PDF
Microservices with Micronaut
PPTX
Evolution of Monitoring and Prometheus (Dublin 2018)
PDF
Continuous testing - GUERLAIS ARGOT - Air France KLM Sogeti- Soirée du Test L...
DOC
Software Bugs A Software Architect Point Of View
PDF
[xp2013] Narrow Down What to Test
PDF
Fighting Fragmentation with Fragments
PDF
How to Design a Program Repair Bot? Insights from the Repairnator Project
PPTX
How's relevant JMeter to me - DevConf (Letterkenny)
PPTX
An Introduction to Prometheus (GrafanaCon 2016)
PDF
DotDotPwn v3.0 [GuadalajaraCON 2012]
PDF
Guadalajara con 2012
PDF
An Extensible Architecture for Avionics Sensor Health Assessment Using DDS
PDF
05 019 web-3d_service
PDF
Software Testing: Test Design and the Project Life Cycle
PPTX
Solve the colocation conundrum: Performance and density at scale with Kubernetes
PDF
PDF
A tale of bug prediction in software development
DOC
Manual testing interview questions
PDF
AngularJS - Overcoming performance issues. Limits.
Enabling Java in Latency Sensitive Applications by Gil Tene, CTO, Azul Systems
Microservices with Micronaut
Evolution of Monitoring and Prometheus (Dublin 2018)
Continuous testing - GUERLAIS ARGOT - Air France KLM Sogeti- Soirée du Test L...
Software Bugs A Software Architect Point Of View
[xp2013] Narrow Down What to Test
Fighting Fragmentation with Fragments
How to Design a Program Repair Bot? Insights from the Repairnator Project
How's relevant JMeter to me - DevConf (Letterkenny)
An Introduction to Prometheus (GrafanaCon 2016)
DotDotPwn v3.0 [GuadalajaraCON 2012]
Guadalajara con 2012
An Extensible Architecture for Avionics Sensor Health Assessment Using DDS
05 019 web-3d_service
Software Testing: Test Design and the Project Life Cycle
Solve the colocation conundrum: Performance and density at scale with Kubernetes
A tale of bug prediction in software development
Manual testing interview questions
AngularJS - Overcoming performance issues. Limits.
Ad

Recently uploaded (20)

PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
Machine Learning_overview_presentation.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
A comparative analysis of optical character recognition models for extracting...
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
A Presentation on Artificial Intelligence
PDF
cuic standard and advanced reporting.pdf
PPTX
Spectroscopy.pptx food analysis technology
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPT
Teaching material agriculture food technology
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
Cloud computing and distributed systems.
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Machine Learning_overview_presentation.pptx
Programs and apps: productivity, graphics, security and other tools
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
A comparative analysis of optical character recognition models for extracting...
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Per capita expenditure prediction using model stacking based on satellite ima...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
A Presentation on Artificial Intelligence
cuic standard and advanced reporting.pdf
Spectroscopy.pptx food analysis technology
The Rise and Fall of 3GPP – Time for a Sabbatical?
Teaching material agriculture food technology
Assigned Numbers - 2025 - Bluetooth® Document
Agricultural_Statistics_at_a_Glance_2022_0.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Approach and Philosophy of On baking technology
Cloud computing and distributed systems.
Ad

Filtering Bug Reports for Fix-Time Analysis

  • 1. Proceedings of the 16th European Conference on Software Maintenance and Reengineering Filtering Bug Reports for Fix-Time Analysis Ahmed Lamkanfi, Serge Demeyer Antwerp Systems and Software Modelling Ansymo 1 /13
  • 2. Bug Report Fix-Time Prediction Predicting Eclipse Bug Lifetimes A Comparative Exploration of FreeBSD Bug Lifetimes Predicting the Fix Time of Bugs Panjer et al. Bougie et al. Giger et al. 2 /13
  • 3. Bug Report Fix-Time Prediction Predicting Eclipse Bug Lifetimes A Comparative Exploration of FreeBSD Bug Lifetimes Predicting the Fix Time of Bugs Panjer et al. Bougie et al. Giger et al. 2 /13
  • 4. Bug Report Fix-Time Prediction Predicting Eclipse Bug Lifetimes A Comparative Exploration of FreeBSD Bug Lifetimes Predicting the Fix Time of Bugs Panjer et al. Bougie et al. Giger et al. 2 /13
  • 5. History of all reported bugs Bug Database Uncover facts about history Make predictions about future 3 /13
  • 6. History of all reported bugs Bug Database Uncover facts about history Make predictions about future Fix-time of a bug? ✓ Time between opening and resolving a bug. 3 /13
  • 7. Cases under Study: Eclipse and Mozilla Project Nr. of Bugs Period Platform 76.456 Oct. 2001 - Oct. 2007 PDE 11.117 Oct. 2001 - Oct. 2007 JDT 41.691 Oct. 2001 - Oct. 2007 CDT 11.468 Oct. 2001 - Oct. 2007 GEF 1.587 Oct. 2001 - Oct. 2007 Core 143.542 Mar. 1997 - Jul. 2008 Bugzilla 19.135 Mar. 2003 - Jul. 2008 Firefox 79.272 Jul. 1999 - Jul. 2008 Thunderbird 23.408 Jan. 2000 - Jul. 2008 SeaMonkey 85.143 Nov. 1995 - Jul. 2008 4 /13
  • 8. 3000 1000 300 100 30 Fix−Time (logarithmic scale) 10 3 1 0.3 0.1 0.03 0.01 0.003 0.001 0.0003 0.0001 Platform PDE JDT CDT GEF Projects 5 /13
  • 9. 3000 1000 300 100 30 Fix−Time (logarithmic scale) 10 3 1 0.3 0.1 0.03 0.01 0.003 0.001 0.0003 0.0001 Core Bugzilla Firefox Thunderbird Seamonkey Projects 6 /13
  • 10. Summary of the Box-Plots Project Smallest Fix-time Platform 10 seconds PDE 12 seconds JDT 10 seconds CDT 9 seconds GEF 8 seconds Core 11 seconds Bugzilla 3 seconds Firefox 13 seconds Thunderbird 18 seconds SeaMonkey 14 seconds 7 /13
  • 12. Ask a developer! ➡“the developer has already the necessary code changes ready to fix a bug, then files a bug to make sure it's getting tracked in the system” 8 /13
  • 13. Filtering out unreliable reports? ✓ How does this impact the accuracy? 9 /13
  • 14. Filtering out unreliable reports? ✓ How does this impact the accuracy? Small experiment ✓ Based on experiment from “Predicting the Fix Time of Bugs” from Giger et al. (2010) 9 /13
  • 15. Train from the history of bug reports ✓ Fields are extracted from the reports day opened, month opened, platform, reporter, ➡ severity,... ✓ Naïve Bayes classifiers learns the characteristics from the reports ✓ 10-fold cross validation 10/13
  • 16. Train from the history of bug reports ✓ Fields are extracted from the reports day opened, month opened, platform, reporter, ➡ severity,... ✓ Naïve Bayes classifiers learns the characteristics from the reports ✓ 10-fold cross validation Bugs are grouped in two sets ✓ Fast: fixtime ≤ median ✓ Slow: fixtime > median 10/13
  • 17. Evaluation: ✓ Receiver Operating Characteristic(ROC) curve ✓ Area Under Curve(AUC): 0.5 is random prediction; 1.0 perfect classification 11/13
  • 18. Evaluation: ✓ Receiver Operating Characteristic(ROC) curve ✓ Area Under Curve(AUC): 0.5 is random prediction; 1.0 perfect classification Two-fold experiment ✓ With and without the filtering of bug reports ✓ Threshold for filtering set to 1/2 of the first quartile 11/13
  • 19. Accuracy Results Project AUC Before AUC After Platform 0.692 0.700 PDE 0.641 0.661 JDT 0.646 0.649 CDT 0.693 0.708 GEF 0.663 0.732 Core 0.663 0.686 Bugzilla 0.722 0.733 Firefox 0.623 0.653 Thunderbird 0.657 0.645 SeaMonkey 0.698 0.706 12/13
  • 20. Conclusions ✓ More investigation needed when dealing with real-world data ✓ Some bugs are fixed conspicuously fast! ✓ More preprocessing/filtering may lead to improved results 13/13

Editor's Notes