SlideShare a Scribd company logo
Are Change Metrics Good Predictors for an Evolving Software Product Line?Sandeep Krishnan, ISUChris Strasburg, ISU & Ames Laboratory Robyn R. Lutz, ISU & JPL, California Institute of TechnologyKaterina Goseva-Popstojanova, WVU1This research is supported by NSF grants 0916275 and 0916284Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
BackgroundProduct line – “A family of products designed to take advantage of their common aspects and predicted variabilities” [Weiss and Lai 1999]e.g., Nokia cellphones, HP printers, etc.Products - Commonalities – Shared by all products. e.g., PlatformVariabilities – Differentiate the productsHigh-reuse variation
JDT, PDE, Mylyn, Webtools, etc.
Reused in more than three products and for more than six years.
Low-reuse variation
CDT, Datatools, Java EE tools.
Reused in three or fewer products and for more than four years.2Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Related WorkEclipse as a product line. [Chastek, McGregor, and Northrop, 2007], [Linden, 2009], [Krishnan et al., 2011]. Summary of previous work3Failure-prone file  -A file with one or more non-trivial post-releasebugs recorded in the Eclipse Bugzilla database.Important/Good predictor – Predictor providing high information gain for classification of failure-prone filesDept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Product Line EvolutionProduct line evolution in two dimensions4New ReleasesP1R1P1R2P1R3P1RnP2R1P2R2P2R3P2RnNew ProductsPnR1PnR2PnR3PnRnDept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
MotivationCan we leverage the reducing amount of change in product lines to better predict failure-prone files?5Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Eclipse case study6BlockerEclipse ClassicCriticalEclipse JavaMajorEclipse C/C++NormalEclipse JavaEEMinor
Research QuestionsAs a product evolves, do any change metrics serve as good predictors of failure-prone files?Is there a subset of change metrics which are good predictors across all product line members?Does our ability to predict failure-prone files improve as product line evolves?7Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
FindingsThe change metrics provide good classification of the failure-prone files in the Eclipse product line.As each product evolves, there is a stable set of change metrics that are prominent predictors of failure-prone files across its releases.There is a subset of change metrics that is among the prominent predictors of all the products across most of the releases.As the product line matures, prediction performance improves for each of the four Eclipse products.8Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Data Source9Data TimelineData TimelineSource of failure reports-Source of change reports – CVS repository of Eclipse.Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Approach10Get prediction results + best predictorsWeka      J48 decision tree learnerDept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
Replication Results11Learner performance compared to previous resultsClassification performance comparison for Eclipse Classic 2.0, 2.1, and 3.0PC- Percentage of correctly classified instancesTPR- True positive rateFPR- False positive rateDept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
12Top five predictors for earlier releases of Eclipse ClassicReplication ResultsTop predictors from this studyRevisions, Weighted_ageTop predictors from previous studyMax_changeset, Bugfixes, RevisionsDept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
13Learner performance improves as single product evolvesExtension ResultsDept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
14Top five predictors for later releases of Eclipse ClassicExtension ResultsRevisions is good predictor for later releases also.Max_changeset is a good predictor also.Dept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011
15Learner performance improves as product line evolvesExtension ResultsPercentage of correctly classified instances increases across releases for each productDept. of Computer Science,  Iowa State University,  PROMISE, September 20, 2011

More Related Content

PPT
Chemistry in the hand: The delivery of structure databases and spectroscopy g...
PPT
Chemistry apps in the wild and at the royal society of chemistry
PPT
Chemistry in your hand. Using mobile devices to access public chemistry compo...
PDF
Industrializing Software Product Line Development for Small Companies (short ...
PPTX
Evolving Industrial Software Architectures into a Software Product Line: A Ca...
PPTX
Software Product Line
PPTX
Software product line
PDF
7 - Architetture Software - Software product line
Chemistry in the hand: The delivery of structure databases and spectroscopy g...
Chemistry apps in the wild and at the royal society of chemistry
Chemistry in your hand. Using mobile devices to access public chemistry compo...
Industrializing Software Product Line Development for Small Companies (short ...
Evolving Industrial Software Architectures into a Software Product Line: A Ca...
Software Product Line
Software product line
7 - Architetture Software - Software product line

Similar to Promise 2011: "Are Change Metrics Good Predictors for an Evolving Software Product Line?" (20)

PPTX
Software Sustainability Institute
PPTX
Kuchinsky_Cytoscape_BOSC2009
PDF
Defect Prediction: Accomplishments and Future Challenges
PDF
Lecture notes on empirical software engineering 1st Edition Natalia Juristo
PDF
Intensive metrics software evolution
PDF
From Bugs to Decision Support - Selected Research Highlights
PDF
Past, Present, and Future of Analyzing Software Data
PDF
ANALYZABILITY METRIC FOR MAINTAINABILITY OF OBJECT ORIENTED SOFTWARE SYSTEM
PDF
[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...
ODP
Why Do Computational Scientists Trust Their So
PPT
Resource Description Framework Approach to Data Publication and Federation
PDF
Software Engineering Of Fault Tolerant Systems Software Engineering And Knowl...
PDF
Software Defect Trend Forecasting In Open Source Projects using A Univariate ...
PDF
Performance Analysis of Leading Application Lifecycle Management Systems for...
PDF
Agile methods cost of quality
PDF
Agile Methods Cost of Quality: Benefits of Testing Early & Often
PPT
lecture24.ppt
PDF
02 stein intro_4th-pv_modeling_workshop_2015-10-22_sand2015-8571_c
PDF
Journal Club - Best Practices for Scientific Computing
DOCX
Exercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docx
Software Sustainability Institute
Kuchinsky_Cytoscape_BOSC2009
Defect Prediction: Accomplishments and Future Challenges
Lecture notes on empirical software engineering 1st Edition Natalia Juristo
Intensive metrics software evolution
From Bugs to Decision Support - Selected Research Highlights
Past, Present, and Future of Analyzing Software Data
ANALYZABILITY METRIC FOR MAINTAINABILITY OF OBJECT ORIENTED SOFTWARE SYSTEM
[SAC2014]Splitting Approaches for Context-Aware Recommendation: An Empirical ...
Why Do Computational Scientists Trust Their So
Resource Description Framework Approach to Data Publication and Federation
Software Engineering Of Fault Tolerant Systems Software Engineering And Knowl...
Software Defect Trend Forecasting In Open Source Projects using A Univariate ...
Performance Analysis of Leading Application Lifecycle Management Systems for...
Agile methods cost of quality
Agile Methods Cost of Quality: Benefits of Testing Early & Often
lecture24.ppt
02 stein intro_4th-pv_modeling_workshop_2015-10-22_sand2015-8571_c
Journal Club - Best Practices for Scientific Computing
Exercise 3 Worksheet Create a Peer Reviewed ReferenceSave this .docx
Ad

More from CS, NcState (20)

PPTX
Talks2015 novdec
PPTX
Future se oct15
PPTX
GALE: Geometric active learning for Search-Based Software Engineering
PPTX
Big Data: the weakest link
PPTX
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
PPTX
Lexisnexis june9
PPTX
Welcome to ICSE NIER’15 (new ideas and emerging results).
PPTX
Icse15 Tech-briefing Data Science
PPTX
Kits to Find the Bits that Fits
PPTX
Ai4se lab template
PPTX
Automated Software Enging, Fall 2015, NCSU
PPT
Requirements Engineering
PPT
172529main ken and_tim_software_assurance_research_at_west_virginia
PPTX
Automated Software Engineering
PDF
Next Generation “Treatment Learning” (finding the diamonds in the dust)
PPTX
Tim Menzies, directions in Data Science
PPTX
Goldrush
PPTX
Dagstuhl14 intro-v1
PPTX
Know thy tools
PPTX
The Art and Science of Analyzing Software Data
Talks2015 novdec
Future se oct15
GALE: Geometric active learning for Search-Based Software Engineering
Big Data: the weakest link
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
Lexisnexis june9
Welcome to ICSE NIER’15 (new ideas and emerging results).
Icse15 Tech-briefing Data Science
Kits to Find the Bits that Fits
Ai4se lab template
Automated Software Enging, Fall 2015, NCSU
Requirements Engineering
172529main ken and_tim_software_assurance_research_at_west_virginia
Automated Software Engineering
Next Generation “Treatment Learning” (finding the diamonds in the dust)
Tim Menzies, directions in Data Science
Goldrush
Dagstuhl14 intro-v1
Know thy tools
The Art and Science of Analyzing Software Data
Ad

Recently uploaded (20)

PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
KodekX | Application Modernization Development
PPTX
Cloud computing and distributed systems.
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Unlocking AI with Model Context Protocol (MCP)
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Understanding_Digital_Forensics_Presentation.pptx
Programs and apps: productivity, graphics, security and other tools
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Diabetes mellitus diagnosis method based random forest with bat algorithm
NewMind AI Weekly Chronicles - August'25 Week I
Encapsulation_ Review paper, used for researhc scholars
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
KodekX | Application Modernization Development
Cloud computing and distributed systems.
Mobile App Security Testing_ A Comprehensive Guide.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Network Security Unit 5.pdf for BCA BBA.
Digital-Transformation-Roadmap-for-Companies.pptx
Empathic Computing: Creating Shared Understanding
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
Reach Out and Touch Someone: Haptics and Empathic Computing
Unlocking AI with Model Context Protocol (MCP)

Promise 2011: "Are Change Metrics Good Predictors for an Evolving Software Product Line?"

  • 1. Are Change Metrics Good Predictors for an Evolving Software Product Line?Sandeep Krishnan, ISUChris Strasburg, ISU & Ames Laboratory Robyn R. Lutz, ISU & JPL, California Institute of TechnologyKaterina Goseva-Popstojanova, WVU1This research is supported by NSF grants 0916275 and 0916284Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 2. BackgroundProduct line – “A family of products designed to take advantage of their common aspects and predicted variabilities” [Weiss and Lai 1999]e.g., Nokia cellphones, HP printers, etc.Products - Commonalities – Shared by all products. e.g., PlatformVariabilities – Differentiate the productsHigh-reuse variation
  • 3. JDT, PDE, Mylyn, Webtools, etc.
  • 4. Reused in more than three products and for more than six years.
  • 7. Reused in three or fewer products and for more than four years.2Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 8. Related WorkEclipse as a product line. [Chastek, McGregor, and Northrop, 2007], [Linden, 2009], [Krishnan et al., 2011]. Summary of previous work3Failure-prone file -A file with one or more non-trivial post-releasebugs recorded in the Eclipse Bugzilla database.Important/Good predictor – Predictor providing high information gain for classification of failure-prone filesDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 9. Product Line EvolutionProduct line evolution in two dimensions4New ReleasesP1R1P1R2P1R3P1RnP2R1P2R2P2R3P2RnNew ProductsPnR1PnR2PnR3PnRnDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 10. MotivationCan we leverage the reducing amount of change in product lines to better predict failure-prone files?5Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 11. Eclipse case study6BlockerEclipse ClassicCriticalEclipse JavaMajorEclipse C/C++NormalEclipse JavaEEMinor
  • 12. Research QuestionsAs a product evolves, do any change metrics serve as good predictors of failure-prone files?Is there a subset of change metrics which are good predictors across all product line members?Does our ability to predict failure-prone files improve as product line evolves?7Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 13. FindingsThe change metrics provide good classification of the failure-prone files in the Eclipse product line.As each product evolves, there is a stable set of change metrics that are prominent predictors of failure-prone files across its releases.There is a subset of change metrics that is among the prominent predictors of all the products across most of the releases.As the product line matures, prediction performance improves for each of the four Eclipse products.8Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 14. Data Source9Data TimelineData TimelineSource of failure reports-Source of change reports – CVS repository of Eclipse.Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 15. Approach10Get prediction results + best predictorsWeka J48 decision tree learnerDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 16. Replication Results11Learner performance compared to previous resultsClassification performance comparison for Eclipse Classic 2.0, 2.1, and 3.0PC- Percentage of correctly classified instancesTPR- True positive rateFPR- False positive rateDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 17. 12Top five predictors for earlier releases of Eclipse ClassicReplication ResultsTop predictors from this studyRevisions, Weighted_ageTop predictors from previous studyMax_changeset, Bugfixes, RevisionsDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 18. 13Learner performance improves as single product evolvesExtension ResultsDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 19. 14Top five predictors for later releases of Eclipse ClassicExtension ResultsRevisions is good predictor for later releases also.Max_changeset is a good predictor also.Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 20. 15Learner performance improves as product line evolvesExtension ResultsPercentage of correctly classified instances increases across releases for each productDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 21. 16Learner performance improves as product line evolvesExtension ResultsPercentage of true positives shows improvement across releases for each productDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 22. 17Learner performance improves as product line evolvesExtension ResultsPercentage of false positives shows reduces across releases for each productDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 23. 18Top five predictors for four products of Eclipse Product LineExtension ResultsNo common set of predictors across each product and each release.Max_changeset, Revisions and Authors are prominent predictors for all products.Some predictors are prominent for only one product.Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 24. 19Dept. of Computer Science, Iowa State University, PROMISE, September 20, 2011
  • 25. Thank You!20Our data is available at http://www.cs.iastate.edu/~lss/PROMISE11Data.tar.gzDept. of Computer Science, Iowa State University, PROMISE, September 20, 2011