SlideShare a Scribd company logo
ISBSG variables most frequently used for software effort estimation: A mapping review 
Fernando González-Ladrón-de-Guevara 
Marta Fernández-Diego
Introduction 
•The International Software Benchmarking Standards Group (ISBSG) is a non-profit organization that designed and maintains an international public repository. 
•This dataset makes it possible to estimate a project’s size, effort, duration, and cost. 
•It is important that ISBSG users have a sound knowledge of the ISBSG data prior to analyzing or using it. 
2
Aim 
•Analyze the ISBSG variables that have been used by researchers for software effort estimation 
–from 2000, when the first papers were published 
–until the end of 2013 
3
Research questions 
•RQ1: What are the most used variables to generate effort estimation models? 
•RQ2: What are the most relevant features of these variables? 
4
Search process 
5
Filtering process 
6
Data collection 
•A systematic mapping study was applied to over 167 papers obtained after the filtering process. 
•133 out of 167 papers produce effort estimates. 
•Only 107 list the independent variables used in the model. 
7
RQ1 
•RQ1: What are the most used variables to generate effort estimation models? 
–ISBSG includes 118 variables but only 71 (60.2%) of them have been used in the set of papers analyzed. 
8
RQ1: Frequency distribution of the ISBSG variables 
9
RQ1: ISBSG variables most frequently used in the papers 
10
RQ1 
•RQ1: What are the most used variables to generate effort estimation models? 
–All ISBSG variables form part of a group of variables that include related data fields according to the ISBSG criteria. 
11
RQ1: The ISBSG groups of variables 
12
RQ2 
•RQ2: What are the most relevant features of these variables? 
–This work has also synthesized and described the most used ISBSG variables, the concept they represent and their relationships. 
–Also this paper discusses the nature of the variables and has placed particular emphasis on their properties, specially the issue of missing data. 
13
RQ2: Most frequently used nominal variables 
14
RQ2 (nominal variables) 
•RQ2: What are the most relevant features of these variables? 
–The more missing values a variable has, the less used this variable is when compared to the rest of variables within its own group. 
–Usually LT is most used than PPL except for cases where more information about the specific used programming language is required. 
–The variables OT, AT, and BAT present many different discrete values. 
15
RQ2: Most frequently used continuous variables 
16
RQ2 (continuous variables) 
•RQ2: What are the most relevant features of these variables? 
–FS is preferred to AFP. 
–The group Size attributes includes five fields that breakdown the FS into inputs, outputs, enquiries, files, and interfaces. The percentage of missing values is around 65% in all cases. 
–The reason ATS may not be used more is because it has a high number of missing values (88%). 
17
Results 
•A collection of 107 selected references 
•A matrix that shows a mapping of the 20 top- ranked most used variables in the estimation models of such references 
•The 71 ISBSG variables that have been used to construct effort estimation models 
•The 20 most used variables have been described arranged by groups along with their relationships and some underlying dependencies 
18
Conclusions 
•This work presents the results of a systematic mapping study about the usage of ISBSG variables until 2013. 
•The analysis is a first approximation to how and to what extent ISBSG variables and groups of variables have been used in software engineering to build effort estimation models. 
•New RQ: How the level of usage of the most frequent variables is influenced by the type of estimation methods that have been used in the papers? 
–The paper under review even suggests a prospective guide for selecting the variables to be used in effort estimation models. 
19
Thanks for your attention! 
Any question? 
20

More Related Content

PDF
Introduction to Computational Statistics
PPTX
Using Excel in project management
PPT
Business Research Methods Chap008
PPT
Business Research Methods Chap017
PPTX
Multi-method Evaluation in Scientific Paper Recommender Systems
PPTX
Fundamental of Quality Data - Anthony Ndungu
PDF
Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018
Introduction to Computational Statistics
Using Excel in project management
Business Research Methods Chap008
Business Research Methods Chap017
Multi-method Evaluation in Scientific Paper Recommender Systems
Fundamental of Quality Data - Anthony Ndungu
Selective Gradient Boosting for Effective Learning to Rank - SIGIR 2018

Viewers also liked (9)

PDF
Analysis of Effort Estimation Model in Traditional and Agile (USING METRICS ...
PPT
Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...
PPT
Cuckoo search final
PPTX
Bat algorithm explained. slides ppt pptx
PPTX
BAT Algorithm
PPTX
Bat algorithm and applications
PDF
Nature-Inspired Optimization Algorithms
PPTX
Bat algorithm
PPTX
Leaky Bucket & Tocken Bucket - Traffic shaping
Analysis of Effort Estimation Model in Traditional and Agile (USING METRICS ...
Improving Analogy Software Effort Estimation using Fuzzy Feature Subset Selec...
Cuckoo search final
Bat algorithm explained. slides ppt pptx
BAT Algorithm
Bat algorithm and applications
Nature-Inspired Optimization Algorithms
Bat algorithm
Leaky Bucket & Tocken Bucket - Traffic shaping
Ad

Similar to 166 - ISBSG variables most frequently used for software effort estimation: A mapping review (20)

PPTX
Pareto-Optimal Search-Based Software Engineering (POSBSE): A Literature Survey
PPT
Decision table
PPTX
Software engineering
PPTX
An Empirical Study of Reliability Growth of Open versus Closed Source Softwar...
PPTX
Multi variate presentation
PPTX
Fsd_216490316162_216490316168_216490316176.pptx.pptx
PPT
Pressman ch-22-process-and-project-metrics
PPTX
UNIT 4.pptx
PDF
Ignacio panach ormeño et-al_caise2013
PDF
A Guide to SPSS Statistics
PPT
RRC Requirements and Use Cases
PPTX
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
PDF
A Review of Socialized Word Embeddings (Zeng+, 2017)
PPTX
Measuring Scientific Productivity
PDF
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
PPT
Case tools and modern process of system development
PPTX
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
PPTX
Global Mutable State Analysis in Spring MVC Applications
PPTX
It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...
PDF
Sida LEAP Training Lectures #7 and #8: Linking LEAP and WEAP and other advanc...
Pareto-Optimal Search-Based Software Engineering (POSBSE): A Literature Survey
Decision table
Software engineering
An Empirical Study of Reliability Growth of Open versus Closed Source Softwar...
Multi variate presentation
Fsd_216490316162_216490316168_216490316176.pptx.pptx
Pressman ch-22-process-and-project-metrics
UNIT 4.pptx
Ignacio panach ormeño et-al_caise2013
A Guide to SPSS Statistics
RRC Requirements and Use Cases
Effect of Temporal Collaboration Network, Maintenance Activity, and Experienc...
A Review of Socialized Word Embeddings (Zeng+, 2017)
Measuring Scientific Productivity
boughtonalexand jdjdjfjjfjfjfjnfjfjjjfkdifij
Case tools and modern process of system development
Measuring the usefulness of Knowledge Organization Systems in Information Ret...
Global Mutable State Analysis in Spring MVC Applications
It Takes Two to Tango: an Exploration of Domain Pairs for Cross-Domain Collab...
Sida LEAP Training Lectures #7 and #8: Linking LEAP and WEAP and other advanc...
Ad

More from ESEM 2014 (20)

PDF
Keynote 2 - The 20% of software engineering practices that contribute to 80% ...
PDF
Keynote 1 - Engineering Software Analytics Studies
PDF
33 - On Knowledge Transfer Skill in Pair Programming
PPTX
222 - Design Pattern Decay: The Case for Class Grime
PDF
210 - Software Population Pyramids: The Current and the Future of OSS Develop...
PDF
169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...
PDF
196 - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...
PDF
42- Using Templates to Elicit Implied Security Requirements from Functional R...
PDF
112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...
PDF
224 - Factors Impacting Rapid Releases: An Industrial Case Study
PPTX
215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...
PPTX
214 - Sampling Improvement in Software Engineering Surveys
PDF
201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...
PPTX
130 - Motivated software engineers are engaged and focused, while satisfied o...
PDF
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
PDF
124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...
PDF
18 - Impact of Process Conformance on the Effects of Test-driven Development
PPTX
65 - An Empirical Simulation-based Study of Real-Time Speech Translation for ...
PPTX
52 - The Impact of Test Ownership and Team Structure on the Reliability and E...
PDF
167 - Productivity for proof engineering
Keynote 2 - The 20% of software engineering practices that contribute to 80% ...
Keynote 1 - Engineering Software Analytics Studies
33 - On Knowledge Transfer Skill in Pair Programming
222 - Design Pattern Decay: The Case for Class Grime
210 - Software Population Pyramids: The Current and the Future of OSS Develop...
169 - Bridging the Gap: SE Technology Transfer into Practice - Study Design a...
196 - Evaluation in Practice: Artifact-based Requirements Engineering and Sc...
42- Using Templates to Elicit Implied Security Requirements from Functional R...
112 - The Role of Mentoring and Project Characteristics for Onboarding in Ope...
224 - Factors Impacting Rapid Releases: An Industrial Case Study
215 Towards a Framework to Support Large Scale Sampling in Software Engineeri...
214 - Sampling Improvement in Software Engineering Surveys
201 - Using Qualitative Metasummary to Synthesize Empirical Findings in Liter...
130 - Motivated software engineers are engaged and focused, while satisfied o...
178 - A replicated study on duplicate detection: Using Apache Lucene to searc...
124 - Impact of Developer Reputation on Code Review Outcomes in OSS Projects:...
18 - Impact of Process Conformance on the Effects of Test-driven Development
65 - An Empirical Simulation-based Study of Real-Time Speech Translation for ...
52 - The Impact of Test Ownership and Team Structure on the Reliability and E...
167 - Productivity for proof engineering

Recently uploaded (20)

PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
Designing Intelligence for the Shop Floor.pdf
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PPTX
Operating system designcfffgfgggggggvggggggggg
PDF
System and Network Administraation Chapter 3
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
medical staffing services at VALiNTRY
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPTX
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
PPTX
history of c programming in notes for students .pptx
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
Transform Your Business with a Software ERP System
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
Softaken Excel to vCard Converter Software.pdf
PDF
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
VVF-Customer-Presentation2025-Ver1.9.pptx
Designing Intelligence for the Shop Floor.pdf
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
Operating system designcfffgfgggggggvggggggggg
System and Network Administraation Chapter 3
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
medical staffing services at VALiNTRY
Odoo Companies in India – Driving Business Transformation.pdf
Oracle E-Business Suite: A Comprehensive Guide for Modern Enterprises
history of c programming in notes for students .pptx
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
wealthsignaloriginal-com-DS-text-... (1).pdf
Design an Analysis of Algorithms II-SECS-1021-03
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Design an Analysis of Algorithms I-SECS-1021-03
Transform Your Business with a Software ERP System
PTS Company Brochure 2025 (1).pdf.......
Softaken Excel to vCard Converter Software.pdf
Addressing The Cult of Project Management Tools-Why Disconnected Work is Hold...
Navsoft: AI-Powered Business Solutions & Custom Software Development

166 - ISBSG variables most frequently used for software effort estimation: A mapping review

  • 1. ISBSG variables most frequently used for software effort estimation: A mapping review Fernando González-Ladrón-de-Guevara Marta Fernández-Diego
  • 2. Introduction •The International Software Benchmarking Standards Group (ISBSG) is a non-profit organization that designed and maintains an international public repository. •This dataset makes it possible to estimate a project’s size, effort, duration, and cost. •It is important that ISBSG users have a sound knowledge of the ISBSG data prior to analyzing or using it. 2
  • 3. Aim •Analyze the ISBSG variables that have been used by researchers for software effort estimation –from 2000, when the first papers were published –until the end of 2013 3
  • 4. Research questions •RQ1: What are the most used variables to generate effort estimation models? •RQ2: What are the most relevant features of these variables? 4
  • 7. Data collection •A systematic mapping study was applied to over 167 papers obtained after the filtering process. •133 out of 167 papers produce effort estimates. •Only 107 list the independent variables used in the model. 7
  • 8. RQ1 •RQ1: What are the most used variables to generate effort estimation models? –ISBSG includes 118 variables but only 71 (60.2%) of them have been used in the set of papers analyzed. 8
  • 9. RQ1: Frequency distribution of the ISBSG variables 9
  • 10. RQ1: ISBSG variables most frequently used in the papers 10
  • 11. RQ1 •RQ1: What are the most used variables to generate effort estimation models? –All ISBSG variables form part of a group of variables that include related data fields according to the ISBSG criteria. 11
  • 12. RQ1: The ISBSG groups of variables 12
  • 13. RQ2 •RQ2: What are the most relevant features of these variables? –This work has also synthesized and described the most used ISBSG variables, the concept they represent and their relationships. –Also this paper discusses the nature of the variables and has placed particular emphasis on their properties, specially the issue of missing data. 13
  • 14. RQ2: Most frequently used nominal variables 14
  • 15. RQ2 (nominal variables) •RQ2: What are the most relevant features of these variables? –The more missing values a variable has, the less used this variable is when compared to the rest of variables within its own group. –Usually LT is most used than PPL except for cases where more information about the specific used programming language is required. –The variables OT, AT, and BAT present many different discrete values. 15
  • 16. RQ2: Most frequently used continuous variables 16
  • 17. RQ2 (continuous variables) •RQ2: What are the most relevant features of these variables? –FS is preferred to AFP. –The group Size attributes includes five fields that breakdown the FS into inputs, outputs, enquiries, files, and interfaces. The percentage of missing values is around 65% in all cases. –The reason ATS may not be used more is because it has a high number of missing values (88%). 17
  • 18. Results •A collection of 107 selected references •A matrix that shows a mapping of the 20 top- ranked most used variables in the estimation models of such references •The 71 ISBSG variables that have been used to construct effort estimation models •The 20 most used variables have been described arranged by groups along with their relationships and some underlying dependencies 18
  • 19. Conclusions •This work presents the results of a systematic mapping study about the usage of ISBSG variables until 2013. •The analysis is a first approximation to how and to what extent ISBSG variables and groups of variables have been used in software engineering to build effort estimation models. •New RQ: How the level of usage of the most frequent variables is influenced by the type of estimation methods that have been used in the papers? –The paper under review even suggests a prospective guide for selecting the variables to be used in effort estimation models. 19
  • 20. Thanks for your attention! Any question? 20