SlideShare a Scribd company logo
On the Diversity of Software Package
Popularity Metrics: An Empirical
Study of npm
Ahmed Zerouali, Tom Mens, Gregorio Robles, Jesus Gonzalez
Barahona
The IEEE International Conference on Software Analysis, Evolution
and Reengineering (SANER)
Hangzhou, China - February 24-27, 2019
/Motivation
/Motivation
/Goal
How are metrics of package popularity related to each other?
/Method/ Dataset
175,774 packages
/Method/ 9 popularity metrics
# runtime dependent repositories (libraries.io)
# transitive runtime dependents (libraries.io)
# direct runtime dependents (npm and libraries.io)
# downloads (npm)
# npm stars (npm)
# github stars (GitHub)
# forks (GitHub)
# pull requests (GitHub)
# subscribers (GitHub)
/Metrics Emanating From the Same Source
/Metrics Emanating From the Same Source
/Metrics Emanating From the Same Source
/Metrics Emanating from Different Sources/ 6 metrics
# runtime dependent repositories (libraries.io)
# direct runtime dependents (npm and libraries.io)
# downloads (npm)
# npm stars (npm)
Aggarwal-Popularity = #forks + #stars + #pull requests² (GitHub)
# subscribers (GitHub)
/Metrics Emanating from Different Sources/ 6 metrics
/Metrics Emanating from Different Sources/ 6 metrics
/Limitations
- There are other metrics not considered in the study (e.g. development
dependents, SoureRank (from libraries.io), etc)
- We focused only on packages that are hosted on GitHub.
- We focused only on one package repository, npm.
/Conclusion
- Conducted analysis on the diversity of software popularity metrics.
- Popularity metrics are not strongly correlated, implying that the use of
different metrics may produce different outcomes.
- There is a need to define a measurement framework that takes into account
the diversity and context- dependence of software popularity.
/Future Work
- Do a more fine grained survey on software popularity metrics.
- Consider concrete examples of studies and reproduce them using other
popularity metrics.
- Extend the analysis on other ecosystems.
https://media.giphy.com/media/DUrdT2xEmJWbS/giphy.gif
Questions

More Related Content

PPTX
On the diversity of software popularity metrics: An empirical study of npm
PDF
FOOPS!: An Ontology Pitfall Scanner for the FAIR principles
PDF
Software as a Well-Formed Research Object
PDF
Deep Software Variability and Frictionless Reproducibility
PPTX
ACM Chicago March 2019 meeting: Software Engineering and AI - Prof. Tao Xie, ...
PPTX
Intelligent Software Engineering: Synergy between AI and Software Engineering
PPT
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
PPT
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
On the diversity of software popularity metrics: An empirical study of npm
FOOPS!: An Ontology Pitfall Scanner for the FAIR principles
Software as a Well-Formed Research Object
Deep Software Variability and Frictionless Reproducibility
ACM Chicago March 2019 meeting: Software Engineering and AI - Prof. Tao Xie, ...
Intelligent Software Engineering: Synergy between AI and Software Engineering
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...
Is Text Search an Effective Approach for Fault Localization: A Practitioners ...

Similar to On the Diversity of Software Package Popularity Metrics: An Empirical Study of npm (20)

PPTX
Big Data: the weakest link
PPTX
ExSchema - ICSM'13
PDF
Meetup SF - Amundsen
PPTX
Software Citation in Theory and Practice
PPTX
Software Security Assurance for Devops
PPTX
Software Security Assurance for DevOps
PDF
Search Solutions 2011: Successful Enterprise Search By Design
PDF
Analyzing Big Data's Weakest Link (hint: it might be you)
PPTX
Crediting informatics and data folks in life science teams
PDF
Software citation
PPTX
Strata sf - Amundsen presentation
PPTX
Software Security Assurance for DevOps - Hewlett Packard Enterprise + Black Duck
PDF
How academic research on GitHub has evolved in the last several years
PDF
Software Analytics: Data Analytics for Software Engineering
PDF
Citation and reproducibility in software
PDF
ICONIQ Analytics: The Modern Developer Technology Stack
PDF
Koss 1605 machine_learning_mariocho_t10
PPTX
Software Repositories for Research -- An Environmental Scan
PPTX
Towards Reusable Research Software
Big Data: the weakest link
ExSchema - ICSM'13
Meetup SF - Amundsen
Software Citation in Theory and Practice
Software Security Assurance for Devops
Software Security Assurance for DevOps
Search Solutions 2011: Successful Enterprise Search By Design
Analyzing Big Data's Weakest Link (hint: it might be you)
Crediting informatics and data folks in life science teams
Software citation
Strata sf - Amundsen presentation
Software Security Assurance for DevOps - Hewlett Packard Enterprise + Black Duck
How academic research on GitHub has evolved in the last several years
Software Analytics: Data Analytics for Software Engineering
Citation and reproducibility in software
ICONIQ Analytics: The Modern Developer Technology Stack
Koss 1605 machine_learning_mariocho_t10
Software Repositories for Research -- An Environmental Scan
Towards Reusable Research Software
Ad

More from Ahmed Zerouali (16)

PDF
Prevalence and Evolution of License Violations in npm and RubyGems Dependency...
PPTX
Analysis And Observations Of The Evolution Of Testing Library Usage
PPTX
On Popularity and Quality Metrics of npm Packages
PDF
On the Impact of Security Vulnerabilities in the npm and RubyGems Dependency ...
PDF
A multi-dimensional analysis of technical lag in Debian-based Docker images
PDF
Evolution of Technical Lag in DockerHub images - Benevol20
PDF
PhD public defense: A Measurement Framework for Analyzing Technical Lag in ...
PDF
Technical Lag in Software Ecosystems
PDF
Technical lag in npm and docker ecosystems
PPTX
Analyzing Packages in Docker images hosted On DockerHub
PPTX
ConPan: A Tool to Analyze Packages in Software Containers
PDF
Technical Lag in Docker Containers
PPTX
Analyzing the Evolution of Testing Library Usage in Open Source Java Projects
PPTX
An Empirical Comparison of the Development History of CloudStack and Eucalyptus
PPTX
Analyzing the Evolution of Testing Library Usage in Open Source Java Projects
PDF
An Empirical Analysis of Technical Lag in npm Package Dependencies
Prevalence and Evolution of License Violations in npm and RubyGems Dependency...
Analysis And Observations Of The Evolution Of Testing Library Usage
On Popularity and Quality Metrics of npm Packages
On the Impact of Security Vulnerabilities in the npm and RubyGems Dependency ...
A multi-dimensional analysis of technical lag in Debian-based Docker images
Evolution of Technical Lag in DockerHub images - Benevol20
PhD public defense: A Measurement Framework for Analyzing Technical Lag in ...
Technical Lag in Software Ecosystems
Technical lag in npm and docker ecosystems
Analyzing Packages in Docker images hosted On DockerHub
ConPan: A Tool to Analyze Packages in Software Containers
Technical Lag in Docker Containers
Analyzing the Evolution of Testing Library Usage in Open Source Java Projects
An Empirical Comparison of the Development History of CloudStack and Eucalyptus
Analyzing the Evolution of Testing Library Usage in Open Source Java Projects
An Empirical Analysis of Technical Lag in npm Package Dependencies
Ad

Recently uploaded (20)

PPT
Presentation of a Romanian Institutee 2.
PDF
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
PPTX
Substance Disorders- part different drugs change body
PPTX
Lesson-1-Introduction-to-the-Study-of-Chemistry.pptx
PPTX
Biomechanics of the Hip - Basic Science.pptx
PPTX
Understanding the Circulatory System……..
PPTX
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PPT
veterinary parasitology ````````````.ppt
PPT
LEC Synthetic Biology and its application.ppt
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PPTX
Seminar Hypertension and Kidney diseases.pptx
PPTX
Fluid dynamics vivavoce presentation of prakash
PDF
Placing the Near-Earth Object Impact Probability in Context
PPT
Mutation in dna of bacteria and repairss
PDF
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
PPT
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PPTX
Introcution to Microbes Burton's Biology for the Health
PDF
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum
Presentation of a Romanian Institutee 2.
BET Eukaryotic signal Transduction BET Eukaryotic signal Transduction.pdf
Substance Disorders- part different drugs change body
Lesson-1-Introduction-to-the-Study-of-Chemistry.pptx
Biomechanics of the Hip - Basic Science.pptx
Understanding the Circulatory System……..
INTRODUCTION TO PAEDIATRICS AND PAEDIATRIC HISTORY TAKING-1.pptx
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
veterinary parasitology ````````````.ppt
LEC Synthetic Biology and its application.ppt
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
Seminar Hypertension and Kidney diseases.pptx
Fluid dynamics vivavoce presentation of prakash
Placing the Near-Earth Object Impact Probability in Context
Mutation in dna of bacteria and repairss
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
THE CELL THEORY AND ITS FUNDAMENTALS AND USE
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
Introcution to Microbes Burton's Biology for the Health
S2 SOIL BY TR. OKION.pdf based on the new lower secondary curriculum

On the Diversity of Software Package Popularity Metrics: An Empirical Study of npm

  • 1. On the Diversity of Software Package Popularity Metrics: An Empirical Study of npm Ahmed Zerouali, Tom Mens, Gregorio Robles, Jesus Gonzalez Barahona The IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) Hangzhou, China - February 24-27, 2019
  • 4. /Goal How are metrics of package popularity related to each other?
  • 6. /Method/ 9 popularity metrics # runtime dependent repositories (libraries.io) # transitive runtime dependents (libraries.io) # direct runtime dependents (npm and libraries.io) # downloads (npm) # npm stars (npm) # github stars (GitHub) # forks (GitHub) # pull requests (GitHub) # subscribers (GitHub)
  • 7. /Metrics Emanating From the Same Source
  • 8. /Metrics Emanating From the Same Source
  • 9. /Metrics Emanating From the Same Source
  • 10. /Metrics Emanating from Different Sources/ 6 metrics # runtime dependent repositories (libraries.io) # direct runtime dependents (npm and libraries.io) # downloads (npm) # npm stars (npm) Aggarwal-Popularity = #forks + #stars + #pull requests² (GitHub) # subscribers (GitHub)
  • 11. /Metrics Emanating from Different Sources/ 6 metrics
  • 12. /Metrics Emanating from Different Sources/ 6 metrics
  • 13. /Limitations - There are other metrics not considered in the study (e.g. development dependents, SoureRank (from libraries.io), etc) - We focused only on packages that are hosted on GitHub. - We focused only on one package repository, npm.
  • 14. /Conclusion - Conducted analysis on the diversity of software popularity metrics. - Popularity metrics are not strongly correlated, implying that the use of different metrics may produce different outcomes. - There is a need to define a measurement framework that takes into account the diversity and context- dependence of software popularity.
  • 15. /Future Work - Do a more fine grained survey on software popularity metrics. - Consider concrete examples of studies and reproduce them using other popularity metrics. - Extend the analysis on other ecosystems.