SlideShare a Scribd company logo
Beware of the Interactions of Variability Layers
when Reasoning about the Evolution of MongoDB
Luc Lesoil, Mathieu Acher, Arnaud Blouin & jean-marc Jézéquel
2022/04/12
Beijing, China
Data Challenge
≠ Thread Levels
≠ Perf Evolutions
Joint evolution of mongoDB change points (top) and performance values (bottom)
Code
User #1 User #2
Thread Level = 512
Perf ↘ Perf ↗
Dataset: Expanded Metrics, Project: sys-perf, Task: industry_benchmark_wmajority,
Hardware: linux-3-node-replSet, Test: csb_50_read_50_update_w_majority
Thread Level = 1
Dev
?
Impact of runtime environments on software evolution
2/5
Interactions between
the runtime environment &
the evolution of the software
[1] The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System, Daly et al., ICPE 2020, https://dl.acm.org/doi/abs/10.1145/3358960.3375791
[1]
Experiment - Compute the DTW
for all combinations of hardware platforms
Impact of hardware platforms on software evolution
Heatmap of DTW between times series
related to different variants of hardware
ⓑ DTW = 0.38
ⓓ DTW = 5.39
What is the Dynamic Time Warping?
Similar
Different
Result - Identify hardware platforms
having similar evolutions
to reduce the cost of benchmarking 3/5
ⓑ DRPC = 1.61%
ⓒ DRPC = 25.07%
Impact of workloads on software evolution
Experiment - Compute the DRPC
distribution for each workload
Result - Identify stable workloads
to use in benchmarks
Daily Relative Percentage Change
● p(t) the performance value at the time t
● d(t, t+1) the number of days between t and t+1 4/5
Takeaway Message
Runtime environments matter (when quantifying software evolution)!
@David and MongoDB performance team
Need feedback & domain knowledge to draw actionable conclusions
Thanks for this Data Challenge !
5/5
Back-Up Slides
Pre-processing of Time Series
Only consider the period of definition
common to the two Time Series
Linear interpolation if a point is
present only in one TS
1
Time
Performance
TS #1
TS #2
These high values can be due to:
- the standardisation if the standard
deviation of the distribution is too
low
- outliers in the TS
We have to standardise because TS
have different scales
High DTW values for couple of hardware platforms
2

More Related Content

PDF
Complexity metrics and models
PDF
Complexity metrics and models
PDF
Using cyclomatic complexity to measure code complexity
PDF
Software effort estimation through clustering techniques of RBFN network
PPTX
software cost factor
PPTX
Measuring the Combinatorial Coverage of Software in Real Time
PPTX
COMP589 (1)
PDF
How Do Developers Toggle Breakpoints?
Complexity metrics and models
Complexity metrics and models
Using cyclomatic complexity to measure code complexity
Software effort estimation through clustering techniques of RBFN network
software cost factor
Measuring the Combinatorial Coverage of Software in Real Time
COMP589 (1)
How Do Developers Toggle Breakpoints?

What's hot (10)

PDF
Spm software effort estimation
PDF
130705 zephyrin soh - how developers spend their effort during maintenance ...
PPTX
Partitioned Based Regression Verification
PPTX
Mpi collective communication operations
DOC
Defect Age
PDF
An Overview of Distributed Debugging
PPTX
Replication in Distributed Systems
DOCX
Bt0070, operating systems
PDF
A New Approach for Job Scheduling Using Hybrid GA-ST Optimization-Crimson Pub...
PPT
Chapter 5 pc
Spm software effort estimation
130705 zephyrin soh - how developers spend their effort during maintenance ...
Partitioned Based Regression Verification
Mpi collective communication operations
Defect Age
An Overview of Distributed Debugging
Replication in Distributed Systems
Bt0070, operating systems
A New Approach for Job Scheduling Using Hybrid GA-ST Optimization-Crimson Pub...
Chapter 5 pc
Ad

Similar to ICPE 2022 - Data Challenge (20)

KEY
Using Basho Bench to Load Test Distributed Applications
PDF
Future Research Challenges in Software Evolution
PDF
ShaREing Is Caring
PDF
DEF CON 27 - CHRISTOPHER ROBERTS - firmware slap
PDF
System Structure for Dependable Software Systems
PDF
Dependability Benchmarking by Injecting Software Bugs
PDF
Changes and Bugs: Mining and Predicting Development Activities
PDF
ERA - Measuring Disruption from Software Evolution Activities Using Graph-Bas...
PDF
A defect prediction model based on the relationships between developers and c...
PDF
Tackling Deep Software Variability Together
PDF
Populating a Release History Database (ICSM 2013 MIP)
PDF
EVALUATION OF SOFTWARE DEGRADATION AND FORECASTING FUTURE DEVELOPMENT NEEDS I...
PDF
EVALUATION AND STUDY OF SOFTWARE DEGRADATION IN THE EVOLUTION OF SIX VERSIONS...
PDF
VaMoS 2021 - Deep Software Variability: Towards Handling Cross-Layer Configur...
PDF
Introduction to Software Evolution: The Software Volcano
PDF
DEF CON 27- JACK BAKER - web assembly games
PPT
Software Change in Software Engineering SE27
PDF
Software evolution evangelisation
PPTX
The Business Economics and Opportunity of Open Source Data Science
PDF
Software Analytics: Data Analytics for Software Engineering
Using Basho Bench to Load Test Distributed Applications
Future Research Challenges in Software Evolution
ShaREing Is Caring
DEF CON 27 - CHRISTOPHER ROBERTS - firmware slap
System Structure for Dependable Software Systems
Dependability Benchmarking by Injecting Software Bugs
Changes and Bugs: Mining and Predicting Development Activities
ERA - Measuring Disruption from Software Evolution Activities Using Graph-Bas...
A defect prediction model based on the relationships between developers and c...
Tackling Deep Software Variability Together
Populating a Release History Database (ICSM 2013 MIP)
EVALUATION OF SOFTWARE DEGRADATION AND FORECASTING FUTURE DEVELOPMENT NEEDS I...
EVALUATION AND STUDY OF SOFTWARE DEGRADATION IN THE EVOLUTION OF SIX VERSIONS...
VaMoS 2021 - Deep Software Variability: Towards Handling Cross-Layer Configur...
Introduction to Software Evolution: The Software Volcano
DEF CON 27- JACK BAKER - web assembly games
Software Change in Software Engineering SE27
Software evolution evangelisation
The Business Economics and Opportunity of Open Source Data Science
Software Analytics: Data Analytics for Software Engineering
Ad

Recently uploaded (20)

PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
2. Earth - The Living Planet Module 2ELS
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPT
protein biochemistry.ppt for university classes
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
PPTX
2. Earth - The Living Planet earth and life
PDF
An interstellar mission to test astrophysical black holes
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PPTX
BIOMOLECULES PPT........................
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
. Radiology Case Scenariosssssssssssssss
2. Earth - The Living Planet Module 2ELS
Derivatives of integument scales, beaks, horns,.pptx
protein biochemistry.ppt for university classes
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
2. Earth - The Living Planet earth and life
An interstellar mission to test astrophysical black holes
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
lecture 2026 of Sjogren's syndrome l .pdf
BIOMOLECULES PPT........................
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Taita Taveta Laboratory Technician Workshop Presentation.pptx
INTRODUCTION TO EVS | Concept of sustainability
Introduction to Cardiovascular system_structure and functions-1
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...

ICPE 2022 - Data Challenge

  • 1. Beware of the Interactions of Variability Layers when Reasoning about the Evolution of MongoDB Luc Lesoil, Mathieu Acher, Arnaud Blouin & jean-marc Jézéquel 2022/04/12 Beijing, China Data Challenge
  • 2. ≠ Thread Levels ≠ Perf Evolutions Joint evolution of mongoDB change points (top) and performance values (bottom) Code User #1 User #2 Thread Level = 512 Perf ↘ Perf ↗ Dataset: Expanded Metrics, Project: sys-perf, Task: industry_benchmark_wmajority, Hardware: linux-3-node-replSet, Test: csb_50_read_50_update_w_majority Thread Level = 1 Dev ? Impact of runtime environments on software evolution 2/5 Interactions between the runtime environment & the evolution of the software [1] The Use of Change Point Detection to Identify Software Performance Regressions in a Continuous Integration System, Daly et al., ICPE 2020, https://dl.acm.org/doi/abs/10.1145/3358960.3375791 [1]
  • 3. Experiment - Compute the DTW for all combinations of hardware platforms Impact of hardware platforms on software evolution Heatmap of DTW between times series related to different variants of hardware ⓑ DTW = 0.38 ⓓ DTW = 5.39 What is the Dynamic Time Warping? Similar Different Result - Identify hardware platforms having similar evolutions to reduce the cost of benchmarking 3/5
  • 4. ⓑ DRPC = 1.61% ⓒ DRPC = 25.07% Impact of workloads on software evolution Experiment - Compute the DRPC distribution for each workload Result - Identify stable workloads to use in benchmarks Daily Relative Percentage Change ● p(t) the performance value at the time t ● d(t, t+1) the number of days between t and t+1 4/5
  • 5. Takeaway Message Runtime environments matter (when quantifying software evolution)! @David and MongoDB performance team Need feedback & domain knowledge to draw actionable conclusions Thanks for this Data Challenge ! 5/5
  • 7. Pre-processing of Time Series Only consider the period of definition common to the two Time Series Linear interpolation if a point is present only in one TS 1 Time Performance TS #1 TS #2
  • 8. These high values can be due to: - the standardisation if the standard deviation of the distribution is too low - outliers in the TS We have to standardise because TS have different scales High DTW values for couple of hardware platforms 2