SlideShare a Scribd company logo
Modeling the Performance of
Ultra-Large-Scale Systems
1
Using
Layered Simulations
[Slide courtesy of Randy Shoup, eBay Distinguished Architect] 2
ULS are Everywhere, and they’re HUGE!
1. Performance Testing should happen …
4
development
process
here?
here!
2. One Model to Rule Them All?
1. How do response time differ
between these two mechanism?
5
3. Different Stakeholders, Different Interests
End User Developer
System
Engineer
System
Integrator
Sales
Rep
6
An Analogy: The “4+1” View Model
Scenario
Logical View
Process
View
Development
View
Physical
View
8
The “3+1” View Model for
Performance Modeling
Scenario
World
View
Physical
View
Component
View
9
Component
Layer
A Layered Simulation Model!
10
World
View
Layer
Physical
Layer
Case Study 1
Bottleneck Detection in RSS Cloud
• RSS blogging service providers (e.g., Wordpress)
• Evaluation of two notification delivery mechanisms:
– Periodic Pull
– Real-time Push
Q1. How does response time differ?
Q2. How does resource usage
(e.g., CPU, memory) look like?
11
Component
Layer
The RSS Cloud Model
12
World
View
Layer
Physical
Layer
CPU is the Bottleneck!
13
Case Study 2
Evaluating a Distributed Monitor
1. How do response time differ
between these two mechanism?
14
Configuration Score
16
DISCRETIZE
fuzzy values
LESS is MORE
Distributed vs. Centralized
17
0.1Hz
0.2Hz
0.3Hz
0.1Hz
0.2Hz
0.3Hz
faster “cheaper”
lower resource usage
Keeping the Model Up-to-Date
• (What-if) Experimenting with different
configuration/deployment scenarios
• (Digging deeper) Adding new “layers”, as system is
understood better
• (Design evolution) Updating the simulation models as
the system evolves
18
GH'@&+/) +A 12(&'I &-829'-3) *#%'3144&2'J '
6!
1) +) 789: ) 02!
9(8; ) <<!
3&+&K'
3&+&F'
LH'7$M&+&2,'",1N&3) #%&+-5'7$M&+&2,'O2,&+&-,-'
?2%'= -&+' 7&B&#) 4&+'
". -,&A ''
?29$2&&+'
". -,&A ''
O2,&9+1,) +'
"1#&-''
0&4'
=!
Component
Layer'
: 'Z1. &+&%'"$A *#18) 2'P ) %&#F'
"I !
World
View
Layer'
Physical
Layer'
7$-,+$S*,&%'B-H'&2,+1#$d&%!
"F!
eHG<d'
eHL<d'
eHQ<d'
TABLE IX
SIMULATION RESULTS FOR THE ORIGINAL DESIGN OF THE PERFORMANCE MONITOR.
DataCollection Response time(s) Cost per Central Monitor Central Monitor Central Monitor
Frequency (Hz) transmission ($) Thread Util. (%) CPU Util. (%) RAM Util. (%)
0.1 0.2 2.2 31.0 42.6 37.2
0.2 0.3 2.7 43.7 68.2 47.6
0.3 0.4 3.1 60.2 86.6 59.2
the CPU of the central monitor is close to running at its full
capacity, which will likely result in system instability. More-
over, whiletheoriginal designhaslower cost per transmission,
be validated as information becomes available throug
development. Due to the lack of access to production
we could only estimate a list of resources and proce
eHG<d'
eHL<d'
eHQ<d'
/1-,&+! U(3&14&+X!
#) D&+'+&-) *+(&'*-19&!
QUESTIONS!
20
Backup Slides
21
FOUR CHALLENGES
The Current Practice of Performance Verification
22
1. Too Late in the Development Lifecycle
• Design changes are not evaluated until after
code is written
– Happens at the last stage of a delayed schedule
23
Requirements
Design
Implementation
Verification
Maintenance
2. Lots of Data
• Industrial case studies
have 2000> counters
• Time consuming to
analyze
• Hard to compare more
than 2 tests at once
24
3. No Documented Behavior
25
• Analysts have different perceptions of
performance regressions
• Analysis may be influenced by
– Analyst’s knowledge
– Deadline
4. Heterogeneous Environments
• Multiple labs to parallelize test executions
– Hardware and software may differ
– Tests from one lab may not be used to analyze
tests from another lab
26
Categorize Each Challenge
27
AT THE DESIGN LEVEL
Performance Verification
28
Evaluate Design Changes through
Performance Modeling
• Analytical models are often not suitable for all
stakeholders
– Abstract mathematical and statistical concepts
• Simulation models can be implemented with
support of existing framework
– Visualization
– No systematic approach to construct models that
can be used by different stakeholders
29
Layered Simulation Model
30
Physical layerComponent layer
World view layer
Can the current
infrastructure support
the projected growth of
users?
Investigate
threading model
Hardware resource
utilization
Case Studies
• We conducted two case studies
– RSS Cloud
• Show the process of constructed the model
• Derive the bottleneck of the application
– Performance monitor for ULS systems
• Evaluate whether or not an organization should re-
architect the software
• Our model can be used to extract important
information and aid in decision making
31

More Related Content

PPT
Automated Discovery of Performance Regressions in Enterprise Applications
PPT
Animated Visualization of Software History Using Software Evolution Storyboards
PPT
Detecting Interaction Coupling from Task Interaction Histories
PPTX
Understanding the Rationale for Updating a Function's Comment
PPTX
Log Engineering: Towards Systematic Log Mining to Support the Development of ...
PPTX
Empircal Studies of Performance Bugs & Performance Analysis Approaches for La...
PPT
Mining Performance Regression Testing Repositories for Automated Performance ...
PPTX
Supporting Software Evolution Using Adaptive Change Propagation
Automated Discovery of Performance Regressions in Enterprise Applications
Animated Visualization of Software History Using Software Evolution Storyboards
Detecting Interaction Coupling from Task Interaction Histories
Understanding the Rationale for Updating a Function's Comment
Log Engineering: Towards Systematic Log Mining to Support the Development of ...
Empircal Studies of Performance Bugs & Performance Analysis Approaches for La...
Mining Performance Regression Testing Repositories for Automated Performance ...
Supporting Software Evolution Using Adaptive Change Propagation

What's hot (19)

PPTX
Software engineering- system testing
PPTX
Testing Technique
PDF
Documented Requirements are not Useless After All!
PPTX
Testing Plan
PPTX
Testing throughout the software life cycle
PDF
Automated Change Impact Analysis between SysML Models of Requirements and Design
PDF
Defect Prediction Over Software Life Cycle in Automotive Domain
PPTX
Survey on Software Defect Prediction
PDF
naveed-kamran-software-architecture-agile
PPTX
System development phases
PPTX
Chaper 1 sdlc
PPT
Fundamentals of Software Engineering
PDF
Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)
PDF
Survey on Software Defect Prediction
PPTX
Maintenance, Re-engineering &Reverse Engineering in Software Engineering
PPTX
Software engineering 21 integration testing
PPTX
Neotys PAC - Bruno Da Silva
PPT
Architectural design
PPTX
Towards Software Sustainability Guides for Industrial Software Systems
Software engineering- system testing
Testing Technique
Documented Requirements are not Useless After All!
Testing Plan
Testing throughout the software life cycle
Automated Change Impact Analysis between SysML Models of Requirements and Design
Defect Prediction Over Software Life Cycle in Automotive Domain
Survey on Software Defect Prediction
naveed-kamran-software-architecture-agile
System development phases
Chaper 1 sdlc
Fundamentals of Software Engineering
Survey on Software Defect Prediction (PhD Qualifying Examination Presentation)
Survey on Software Defect Prediction
Maintenance, Re-engineering &Reverse Engineering in Software Engineering
Software engineering 21 integration testing
Neotys PAC - Bruno Da Silva
Architectural design
Towards Software Sustainability Guides for Industrial Software Systems
Ad

Viewers also liked (13)

PPTX
Large-Scale Empirical Studies of Mobile Apps
PPTX
Medios publicitarios
PDF
Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...
DOCX
Proyecto de sociologia
PDF
Case History Carlo Cairoli
PDF
C de marchi_tirocinio
DOCX
비아그라 판매 =&lt;7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...
PPTX
Importance of Eye Contact in Hospitality
PDF
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015
PPTX
Powering of bangladesh- Vision 2021
PPTX
An Industrial Case Study on the Automated Detection of Performance Regression...
PDF
Sustainability Day Leeds 2017
PPTX
Lean Canvas evolved - FTE Canvas
Large-Scale Empirical Studies of Mobile Apps
Medios publicitarios
Iniciativa del Senador Luis Humberto Fernández sobre coaliciones entre indepe...
Proyecto de sociologia
Case History Carlo Cairoli
C de marchi_tirocinio
비아그라 판매 =&lt;7cc.kr>=비아그라 정품 판매~비아그라판매±비아그라 정품판매∏비아그라 50mg판매,프릴리지 50mg판매,흥분제 ...
Importance of Eye Contact in Hospitality
Singapore startup ecosystem and entrepreneur toolbox - Aug 2015
Powering of bangladesh- Vision 2021
An Industrial Case Study on the Automated Detection of Performance Regression...
Sustainability Day Leeds 2017
Lean Canvas evolved - FTE Canvas
Ad

Similar to Modeling the Performance of Ultra-Large-Scale Systems Using Layered Simulations (20)

PPTX
2 approaches to system development
DOC
Chapter 1,2,3,4 notes
PPTX
Week 2 - System Development Approaches (1).pptx
PPTX
Lecture 3 software_engineering
PPTX
Monitoring Containerized Micro-Services In Azure
PPTX
Comp8 unit5 lecture_slides
PPTX
Lecture 3 software_engineering
PDF
Gartner Infrastructure and Operations Summit Berlin 2015 - DevOps Journey
PPTX
Chapter 3.pptx
PPTX
Exactpro Systems for KSTU Students in Kostroma
PPTX
Mastering the DevOps Certification: CI/CD, Governance & Monitoring Made Simple
PPTX
The art of project estimation
PPTX
The art of architecture
PPTX
EMBEDDED AND REAL TIME SYSTEMS-Unit-4_6703.pptx
PPTX
OOP 2014 - Lifecycle By Design
PPTX
PDF
SE_Unit 2.pdf it is a process model of it student
PDF
The Evolution of Testing Methodology at AWS: From Status Quo to Formal Method...
PPTX
Best Practices for Enterprise Continuous Delivery of Oracle Fusion Middlewa...
2 approaches to system development
Chapter 1,2,3,4 notes
Week 2 - System Development Approaches (1).pptx
Lecture 3 software_engineering
Monitoring Containerized Micro-Services In Azure
Comp8 unit5 lecture_slides
Lecture 3 software_engineering
Gartner Infrastructure and Operations Summit Berlin 2015 - DevOps Journey
Chapter 3.pptx
Exactpro Systems for KSTU Students in Kostroma
Mastering the DevOps Certification: CI/CD, Governance & Monitoring Made Simple
The art of project estimation
The art of architecture
EMBEDDED AND REAL TIME SYSTEMS-Unit-4_6703.pptx
OOP 2014 - Lifecycle By Design
SE_Unit 2.pdf it is a process model of it student
The Evolution of Testing Methodology at AWS: From Status Quo to Formal Method...
Best Practices for Enterprise Continuous Delivery of Oracle Fusion Middlewa...

More from SAIL_QU (20)

PDF
Studying the Integration Practices and the Evolution of Ad Libraries in the G...
PDF
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
PPTX
Improving the testing efficiency of selenium-based load tests
PDF
Studying User-Developer Interactions Through the Distribution and Reviewing M...
PDF
Studying online distribution platforms for games through the mining of data f...
PPTX
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
PDF
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
PDF
Mining Development Knowledge to Understand and Support Software Logging Pract...
PPTX
Which Log Level Should Developers Choose For a New Logging Statement?
PPTX
Towards Just-in-Time Suggestions for Log Changes
PDF
The Impact of Task Granularity on Co-evolution Analyses
PPTX
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
PPTX
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
PPTX
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
PDF
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
PPTX
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
PDF
What Do Programmers Know about Software Energy Consumption?
PPTX
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
PDF
Revisiting the Experimental Design Choices for Approaches for the Automated R...
PPTX
Measuring Program Comprehension: A Large-Scale Field Study with Professionals
Studying the Integration Practices and the Evolution of Ad Libraries in the G...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
Improving the testing efficiency of selenium-based load tests
Studying User-Developer Interactions Through the Distribution and Reviewing M...
Studying online distribution platforms for games through the mining of data f...
Understanding the Factors for Fast Answers in Technical Q&A Websites: An Empi...
Investigating the Challenges in Selenium Usage and Improving the Testing Effi...
Mining Development Knowledge to Understand and Support Software Logging Pract...
Which Log Level Should Developers Choose For a New Logging Statement?
Towards Just-in-Time Suggestions for Log Changes
The Impact of Task Granularity on Co-evolution Analyses
A Framework for Evaluating the Results of the SZZ Approach for Identifying Bu...
How are Discussions Associated with Bug Reworking? An Empirical Study on Open...
A Study of the Relation of Mobile Device Attributes with the User-Perceived Q...
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect C...
Studying the Dialogue Between Users and Developers of Free Apps in the Google...
What Do Programmers Know about Software Energy Consumption?
Threshold for Size and Complexity Metrics: A Case Study from the Perspective ...
Revisiting the Experimental Design Choices for Approaches for the Automated R...
Measuring Program Comprehension: A Large-Scale Field Study with Professionals

Recently uploaded (20)

PPTX
ManageIQ - Sprint 268 Review - Slide Deck
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Nekopoi APK 2025 free lastest update
PPTX
Introduction to Artificial Intelligence
PPTX
ai tools demonstartion for schools and inter college
PDF
How to Choose the Right IT Partner for Your Business in Malaysia
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
PTS Company Brochure 2025 (1).pdf.......
PPTX
Operating system designcfffgfgggggggvggggggggg
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
Adobe Illustrator 28.6 Crack My Vision of Vector Design
PPTX
ISO 45001 Occupational Health and Safety Management System
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
AI in Product Development-omnex systems
PPTX
history of c programming in notes for students .pptx
PPTX
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
PDF
Digital Strategies for Manufacturing Companies
PDF
top salesforce developer skills in 2025.pdf
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PDF
Design an Analysis of Algorithms I-SECS-1021-03
ManageIQ - Sprint 268 Review - Slide Deck
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Nekopoi APK 2025 free lastest update
Introduction to Artificial Intelligence
ai tools demonstartion for schools and inter college
How to Choose the Right IT Partner for Your Business in Malaysia
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PTS Company Brochure 2025 (1).pdf.......
Operating system designcfffgfgggggggvggggggggg
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Adobe Illustrator 28.6 Crack My Vision of Vector Design
ISO 45001 Occupational Health and Safety Management System
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
AI in Product Development-omnex systems
history of c programming in notes for students .pptx
Lecture 3: Operating Systems Introduction to Computer Hardware Systems
Digital Strategies for Manufacturing Companies
top salesforce developer skills in 2025.pdf
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Design an Analysis of Algorithms I-SECS-1021-03

Modeling the Performance of Ultra-Large-Scale Systems Using Layered Simulations

  • 1. Modeling the Performance of Ultra-Large-Scale Systems 1 Using Layered Simulations
  • 2. [Slide courtesy of Randy Shoup, eBay Distinguished Architect] 2 ULS are Everywhere, and they’re HUGE!
  • 3. 1. Performance Testing should happen … 4 development process here? here!
  • 4. 2. One Model to Rule Them All? 1. How do response time differ between these two mechanism? 5
  • 5. 3. Different Stakeholders, Different Interests End User Developer System Engineer System Integrator Sales Rep 6
  • 6. An Analogy: The “4+1” View Model Scenario Logical View Process View Development View Physical View 8
  • 7. The “3+1” View Model for Performance Modeling Scenario World View Physical View Component View 9
  • 8. Component Layer A Layered Simulation Model! 10 World View Layer Physical Layer
  • 9. Case Study 1 Bottleneck Detection in RSS Cloud • RSS blogging service providers (e.g., Wordpress) • Evaluation of two notification delivery mechanisms: – Periodic Pull – Real-time Push Q1. How does response time differ? Q2. How does resource usage (e.g., CPU, memory) look like? 11
  • 10. Component Layer The RSS Cloud Model 12 World View Layer Physical Layer
  • 11. CPU is the Bottleneck! 13
  • 12. Case Study 2 Evaluating a Distributed Monitor 1. How do response time differ between these two mechanism? 14
  • 15. Keeping the Model Up-to-Date • (What-if) Experimenting with different configuration/deployment scenarios • (Digging deeper) Adding new “layers”, as system is understood better • (Design evolution) Updating the simulation models as the system evolves 18
  • 16. GH'@&+/) +A 12(&'I &-829'-3) *#%'3144&2'J ' 6! 1) +) 789: ) 02! 9(8; ) <<! 3&+&K' 3&+&F' LH'7$M&+&2,'",1N&3) #%&+-5'7$M&+&2,'O2,&+&-,-' ?2%'= -&+' 7&B&#) 4&+' ". -,&A '' ?29$2&&+' ". -,&A '' O2,&9+1,) +' "1#&-'' 0&4' =! Component Layer' : 'Z1. &+&%'"$A *#18) 2'P ) %&#F' "I ! World View Layer' Physical Layer' 7$-,+$S*,&%'B-H'&2,+1#$d&%! "F! eHG<d' eHL<d' eHQ<d' TABLE IX SIMULATION RESULTS FOR THE ORIGINAL DESIGN OF THE PERFORMANCE MONITOR. DataCollection Response time(s) Cost per Central Monitor Central Monitor Central Monitor Frequency (Hz) transmission ($) Thread Util. (%) CPU Util. (%) RAM Util. (%) 0.1 0.2 2.2 31.0 42.6 37.2 0.2 0.3 2.7 43.7 68.2 47.6 0.3 0.4 3.1 60.2 86.6 59.2 the CPU of the central monitor is close to running at its full capacity, which will likely result in system instability. More- over, whiletheoriginal designhaslower cost per transmission, be validated as information becomes available throug development. Due to the lack of access to production we could only estimate a list of resources and proce eHG<d' eHL<d' eHQ<d' /1-,&+! U(3&14&+X! #) D&+'+&-) *+(&'*-19&! QUESTIONS!
  • 17. 20
  • 19. FOUR CHALLENGES The Current Practice of Performance Verification 22
  • 20. 1. Too Late in the Development Lifecycle • Design changes are not evaluated until after code is written – Happens at the last stage of a delayed schedule 23 Requirements Design Implementation Verification Maintenance
  • 21. 2. Lots of Data • Industrial case studies have 2000> counters • Time consuming to analyze • Hard to compare more than 2 tests at once 24
  • 22. 3. No Documented Behavior 25 • Analysts have different perceptions of performance regressions • Analysis may be influenced by – Analyst’s knowledge – Deadline
  • 23. 4. Heterogeneous Environments • Multiple labs to parallelize test executions – Hardware and software may differ – Tests from one lab may not be used to analyze tests from another lab 26
  • 25. AT THE DESIGN LEVEL Performance Verification 28
  • 26. Evaluate Design Changes through Performance Modeling • Analytical models are often not suitable for all stakeholders – Abstract mathematical and statistical concepts • Simulation models can be implemented with support of existing framework – Visualization – No systematic approach to construct models that can be used by different stakeholders 29
  • 27. Layered Simulation Model 30 Physical layerComponent layer World view layer Can the current infrastructure support the projected growth of users? Investigate threading model Hardware resource utilization
  • 28. Case Studies • We conducted two case studies – RSS Cloud • Show the process of constructed the model • Derive the bottleneck of the application – Performance monitor for ULS systems • Evaluate whether or not an organization should re- architect the software • Our model can be used to extract important information and aid in decision making 31

Editor's Notes

  • #3: Slides from eBay to give another taste of scale
  • #7: Spend a bit of time about performance concerns. E.g. End user: overall system performance for various configurations Developer: Organization and performance of system modules System Engineer: Hardware resource util, hardware capacity planning System Integrator: Performance after integration of different components Sales Representative: Reliability, costs
  • #8: Analytical Modeling: use math equations and statistical concepts to model the performance of a software system. This requires substantial level of expertise. Simulation models (Discrete-event simulation) can created to answer specific performance questions from different stakeholders.
  • #10: Explain a bit on what the models are and who those stake holders are
  • #11: Discuss the contents in Table III
  • #13: Discuss the contents in Table III