SlideShare a Scribd company logo
Gabriele D’Angelo joint work with  Luciano Bononi Michele Bracuto Lorenzo Donatiello University of Bologna Department of Computer Science WOMP 2006 – Sorrento (NA) An Adaptive Load Balancing Middleware for Distributed Simulation
Presentation outline Basic assumptions and goals Parallel and Distributed Simulation (PADS) main problems A migration-based approach to improve the simulation speed The proposed adaptive load-balancing mechanism Testbed: Ad Hoc network model Execution architecture definition Simulation results: dynamic reallocation and simulation speed Conclusions and future work
Basic assumptions and goal The simulation is a useful technique to support the design and the performance evaluation of complex systems The systems now considered of interest are composed by many highly dynamic entities, with unpredictable communication patterns The simulation of such kind of systems is possible only aggregating together many execution units:  Parallel And Distributed Simulation (PADS) Our goal is to  increase the simulation speed , that is reduce the  Wall Clock Time (WCT)  required to complete the simulation runs
Parallel and Distributed Simulation (PADS) PROS : Clustering together many Physical Execution Units (PEUs) it is possible to reduce the WCT required to complete a simulation Aggregating resources (i.e. RAM) it is possible to represent very complex systems CONS : A distributed simulation requires a large amount of  communication  and  synchronization  to obtain correct results The simulation speed depends on may factors, in example: heterogeneous CPUs and resources, background load
The GAIA migration-based framework A distributed simulation can be seen as a set of Logical Processes (LPs). Each LP is run by a possibly different PEU and takes care to manage the evolution of a set of Simulated Model Entities (SMEs) What is a good scheme to allocate the SMEs on the LPs/PEUs? We have demonstrated that the  static allocation schemes are in most cases sub-optimal , and that a  migration-based approach of the simulated model entities  (GAIA) can Reduce  the amount of communication and synchronization, clustering together the highly interacting simulated entities within the same LP/PEU Reduce  the WCT, therefore increase the simulation speed-up
PADS execution architectures Dedicated clusters  composed by homogeneous units are  costly  and often  underutilized For the same reason,  shared  clusters composed by available Commercial-Off-the-Shelf (COTS) hardware are preferable to dedicated systems The simulation performance is highly influenced by the CPU load in background In a shared cluster it is impossible to predict a good  static   allocation scheme  for the simulated entities: the background load is  unpredictable  and the CPU can usually highly  heterogeneous An  adaptive load balancing mechanism  could improve the resources utilization and therefore the simulation speed
The GAIA+ adaptive load balancing middleware GAIA+ is an evolution of the migration-based mechanism GAIA It is composed by two cooperative parts, both based on the reallocation of simulated entities: The heuristic migration policy : to adapt and reduce the communication and synchronization needs The heuristic load-balancing policy : the overloaded  (and therefore “slow”)  PEUs can migrate some of the managed model entities to unloaded PEUs The simulation is totally distributed and therefore there is no point of centralization.  Slow  and  Fast  are attributes that have to be observed locally, with only a partial knowledge of the whole system and influenced by the networks delays
Performance evaluation: Ad-Hoc wireless network model The GAIA mechanism outperforms the static distributed simulation approach. This performance evaluation will compare the GAIA and the GAIA+ mechanisms Ad-Hoc wireless network model   definition : 9000  Simulated Mobile Hosts  (SMHs) over a flat topology Mobility model:  Random Waypoint Model (RWP) uncorrelated SMHs’ mobility Traffic model: ping messages (CBR) by every SMH to all neighbors within the wireless communication range (250 m) Propagation model open space (neighbor-SMHs within detection range)
Ad-Hoc network model characterization Computation and communication issues: The  computation  required for each SMH per time-step is in the order of  O(#SMH ^ 2) : to determine the neighbor set The communication required among SMHs is in the order of  O(K*#SMH)  per time-step, with K defined as a constant value based on SMHs density (assumed as constant)
Testbed execution architecture Distributed simulation, execution architecture: 3 heterogeneous PEU: 2  - Intel  Dual  Xeon Pentium IV 2800 MHz, with  3 GB RAM  and  4 GB RAM , Debian GNU/Linux OS with kernel version 2.6.x 1  - Intel  Quad  Xeon Pentium IV 1500 MHz, with  1 GB RAM , Debian GNU/Linux OS with kernel version 2.6.x PEUs are interconnected by a Gigabit Ethernet LAN 3 LPs = 1 LP for each PEU Conservative time-stepped simulation: 2000 time-steps
Performance evaluation: three different scenarios The performance evaluation of the GAIA+ mechanism has involved three different scenarios: The CPU are considered as homogeneous:  initially  each PEU allocates  the same number  of SMHs The  initial  allocation is based on the  nominal performance of the CPUs  (as expressed in MHz) A  synthetic background load  (in form of a sinusoidal wave) is injected in part of the simulation cluster
Performance evaluation:  Equal Distribution (scenario a)
Performance evaluation:  MHz-based Distribution (scenario b)
Performance evaluation:  Variable Background Load (scenario c)
Performance evaluation: Wall-Clock-Time (WCT) WCT (seconds) to complete a simulation run of 1000 timesteps -21.10 % 4128 5232 c -10.41 % 3568 3989 b -4.38 % 3442 3600 a diff (%) GAIA+ GAIA scenario
Conclusions and future work The  communication  and the  load balancing  are two main problems of PADS, they should be  managed together An approach based on the migration of simulated entities can:  reduce the communication overhead  of a distributed simulation, induce a  good level of load balancing  and therefore  increase the simulation speed In many cases the very costly High-Performance-Computing (HPC) clusters can be replaced by  shared clusters of Commercial-Off-the-Shelf (COTS)  computers The Computational Grid architecture could benefit of similar approaches, enhancing the use of the Grid for simulation tasks
Gabriele D’Angelo   [email_address] http://www.cs.unibo.it/~gdangelo University of Bologna Department of Computer Science WOMP 2006, Sorrento (NA) An Adaptive Load Balancing Middleware for Distributed Simulation

More Related Content

ODP
From Simulation to Online Gaming: the need for adaptive solutions
DOCX
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERS
DOCX
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS A stochastic model to investigate dat...
PPT
Proximity Detection in Distributed Simulation of Wireless Mobile Systems
DOC
Performance of a speculative transmission scheme for scheduling latency reduc...
PDF
A0270107
PDF
ACES_Journal_February_2012_Paper_07
PDF
Networking Articles Overview
From Simulation to Online Gaming: the need for adaptive solutions
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERS
IEEE 2014 JAVA CLOUD COMPUTING PROJECTS A stochastic model to investigate dat...
Proximity Detection in Distributed Simulation of Wireless Mobile Systems
Performance of a speculative transmission scheme for scheduling latency reduc...
A0270107
ACES_Journal_February_2012_Paper_07
Networking Articles Overview

What's hot (17)

PDF
An Alternative Genetic Algorithm to Optimize OSPF Weights
DOC
Cloud data management
PDF
IMPACT OF PARTIAL DEMAND INCREASE ON THE PERFORMANCE OF IP NETWORKS AND RE-OP...
PDF
IRJET-Multiple Object Detection using Deep Neural Networks
PDF
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
PDF
Conference Paper: Cross-platform estimation of Network Function Performance
PPT
UIC Panella Thesis
PDF
Hardback solution to accelerate multimedia computation through mgp in cmp
PDF
Design and analysis of a model predictive controller for active queue management
DOCX
JPJ1403 A Stochastic Model To Investigate Data Center Performance And QoS I...
PDF
(Im2col)accelerating deep neural networks on low power heterogeneous architec...
PDF
Lecture 4 principles of parallel algorithm design updated
PDF
Conference Paper: Towards High Performance Packet Processing for 5G
PDF
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
PDF
M3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
DOCX
Load rebalancing
PPTX
Limitations of memory system performance
An Alternative Genetic Algorithm to Optimize OSPF Weights
Cloud data management
IMPACT OF PARTIAL DEMAND INCREASE ON THE PERFORMANCE OF IP NETWORKS AND RE-OP...
IRJET-Multiple Object Detection using Deep Neural Networks
NETWORK-AWARE DATA PREFETCHING OPTIMIZATION OF COMPUTATIONS IN A HETEROGENEOU...
Conference Paper: Cross-platform estimation of Network Function Performance
UIC Panella Thesis
Hardback solution to accelerate multimedia computation through mgp in cmp
Design and analysis of a model predictive controller for active queue management
JPJ1403 A Stochastic Model To Investigate Data Center Performance And QoS I...
(Im2col)accelerating deep neural networks on low power heterogeneous architec...
Lecture 4 principles of parallel algorithm design updated
Conference Paper: Towards High Performance Packet Processing for 5G
DYNAMIC TASK SCHEDULING BASED ON BURST TIME REQUIREMENT FOR CLOUD ENVIRONMENT
M3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
Load rebalancing
Limitations of memory system performance
Ad

Viewers also liked (6)

PPTX
A load balancing model based on cloud partitioning for the public cloud. ppt
PPTX
Load Balancing Server
PPTX
Load Balancing In Distributed Computing
DOCX
Load balancing in Distributed Systems
PPT
Load Balancing
PPTX
Load balancing
A load balancing model based on cloud partitioning for the public cloud. ppt
Load Balancing Server
Load Balancing In Distributed Computing
Load balancing in Distributed Systems
Load Balancing
Load balancing
Ad

Similar to An Adaptive Load Balancing Middleware for Distributed Simulation (20)

PPT
Detailed Simulation of Large-Scale Wireless Networks
PPT
Concurrent Replication of Parallel and Distributed Simulations
PDF
A Survey on Wireless Network Simulators
PDF
Power system and communication network co simulation for smart grid applications
PDF
A New Approach for Dynamic Load Balancing Using Simulation In Grid Computing
PPTX
Integrated Simulation Environments
PDF
Calibration of Deployment Simulation Models - A Multi-Paradigm Modelling Appr...
PDF
Open modeling and simulation framework for evolutive analysis
PDF
DGBSA : A BATCH JOB SCHEDULINGALGORITHM WITH GA WITH REGARD TO THE THRESHOLD ...
PPTX
CloudLightning Simulator
PDF
How to Develop True Distributed Real Time Simulations? Mixing IEEE HLA and OM...
PDF
Integrated modeling and simulation framework for wireless sensor networks
PDF
Giddings
PDF
The Internet of Simulations and the agile development of Cyber-physical systems
PPTX
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
PDF
Use of genetic algorithm for
PDF
Report_Internships
PPTX
Simulation-System Modeling and Simulation
PDF
Cs556 section2
PDF
An efficient scheduling policy for load balancing model for computational gri...
Detailed Simulation of Large-Scale Wireless Networks
Concurrent Replication of Parallel and Distributed Simulations
A Survey on Wireless Network Simulators
Power system and communication network co simulation for smart grid applications
A New Approach for Dynamic Load Balancing Using Simulation In Grid Computing
Integrated Simulation Environments
Calibration of Deployment Simulation Models - A Multi-Paradigm Modelling Appr...
Open modeling and simulation framework for evolutive analysis
DGBSA : A BATCH JOB SCHEDULINGALGORITHM WITH GA WITH REGARD TO THE THRESHOLD ...
CloudLightning Simulator
How to Develop True Distributed Real Time Simulations? Mixing IEEE HLA and OM...
Integrated modeling and simulation framework for wireless sensor networks
Giddings
The Internet of Simulations and the agile development of Cyber-physical systems
Model-Simulation-and-Measurement-Based Systems Engineering of Power System Sy...
Use of genetic algorithm for
Report_Internships
Simulation-System Modeling and Simulation
Cs556 section2
An efficient scheduling policy for load balancing model for computational gri...

More from Gabriele D'Angelo (6)

ODP
A Parallel Data Distribution Management Algorithm
ODP
Time Warp on the Go
PDF
Parallel and Distributed Simulation from Many Cores to the Public Cloud
PDF
Multiplayer Online Games over Scale-Free Networks: a Viable Solution?
PPT
Simulation of Scale-Free Networks
PPT
Parallel and Distributed Simulation of Coalition Structure Generation in Coop...
A Parallel Data Distribution Management Algorithm
Time Warp on the Go
Parallel and Distributed Simulation from Many Cores to the Public Cloud
Multiplayer Online Games over Scale-Free Networks: a Viable Solution?
Simulation of Scale-Free Networks
Parallel and Distributed Simulation of Coalition Structure Generation in Coop...

Recently uploaded (20)

PDF
discourse-2025-02-building-a-trillion-dollar-dream.pdf
PDF
Corporate Finance Fundamentals - Course Presentation.pdf
PDF
Dialnet-DynamicHedgingOfPricesOfNaturalGasInMexico-8788871.pdf
PDF
financing insitute rbi nabard adb imf world bank insurance and credit gurantee
PPTX
Introduction to Essence of Indian traditional knowledge.pptx
PDF
Is Retirement Income a Three Dimensional (3-D) problem_ What is the differenc...
PDF
NAPF_RESPONSE_TO_THE_PENSIONS_COMMISSION_8 _2_.pdf
PDF
Bladex Earnings Call Presentation 2Q2025
PDF
Bitcoin Layer August 2025: Power Laws of Bitcoin: The Core and Bubbles
PDF
Mathematical Economics 23lec03slides.pdf
PDF
Q2 2025 :Lundin Gold Conference Call Presentation_Final.pdf
PPTX
Session 11-13. Working Capital Management and Cash Budget.pptx
DOCX
marketing plan Elkhabiry............docx
PPTX
kyc aml guideline a detailed pt onthat.pptx
PDF
Spending, Allocation Choices, and Aging THROUGH Retirement. Are all of these ...
PDF
Why Ignoring Passive Income for Retirees Could Cost You Big.pdf
PPTX
What is next for the Fractional CFO - August 2025
PPTX
Introduction to Customs (June 2025) v1.pptx
PPTX
social-studies-subject-for-high-school-globalization.pptx
PPTX
Who’s winning the race to be the world’s first trillionaire.pptx
discourse-2025-02-building-a-trillion-dollar-dream.pdf
Corporate Finance Fundamentals - Course Presentation.pdf
Dialnet-DynamicHedgingOfPricesOfNaturalGasInMexico-8788871.pdf
financing insitute rbi nabard adb imf world bank insurance and credit gurantee
Introduction to Essence of Indian traditional knowledge.pptx
Is Retirement Income a Three Dimensional (3-D) problem_ What is the differenc...
NAPF_RESPONSE_TO_THE_PENSIONS_COMMISSION_8 _2_.pdf
Bladex Earnings Call Presentation 2Q2025
Bitcoin Layer August 2025: Power Laws of Bitcoin: The Core and Bubbles
Mathematical Economics 23lec03slides.pdf
Q2 2025 :Lundin Gold Conference Call Presentation_Final.pdf
Session 11-13. Working Capital Management and Cash Budget.pptx
marketing plan Elkhabiry............docx
kyc aml guideline a detailed pt onthat.pptx
Spending, Allocation Choices, and Aging THROUGH Retirement. Are all of these ...
Why Ignoring Passive Income for Retirees Could Cost You Big.pdf
What is next for the Fractional CFO - August 2025
Introduction to Customs (June 2025) v1.pptx
social-studies-subject-for-high-school-globalization.pptx
Who’s winning the race to be the world’s first trillionaire.pptx

An Adaptive Load Balancing Middleware for Distributed Simulation

  • 1. Gabriele D’Angelo joint work with Luciano Bononi Michele Bracuto Lorenzo Donatiello University of Bologna Department of Computer Science WOMP 2006 – Sorrento (NA) An Adaptive Load Balancing Middleware for Distributed Simulation
  • 2. Presentation outline Basic assumptions and goals Parallel and Distributed Simulation (PADS) main problems A migration-based approach to improve the simulation speed The proposed adaptive load-balancing mechanism Testbed: Ad Hoc network model Execution architecture definition Simulation results: dynamic reallocation and simulation speed Conclusions and future work
  • 3. Basic assumptions and goal The simulation is a useful technique to support the design and the performance evaluation of complex systems The systems now considered of interest are composed by many highly dynamic entities, with unpredictable communication patterns The simulation of such kind of systems is possible only aggregating together many execution units: Parallel And Distributed Simulation (PADS) Our goal is to increase the simulation speed , that is reduce the Wall Clock Time (WCT) required to complete the simulation runs
  • 4. Parallel and Distributed Simulation (PADS) PROS : Clustering together many Physical Execution Units (PEUs) it is possible to reduce the WCT required to complete a simulation Aggregating resources (i.e. RAM) it is possible to represent very complex systems CONS : A distributed simulation requires a large amount of communication and synchronization to obtain correct results The simulation speed depends on may factors, in example: heterogeneous CPUs and resources, background load
  • 5. The GAIA migration-based framework A distributed simulation can be seen as a set of Logical Processes (LPs). Each LP is run by a possibly different PEU and takes care to manage the evolution of a set of Simulated Model Entities (SMEs) What is a good scheme to allocate the SMEs on the LPs/PEUs? We have demonstrated that the static allocation schemes are in most cases sub-optimal , and that a migration-based approach of the simulated model entities (GAIA) can Reduce the amount of communication and synchronization, clustering together the highly interacting simulated entities within the same LP/PEU Reduce the WCT, therefore increase the simulation speed-up
  • 6. PADS execution architectures Dedicated clusters composed by homogeneous units are costly and often underutilized For the same reason, shared clusters composed by available Commercial-Off-the-Shelf (COTS) hardware are preferable to dedicated systems The simulation performance is highly influenced by the CPU load in background In a shared cluster it is impossible to predict a good static allocation scheme for the simulated entities: the background load is unpredictable and the CPU can usually highly heterogeneous An adaptive load balancing mechanism could improve the resources utilization and therefore the simulation speed
  • 7. The GAIA+ adaptive load balancing middleware GAIA+ is an evolution of the migration-based mechanism GAIA It is composed by two cooperative parts, both based on the reallocation of simulated entities: The heuristic migration policy : to adapt and reduce the communication and synchronization needs The heuristic load-balancing policy : the overloaded (and therefore “slow”) PEUs can migrate some of the managed model entities to unloaded PEUs The simulation is totally distributed and therefore there is no point of centralization. Slow and Fast are attributes that have to be observed locally, with only a partial knowledge of the whole system and influenced by the networks delays
  • 8. Performance evaluation: Ad-Hoc wireless network model The GAIA mechanism outperforms the static distributed simulation approach. This performance evaluation will compare the GAIA and the GAIA+ mechanisms Ad-Hoc wireless network model definition : 9000 Simulated Mobile Hosts (SMHs) over a flat topology Mobility model: Random Waypoint Model (RWP) uncorrelated SMHs’ mobility Traffic model: ping messages (CBR) by every SMH to all neighbors within the wireless communication range (250 m) Propagation model open space (neighbor-SMHs within detection range)
  • 9. Ad-Hoc network model characterization Computation and communication issues: The computation required for each SMH per time-step is in the order of O(#SMH ^ 2) : to determine the neighbor set The communication required among SMHs is in the order of O(K*#SMH) per time-step, with K defined as a constant value based on SMHs density (assumed as constant)
  • 10. Testbed execution architecture Distributed simulation, execution architecture: 3 heterogeneous PEU: 2 - Intel Dual Xeon Pentium IV 2800 MHz, with 3 GB RAM and 4 GB RAM , Debian GNU/Linux OS with kernel version 2.6.x 1 - Intel Quad Xeon Pentium IV 1500 MHz, with 1 GB RAM , Debian GNU/Linux OS with kernel version 2.6.x PEUs are interconnected by a Gigabit Ethernet LAN 3 LPs = 1 LP for each PEU Conservative time-stepped simulation: 2000 time-steps
  • 11. Performance evaluation: three different scenarios The performance evaluation of the GAIA+ mechanism has involved three different scenarios: The CPU are considered as homogeneous: initially each PEU allocates the same number of SMHs The initial allocation is based on the nominal performance of the CPUs (as expressed in MHz) A synthetic background load (in form of a sinusoidal wave) is injected in part of the simulation cluster
  • 12. Performance evaluation: Equal Distribution (scenario a)
  • 13. Performance evaluation: MHz-based Distribution (scenario b)
  • 14. Performance evaluation: Variable Background Load (scenario c)
  • 15. Performance evaluation: Wall-Clock-Time (WCT) WCT (seconds) to complete a simulation run of 1000 timesteps -21.10 % 4128 5232 c -10.41 % 3568 3989 b -4.38 % 3442 3600 a diff (%) GAIA+ GAIA scenario
  • 16. Conclusions and future work The communication and the load balancing are two main problems of PADS, they should be managed together An approach based on the migration of simulated entities can: reduce the communication overhead of a distributed simulation, induce a good level of load balancing and therefore increase the simulation speed In many cases the very costly High-Performance-Computing (HPC) clusters can be replaced by shared clusters of Commercial-Off-the-Shelf (COTS) computers The Computational Grid architecture could benefit of similar approaches, enhancing the use of the Grid for simulation tasks
  • 17. Gabriele D’Angelo [email_address] http://www.cs.unibo.it/~gdangelo University of Bologna Department of Computer Science WOMP 2006, Sorrento (NA) An Adaptive Load Balancing Middleware for Distributed Simulation