SlideShare a Scribd company logo
www.bsc.es
UCC 2016
Shanghai, 6th November
Mauro Canuto, Raimon Bosch, Mario Macías, Jordi Guitart
A Methodology for Full-System
Power Modeling in
Heterogeneous Data Centers
www.bsc.es
DEPARTMENT PRESENTATION
3
Objectives
• A new methodology to model power consumption that is
platform and application agnostic.
• Overcome existing limitations: platform and application
dependencies, inaccuracy, complexity, …
• Approximate energy with resource usage indicators and its
correlations.
• Present our final results.
4
Introduction
• Why do we need power modellers?
– Energy awareness.
– Increasing use of energy in datacenters.
– Not access to power consumption meters.
• What are the limitations of current energy modellers?
– Considering only the CPU.
– Inadequate selection of indicators. Correlations and nonlinear behaviors.
– Use a training that relies on architecture-specific indicators.
5
Scope
• How is our methodology? (1 / 3)
- Derives full-system power models by considering the impact on energy consumption
of all the resources in the system, namely processor, cache, memory, disk, and
network, and can be easily extended to include other resources (i.e. GPUs) as
needed.
- Derives a single model per platform that works with high accuracy for heterogeneous
applications with different patterns of resource usage and energy consumption.
- Derives models by using a minimum set of resource usage indicators, which are
selected for each platform according to their correlation with the power
consumption.
6
Scope
• How is our methodology? (2 / 3)
- Derives models that capture non-linear relations between the resource usage and
the power consumption.
- Exploits machine learning techniques to drive the selection of resource usage
indicators and the capture of non-linear relations without human intervention.
- Gets full-system information needed to generate the models by means of a
comprehensive monitoring framework that seamlessly integrates several data
sources, including system information, performance counters, and overall power
consumption measures.
7
Scope
• How is our methodology? (3 / 3)
- Uses the same training set for each platform, which is independent of the
applications used to validate the models and comprises various micro-benchmarks
that selectively stress each resource at varying levels of utilization.
- Validates the models with real applications, which are commonly found in Cloud data
centers, with different patterns of resource usage and energy consumption.
- Achieves platform- and application-agnosticism by using the same tools and
systematically following the same steps to derive power models in heterogeneous
platforms.
COLLECTION, TRAINING, MODEL
GENERATION & VALIDATION
9
Data collection
10
Training micro-benchmarks
• Stress different components of the target platform (CPU, cache, main memory,
network, and disk) at different intensity levels
• CPU: Stress-ng, Ibench, Sysbench, Prime95 and Linpack-neon. Increase intensity
until you hit 100% of resources.
• Memory: Stress-ng, Pmbw, STREAM.
• Disk: Stress-ng, Fio
• Network: iperf increasing bandwidth by steps of 10%.
11
Model generation
12
Model generation
- An initial selection of features is obtained by excluding those not showing significant
variance in the training experiments.
- For each resource usage indicator on each micro-benchmark, we perform a
regression analysis to find the best the relation METRIC vs. CONSUMPTION.
- Apply logarithmic and polynomial transformations:
● If a logarithmic function returns a best correlation than the original function, we
apply the log(metric) transformation to the data corresponding to that metric.
● If the relation fits better with a polynomial function, we apply the pow(metric, α)
transformation to the data. The α parameter is obtained through the nls function
from stats package for R language, which determines non-linear least-squares
estimates of non-linear models.
13
Model generation
l CPU cycles for the Prime95
micro-benchmark in srv-xeon-2
platform is best approximated
by using a logarithmic function.
=>
l training dataset =
l Filter the selected features.
l Join datasets of all
microbenchmarks.
l Perform the polynomial and
logarithmic transformations.
l Use linear regression in order to
fit our model.
14
Model validation
l Cloudsuite: Benchmarks based on real cloud applications + HPC validation with NAS Parallel
Benchmarks.
RESULTS
16
Results
17
Results: Intel Xeon
● Validate against Lasso regression
to automatically incorporate
features
○ linear models (LNLS)
○ nonlinear polynomial models
(PLLS)
○ Our model (OURS)
● The average MAPE of our models
is 4.3% for srv-xeon-1 and 5.7% for
srv-xeon-2.
● Very accurate considering the
range of applications.
18
Results: Intel Xeon
19
Results: AMD Opteron
● High range of power consumption
that can be generated by the two
platforms (up to 400 W for srv-opt-1
and 360 W for srv-opt2)
● The average error of our models is
still very low: 5.4% for srv-opt-1
and 5.7% for srv-opt-2.
20
Results: AMD Opteron
21
Results: Low power
● Intel Atom (srv-atom) & ARM
Cortex-A (srv-arm).
● Very promising (as shown in Table
4). The average MAPE for srv-atom
is 2.6% and for srv-arm is 5.2%.
● Lasso models provide good
accuracy for NAS HPC jobs, but
they show important errors for
CloudSuite benchmarks.
22
Results: Low power
23
Conclusions
- A platform and application-agnostic methodology for full-system power
modeling in heterogenous data centers.
- Demonstrated with different power consumption profiles ranging from high-
performance to low-power architectures.
- Models provide high accuracy (around 5% of average estimation error).
www.bsc.es
Thank you!
For further information please contact
raimon.bosch@bsc.es
24

More Related Content

PPTX
New developments in Aimsun and future trends
PPTX
Fruct14 sholokhova
PDF
A Uniform Implementation Scheme for Evolutionary Optimization Algorithms and ...
PPT
An Adaptive Load Balancing Middleware for Distributed Simulation
PPT
Network Performance Forecasting System
PDF
M phil-computer-science-artifical-neural-networks-projects
PPTX
Adaptive Execution Support for Malleable Computation
PDF
Image Processing IEEE 2015 Projects
New developments in Aimsun and future trends
Fruct14 sholokhova
A Uniform Implementation Scheme for Evolutionary Optimization Algorithms and ...
An Adaptive Load Balancing Middleware for Distributed Simulation
Network Performance Forecasting System
M phil-computer-science-artifical-neural-networks-projects
Adaptive Execution Support for Malleable Computation
Image Processing IEEE 2015 Projects

What's hot (11)

PDF
Hardware Architecture for Calculating LBP-Based Image Region Descriptors
PPTX
Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...
PPTX
Bioenergy prototype for the Global Atlas
PDF
Image Processing IEEE 2015 Projects
PDF
Network tomography to enhance the performance of software defined network mon...
DOCX
Naidu_Sumit_Resume
PDF
Erickson_FYP_Poster
PDF
Integrated Model Discovery and Self-Adaptation of Robots
PDF
Error Permissive Computing
PDF
Performance analysis and optimal cooperative cluster size for randomly distri...
PPTX
排隊理論_An Exploration of The Optimization of Executive Scheduling in The Cloud ...
Hardware Architecture for Calculating LBP-Based Image Region Descriptors
Principles of Computing Resources Planning in Cloud-Based Problem Solving Env...
Bioenergy prototype for the Global Atlas
Image Processing IEEE 2015 Projects
Network tomography to enhance the performance of software defined network mon...
Naidu_Sumit_Resume
Erickson_FYP_Poster
Integrated Model Discovery and Self-Adaptation of Robots
Error Permissive Computing
Performance analysis and optimal cooperative cluster size for randomly distri...
排隊理論_An Exploration of The Optimization of Executive Scheduling in The Cloud ...
Ad

Similar to A methodology for full system power modeling in heterogeneous data centers (20)

PDF
[February 2017 - Ph.D. Final Dissertation] Enabling Power-awareness For Multi...
PDF
Enabling a hardware accelerated deep learning data science experience for Apa...
PPT
Enabling a hardware accelerated deep learning data science experience for Apa...
PDF
Run-time power management in cloud and containerized environments
PPTX
Webinar: Cutting Time, Complexity and Cost from Data Science to Production
PDF
[EWiLi2016] Enabling power-awareness for the Xen Hypervisor
PPTX
AI Sustainability Mascots 23-f.pptx
PDF
LCU14-410: How to build an Energy Model for your SoC
PPTX
Optimizing High Performance Computing Applications for Energy
PDF
HPC Facility Designing for next generation HPC systems Ram Nagappan Intel Final
PDF
Intel: How to Use Alluxio to Accelerate BigData Analytics on the Cloud and Ne...
PDF
⭐⭐⭐⭐⭐ Learning-based Energy Consumption Prediction
PDF
Deep Learning Training at Scale: Spring Crest Deep Learning Accelerator
PDF
Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...
PPTX
Statistical power consumption analysis and modeling
PDF
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
PPTX
2018 03 25 system ml ai and openpower meetup
PDF
Open power ddl and lms
PPTX
OpenPOWER and IBM AI overview
PDF
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark with Ma...
[February 2017 - Ph.D. Final Dissertation] Enabling Power-awareness For Multi...
Enabling a hardware accelerated deep learning data science experience for Apa...
Enabling a hardware accelerated deep learning data science experience for Apa...
Run-time power management in cloud and containerized environments
Webinar: Cutting Time, Complexity and Cost from Data Science to Production
[EWiLi2016] Enabling power-awareness for the Xen Hypervisor
AI Sustainability Mascots 23-f.pptx
LCU14-410: How to build an Energy Model for your SoC
Optimizing High Performance Computing Applications for Energy
HPC Facility Designing for next generation HPC systems Ram Nagappan Intel Final
Intel: How to Use Alluxio to Accelerate BigData Analytics on the Cloud and Ne...
⭐⭐⭐⭐⭐ Learning-based Energy Consumption Prediction
Deep Learning Training at Scale: Spring Crest Deep Learning Accelerator
Lessons Learned from Deploying Apache Spark as a Service on IBM Power Systems...
Statistical power consumption analysis and modeling
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
2018 03 25 system ml ai and openpower meetup
Open power ddl and lms
OpenPOWER and IBM AI overview
No More Cumbersomeness: Automatic Predictive Modeling on Apache Spark with Ma...
Ad

Recently uploaded (20)

PDF
Smarter Security: How Door Access Control Works with Alarms & CCTV
PPTX
Prograce_Present.....ggation_Simple.pptx
PPTX
Nanokeyer nano keyekr kano ketkker nano keyer
PPTX
PLC ANALOGUE DONE BY KISMEC KULIM TD 5 .0
PPTX
Computers and mobile device: Evaluating options for home and work
PPTX
title _yeOPC_Poisoning_Presentation.pptx
PPT
FABRICATION OF MOS FET BJT DEVICES IN NANOMETER
PPTX
material for studying about lift elevators escalation
PPTX
Fundamentals of Computer.pptx Computer BSC
PPTX
sdn_based_controller_for_mobile_network_traffic_management1.pptx
PPTX
ERP good ERP good ERP good ERP good good ERP good ERP good
PPTX
Embedded for Artificial Intelligence 1.pptx
PPTX
STEEL- intro-1.pptxhejwjenwnwnenemwmwmwm
PPTX
making presentation that do no stick.pptx
PDF
Dynamic Checkweighers and Automatic Weighing Machine Solutions
PDF
PPT Determiners.pdf.......................
PDF
-DIGITAL-INDIA.pdf one of the most prominent
PDF
How NGOs Save Costs with Affordable IT Rentals
PPTX
Lecture-3-Computer-programming for BS InfoTech
PPTX
Wireless and Mobile Backhaul Market.pptx
Smarter Security: How Door Access Control Works with Alarms & CCTV
Prograce_Present.....ggation_Simple.pptx
Nanokeyer nano keyekr kano ketkker nano keyer
PLC ANALOGUE DONE BY KISMEC KULIM TD 5 .0
Computers and mobile device: Evaluating options for home and work
title _yeOPC_Poisoning_Presentation.pptx
FABRICATION OF MOS FET BJT DEVICES IN NANOMETER
material for studying about lift elevators escalation
Fundamentals of Computer.pptx Computer BSC
sdn_based_controller_for_mobile_network_traffic_management1.pptx
ERP good ERP good ERP good ERP good good ERP good ERP good
Embedded for Artificial Intelligence 1.pptx
STEEL- intro-1.pptxhejwjenwnwnenemwmwmwm
making presentation that do no stick.pptx
Dynamic Checkweighers and Automatic Weighing Machine Solutions
PPT Determiners.pdf.......................
-DIGITAL-INDIA.pdf one of the most prominent
How NGOs Save Costs with Affordable IT Rentals
Lecture-3-Computer-programming for BS InfoTech
Wireless and Mobile Backhaul Market.pptx

A methodology for full system power modeling in heterogeneous data centers

  • 1. www.bsc.es UCC 2016 Shanghai, 6th November Mauro Canuto, Raimon Bosch, Mario Macías, Jordi Guitart A Methodology for Full-System Power Modeling in Heterogeneous Data Centers
  • 3. 3 Objectives • A new methodology to model power consumption that is platform and application agnostic. • Overcome existing limitations: platform and application dependencies, inaccuracy, complexity, … • Approximate energy with resource usage indicators and its correlations. • Present our final results.
  • 4. 4 Introduction • Why do we need power modellers? – Energy awareness. – Increasing use of energy in datacenters. – Not access to power consumption meters. • What are the limitations of current energy modellers? – Considering only the CPU. – Inadequate selection of indicators. Correlations and nonlinear behaviors. – Use a training that relies on architecture-specific indicators.
  • 5. 5 Scope • How is our methodology? (1 / 3) - Derives full-system power models by considering the impact on energy consumption of all the resources in the system, namely processor, cache, memory, disk, and network, and can be easily extended to include other resources (i.e. GPUs) as needed. - Derives a single model per platform that works with high accuracy for heterogeneous applications with different patterns of resource usage and energy consumption. - Derives models by using a minimum set of resource usage indicators, which are selected for each platform according to their correlation with the power consumption.
  • 6. 6 Scope • How is our methodology? (2 / 3) - Derives models that capture non-linear relations between the resource usage and the power consumption. - Exploits machine learning techniques to drive the selection of resource usage indicators and the capture of non-linear relations without human intervention. - Gets full-system information needed to generate the models by means of a comprehensive monitoring framework that seamlessly integrates several data sources, including system information, performance counters, and overall power consumption measures.
  • 7. 7 Scope • How is our methodology? (3 / 3) - Uses the same training set for each platform, which is independent of the applications used to validate the models and comprises various micro-benchmarks that selectively stress each resource at varying levels of utilization. - Validates the models with real applications, which are commonly found in Cloud data centers, with different patterns of resource usage and energy consumption. - Achieves platform- and application-agnosticism by using the same tools and systematically following the same steps to derive power models in heterogeneous platforms.
  • 10. 10 Training micro-benchmarks • Stress different components of the target platform (CPU, cache, main memory, network, and disk) at different intensity levels • CPU: Stress-ng, Ibench, Sysbench, Prime95 and Linpack-neon. Increase intensity until you hit 100% of resources. • Memory: Stress-ng, Pmbw, STREAM. • Disk: Stress-ng, Fio • Network: iperf increasing bandwidth by steps of 10%.
  • 12. 12 Model generation - An initial selection of features is obtained by excluding those not showing significant variance in the training experiments. - For each resource usage indicator on each micro-benchmark, we perform a regression analysis to find the best the relation METRIC vs. CONSUMPTION. - Apply logarithmic and polynomial transformations: ● If a logarithmic function returns a best correlation than the original function, we apply the log(metric) transformation to the data corresponding to that metric. ● If the relation fits better with a polynomial function, we apply the pow(metric, α) transformation to the data. The α parameter is obtained through the nls function from stats package for R language, which determines non-linear least-squares estimates of non-linear models.
  • 13. 13 Model generation l CPU cycles for the Prime95 micro-benchmark in srv-xeon-2 platform is best approximated by using a logarithmic function. => l training dataset = l Filter the selected features. l Join datasets of all microbenchmarks. l Perform the polynomial and logarithmic transformations. l Use linear regression in order to fit our model.
  • 14. 14 Model validation l Cloudsuite: Benchmarks based on real cloud applications + HPC validation with NAS Parallel Benchmarks.
  • 17. 17 Results: Intel Xeon ● Validate against Lasso regression to automatically incorporate features ○ linear models (LNLS) ○ nonlinear polynomial models (PLLS) ○ Our model (OURS) ● The average MAPE of our models is 4.3% for srv-xeon-1 and 5.7% for srv-xeon-2. ● Very accurate considering the range of applications.
  • 19. 19 Results: AMD Opteron ● High range of power consumption that can be generated by the two platforms (up to 400 W for srv-opt-1 and 360 W for srv-opt2) ● The average error of our models is still very low: 5.4% for srv-opt-1 and 5.7% for srv-opt-2.
  • 21. 21 Results: Low power ● Intel Atom (srv-atom) & ARM Cortex-A (srv-arm). ● Very promising (as shown in Table 4). The average MAPE for srv-atom is 2.6% and for srv-arm is 5.2%. ● Lasso models provide good accuracy for NAS HPC jobs, but they show important errors for CloudSuite benchmarks.
  • 23. 23 Conclusions - A platform and application-agnostic methodology for full-system power modeling in heterogenous data centers. - Demonstrated with different power consumption profiles ranging from high- performance to low-power architectures. - Models provide high accuracy (around 5% of average estimation error).
  • 24. www.bsc.es Thank you! For further information please contact raimon.bosch@bsc.es 24