SlideShare a Scribd company logo
Performance Metrics and Ontology for
 Describing Performance Data of Grid
             Workflows

Hong-Linh Truong, Thomas Fahringer, Francesco Nerieri
             Distributed and Parallel Systems Group
      Institute for Computer Science, University of Innsbruck
                 {truong,tf,nero}@dps.uibk.ac.at
                      Schahram Dustdar
  Information Systems Institute, Vienna University of Technology
                  dustdar@infosys.tuwien.ac.at

                http://dps.uibk.ac.at/projects/pma

   1st Performability Workshop, CCGrid05, Cardiff 09 May, 2005
Outline
     Motivation
     Grid workflows and workflow execution model
     Performance metrics of Grid workflows
   WfPerfOnto: Ontology for describing performance
 data of Grid workflows
     Utilizing WfPerfOnto
     Conclusion and Future work



Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   2
Motivation
  Lack of comprehensive study of useful performance metrics for
 Grid workflows
       A few metrics are studied and supported
       Most of metrics are being limited to the activity (task) level.
     study performance metrics at multiple levels of abstraction

    Describing and sharing performance data of Grid workflows
      Highly heterogeneous, inter-related and dynamic
      Inter-organizational
      Multiple types of performance and monitoring data provided by various
      tools
    an ontology for performance data
       • Can be used to describe concepts associated with workflow
         executions
       • Will facilitate the performance data sharing

Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   3
Hierarchical Structure View of a Workflow
                                        <parallel>
             Workflow
                                          <activity name="mProject2">
                                                 <executable name="/home/truong/mProject2"/>
                                          </activity>
     Workflow Construct n
                                          <activity name="mProject1">
                                                 <executable name="/home/truong/mProject1"/>
                                          </activity>
             Activity m
                                        </parallel>


                                                           mProject1.c
      Invoked Application m
                                                                  int main() {

                                                                              A();
                                                                              while () {
  Code             Code           Code                                        ...
 Region 1        Region …        Region q                                      }

                                                                  }
Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   4
Workflow Execution Model (Simplified)




                                                                    Local scheduler

       Workflow execution
        Spanning multiple Grid sites
        Highly inter-organizational, inter-related and dynamic
       Multiple levels of job scheduling
        At workflow execution engine (part of WfMS)
        At Grid sites
Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   5
Performance Metrics of Grid Workflows
      Interesting performance metrics associated with multiple
    levels of abstraction
         Metrics can be used in workflow composition, for
         comparing different invoked applications of a single activity,
         etc.
      Five levels of abstraction
         Code region, Invoked application
         Activity,Workflow construct, Workflow
      Performance metrics of a lower level can be used to construct
    similar metrics for the immediate higher-level
         By using aggregate operator
         Based on metric definition and structure of workflows

Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   6
Performance Metrics at Code Region Level
         Category                                               Metric
 Execution time            ElapsedTIme, UserCPUTime, SystemCPUTime, SerialTime,
                           EncodingTime
 Counter                   L2_TCM, L2_TCA, etc., (hardware counters)
                           NCalls, NSubs, RecvMsgCount, SendMsgCount
 Synchronization           CondSynTime, ExclSynTime
 Data Movement             TotalCommTime, TotalTransSize
 Ratio                     MeanElapsedTime, CommPerComp, MeanTransRate, MeanTranSize
                           CachMissRatio, MFLOPS, etc.
 Temporal overhead          temporal overhead of parallel code regions


     Most existing conventional performance tools provide these metrics
     Existing workflow monitoring and analysis tools normally do not
     Challenging issues
       Integrate conventional performance monitoring tools into workflow
       monitoring tools


Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   7
Performance Metrics at Invoked Application
                 Level
      Most metrics can be constructed from metrics at code region
    level



                   Category                                           Metric
      Execution time                          ElapsedTime
                                              FailedTime
      Counter                                 NCallFailed
                                              NCalls
      Ratio                                   FailedFreq
      Performance Improvement                 SpeedupFactor




Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   8
Performance Metrics at Activity Level
            Category                                            Metric
    Execution time            ElapsedTime, ProcessingTime, QueuingTime, SuspendingTime
                              FailedTime, SharedResTime
    Counter                   RedandantActivity, NIteration, PathSelectionRatio, ResUtilization
    Ratio                     Throughput, MeanTimePerState, TransRate
    Synchronization           SynDelay, ExecDelay
    Performance               SlowdownFactor
    Improvement

       Metrics can be defined for both activity and activity instance
      Aggregate metrics of an activity can be defined based on its
    instances and the execution of instances at runtime
      Challenging problems
        How to monitor and correlate metrics when a resource is
        shared among applications
Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   9
Performance Metrics at Workflow Construct
                 Level
                 Category                                            Metric
    Execution time                          ElapsedTime, ProcessingTime
    Counter                                 RedandantActivity,
                                            NIteration, PathSelectionRatio, ResUtilization
    Load balancing                          LoadIm (Load imbalance)
    Performance Improvement                 SpeedupFactor
       Generic and construct-specific metrics
    Resource                                RedundantProcessing


      Aggregate metrics of a workflow construct/workflow construct
    instance are defined based on the structure of the construct. E.g.,
         LoadIm (load imbalance) is for parallel construct
         ElapsedTime/ProcessingTime is defined based on critical path




Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   10
Performance Metrics at Workflow Level


                   Category                                            Metric
     Execution time                           ElapsedTime,ProcesingTime
                                              ParTime,SeqTime
     Ratio                                    QueuingRatio, MeanProcessingTime,
                                              MeanQueuingTime, ResUtilization
     Correlation                              NAPerRes,ProcInRes,LoadImRes
     Performance Improvement                  Speedup




Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   11
Performance Metrics Ontology

       WfMetricOnto
        OWL-based performance metrics ontology




       Metrics ontology
        Specifies which performance metrics a tool can provide
        Simplifies the access to performance metrics provided by
        various tools

Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   12
Monitoring and Measuring Performance
                    Metrics
       Performance monitoring and analysis tools
         Operate at multiple levels
         Correlate performance metrics from multiple levels
      Middleware and application instrumentation
        Instrument execution engine of WfMS
         • Execution engine can be distributed or centralized
        Instrument applications
         • Distributed, spanning multiple Grid sites
       Challenging problems: Performance tool and data complexity
         Integrate multiple performance monitoring tools executed
         on multiple Grid sites
         Integrate performance data produced by various tools
Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   13
Ontology Describing Performance Data of
               Grid Workflows
      Objectives
           Understanding basic concepts associated with performance data of
           Grid workflows
           Performance data integration for Grid workflows
           Towards distributed/intelligent performance analysis

     WfPerfOnto (Ontology describing Performance data of Grid
    Workflows)
           Basic concepts
            • Concepts reflects the hierarchical view of a workflow
            • Static and dynamic performance and monitoring data of workflow
           Relationships
            • Static and dynamic relationships among concepts


Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   14
Ontology for Describing Performance Data
              of Grid Workflows
       WfPerfOnto




Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   15
Utilizing WfPerfOnto
      Describing Performance Data and Data Integration
        Different monitoring and analysis tools can store/export
        performance data in/to ontological representation
        High-level search and retrieval of performance data
      Knowledge base performance data of Grid workflows
        Utilized by high-level tools such as schedulers, workflow
        composition tools, etc.
        Used to re(discover) workflow patterns, interactions in
        workflows, to check correct execution, etc.
      Distributed Performance Analysis
         Performance analysis requests can be built based on
         WfPerfOnto
Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   16
Utilizing WfPerfOnto: Describing
                      Performance Data
     <rdf:Description rdf:about="http://dps.uibk.ac.at/wfperfonto#mImgtbl21">
       <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#ElapsedTime78"/>
       <rdf:type rdf:resource="http://dps.uibk.ac.at/wfperfonto#ActivityInstance"/>
       <wfperfonto:instanceName>mImgtbl21</wfperfonto:instanceName>
       <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#QueuingTime80"/>
       <wfperfonto:ofActivity rdf:resource="http://dps.uibk.ac.at/wfperfonto#mImgtbl2"/>
       <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#ProcessingTime79"/>
      </rdf:Description>




Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05         17
Utilizing WfPerfOnto: Checking
                                        Correct Execution




                                <rdf:Description rdf:about="http://dps.uibk.ac.at/wfperfonto#Seq4ForkJoin5">
                                   <wfperfonto:hasActivityInstance rdf:resource="http://dps.uibk.ac.at/wfperfonto#tRawImage4"/>
                                   <wfperfonto:hasActivityInstance rdf:resource="http://dps.uibk.ac.at/wfperfonto#tProjectedImage4"/>
                                   <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#ElapsedTime57"/>
                                   <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#QueuingTime59"/>
                                   <wfperfonto:hasActivityInstance rdf:resource="http://dps.uibk.ac.at/wfperfonto#mProject14"/>
                                   <wfperfonto:hasActivityInstance rdf:resource="http://dps.uibk.ac.at/wfperfonto#mImgtbl14"/>
                                   <rdf:type rdf:resource="http://dps.uibk.ac.at/wfperfonto#WorkflowConstructInstance"/>
                                   <wfperfonto:ofWorkflowConstruct rdf:resource="http://dps.uibk.ac.at/wfperfonto#SeqForkJoin"/>
                                   <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#ProcessingTime58"/>
                                   <wfperfonto:instanceName>Seq4ForkJoin5</wfperfonto:instanceName>
                                 </rdf:Description>


Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05                              18
Utilizing WfPerfOnto: Distributed Performance
                       Analysis
                        Clients                  External               Knowledge
                                                  Tools                Builder Agent


               DIPAS                   Grid analysis
                    Grid analysis         agent
                       agent
                                               Grid analysis
                                                  agent
                                                                               GOM
                    Grid analysis
                       agent
                                                       Grid analysis
                                                          agent




                                       Monitoring Service


                          Resources                              Applications
Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   19
Utilizing WfPerfOnto: Analysis Request


                                     Requests based on           Analysis
                                     WfPerfOnto                   agent
                                                                                   Ontological
                                                                                      data
                                                                Monitoring
                                                                  agent
                                                                         Grid analysis agent




                                                            To the Monitoring
                                                            Service




Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05    20
Conclusion and Future Work
      Performance metrics of Grid workflows that characterize
    the performance and dependability of Grid workflows; metrics
    associated with multiple levels of abstraction
      Ontology describing performance data of Grid workflows
      Current implementation
           OWL-based ontologies, Jena toolkit for processing ontology-related
           task
           Store and export performance data in/to WfPerfOnto representation

      Future work
           Extend and revise performance metrics and WfPerfOnto
           Distributed performance analysis
           Reasoning performance data
     Shared conceptualization                         community work?
Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05   21

More Related Content

PDF
Performance Analysis of Grid Workflows in K-WfGrid and ASKALON
PDF
An Integrated Framework for Parameter-based Optimization of Scientific Workflows
PDF
Scheduling and Allocation Algorithm for an Elliptic Filter
PDF
Service Request Scheduling in Cloud Computing using Meta-Heuristic Technique:...
PDF
Optimized Assignment of Independent Task for Improving Resources Performance ...
PDF
Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...
PDF
A novel methodology for task distribution
PDF
Using the black-box approach with machine learning methods in ...
Performance Analysis of Grid Workflows in K-WfGrid and ASKALON
An Integrated Framework for Parameter-based Optimization of Scientific Workflows
Scheduling and Allocation Algorithm for an Elliptic Filter
Service Request Scheduling in Cloud Computing using Meta-Heuristic Technique:...
Optimized Assignment of Independent Task for Improving Resources Performance ...
Scheduling of Heterogeneous Tasks in Cloud Computing using Multi Queue (MQ) A...
A novel methodology for task distribution
Using the black-box approach with machine learning methods in ...

What's hot (13)

PDF
J0210053057
PPTX
Operating system 23 process synchronization
PDF
In-Memory Compute Grids… Explained
PDF
Intelligent Workload Management in Virtualized Cloud Environment
PDF
IRJET- Application of Artificial Neural Networking Technique for the Lifecycl...
DOCX
Job shop scheduling problem using genetic algorithm
PDF
PDF
Improved Max-Min Scheduling Algorithm
PDF
IRJET- Time and Resource Efficient Task Scheduling in Cloud Computing Environ...
PDF
A New Approach for Dynamic Load Balancing Using Simulation In Grid Computing
PDF
IRJET- Enhance Dynamic Heterogeneous Shortest Job first (DHSJF): A Task Schedu...
PDF
IRJET-Hardware Co-Simulation of Classical Edge Detection Algorithms using Xil...
PDF
A Framework and Methods for Dynamic Scheduling of a Directed Acyclic Graph on...
J0210053057
Operating system 23 process synchronization
In-Memory Compute Grids… Explained
Intelligent Workload Management in Virtualized Cloud Environment
IRJET- Application of Artificial Neural Networking Technique for the Lifecycl...
Job shop scheduling problem using genetic algorithm
Improved Max-Min Scheduling Algorithm
IRJET- Time and Resource Efficient Task Scheduling in Cloud Computing Environ...
A New Approach for Dynamic Load Balancing Using Simulation In Grid Computing
IRJET- Enhance Dynamic Heterogeneous Shortest Job first (DHSJF): A Task Schedu...
IRJET-Hardware Co-Simulation of Classical Edge Detection Algorithms using Xil...
A Framework and Methods for Dynamic Scheduling of a Directed Acyclic Graph on...
Ad

Viewers also liked (19)

PPTX
DataEngConf: Apache Kafka at Rocana: a scalable, distributed log for machine ...
PDF
Ceph Day San Jose - From Zero to Ceph in One Minute
PPTX
Ceph Day San Jose - Ceph at Salesforce
PPTX
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
PDF
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
PDF
Ceph Day San Jose - HA NAS with CephFS
PPTX
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
PPTX
Ceph Day San Jose - Ceph in a Post-Cloud World
PDF
Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph
PPTX
Ceph Day Tokyo - High Performance Layered Architecture
PPTX
Ceph Day Tokyo - Bring Ceph to Enterprise
PPTX
Ceph Day Tokyo - Delivering cost effective, high performance Ceph cluster
PPTX
Ceph Day Tokyo - Ceph Community Update
PDF
Ceph Day Tokyo -- Ceph on All-Flash Storage
PDF
Ceph Day Tokyo - Ceph on ARM: Scaleable and Efficient
PPTX
Connected Vehicle Data Platform
PDF
Ceph Day San Jose - Object Storage for Big Data
PPTX
Double Your Hadoop Hardware Performance with SmartSense
DataEngConf: Apache Kafka at Rocana: a scalable, distributed log for machine ...
Ceph Day San Jose - From Zero to Ceph in One Minute
Ceph Day San Jose - Ceph at Salesforce
Ceph Day San Jose - Red Hat Storage Acceleration Utlizing Flash Technology
Ceph Day San Jose - All-Flahs Ceph on NUMA-Balanced Server
Ceph Day San Jose - HA NAS with CephFS
Ceph Day San Jose - Enable Fast Big Data Analytics on Ceph with Alluxio
Ceph Day San Jose - Ceph in a Post-Cloud World
Ceph Day Tokyo - Bit-Isle's 3 years footprint with Ceph
Ceph Day Tokyo - High Performance Layered Architecture
Ceph Day Tokyo - Bring Ceph to Enterprise
Ceph Day Tokyo - Delivering cost effective, high performance Ceph cluster
Ceph Day Tokyo - Ceph Community Update
Ceph Day Tokyo -- Ceph on All-Flash Storage
Ceph Day Tokyo - Ceph on ARM: Scaleable and Efficient
Connected Vehicle Data Platform
Ceph Day San Jose - Object Storage for Big Data
Double Your Hadoop Hardware Performance with SmartSense
Ad

Similar to Performance Metrics and Ontology for Describing Performance Data of Grid Workflows (20)

PPT
Instrumentation and measurement
PPTX
Evolution of netflix conductor
PDF
Queuing model based load testing of large enterprise applications
PDF
IRJET - Hardware Benchmarking Application
PDF
2006 mm,ks,jb (miami, florida bpm summit) xpdl tutorial
PDF
IRJET- Scheduling of Independent Tasks over Virtual Machines on Computati...
PPTX
Software Defined Service Networking (SDSN) - by Dr. Indika Kumara
PDF
Exploring Neo4j Graph Database as a Fast Data Access Layer
PPT
Software Engineering
PDF
Best Practices: How to Analyze IoT Sensor Data with InfluxDB
PPTX
Von neumann workers
PPTX
Webinar september 2013
PDF
Score based deadline constrained workflow scheduling algorithm for cloud systems
PPTX
The differing ways to monitor and instrument
PDF
Introduction to enterprise applications capacity planning
PDF
PID2143641
DOC
Qtp interview questions
DOC
Qtp interview questions
PPT
Probe Debugging
Instrumentation and measurement
Evolution of netflix conductor
Queuing model based load testing of large enterprise applications
IRJET - Hardware Benchmarking Application
2006 mm,ks,jb (miami, florida bpm summit) xpdl tutorial
IRJET- Scheduling of Independent Tasks over Virtual Machines on Computati...
Software Defined Service Networking (SDSN) - by Dr. Indika Kumara
Exploring Neo4j Graph Database as a Fast Data Access Layer
Software Engineering
Best Practices: How to Analyze IoT Sensor Data with InfluxDB
Von neumann workers
Webinar september 2013
Score based deadline constrained workflow scheduling algorithm for cloud systems
The differing ways to monitor and instrument
Introduction to enterprise applications capacity planning
PID2143641
Qtp interview questions
Qtp interview questions
Probe Debugging

More from Hong-Linh Truong (20)

PDF
QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
PDF
Sharing Blockchain Performance Knowledge for Edge Service Development
PDF
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
PDF
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
PDF
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
PDF
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
PDF
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
PDF
Characterizing Incidents in Cloud-based IoT Data Analytics
PDF
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
PDF
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
PDF
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
PDF
Deep Context-Awareness: Context Coupling and New Types of Context Information...
PDF
Managing and Testing Ensembles of IoT, Network functions, and Clouds
PDF
Towards a Resource Slice Interoperability Hub for IoT
PDF
On Supporting Contract-aware IoT Dataspace Services
PDF
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
PDF
On Engineering Analytics of Elastic IoT Cloud Systems
PDF
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
PDF
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
PDF
Governing Elastic IoT Cloud Systems under Uncertainties
QoA4ML – A Framework for Supporting Contracts in Machine Learning Services
Sharing Blockchain Performance Knowledge for Edge Service Development
Measuring, Quantifying, & Predicting the Cost-Accuracy Tradeoff
DevOps for Dynamic Interoperability of IoT, Edge and Cloud Systems
Dynamic IoT data, protocol, and middleware interoperability with resource sli...
Integrated Analytics for IIoT Predictive Maintenance using IoT Big Data Cloud...
Modeling and Provisioning IoT Cloud Systems for Testing Uncertainties
Characterizing Incidents in Cloud-based IoT Data Analytics
Enabling Edge Analytics of IoT Data: The Case of LoRaWAN
Analytics of Performance and Data Quality for Mobile Edge Cloud Applications
Testing Uncertainty of Cyber-Physical Systems in IoT Cloud Infrastructures: C...
Deep Context-Awareness: Context Coupling and New Types of Context Information...
Managing and Testing Ensembles of IoT, Network functions, and Clouds
Towards a Resource Slice Interoperability Hub for IoT
On Supporting Contract-aware IoT Dataspace Services
Towards the Realization of Multi-dimensional Elasticity for Distributed Cloud...
On Engineering Analytics of Elastic IoT Cloud Systems
HINC – Harmonizing Diverse Resource Information Across IoT, Network Functions...
SINC – An Information-Centric Approach for End-to-End IoT Cloud Resource Prov...
Governing Elastic IoT Cloud Systems under Uncertainties

Recently uploaded (20)

PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
Pharma ospi slides which help in ospi learning
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Pre independence Education in Inndia.pdf
PDF
RMMM.pdf make it easy to upload and study
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Complications of Minimal Access Surgery at WLH
PDF
Classroom Observation Tools for Teachers
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
master seminar digital applications in india
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PDF
Insiders guide to clinical Medicine.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
01-Introduction-to-Information-Management.pdf
Renaissance Architecture: A Journey from Faith to Humanism
Pharma ospi slides which help in ospi learning
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Pre independence Education in Inndia.pdf
RMMM.pdf make it easy to upload and study
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Complications of Minimal Access Surgery at WLH
Classroom Observation Tools for Teachers
Supply Chain Operations Speaking Notes -ICLT Program
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
master seminar digital applications in india
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Insiders guide to clinical Medicine.pdf
Microbial diseases, their pathogenesis and prophylaxis

Performance Metrics and Ontology for Describing Performance Data of Grid Workflows

  • 1. Performance Metrics and Ontology for Describing Performance Data of Grid Workflows Hong-Linh Truong, Thomas Fahringer, Francesco Nerieri Distributed and Parallel Systems Group Institute for Computer Science, University of Innsbruck {truong,tf,nero}@dps.uibk.ac.at Schahram Dustdar Information Systems Institute, Vienna University of Technology dustdar@infosys.tuwien.ac.at http://dps.uibk.ac.at/projects/pma 1st Performability Workshop, CCGrid05, Cardiff 09 May, 2005
  • 2. Outline Motivation Grid workflows and workflow execution model Performance metrics of Grid workflows WfPerfOnto: Ontology for describing performance data of Grid workflows Utilizing WfPerfOnto Conclusion and Future work Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 2
  • 3. Motivation Lack of comprehensive study of useful performance metrics for Grid workflows A few metrics are studied and supported Most of metrics are being limited to the activity (task) level. study performance metrics at multiple levels of abstraction Describing and sharing performance data of Grid workflows Highly heterogeneous, inter-related and dynamic Inter-organizational Multiple types of performance and monitoring data provided by various tools an ontology for performance data • Can be used to describe concepts associated with workflow executions • Will facilitate the performance data sharing Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 3
  • 4. Hierarchical Structure View of a Workflow <parallel> Workflow <activity name="mProject2"> <executable name="/home/truong/mProject2"/> </activity> Workflow Construct n <activity name="mProject1"> <executable name="/home/truong/mProject1"/> </activity> Activity m </parallel> mProject1.c Invoked Application m int main() { A(); while () { Code Code Code ... Region 1 Region … Region q } } Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 4
  • 5. Workflow Execution Model (Simplified) Local scheduler Workflow execution Spanning multiple Grid sites Highly inter-organizational, inter-related and dynamic Multiple levels of job scheduling At workflow execution engine (part of WfMS) At Grid sites Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 5
  • 6. Performance Metrics of Grid Workflows Interesting performance metrics associated with multiple levels of abstraction Metrics can be used in workflow composition, for comparing different invoked applications of a single activity, etc. Five levels of abstraction Code region, Invoked application Activity,Workflow construct, Workflow Performance metrics of a lower level can be used to construct similar metrics for the immediate higher-level By using aggregate operator Based on metric definition and structure of workflows Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 6
  • 7. Performance Metrics at Code Region Level Category Metric Execution time ElapsedTIme, UserCPUTime, SystemCPUTime, SerialTime, EncodingTime Counter L2_TCM, L2_TCA, etc., (hardware counters) NCalls, NSubs, RecvMsgCount, SendMsgCount Synchronization CondSynTime, ExclSynTime Data Movement TotalCommTime, TotalTransSize Ratio MeanElapsedTime, CommPerComp, MeanTransRate, MeanTranSize CachMissRatio, MFLOPS, etc. Temporal overhead temporal overhead of parallel code regions Most existing conventional performance tools provide these metrics Existing workflow monitoring and analysis tools normally do not Challenging issues Integrate conventional performance monitoring tools into workflow monitoring tools Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 7
  • 8. Performance Metrics at Invoked Application Level Most metrics can be constructed from metrics at code region level Category Metric Execution time ElapsedTime FailedTime Counter NCallFailed NCalls Ratio FailedFreq Performance Improvement SpeedupFactor Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 8
  • 9. Performance Metrics at Activity Level Category Metric Execution time ElapsedTime, ProcessingTime, QueuingTime, SuspendingTime FailedTime, SharedResTime Counter RedandantActivity, NIteration, PathSelectionRatio, ResUtilization Ratio Throughput, MeanTimePerState, TransRate Synchronization SynDelay, ExecDelay Performance SlowdownFactor Improvement Metrics can be defined for both activity and activity instance Aggregate metrics of an activity can be defined based on its instances and the execution of instances at runtime Challenging problems How to monitor and correlate metrics when a resource is shared among applications Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 9
  • 10. Performance Metrics at Workflow Construct Level Category Metric Execution time ElapsedTime, ProcessingTime Counter RedandantActivity, NIteration, PathSelectionRatio, ResUtilization Load balancing LoadIm (Load imbalance) Performance Improvement SpeedupFactor Generic and construct-specific metrics Resource RedundantProcessing Aggregate metrics of a workflow construct/workflow construct instance are defined based on the structure of the construct. E.g., LoadIm (load imbalance) is for parallel construct ElapsedTime/ProcessingTime is defined based on critical path Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 10
  • 11. Performance Metrics at Workflow Level Category Metric Execution time ElapsedTime,ProcesingTime ParTime,SeqTime Ratio QueuingRatio, MeanProcessingTime, MeanQueuingTime, ResUtilization Correlation NAPerRes,ProcInRes,LoadImRes Performance Improvement Speedup Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 11
  • 12. Performance Metrics Ontology WfMetricOnto OWL-based performance metrics ontology Metrics ontology Specifies which performance metrics a tool can provide Simplifies the access to performance metrics provided by various tools Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 12
  • 13. Monitoring and Measuring Performance Metrics Performance monitoring and analysis tools Operate at multiple levels Correlate performance metrics from multiple levels Middleware and application instrumentation Instrument execution engine of WfMS • Execution engine can be distributed or centralized Instrument applications • Distributed, spanning multiple Grid sites Challenging problems: Performance tool and data complexity Integrate multiple performance monitoring tools executed on multiple Grid sites Integrate performance data produced by various tools Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 13
  • 14. Ontology Describing Performance Data of Grid Workflows Objectives Understanding basic concepts associated with performance data of Grid workflows Performance data integration for Grid workflows Towards distributed/intelligent performance analysis WfPerfOnto (Ontology describing Performance data of Grid Workflows) Basic concepts • Concepts reflects the hierarchical view of a workflow • Static and dynamic performance and monitoring data of workflow Relationships • Static and dynamic relationships among concepts Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 14
  • 15. Ontology for Describing Performance Data of Grid Workflows WfPerfOnto Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 15
  • 16. Utilizing WfPerfOnto Describing Performance Data and Data Integration Different monitoring and analysis tools can store/export performance data in/to ontological representation High-level search and retrieval of performance data Knowledge base performance data of Grid workflows Utilized by high-level tools such as schedulers, workflow composition tools, etc. Used to re(discover) workflow patterns, interactions in workflows, to check correct execution, etc. Distributed Performance Analysis Performance analysis requests can be built based on WfPerfOnto Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 16
  • 17. Utilizing WfPerfOnto: Describing Performance Data <rdf:Description rdf:about="http://dps.uibk.ac.at/wfperfonto#mImgtbl21"> <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#ElapsedTime78"/> <rdf:type rdf:resource="http://dps.uibk.ac.at/wfperfonto#ActivityInstance"/> <wfperfonto:instanceName>mImgtbl21</wfperfonto:instanceName> <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#QueuingTime80"/> <wfperfonto:ofActivity rdf:resource="http://dps.uibk.ac.at/wfperfonto#mImgtbl2"/> <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#ProcessingTime79"/> </rdf:Description> Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 17
  • 18. Utilizing WfPerfOnto: Checking Correct Execution <rdf:Description rdf:about="http://dps.uibk.ac.at/wfperfonto#Seq4ForkJoin5"> <wfperfonto:hasActivityInstance rdf:resource="http://dps.uibk.ac.at/wfperfonto#tRawImage4"/> <wfperfonto:hasActivityInstance rdf:resource="http://dps.uibk.ac.at/wfperfonto#tProjectedImage4"/> <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#ElapsedTime57"/> <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#QueuingTime59"/> <wfperfonto:hasActivityInstance rdf:resource="http://dps.uibk.ac.at/wfperfonto#mProject14"/> <wfperfonto:hasActivityInstance rdf:resource="http://dps.uibk.ac.at/wfperfonto#mImgtbl14"/> <rdf:type rdf:resource="http://dps.uibk.ac.at/wfperfonto#WorkflowConstructInstance"/> <wfperfonto:ofWorkflowConstruct rdf:resource="http://dps.uibk.ac.at/wfperfonto#SeqForkJoin"/> <wfperfonto:hasPerfMetric rdf:resource="http://dps.uibk.ac.at/wfperfonto#ProcessingTime58"/> <wfperfonto:instanceName>Seq4ForkJoin5</wfperfonto:instanceName> </rdf:Description> Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 18
  • 19. Utilizing WfPerfOnto: Distributed Performance Analysis Clients External Knowledge Tools Builder Agent DIPAS Grid analysis Grid analysis agent agent Grid analysis agent GOM Grid analysis agent Grid analysis agent Monitoring Service Resources Applications Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 19
  • 20. Utilizing WfPerfOnto: Analysis Request Requests based on Analysis WfPerfOnto agent Ontological data Monitoring agent Grid analysis agent To the Monitoring Service Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 20
  • 21. Conclusion and Future Work Performance metrics of Grid workflows that characterize the performance and dependability of Grid workflows; metrics associated with multiple levels of abstraction Ontology describing performance data of Grid workflows Current implementation OWL-based ontologies, Jena toolkit for processing ontology-related task Store and export performance data in/to WfPerfOnto representation Future work Extend and revise performance metrics and WfPerfOnto Distributed performance analysis Reasoning performance data Shared conceptualization community work? Performance Metrics and Ontology for Describing Performance Data of Grid Workflows, CCGrid 05 21