SlideShare a Scribd company logo
© 2005 IBM Corporation | October 18, 2005 |
Eric Labadie
IBM Rational Software
Ashish Patel
IBM Rational Software
Using and Extending the Eclipse Test and
Performance Tools Platform (TPTP) for Data
Collection in Self-Healing Systems.
2 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
Agenda
 Problem Determination
 Eclipse Project
 Test and Performance Tools Platform (TPTP)
 Architecture
 Eclipse Modeling Framework (EMF) Data Models
 Common Base Events (CBEs)
 Agent Controller
 Autonomic Computing and Self-Healing
 Demonstration
 Conclusion
3 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
 Disparate pieces
and parts
 Tools focused on
individual products
 No common
interfaces
among tools
 No synergies in
building tools OR in
creating log entries
Problem Determination - YesterdayProblem Determination - Today
Database
Networks
Application
Server
Servers
Storage
devices
Applications
 Generic log adapter
 Common format for
log files
 Common set of tools
 Common interfaces
among tools
common
base event
AdaptersAdapters
Common Base Event
submitted to OASIS
Common Base Event
submitted to OASIS
Common
Base Event
4 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
Eclipse and TPTP
 An open-source project dedicated to provide an universal, robust,
full-featured and commercial-quality platform written in Java for
developing Integrated Development Environments (IDEs).
 Founded by IBM and other industry leaders in November 2001.
 TPTP – Test and Performance Tools Platform (2002)
(Formerly called Hyades)
 Extensible framework and infrastructure that embraces automated
testing, trace, profiling, monitoring, and asset management.
 Includes an unified data model, normative user experience and
workflow and an united set of APIs and reference tools that work
consistently across a range of targets.
 Computer Associates , Compuware, FOKUS, IBM , Intel, OC Systems,
SAP, Scapa Technologies
5 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
TPTP Project - Architecture
Presentation System
UITest
Trace
EMF Data Models
Log
Statistical
XMI
Real Time
Export
Runtime
Monitor/log
TraceAnalysis
AndProfiling
TestCreation
andExecution
Artifact
Management
Standard Widgets and
Core Plug-ins
Reference Perspectives
And Workflow
Eclipse Platform
AgentControlInterfaceDataLoader
Target System
Application
Execution Environment
DataCollection
Test Engine
Testability Interface
InjectionCorrelation
Log Collection
Trace Collection
System Performance
Monitor
JVMPI Monitor
Agent
Controller
JVMTI Monitor
Distributed
Control Framework
Distributed Data
Collection Framework
AgentControlInterfaceDataCollectionInterface
6 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
TPTP Project - Architecture
Presentation System
UITest
Trace
EMF Data Models
Log
Statistical
XMI
Real Time
Export
Runtime
Monitor/log
TraceAnalysis
AndProfiling
TestCreation
andExecution
Artifact
Management
Standard Widgets and
Core Plug-ins
Reference Perspectives
And Workflow
Eclipse Platform
AgentControlInterfaceDataLoader
Target System
Application
Execution Environment
DataCollection
Test Engine
Testability Interface
InjectionCorrelation
Log Collection
Trace Collection
System Performance
Monitor
Agent
Controller
JVMTI Monitor
Distributed
Control Framework
Distributed Data
Collection Framework
AgentControlInterfaceDataCollectionInterface
JVMPI Monitor
7 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
Data Collection – Trace Agent
 Java Virtual Machine Profiling Interface (JVMPI) agent implementation for
profiling local or remote Java applications from a local TPTP workbench to
capture and record an application's behavior.
 Used for performance details such as classes or methods responsible for the
poor execution performance as well as to analyze application heap and
locate memory leaks.
 Emits trace XML fragments based on a structured Event Specification for
Java Profiling (trace.xsd).
 May be launched from the TPTP workbench or invoked using the -Xrun JVM
command line option.
 Used to persist stand-alone application and distributed system trace data.
 A trace XML loader at the client side receives and parses the trace XML
fragments returned from any agent.
 Hierarchal data model capturing interaction at various levels of the execution
stacks and heap (e.g. method X in class Y on thread Z).
8 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
TPTP Project - Architecture
Presentation System
UITest
Trace
EMF Data Models
Log
Statistical
XMI
Real Time
Export
Runtime
Monitor/log
TraceAnalysis
AndProfiling
TestCreation
andExecution
Artifact
Management
Standard Widgets and
Core Plug-ins
Reference Perspectives
And Workflow
Eclipse Platform
AgentControlInterfaceDataLoader
Target System
Application
Execution Environment
DataCollection
Test Engine
Testability Interface
InjectionCorrelation
Log Collection
Trace Collection
System Performance
Monitor
JVMPI Monitor
Agent
Controller
JVMTI Monitor
Distributed
Control Framework
Distributed Data
Collection Framework
AgentControlInterfaceDataCollectionInterface
9 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
Data Collection – Logging Agents
 XML message-based agent for real-time monitoring of message generating
facilities (e.g. loggers).
 Extensible architecture for crafting proprietary Logging Agents:
 Defined APIs and programming model.
 C implementation with a Java wrapper for supporting C/C++ and Java run-
times.
 TPTP provides Common Base Event and Logging Agent standalone and
plug-in support for the following popular logging facilities:
 Jakarta Apache Commons
 Java Logging (JSR-047)
 Jakarta Apache Log4J
10 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
Common Base Event - Details
11 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
Autonomic Computing
Level 2 Level 3 Level 4 Level 5Level 1
Basic
Managed
Predictive
Adaptive
Autonomic
Manual analysis and
problem solving
Centralized
tools, manual
actions
Cross-resource
correlation and
guidance
System monitors,
correlates and
takes action
Dynamic business
policy based
management
Evolution not
revolution
12 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
Autonomic Computing – Self-Healing
AdaptersAdapters
EnterpriseServiceBus
Common
Base Event
Knowledge
SymptomSymptom
Feedback
Application
Server
Servers
Storage
devices
Database
Networks
Applications
Adapters
ConfigurationConfiguration
© 2005 IBM Corporation | October 18, 2005 |
Demonstration
14 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
Conclusion
 TPTP Logging and Monitoring Tools provide a centralized, open-source and
extensible mechanism for end-users, administrators, field service engineers
and developers to decrease problem determination costs and improve
system quality.
 Collects and consolidates log and trace data from disparate systems into a
single management tool.
 Consumes common event format for viewing, navigating, sorting, filtering
and searching large amounts of log and trace data.
 Correlation to determine a set of related events thereby visualizing control
flow across distributed systems.
 Analysis to provide explanation and possible solutions to known problems.
 Futuristic extensions include Self-Healing capabilities using the analysis from
known problems.
© 2005 IBM Corporation | October 18, 2005 |
Thank you.
Eric Labadie
labadie@ca.ibm.com
Ashish Patel
ashishp@ca.ibm.com
16 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
TPTP Project - Architecture
Presentation System
UITest
Trace
EMF Data Models
Log
Statistical
XMI
Real Time
Export
Runtime
Monitor/log
TraceAnalysis
AndProfiling
TestCreation
andExecution
Artifact
Management
Standard Widgets and
Core Plug-ins
Reference Perspectives
And Workflow
Eclipse Platform
AgentControlInterfaceDataLoader
Target System
Application
Execution Environment
DataCollection
Test Engine
Testability Interface
InjectionCorrelation
Log Collection
Trace Collection
System Performance
Monitor
JVMPI Monitor
Agent
Controller
JVMTI Monitor
Distributed
Control Framework
Distributed Data
Collection Framework
AgentControlInterfaceDataCollectionInterface
17 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
Data Collection – Statistical Agents
 XML message-based agent for real-time monitoring of operating systems,
web servers, database servers and application servers:
 Windows, Linux, JBoss and JOnAS Application Server, Apache/IHS, MySQL
 Emits statistical XML fragments based on a structured schema
(statistical.xsd).
 Examples of operating system metrics include:
 CPU usage
 Network activity
 Disk activity
 Memory usage
 Views to effectively visualize the large amount and varying types of statistical
data in real-time.
 Extensible architecture for crafting proprietary Statistical Agents:
 Defined APIs and programming model.
18 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation
TPTP EMF Data Models – Statistical
 Used to persist arbitrary numerical data over time.
 Hierarchal data model capturing structured statistical data defined by
descriptors.
 Descriptors may contain child counter descriptors.
 Descriptors contain snapshot observations that contain actual typed
statistical data:
 Discrete (integer/time)
 Contiguous (float/time)
 String (String/time)

More Related Content

PDF
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
PDF
FDMEE Taking Source Filters to the Next Level
PDF
ODTUG KSCOPE 2017 - Black Belt Techniques for FDMEE and Cloud Data Management
PDF
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
PDF
Eclipse Visualization and Performance Monitoring
PPTX
Productionizing Spark ML pipelines with the portable format for analytics
PPTX
Log Data Analysis Platform
PPTX
Log Data Analysis Platform by Valentin Kropov
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
FDMEE Taking Source Filters to the Next Level
ODTUG KSCOPE 2017 - Black Belt Techniques for FDMEE and Cloud Data Management
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
Eclipse Visualization and Performance Monitoring
Productionizing Spark ML pipelines with the portable format for analytics
Log Data Analysis Platform
Log Data Analysis Platform by Valentin Kropov

More from Ashish Patel (16)

PDF
Thinking In Cloud Native
PDF
American Marketing Association, Legendary Leadership Series: Think like a sof...
PDF
Digital is more than just technology
PDF
Digital Transformation: Embracing a Growth Mindset
PDF
Can your business survive the next disaster?
PPTX
Where in the world is your Corporate data?
PPTX
DevOps and the Enterprise Cloud
PPTX
TeraGo Corporate Services Overview
PPTX
IBM Cloud OpenStack Services
PPT
IBM Corporate Services Corps - Experience in Malaysia
PPT
Security in Cloud Computing
PPT
IBM Rational AppScan Technical Overview
PPT
IBM Rational AppScan Product Overview
PPT
Application Response Measurement (ARM) based Monitoring for Eclipse
PPT
IBM Performance Optimizaiton Toolkit for Rational Performance Tester
PPT
IBM Performance Optimizaiton Toolkit for Rational Application Developer
Thinking In Cloud Native
American Marketing Association, Legendary Leadership Series: Think like a sof...
Digital is more than just technology
Digital Transformation: Embracing a Growth Mindset
Can your business survive the next disaster?
Where in the world is your Corporate data?
DevOps and the Enterprise Cloud
TeraGo Corporate Services Overview
IBM Cloud OpenStack Services
IBM Corporate Services Corps - Experience in Malaysia
Security in Cloud Computing
IBM Rational AppScan Technical Overview
IBM Rational AppScan Product Overview
Application Response Measurement (ARM) based Monitoring for Eclipse
IBM Performance Optimizaiton Toolkit for Rational Performance Tester
IBM Performance Optimizaiton Toolkit for Rational Application Developer
Ad

Recently uploaded (20)

PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Big Data Technologies - Introduction.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Spectroscopy.pptx food analysis technology
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
cuic standard and advanced reporting.pdf
PDF
A comparative analysis of optical character recognition models for extracting...
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
Empathic Computing: Creating Shared Understanding
PDF
Encapsulation theory and applications.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Big Data Technologies - Introduction.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Electronic commerce courselecture one. Pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Spectroscopy.pptx food analysis technology
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
Spectral efficient network and resource selection model in 5G networks
Encapsulation_ Review paper, used for researhc scholars
cuic standard and advanced reporting.pdf
A comparative analysis of optical character recognition models for extracting...
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Unlocking AI with Model Context Protocol (MCP)
NewMind AI Weekly Chronicles - August'25-Week II
Empathic Computing: Creating Shared Understanding
Encapsulation theory and applications.pdf
Ad

Using and Extending the Eclipse Test and Performance Tools Platform (TPTP) for Data Collection in Self-Healing Systems.

  • 1. © 2005 IBM Corporation | October 18, 2005 | Eric Labadie IBM Rational Software Ashish Patel IBM Rational Software Using and Extending the Eclipse Test and Performance Tools Platform (TPTP) for Data Collection in Self-Healing Systems.
  • 2. 2 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation Agenda  Problem Determination  Eclipse Project  Test and Performance Tools Platform (TPTP)  Architecture  Eclipse Modeling Framework (EMF) Data Models  Common Base Events (CBEs)  Agent Controller  Autonomic Computing and Self-Healing  Demonstration  Conclusion
  • 3. 3 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation  Disparate pieces and parts  Tools focused on individual products  No common interfaces among tools  No synergies in building tools OR in creating log entries Problem Determination - YesterdayProblem Determination - Today Database Networks Application Server Servers Storage devices Applications  Generic log adapter  Common format for log files  Common set of tools  Common interfaces among tools common base event AdaptersAdapters Common Base Event submitted to OASIS Common Base Event submitted to OASIS Common Base Event
  • 4. 4 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation Eclipse and TPTP  An open-source project dedicated to provide an universal, robust, full-featured and commercial-quality platform written in Java for developing Integrated Development Environments (IDEs).  Founded by IBM and other industry leaders in November 2001.  TPTP – Test and Performance Tools Platform (2002) (Formerly called Hyades)  Extensible framework and infrastructure that embraces automated testing, trace, profiling, monitoring, and asset management.  Includes an unified data model, normative user experience and workflow and an united set of APIs and reference tools that work consistently across a range of targets.  Computer Associates , Compuware, FOKUS, IBM , Intel, OC Systems, SAP, Scapa Technologies
  • 5. 5 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation TPTP Project - Architecture Presentation System UITest Trace EMF Data Models Log Statistical XMI Real Time Export Runtime Monitor/log TraceAnalysis AndProfiling TestCreation andExecution Artifact Management Standard Widgets and Core Plug-ins Reference Perspectives And Workflow Eclipse Platform AgentControlInterfaceDataLoader Target System Application Execution Environment DataCollection Test Engine Testability Interface InjectionCorrelation Log Collection Trace Collection System Performance Monitor JVMPI Monitor Agent Controller JVMTI Monitor Distributed Control Framework Distributed Data Collection Framework AgentControlInterfaceDataCollectionInterface
  • 6. 6 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation TPTP Project - Architecture Presentation System UITest Trace EMF Data Models Log Statistical XMI Real Time Export Runtime Monitor/log TraceAnalysis AndProfiling TestCreation andExecution Artifact Management Standard Widgets and Core Plug-ins Reference Perspectives And Workflow Eclipse Platform AgentControlInterfaceDataLoader Target System Application Execution Environment DataCollection Test Engine Testability Interface InjectionCorrelation Log Collection Trace Collection System Performance Monitor Agent Controller JVMTI Monitor Distributed Control Framework Distributed Data Collection Framework AgentControlInterfaceDataCollectionInterface JVMPI Monitor
  • 7. 7 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation Data Collection – Trace Agent  Java Virtual Machine Profiling Interface (JVMPI) agent implementation for profiling local or remote Java applications from a local TPTP workbench to capture and record an application's behavior.  Used for performance details such as classes or methods responsible for the poor execution performance as well as to analyze application heap and locate memory leaks.  Emits trace XML fragments based on a structured Event Specification for Java Profiling (trace.xsd).  May be launched from the TPTP workbench or invoked using the -Xrun JVM command line option.  Used to persist stand-alone application and distributed system trace data.  A trace XML loader at the client side receives and parses the trace XML fragments returned from any agent.  Hierarchal data model capturing interaction at various levels of the execution stacks and heap (e.g. method X in class Y on thread Z).
  • 8. 8 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation TPTP Project - Architecture Presentation System UITest Trace EMF Data Models Log Statistical XMI Real Time Export Runtime Monitor/log TraceAnalysis AndProfiling TestCreation andExecution Artifact Management Standard Widgets and Core Plug-ins Reference Perspectives And Workflow Eclipse Platform AgentControlInterfaceDataLoader Target System Application Execution Environment DataCollection Test Engine Testability Interface InjectionCorrelation Log Collection Trace Collection System Performance Monitor JVMPI Monitor Agent Controller JVMTI Monitor Distributed Control Framework Distributed Data Collection Framework AgentControlInterfaceDataCollectionInterface
  • 9. 9 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation Data Collection – Logging Agents  XML message-based agent for real-time monitoring of message generating facilities (e.g. loggers).  Extensible architecture for crafting proprietary Logging Agents:  Defined APIs and programming model.  C implementation with a Java wrapper for supporting C/C++ and Java run- times.  TPTP provides Common Base Event and Logging Agent standalone and plug-in support for the following popular logging facilities:  Jakarta Apache Commons  Java Logging (JSR-047)  Jakarta Apache Log4J
  • 10. 10 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation Common Base Event - Details
  • 11. 11 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation Autonomic Computing Level 2 Level 3 Level 4 Level 5Level 1 Basic Managed Predictive Adaptive Autonomic Manual analysis and problem solving Centralized tools, manual actions Cross-resource correlation and guidance System monitors, correlates and takes action Dynamic business policy based management Evolution not revolution
  • 12. 12 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation Autonomic Computing – Self-Healing AdaptersAdapters EnterpriseServiceBus Common Base Event Knowledge SymptomSymptom Feedback Application Server Servers Storage devices Database Networks Applications Adapters ConfigurationConfiguration
  • 13. © 2005 IBM Corporation | October 18, 2005 | Demonstration
  • 14. 14 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation Conclusion  TPTP Logging and Monitoring Tools provide a centralized, open-source and extensible mechanism for end-users, administrators, field service engineers and developers to decrease problem determination costs and improve system quality.  Collects and consolidates log and trace data from disparate systems into a single management tool.  Consumes common event format for viewing, navigating, sorting, filtering and searching large amounts of log and trace data.  Correlation to determine a set of related events thereby visualizing control flow across distributed systems.  Analysis to provide explanation and possible solutions to known problems.  Futuristic extensions include Self-Healing capabilities using the analysis from known problems.
  • 15. © 2005 IBM Corporation | October 18, 2005 | Thank you. Eric Labadie labadie@ca.ibm.com Ashish Patel ashishp@ca.ibm.com
  • 16. 16 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation TPTP Project - Architecture Presentation System UITest Trace EMF Data Models Log Statistical XMI Real Time Export Runtime Monitor/log TraceAnalysis AndProfiling TestCreation andExecution Artifact Management Standard Widgets and Core Plug-ins Reference Perspectives And Workflow Eclipse Platform AgentControlInterfaceDataLoader Target System Application Execution Environment DataCollection Test Engine Testability Interface InjectionCorrelation Log Collection Trace Collection System Performance Monitor JVMPI Monitor Agent Controller JVMTI Monitor Distributed Control Framework Distributed Data Collection Framework AgentControlInterfaceDataCollectionInterface
  • 17. 17 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation Data Collection – Statistical Agents  XML message-based agent for real-time monitoring of operating systems, web servers, database servers and application servers:  Windows, Linux, JBoss and JOnAS Application Server, Apache/IHS, MySQL  Emits statistical XML fragments based on a structured schema (statistical.xsd).  Examples of operating system metrics include:  CPU usage  Network activity  Disk activity  Memory usage  Views to effectively visualize the large amount and varying types of statistical data in real-time.  Extensible architecture for crafting proprietary Statistical Agents:  Defined APIs and programming model.
  • 18. 18 Using and Extending the Eclipse TPTP for Data Collection in Self-Healing Systems. | © 2005 IBM Corporation TPTP EMF Data Models – Statistical  Used to persist arbitrary numerical data over time.  Hierarchal data model capturing structured statistical data defined by descriptors.  Descriptors may contain child counter descriptors.  Descriptors contain snapshot observations that contain actual typed statistical data:  Discrete (integer/time)  Contiguous (float/time)  String (String/time)

Editor's Notes

  • #3: Our Focus how to use and extend the logging and monitoring tooling in the Eclipse Test and Performance Tools Platform (TPTP) for data collection in self-healing systems begin by exploring the motivation and history of the TPTP project explanation of TPTP's architecture (with focus on the project's extensibility and data collection components) explanation of how these tools are utilized in the vision of Autonomic Computing Demo to illustrate the collection of log and statistical data from a sample distributed two-tier application in real-time
  • #4: Today’s complexity: Computing systems now entail dozens of subsystems (e.g. servers and databases) and hundreds of components (e.g. clients and applications) all connected across internal and external networks – together creating distributed systems. Outages of business-critical systems cost up to $2.8M US per year. System-wide outages requires a set of skilled laborers to determine the root cause of the failure (this is where Problem Determination begins) Problem Determination is essentially that, investigating/determine the root cause of a system (or a set of systems) failure. Goal of Problem Determination =  Costs +  Stability Yesterday (before Autonomic Computing) Each subsystem produces it own Problem Determination Data (or PD Data), such as traces and logs, that is viewed and analyzed by different people, who have different tools. Think of all systems speaking different languages (German, English, Japanese etc) To facilitate PD, a centralized tool that understands all these systems is required. In order to achive this tool, there must be a common language format. Therefore, PD data is transformed into CBEs to create a common format across all disparate systems that speak different languages. Today (with Autonomic Computing) Centralized tooling to collect and consolidate PD Data from disparate and distributed systems into a single management tool. This common event format enables (1) viewing, (2) navigating, (3) sorting, (4) filtering, (5) correlating, (6) and searching large amounts of log and trace data across distributed systems. CBE. Originated from IBM Research as a mechanism to cheaply enable Common Base Event compliance in legacy computing systems. Based on open standards to be useful and to promote widespread adoption in a heterogeneous environment.
  • #5: Eclipse an open-source, universal tools platform began as a Java Development Environment and was extended to various IDEs in 2001 founded by IBM and other industry leaders TPTP – Test and Performance Tools Platforms (2002) project formerly known as Hyades extensible framework for automated testing, trace, profiling, monitoring, and asset management. composed of a unified data model and a united set of APIs and reference tools. TPTP is the manual tools created for self-managing systems.
  • #6: Organized around a client-server model which may or may not be on the same physical machines. Client - local or remote application (eg. Eclipse Workbench) that is the terminal destination of host process data. Server: Host process – application under test. Agent – component that provides services to the host process for transferring application data to attached clients. Agent Controller – a standalone component used to launch new processes and attach to agents that coexist within processes. Provides extensible agent architecture for control capabilities and native data collection. Agent hosted within the Agent Controller may communicate data back to monitoring clients (e.g. TPTP client). Consider how a user uses TPTP: From the UI the user opens the profiling wizard and attaches to a remote system. Commands are sent using the Control Framework, via the Agent Control interface from the client to the target system The appropriate sub-system, say the trace collection, begins its data collection in real-time. As events are generated, they are sent back to the client using the Agent Controller’s Data Collection Framework. These events (xml messages) are persisted into the Eclipse Modelling Framework appropriately (ie. Trace events in the Trace Model) The UI layer will then render the data model for the user to analyze. EMF - is a modeling framework and code generation facility for building tools and other applications based on a structured data model. Data models may be designed using annotated Java, XML, or UML class diagrams and converted to an EMF model specification defined in XML Metadata Interchange (XMI). Data models are populated by model loaders consuming structured XML fragments originating from a variety of sources. TPTP contains four EMF data models: Trace, Log, Statistical, Test
  • #8: Java Virtual Machine Profiling Interface (JVMPI) agent for profiling local or remote Java application behavior. Used for performance details (classes or methods) responsible for the poor execution performance and to analyze and locate memory leaks. Emits trace XML fragments based on a structured Event Specification for Java Profiling (trace.xsd). << Defined Schema A trace XML loader at the client side receives and parses the trace XML fragments returned from any agent Hierarchal data model capturing interaction within the execution stacks and heap (e.g. method X in class Y on thread Z). Statistical Agent - for real-time monitoring of operating systems, web servers, database servers and application servers Emits statistical XML fragments based on a structured schema (statistical.xsd). << Defined Schema Metrics: (1) CPU usage (2) Network activity (3) Disk activity (4) Memory usage
  • #10: XML message-based agent for real-time monitoring of message generating facilities (e.g. loggers). Extensible architecture for crafting proprietary Logging Agents: Defined APIs and programming model. TPTP provides CBE and Logging Agent support for popular logging types.
  • #11: Open-source specification (OASIS) to provide a common and standardized taxonomy for events occurring in hardware and software Unified format and terminology for the consistent interpretation of problem determination data to circumvent varying vendor, product and version representations. Hierarchal data model capturing textual message(s) and metadata (such as timestamp, source component, situation, and severity). Events consist of message(s) and metadata resulting from an occurrence or situation Situation data, the properties describing the situation including correlation information. Reporting Component - Component observing a situation Source Component – Component experiencing the problem GLA Tooling that transforms log and trace data in proprietary formats into the Common Base Event format for use as a problem determination resource. Assists developers in adapting log and trace data to the Common Base Event format without re-writing existing applications, but using mapping rules. Originated from IBM Research as a mechanism to cheaply enable Common Base Event compliance in legacy computing systems.
  • #12: - Here is the 5 levels to achieve autonomic computing – we are currently between levels 1 and 2, such that, we have built tools to help in manual analysis/problem solving. - The goal is to reach level 5, where we have autonomic systems that learn based on historical events and take a proactive approach to healing itself in the event of a failure.
  • #13: We have already seen how we transform problem determination data from disparate systems into the CBE format This PD data is then sent to a centralized system using the ESB (ie. A transport mechanism – such as the Agent Controller) Over time, a human diagnosis these events. The root cause and diagnosis are persisted. Eventually, an autonomic management system can be configured to use this knowledge and AUTONOMICALLY heal the system in failure using the feedback mechanism.
  • #15: We have seen how: TPTP provides a centralized, open-source and extensible mechanism for problem determination To collects and consolidates log and trace data from disparate and distributed systems into a single management tool. Utilize a common event format for (1) viewing, (2) navigating, (3) sorting, (4) filtering and (5) searching large amounts of log and trace data. (4) Lead into future extension for Self-Healing capabilities using the analysis from known problems. IBM offers commercial products that EXTEND and EXPAND the Eclipse platform and TPTP framework.
  • #18: - XML message-based agent for real-time monitoring of operating systems, web servers, database servers and application servers: Windows, Linux, JBoss and JOnAS Application Server, Apache/IHS, MySQL Emits statistical XML fragments based on a structured schema (statistical.xsd). CPU usage Network activity Disk activity Memory usage - Extensible architecture for crafting proprietary Statistical Agents: Defined APIs and programming model.
  • #19: Used to persist arbitrary numerical data over time. Hierarchal data model capturing structured statistical data defined by descriptors (such as metric counters) Descriptors contain snapshot observations that contain actual typed statistical data: Discrete (integer/time) Contiguous (float/time) String (String/time)