SlideShare a Scribd company logo
High Scale Low Touch and
Other Bedtime Stories
Mr. White has fifteen years of experience designing and managing the
deployment of Systems Monitoring and Event Management software. Prior
to joining IBM, Mr. White held various positions including the leader of the
Monitoring and Event Management organization of a Fortune 100 company
and developing solutions as a consultant for a wide variety of organizations,
including the Mexican Secretaría de Hacienda y Crédito Público, Telmex,
Wal-Mart of Mexico, JP Morgan Chase, Nationwide Insurance and the US
Navy Facilities and Engineering Command.
Andrew White
Cloud and Smarter Infrastructure Solution Specialist
IBM Corporation
http://weheartit.com/entry/12433848!
Ground rules for this
session…
•  If you can’t tell if I am trying to be funny…
–  
GO AHEAD AND LAUGH!
•  Feel free to text, tweet, yammer, or whatever
during this talk. I appreciate the real time
feedback from you.
•  If you have a question, no need to wait until
the end. Just interrupt me. Seriously… I
don’t mind.
Anti-patterns and
I am here today to share some of what I have learned about
Orchestration
What Is a System?
It is a set of interconnected actors that change
over time when they are influenced by other
elements of the system.
Actor
Actor
Actor
Actor
Actor
Actor
Actor
Actor
Systems are Volatile
This change makes it difficult to control the
behavior of the system. The good news is that
systems are perfect. They always deliver the
optimum result given a specific stimuli.
Rare Events
“one chance in a million” will undoubtedly occur,
with no less and no more than it’s appropriate
frequency, however surprised we may be that it
should occur to us.


 
 
 
 



 
 
 
 
Sir Ronald A. Fisher
©	
  Aquire	
  Inc.	
  2012	
  
The Gaussian Bell Curve
Mean	
  
-1σ
 +1σ
-2σ
 +2σ
-3σ
 +3σ
67%
95%
99.5%
Two Important Properties
•  The causal effect between two actors will
always impact the entire system
•  Correlation != Causation
How Fire Works
Time
Oxygen
Heat
Fuel
Fire
MatchStrike
Action
Conditions
Fire
Oxygen
Heat
Fuel
Match
Strike
-AND-
• Actions are momentary and act as a
catalyst to bring about change
• Conditions are stable and exist over
time
A Real World Example
Customers
Complaining
Web Server returning
500 errors
The application
server was timing
out
SQL Server was not
processing queries
Transaction log was
unable to grow
T: Drive at 0 Bytes
free
Logs were not
truncated
DBA on honeymoon
vacation in Fiji
Logs are truncated
manually
Company has only 1
DBA
“Backup” DBA was
not aware the logs
require truncation
Space allocations
are fixed
 Lack of Control
Only one database
cluster in use
DR SQL Cluster
DR Cluster being
used for UAT testing
More Information
Needed
One one application
server exists
More Information
Needed
Trying to do business
on the website
 Desired Condition
-AND-
-AND-
-AND-
-AND-
-AND-
-AND-
-AND-
Hidden Factors
Hidden
Factor
Smoking
Lung
Cancer
A failure is not always a mistake, it
may simply be the best one can do
under the circumstances.
-- B.F. Skinner, American
Psychologist & Professor, Harvard
University
Uh oh. Two independent thought
alarms in one day. The students are
over-stimulated. Willie! Remove all the
colored chalk from the classrooms.
--B.F. Skinner, Principal, Springfield
Elementary School
You can't judge my choices without
understanding my reasons.
-Unknown
Feedback Loops
Unfortunately feedback has taken on both positive and negative
indications. In reality, positive feedback is not “praise” and
negative feedback is not “criticism.” Positive feedback
reinforces while negative feedback balances.
Profits
Productivity
Cost Cutting
Reinforcing
Balancing
The Agile Value
Proposition
Availability
Change
Frequency
Change
Size
Change
Capability Change
Risk
(-)
(+)
(+)
(-) (-)
Adapted From: http://www.lean4it.com/2013/05/devops-cld-part-2.html
Customer Satisfaction
Availability
Change
Frequency
Change
Size
Change
Capability Change
Risk
(-)
(+)
(+)
(-) (-)
Business
Value
Business
Demand
Change
Backlog (+)
(+)
(+)
(+)
(-)
(+)
Adapted From: http://www.lean4it.com/2013/05/devops-cld-part-2.html
Be Careful of Good
Intentions
Availability
Change
Frequency
Change
Size
Change
Capability Change
Risk
(-)
(+)
(+)
(-) (-)
Business
Value
Business
Demand
Change
Backlog (+)
(+)
(+)
(+)
(-)
(+)
Change
Process
Release
Process
(+)
(+)
(-)
(+)
(+)
Adapted From: http://www.lean4it.com/2013/05/devops-cld-part-2.html
Be Careful of Good
Intentions
Availability
Change
Frequency
Change
Size
Change
Capability Change
Risk
(-)
(+)
(+)
(-) (-)
Business
Value
Business
Demand
Change
Backlog (+)
(+)
(+)
(+)
(-)
(+)
Change
Process
Release
Process
(+)
(+)
(-)
(+)
(+)
Change
Automation
Adapted From: http://www.lean4it.com/2013/05/devops-cld-part-2.html
(+) (-)(-)
The trick is not to spend our
time trying to get better at
predicting this world, or
making it more predictable, for
both of these strategies are
bound to fail.
- Nassim Nicholas Taleb, Author and Philosopher
http://www.lamomparis.com/2012/08/french-handball-fantasies-whatever.html
Woteva you say, man!
Increasing
Complexity
§ Heterogeneous environments
§ Organizational silos
§ Skill gaps
Massive
Scale
§ Users, transactions, data
§ Rapid demand cycles
§ Unpredictable
Rapid
Pace 
§ Evolving ecosystem
§ Minimize time to value
§ Accelerating business needs
Today’s IT infrastructures are too complex, provide poor scalability, and are
slow to keep up with today’s rapid rate of change
A new set of challenges
V1 V2 V3 V4 V5 V5 ... …. Vn
C
C
W1 W2 W3 W4
R1 R2 R3
Traditional
(Systems of Record)
Emerging
(Systems of Interaction)
Workload View
Future
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
§  Rapidly changing workloads,
dynamic patterns
§  Dynamic automatic composition
of heterogeneous system
§  Autonomic and proactive
management
Current
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
§  Diverse workload, limited
patterns
§  Homogeneous resource
pooling
§  Expert configuration and
mapping of workload
Traditional
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  
	
  

§  Few, stable, and well known
workloads
§  Fixed System hardware,
manual scaling
§  Hardwired workload, minimal
configuration
W1 W2 W3 W4
R1 R2 R3
Volatile workload characteristics result from changing business requirements
V1 V2 V3 V4 V5 … Vn
V1 V2 V3 V4 V5 V5 ... …. Vn
C
C
Workloads are volatile
A new approach to
infrastructures
Simplified
Reduce infrastructure complexity,
specialization and support
Adaptive
Ensure service qualities through
policy based, scalable
infrastructure automation
Responsive
Accelerate application lifecycle
and workload deployment through
rapid change
Resource Smart
Application Aware
Definitions
 Patterns
 Analytics
Compute
 Network
 Storage
Software Defined Environments are optimized to deliver the agility, efficiency and
performance needed for today’s workloads
Software Defined Environment
What are Software
Defined Environments
Resource Smart
Application Aware
Definitions
 Patterns
 Analytics
Compute
 Network
 Storage
With a Software Defined Environment, business
users can describe their requirements of the IT
environment in a systematic way that in turn
drives automation of the infrastructure
Software
Defined
Environments
Abstracted and virtualized IT infrastructure resources
managed by software
Applications that define infrastructure requirements
and configuration
IT infrastructure that extends multiple environments to
go beyond the data center
Fully virtualized, integrated & programmable
infrastructure
Elastically scalable resources available on-demand
Intelligent resource scheduling
Infrastructure that captures workload requirements
and deployment best practices
Policy-based automation across infrastructure
Analytics to optimize the environment in real-time
The next-gen
infrastructure for HSLT
Resource Smart
Application Aware
Definitions
 Patterns
 Analytics
Compute
 Network
 Storage
Software Defined Environment
Application Aware that understands the unique workload requirements
Resource Smart that dynamically allocates infrastructure based on policies
What the business wants:
• Define business needs 
• Identify service opportunities and
requirements
• Quickly experiment and test new
services
What Software Defined
Environments provides:
• Patterns of Expertise to link solution to
infrastructure based on business rules
• Automated orchestration of workloads
• Analytics-based optimization of workload
to maximize outcomes
ContinuousOptimization
P r io r it y
P o lic y
S e r v ic e
P r io r it y
P o lic y
S e r v ic e
Solution Definition
Leverage best practices with
patterns of expertise
What s the best infrastructure for my cloud?
How do I manage & secure my hybrid environment?
How do I maintain choice and flexibility?
How do I rapidly deploy & operate my cloud?
Are you building the right cloud?
Anti-Pattern: a commonly used
process structure or pattern of action
that, despite initially appearing to be
an appropriate and effective response
to a problem, typically has more bad
consequences than beneficial results.
-Wikipedia
Benefits
 Consequences
Related Patterns
How to recognize anti-patterns
Problem
Solution
Context and Forces
Design Pattern
Benefits
Anti-Pattern Solution
Refactored Solution
Consequences
Symptoms and Consequences
Related Patterns
Design Anti-Pattern
Contextual Causes
Are Today’s Good
Practices Tomorrow’s
Anti-Patterns?
Architecture by Accident
The Humble Start…
Meeting Demand…
The First Bottleneck…
The Second Bottleneck…
Becoming Mission
Critical…
Enabling SOA…
The Fun Begins…
How Did We Get Here?
Everything fails, all the time.
- Werner Vogels, CTO Amazon
Brighttalk   high scale low touch and other bedtime stories - final
Six Types of Socratic Questions
•  What does that mean?
•  Can you explain that?
Questions that
Clarify
•  Is that always the case?
•  How can you verify or disprove?
Questions that
Challenge
•  What causes that to happen?
•  What would be an example?
Questions that
Examine
•  Why is that view best?
•  What is another way to look at it?
Questions that
Expand
•  What are the consequences?
•  What are you implying?
Questions that
Asses
•  What was the point of that question?
•  How does that question apply to us?
Questioning the
Question
Questions we need to ask
•  What kind of scenarios do I need to plan for?
•  Where are my single-points-of-failure?
•  If I use a master/slave architecture, what happens
when I lose the master?
•  What happens when my load balancer fails?
•  How will I replace a node when it fails?
•  How will I recognize a failure when it occurs?
•  What if the cache grows beyond the memory limit
of the instance?
•  What if a downstream service times out or returns
an exception?
Requirements for cloud scalability
•  Build processes threads that resume on reboot
•  Allow for state to re-sync using message queues
•  Prepare pre-configured and pre-optimized virtual
images for quick launches
•  Avoid in-memory data stores
•  Have a good DR strategy and then automate it
•  Create application as loosely coupled systems to
enable better fault tolerance and larger scale
Don’t forget get
about the cloud-as-a-
system view
Where Does Over-
Subscription Occur?
Corporate!
LANs & VPNs!
Load Balancer!
Load Balancer!
Firewall!
Switch!
VM Server Farm!
Database!
NAS !
Appliances!
Storage!
Frame!
Web Servers!
Load Balancer!
1.  Hypervisor
2.  CPU Cycles
3.  Memory
4.  Blade Backplane I/O
5.  SAN Fabric
6.  Network Interfaces
7.  Host Bus Adapters
8.  Backup Device
9.  WAN Circuits
10. Storage Processors
Common Locations
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
!
Here
Here
Here Here
Brighttalk   high scale low touch and other bedtime stories - final
The scalability of an
application is defined by
whether it can
accommodate changes in
traffic without requiring
changes in architecture.
Service Orientation
1
2
3
4
5
6
Goals of
Service
Orientation
Abstraction
Loose Coupling
Autonomy
Standard Services
Composability
Reusability
OrŸchesŸtraŸtion [AWR-kuh-strey-shun]
•  A central process controls everything and coordinates the
execution of different operations involved in the operation 
•  The services do not "know” that they are involved in a
composite process
•  Only the central coordinator of the orchestration is aware of
the desired outcome, 
•  The orchestration leverages explicit process definitions to
operate the services in the correct order of invocation
	
  
1.  the act of arranging a piece of music
2.  the planning or execution of events in order to achieve a desired effect
3.  The technique of arranging or manipulating, especially by means of
clever or thorough planning or maneuvering
Orchestration Illustration
Orchestrator
Web Service
1
Web Service
4
Web Service
3
Web Service
2
ChoŸreŸogŸraŸphy [kawr-ee-OG-ruh-fee]	
  
1.  the art of composing ballets and other dances
2.  the method of representing the various movements in dancing by a
system of notation
3.  The arrangement or manipulation of actions leading up to an event
•  Choreography does not rely on a central coordinator. 
•  Each service knows exactly who and when to execute 
•  Focuses on the exchange of messages and information
•  All services need to be aware of the business process,
operations to execute, messages to exchange, timing, etc.
Orchestration Illustration
Web Service
1
Web Service
4
Web Service
3
Web Service
2
Send
Receive
Invoke
Invoke
Invoke
Choreography vs. Orchestration
•  From the perspective of composing services to
execute business processes, orchestration is a more
flexible paradigm and has the following advantages
over choreography: 
–  The coordination of component processes is centrally
managed by a known coordinator. 
–  Web services can be incorporated without their being
aware that they are taking part in a larger business
process. 
–  Alternative scenarios can be put in place in case faults
occur. 
Page	
  49	
  
Orchestration Requirements
•  Event-based processing
•  Coordinate asynchronously between services
•  Correlate messages being exchanged
•  Provide for parallel processing
•  Allow for transaction roll-back
•  Manipulate and transform data between messaging
partners
•  Be able to manage long running business
transactions and activities
•  Have a robust mechanism for fault and error
handling
The Management Eco-System
Capacity ManagementCompute Storage Network Facilities
Event Management
(Netcool OMNIbus)
CMDB
(SmartCloud ControlDesk)
Billing
(Cost Mgr)
Software
Tracking
(TEM SUA)
SmartCloud
Monitoring
Storage
Productivity
Center
Network
Performance
Manager
Data Center
Infrastructure
Manager
Capacity Management
Predictive Insights Capacity Analyzer
Automated Reporting Engine
(Tivoli Common Reporter)
Cloud Automation
(SmartCloud Orchestrator)
Interface for
Capacity Planners
Interface for
Business Users
Policies (EGO)
Data Warehouse
One Integrated Environment
Distributed DatabaseMainframe Network Middleware Storage
Event Pool
Operational!
Data Warehouse!
Predictive
Enrichment & Correlation
Service DeskPaging
CMDB
Knowledge
Asset Mgmt
Event Catalog
Event API
Business Telemetry
3rd Party Providers
Presentation Framework
Processing Streams
Situational
Awareness
Engine
Adapted from http://www.slideshare.net/TimBassCEP/getting-started-in-cep-
how-to-build-an-event-processing-application-presentation-717795
Real-Time
Event Streams
Detected and
Predicted Situations
Patterns from
Historical Data
Causal Relationship
from Past RCAs
Complex Event Processing
Event Pipeline
Event Queries
Time Window
Data Events
Control Event
Other Events
Event Filter
Scenarios
A
B
C
Feedback Loop
Event Intelligence
Action Events
AutomatedAction
Notificationand
Escalation
BusinessImpact
Analysis
RootCauseAnalysis
Correlationand
EventSuppression
Enrichment
Meta-Data Integration Bus
DistributedCollectors
DistributedCollectors
LOB Managed
Monitoring System
Service Provider
Monitoring System
Vendor Managed
Monitoring System
Element
Manager
Element
Manager
Element
Manager
Other
Enterprise
Data
Document
Sharing
Service Desk
 CMDB
Batch
Scheduling
Knowledge
Database
Online Run
Book
PBX/Call
Manager
Visualization Framework
CommonEvent
Format
Topology And
Relationship
Database
Automated Action
Tools
DistributedCollectors
Automated
Provisioning
System
Predictive
Analysis
Automated
Change
Reconciliation
Security
Management
ArchiveandReport
Business
Telemetry Data
Service Center
and Enterprise
Notification Tool
Event Processing
Palette of library
assets enable easy
workflow composition
through drag and drop
Access to rich libraries
(toolkits) of reusable
automation assets that
enable to speed
automation creation
Rich set of actions types,
flow control, data handling
primitives that simplify
creation of complex
automations
Easy workflow action editing
for managing: data mapping,
error recovery options,
implementation details , etc.
Graphical editor for
composing and
connecting
workflows
Rich tooling
functions to edit,
version, debug,
optimize workflows
Automating Responses
Custom"
Data
Core pattern
processing

 
BPM"
Process
SCO
BPM"
Human"
Service

 


 
pre-provision "
event operation
 post-provision "
event operation
Operation
context
Operation
context
 Operation
context
Custom"
Data
SCO REST API
Operation
context
1
2
4
 5
3
SCORESTAPI
A
B
A
B
Once per registered
event operation
Extending Workload Deployment
with Custom Automation
Pre-process
event
Post-process
event
Completing the journey
Define
•  Review the existing architecture
•  Review the business outcomes
•  Define the end state
Prioritize
•  Consolidations
•  Technologies to virtualize
•  Business processes to model and workflows to automate
Execute
•  Look for early wins
•  Evolve incrementally
•  Organize the teams effectively
You have to be realistic about how fast
you can mature. Iterating helps form a
cultural of continuous improvements
Iterative development
Let’s keep the
conversation going…
Andrew.P.White@Gmail.com!
ReverendDrew!
SystemsManagementZen.Wordpress.com!
systemsmanagementzen.wordpress.com/feed/!
@SystemsMgmtZen!
ReverendDrew!
APWhite@us.ibm.com!
614-306-3434!
Brighttalk   high scale low touch and other bedtime stories - final

More Related Content

PDF
Brighttalk converged infrastructure and it operations management - final
PDF
Brighttalk brining it all together - final
PDF
How to improve your system monitoring
PDF
Brighttalk understanding the promise of sde - final
PDF
Bright talk running a cloud - final
PPTX
Webinar: Machine learning analytics for immediate resolution to the most chal...
PPTX
2011 09 18 United "Platitudes, reality and promise"
PPTX
CSI Content: Who Killed ECM?
Brighttalk converged infrastructure and it operations management - final
Brighttalk brining it all together - final
How to improve your system monitoring
Brighttalk understanding the promise of sde - final
Bright talk running a cloud - final
Webinar: Machine learning analytics for immediate resolution to the most chal...
2011 09 18 United "Platitudes, reality and promise"
CSI Content: Who Killed ECM?

What's hot (20)

PPTX
Aureus disaster planning
PDF
Cisco Connect 2018 Thailand - Introducing cisco dna assurance the future of n...
PPTX
Australian IT Pros Survey: Network Complexity
PDF
Beyond the Knowledge Base: Turning Data into Wisdom - an ITSM Academy Webinar
PPTX
Harry Regan - Disaster Recovery and Business Continuity - "It's never so bad ...
PPTX
Top ECM Trends in Digital Enterprise
PPTX
Survey: Maximizing the ROI of Network Management Solutions
PPTX
SITS15: Swarming - A radical new way to deliver service
PPTX
CISO's first 100 days
PPTX
New IT Survey: Australia
PPTX
IT Survey: UK and Germany SMEs
PDF
Abb e guide3
PPTX
New IT Survey: Germany
PPTX
CIO 101 for Entrepreneurs (2016)
PDF
E guide weathering the storm at your business
PDF
2011 Jun 7 IT Best Practices Audit Overview
PPTX
New IT Survey: UK
PPTX
Acquiforce H4D Stanford 2018 final presentation
PDF
7 key problems Water Industry need to solve
Aureus disaster planning
Cisco Connect 2018 Thailand - Introducing cisco dna assurance the future of n...
Australian IT Pros Survey: Network Complexity
Beyond the Knowledge Base: Turning Data into Wisdom - an ITSM Academy Webinar
Harry Regan - Disaster Recovery and Business Continuity - "It's never so bad ...
Top ECM Trends in Digital Enterprise
Survey: Maximizing the ROI of Network Management Solutions
SITS15: Swarming - A radical new way to deliver service
CISO's first 100 days
New IT Survey: Australia
IT Survey: UK and Germany SMEs
Abb e guide3
New IT Survey: Germany
CIO 101 for Entrepreneurs (2016)
E guide weathering the storm at your business
2011 Jun 7 IT Best Practices Audit Overview
New IT Survey: UK
Acquiforce H4D Stanford 2018 final presentation
7 key problems Water Industry need to solve
Ad

Similar to Brighttalk high scale low touch and other bedtime stories - final (20)

PPTX
Observability – the good, the bad, and the ugly
PDF
Visualizing Your Network Health - Know your Network
PDF
Its Not You Its Me MSSP Couples Counseling
PDF
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
PDF
Drive Smarter Decisions with Big Data Using Complex Event Processing
PPTX
NCET Tech Bite | Ron Husey, Moving Your Business to the Cloud | Mar 2016
PDF
Platforming the Major Analytic Use Cases for Modern Engineering
PDF
Building data intensive applications
PPTX
IT HealthCheck
PPTX
Top 5 Java Performance Metrics, Tips & Tricks
PPTX
Scaling Systems: Architectures that grow
PDF
CQRS + Event Sourcing
PDF
Stream Processing and Complex Event Processing together with Kafka, Flink and...
PPTX
DCIM Software Five Years Later: What I Wish I Had Known When I Started (Case ...
PDF
Scaling unstable systems velocity 2015
PPTX
MGT3342BUS - Architecting Data Protection with Rubrik - VMworld 2017
PPTX
What to expect from your IT People
PPTX
Justifying Capacity Management Efforts
PDF
Cloud Service Management: Why Machine Learning is Now Essential
PPTX
Cloud Services helping in cloud service to be fully knowledgably .pptx
Observability – the good, the bad, and the ugly
Visualizing Your Network Health - Know your Network
Its Not You Its Me MSSP Couples Counseling
If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...
Drive Smarter Decisions with Big Data Using Complex Event Processing
NCET Tech Bite | Ron Husey, Moving Your Business to the Cloud | Mar 2016
Platforming the Major Analytic Use Cases for Modern Engineering
Building data intensive applications
IT HealthCheck
Top 5 Java Performance Metrics, Tips & Tricks
Scaling Systems: Architectures that grow
CQRS + Event Sourcing
Stream Processing and Complex Event Processing together with Kafka, Flink and...
DCIM Software Five Years Later: What I Wish I Had Known When I Started (Case ...
Scaling unstable systems velocity 2015
MGT3342BUS - Architecting Data Protection with Rubrik - VMworld 2017
What to expect from your IT People
Justifying Capacity Management Efforts
Cloud Service Management: Why Machine Learning is Now Essential
Cloud Services helping in cloud service to be fully knowledgably .pptx
Ad

More from Andrew White (7)

PDF
Brighttalk learning to cook- network management recipes - final
PDF
Brighttalk outage insurance- what you need to know - final
PDF
Brighttalk reason 114 for learning math - final
PDF
Brighttalk what should we be monitoring - final
PDF
Brighttalk getting back on track - final
PDF
Bright talk if they cant use it, it doesnt work - final
PDF
Bright talk bringing back the love - final
Brighttalk learning to cook- network management recipes - final
Brighttalk outage insurance- what you need to know - final
Brighttalk reason 114 for learning math - final
Brighttalk what should we be monitoring - final
Brighttalk getting back on track - final
Bright talk if they cant use it, it doesnt work - final
Bright talk bringing back the love - final

Recently uploaded (20)

PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
A Presentation on Artificial Intelligence
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PDF
KodekX | Application Modernization Development
PPTX
Cloud computing and distributed systems.
PDF
Empathic Computing: Creating Shared Understanding
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
NewMind AI Weekly Chronicles - August'25 Week I
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Reach Out and Touch Someone: Haptics and Empathic Computing
A Presentation on Artificial Intelligence
Diabetes mellitus diagnosis method based random forest with bat algorithm
Advanced methodologies resolving dimensionality complications for autism neur...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
MYSQL Presentation for SQL database connectivity
KodekX | Application Modernization Development
Cloud computing and distributed systems.
Empathic Computing: Creating Shared Understanding
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Mobile App Security Testing_ A Comprehensive Guide.pdf
Unlocking AI with Model Context Protocol (MCP)
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Digital-Transformation-Roadmap-for-Companies.pptx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Dropbox Q2 2025 Financial Results & Investor Presentation
Per capita expenditure prediction using model stacking based on satellite ima...
NewMind AI Weekly Chronicles - August'25 Week I

Brighttalk high scale low touch and other bedtime stories - final

  • 1. High Scale Low Touch and Other Bedtime Stories
  • 2. Mr. White has fifteen years of experience designing and managing the deployment of Systems Monitoring and Event Management software. Prior to joining IBM, Mr. White held various positions including the leader of the Monitoring and Event Management organization of a Fortune 100 company and developing solutions as a consultant for a wide variety of organizations, including the Mexican Secretaría de Hacienda y Crédito Público, Telmex, Wal-Mart of Mexico, JP Morgan Chase, Nationwide Insurance and the US Navy Facilities and Engineering Command. Andrew White Cloud and Smarter Infrastructure Solution Specialist IBM Corporation
  • 4. Ground rules for this session… •  If you can’t tell if I am trying to be funny… –  GO AHEAD AND LAUGH! •  Feel free to text, tweet, yammer, or whatever during this talk. I appreciate the real time feedback from you. •  If you have a question, no need to wait until the end. Just interrupt me. Seriously… I don’t mind.
  • 5. Anti-patterns and I am here today to share some of what I have learned about Orchestration
  • 6. What Is a System? It is a set of interconnected actors that change over time when they are influenced by other elements of the system. Actor Actor Actor Actor Actor Actor Actor Actor
  • 7. Systems are Volatile This change makes it difficult to control the behavior of the system. The good news is that systems are perfect. They always deliver the optimum result given a specific stimuli.
  • 8. Rare Events “one chance in a million” will undoubtedly occur, with no less and no more than it’s appropriate frequency, however surprised we may be that it should occur to us. Sir Ronald A. Fisher ©  Aquire  Inc.  2012  
  • 9. The Gaussian Bell Curve Mean   -1σ +1σ -2σ +2σ -3σ +3σ 67% 95% 99.5%
  • 10. Two Important Properties •  The causal effect between two actors will always impact the entire system •  Correlation != Causation
  • 11. How Fire Works Time Oxygen Heat Fuel Fire MatchStrike Action Conditions Fire Oxygen Heat Fuel Match Strike -AND- • Actions are momentary and act as a catalyst to bring about change • Conditions are stable and exist over time
  • 12. A Real World Example Customers Complaining Web Server returning 500 errors The application server was timing out SQL Server was not processing queries Transaction log was unable to grow T: Drive at 0 Bytes free Logs were not truncated DBA on honeymoon vacation in Fiji Logs are truncated manually Company has only 1 DBA “Backup” DBA was not aware the logs require truncation Space allocations are fixed Lack of Control Only one database cluster in use DR SQL Cluster DR Cluster being used for UAT testing More Information Needed One one application server exists More Information Needed Trying to do business on the website Desired Condition -AND- -AND- -AND- -AND- -AND- -AND- -AND-
  • 14. A failure is not always a mistake, it may simply be the best one can do under the circumstances. -- B.F. Skinner, American Psychologist & Professor, Harvard University
  • 15. Uh oh. Two independent thought alarms in one day. The students are over-stimulated. Willie! Remove all the colored chalk from the classrooms. --B.F. Skinner, Principal, Springfield Elementary School
  • 16. You can't judge my choices without understanding my reasons. -Unknown
  • 17. Feedback Loops Unfortunately feedback has taken on both positive and negative indications. In reality, positive feedback is not “praise” and negative feedback is not “criticism.” Positive feedback reinforces while negative feedback balances. Profits Productivity Cost Cutting Reinforcing Balancing
  • 18. The Agile Value Proposition Availability Change Frequency Change Size Change Capability Change Risk (-) (+) (+) (-) (-) Adapted From: http://www.lean4it.com/2013/05/devops-cld-part-2.html
  • 19. Customer Satisfaction Availability Change Frequency Change Size Change Capability Change Risk (-) (+) (+) (-) (-) Business Value Business Demand Change Backlog (+) (+) (+) (+) (-) (+) Adapted From: http://www.lean4it.com/2013/05/devops-cld-part-2.html
  • 20. Be Careful of Good Intentions Availability Change Frequency Change Size Change Capability Change Risk (-) (+) (+) (-) (-) Business Value Business Demand Change Backlog (+) (+) (+) (+) (-) (+) Change Process Release Process (+) (+) (-) (+) (+) Adapted From: http://www.lean4it.com/2013/05/devops-cld-part-2.html
  • 21. Be Careful of Good Intentions Availability Change Frequency Change Size Change Capability Change Risk (-) (+) (+) (-) (-) Business Value Business Demand Change Backlog (+) (+) (+) (+) (-) (+) Change Process Release Process (+) (+) (-) (+) (+) Change Automation Adapted From: http://www.lean4it.com/2013/05/devops-cld-part-2.html (+) (-)(-)
  • 22. The trick is not to spend our time trying to get better at predicting this world, or making it more predictable, for both of these strategies are bound to fail. - Nassim Nicholas Taleb, Author and Philosopher
  • 24. Increasing Complexity § Heterogeneous environments § Organizational silos § Skill gaps Massive Scale § Users, transactions, data § Rapid demand cycles § Unpredictable Rapid Pace § Evolving ecosystem § Minimize time to value § Accelerating business needs Today’s IT infrastructures are too complex, provide poor scalability, and are slow to keep up with today’s rapid rate of change A new set of challenges V1 V2 V3 V4 V5 V5 ... …. Vn C C W1 W2 W3 W4 R1 R2 R3 Traditional (Systems of Record) Emerging (Systems of Interaction) Workload View
  • 25. Future                           §  Rapidly changing workloads, dynamic patterns §  Dynamic automatic composition of heterogeneous system §  Autonomic and proactive management Current                           §  Diverse workload, limited patterns §  Homogeneous resource pooling §  Expert configuration and mapping of workload Traditional                         §  Few, stable, and well known workloads §  Fixed System hardware, manual scaling §  Hardwired workload, minimal configuration W1 W2 W3 W4 R1 R2 R3 Volatile workload characteristics result from changing business requirements V1 V2 V3 V4 V5 … Vn V1 V2 V3 V4 V5 V5 ... …. Vn C C Workloads are volatile
  • 26. A new approach to infrastructures Simplified Reduce infrastructure complexity, specialization and support Adaptive Ensure service qualities through policy based, scalable infrastructure automation Responsive Accelerate application lifecycle and workload deployment through rapid change Resource Smart Application Aware Definitions Patterns Analytics Compute Network Storage Software Defined Environments are optimized to deliver the agility, efficiency and performance needed for today’s workloads Software Defined Environment
  • 27. What are Software Defined Environments Resource Smart Application Aware Definitions Patterns Analytics Compute Network Storage With a Software Defined Environment, business users can describe their requirements of the IT environment in a systematic way that in turn drives automation of the infrastructure Software Defined Environments Abstracted and virtualized IT infrastructure resources managed by software Applications that define infrastructure requirements and configuration IT infrastructure that extends multiple environments to go beyond the data center
  • 28. Fully virtualized, integrated & programmable infrastructure Elastically scalable resources available on-demand Intelligent resource scheduling Infrastructure that captures workload requirements and deployment best practices Policy-based automation across infrastructure Analytics to optimize the environment in real-time The next-gen infrastructure for HSLT Resource Smart Application Aware Definitions Patterns Analytics Compute Network Storage Software Defined Environment Application Aware that understands the unique workload requirements Resource Smart that dynamically allocates infrastructure based on policies
  • 29. What the business wants: • Define business needs • Identify service opportunities and requirements • Quickly experiment and test new services What Software Defined Environments provides: • Patterns of Expertise to link solution to infrastructure based on business rules • Automated orchestration of workloads • Analytics-based optimization of workload to maximize outcomes ContinuousOptimization P r io r it y P o lic y S e r v ic e P r io r it y P o lic y S e r v ic e Solution Definition Leverage best practices with patterns of expertise
  • 30. What s the best infrastructure for my cloud? How do I manage & secure my hybrid environment? How do I maintain choice and flexibility? How do I rapidly deploy & operate my cloud? Are you building the right cloud?
  • 31. Anti-Pattern: a commonly used process structure or pattern of action that, despite initially appearing to be an appropriate and effective response to a problem, typically has more bad consequences than beneficial results. -Wikipedia
  • 32. Benefits Consequences Related Patterns How to recognize anti-patterns Problem Solution Context and Forces Design Pattern Benefits Anti-Pattern Solution Refactored Solution Consequences Symptoms and Consequences Related Patterns Design Anti-Pattern Contextual Causes
  • 33. Are Today’s Good Practices Tomorrow’s Anti-Patterns?
  • 34. Architecture by Accident The Humble Start… Meeting Demand… The First Bottleneck… The Second Bottleneck… Becoming Mission Critical… Enabling SOA… The Fun Begins… How Did We Get Here?
  • 35. Everything fails, all the time. - Werner Vogels, CTO Amazon
  • 37. Six Types of Socratic Questions •  What does that mean? •  Can you explain that? Questions that Clarify •  Is that always the case? •  How can you verify or disprove? Questions that Challenge •  What causes that to happen? •  What would be an example? Questions that Examine •  Why is that view best? •  What is another way to look at it? Questions that Expand •  What are the consequences? •  What are you implying? Questions that Asses •  What was the point of that question? •  How does that question apply to us? Questioning the Question
  • 38. Questions we need to ask •  What kind of scenarios do I need to plan for? •  Where are my single-points-of-failure? •  If I use a master/slave architecture, what happens when I lose the master? •  What happens when my load balancer fails? •  How will I replace a node when it fails? •  How will I recognize a failure when it occurs? •  What if the cache grows beyond the memory limit of the instance? •  What if a downstream service times out or returns an exception?
  • 39. Requirements for cloud scalability •  Build processes threads that resume on reboot •  Allow for state to re-sync using message queues •  Prepare pre-configured and pre-optimized virtual images for quick launches •  Avoid in-memory data stores •  Have a good DR strategy and then automate it •  Create application as loosely coupled systems to enable better fault tolerance and larger scale
  • 40. Don’t forget get about the cloud-as-a- system view
  • 41. Where Does Over- Subscription Occur? Corporate! LANs & VPNs! Load Balancer! Load Balancer! Firewall! Switch! VM Server Farm! Database! NAS ! Appliances! Storage! Frame! Web Servers! Load Balancer! 1.  Hypervisor 2.  CPU Cycles 3.  Memory 4.  Blade Backplane I/O 5.  SAN Fabric 6.  Network Interfaces 7.  Host Bus Adapters 8.  Backup Device 9.  WAN Circuits 10. Storage Processors Common Locations ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! ! Here Here Here Here
  • 43. The scalability of an application is defined by whether it can accommodate changes in traffic without requiring changes in architecture.
  • 44. Service Orientation 1 2 3 4 5 6 Goals of Service Orientation Abstraction Loose Coupling Autonomy Standard Services Composability Reusability
  • 45. OrŸchesŸtraŸtion [AWR-kuh-strey-shun] •  A central process controls everything and coordinates the execution of different operations involved in the operation •  The services do not "know” that they are involved in a composite process •  Only the central coordinator of the orchestration is aware of the desired outcome, •  The orchestration leverages explicit process definitions to operate the services in the correct order of invocation   1.  the act of arranging a piece of music 2.  the planning or execution of events in order to achieve a desired effect 3.  The technique of arranging or manipulating, especially by means of clever or thorough planning or maneuvering
  • 46. Orchestration Illustration Orchestrator Web Service 1 Web Service 4 Web Service 3 Web Service 2
  • 47. ChoŸreŸogŸraŸphy [kawr-ee-OG-ruh-fee]   1.  the art of composing ballets and other dances 2.  the method of representing the various movements in dancing by a system of notation 3.  The arrangement or manipulation of actions leading up to an event •  Choreography does not rely on a central coordinator. •  Each service knows exactly who and when to execute •  Focuses on the exchange of messages and information •  All services need to be aware of the business process, operations to execute, messages to exchange, timing, etc.
  • 48. Orchestration Illustration Web Service 1 Web Service 4 Web Service 3 Web Service 2 Send Receive Invoke Invoke Invoke
  • 49. Choreography vs. Orchestration •  From the perspective of composing services to execute business processes, orchestration is a more flexible paradigm and has the following advantages over choreography: –  The coordination of component processes is centrally managed by a known coordinator. –  Web services can be incorporated without their being aware that they are taking part in a larger business process. –  Alternative scenarios can be put in place in case faults occur. Page  49  
  • 50. Orchestration Requirements •  Event-based processing •  Coordinate asynchronously between services •  Correlate messages being exchanged •  Provide for parallel processing •  Allow for transaction roll-back •  Manipulate and transform data between messaging partners •  Be able to manage long running business transactions and activities •  Have a robust mechanism for fault and error handling
  • 51. The Management Eco-System Capacity ManagementCompute Storage Network Facilities Event Management (Netcool OMNIbus) CMDB (SmartCloud ControlDesk) Billing (Cost Mgr) Software Tracking (TEM SUA) SmartCloud Monitoring Storage Productivity Center Network Performance Manager Data Center Infrastructure Manager Capacity Management Predictive Insights Capacity Analyzer Automated Reporting Engine (Tivoli Common Reporter) Cloud Automation (SmartCloud Orchestrator) Interface for Capacity Planners Interface for Business Users Policies (EGO) Data Warehouse
  • 52. One Integrated Environment Distributed DatabaseMainframe Network Middleware Storage Event Pool Operational! Data Warehouse! Predictive Enrichment & Correlation Service DeskPaging CMDB Knowledge Asset Mgmt Event Catalog Event API Business Telemetry 3rd Party Providers Presentation Framework
  • 53. Processing Streams Situational Awareness Engine Adapted from http://www.slideshare.net/TimBassCEP/getting-started-in-cep- how-to-build-an-event-processing-application-presentation-717795 Real-Time Event Streams Detected and Predicted Situations Patterns from Historical Data Causal Relationship from Past RCAs
  • 54. Complex Event Processing Event Pipeline Event Queries Time Window Data Events Control Event Other Events Event Filter Scenarios A B C Feedback Loop Event Intelligence Action Events
  • 55. AutomatedAction Notificationand Escalation BusinessImpact Analysis RootCauseAnalysis Correlationand EventSuppression Enrichment Meta-Data Integration Bus DistributedCollectors DistributedCollectors LOB Managed Monitoring System Service Provider Monitoring System Vendor Managed Monitoring System Element Manager Element Manager Element Manager Other Enterprise Data Document Sharing Service Desk CMDB Batch Scheduling Knowledge Database Online Run Book PBX/Call Manager Visualization Framework CommonEvent Format Topology And Relationship Database Automated Action Tools DistributedCollectors Automated Provisioning System Predictive Analysis Automated Change Reconciliation Security Management ArchiveandReport Business Telemetry Data Service Center and Enterprise Notification Tool Event Processing
  • 56. Palette of library assets enable easy workflow composition through drag and drop Access to rich libraries (toolkits) of reusable automation assets that enable to speed automation creation Rich set of actions types, flow control, data handling primitives that simplify creation of complex automations Easy workflow action editing for managing: data mapping, error recovery options, implementation details , etc. Graphical editor for composing and connecting workflows Rich tooling functions to edit, version, debug, optimize workflows Automating Responses
  • 57. Custom" Data Core pattern processing BPM" Process SCO BPM" Human" Service pre-provision " event operation post-provision " event operation Operation context Operation context Operation context Custom" Data SCO REST API Operation context 1 2 4 5 3 SCORESTAPI A B A B Once per registered event operation Extending Workload Deployment with Custom Automation Pre-process event Post-process event
  • 58. Completing the journey Define •  Review the existing architecture •  Review the business outcomes •  Define the end state Prioritize •  Consolidations •  Technologies to virtualize •  Business processes to model and workflows to automate Execute •  Look for early wins •  Evolve incrementally •  Organize the teams effectively
  • 59. You have to be realistic about how fast you can mature. Iterating helps form a cultural of continuous improvements Iterative development
  • 60. Let’s keep the conversation going… Andrew.P.White@Gmail.com! ReverendDrew! SystemsManagementZen.Wordpress.com! systemsmanagementzen.wordpress.com/feed/! @SystemsMgmtZen! ReverendDrew! APWhite@us.ibm.com! 614-306-3434!