SlideShare a Scribd company logo
Precise information
Alert the right person
Automation
Service is alive
•
Is my application alive on the minimum
number required by my SLA?
•
2 out of 5 instances of my-app are not
responding to isAlive
•
my-app requires a minimum of 3
instances to meet the SLA
Alert
Sensu
Queries Nginx
Alert & SLA
ZooKeeper
Planned
Configuration
Service owner
Nginx
Service Load
Balancer
Is-alive
Alert
Sensu
Queries Nginx
Alert & SLA
ZooKeeper
Planned
Configuration
Service owner
Nginx
Service Load
Balancer
Is-alive
Alert the right
person
Precise
information
Automation
Service anomalies
BECKEND
•
Identify unhealthy KPIs per end-points
•
Abnormal increase in error rate for
class.method.get
•
Abnormal increase in performance or
decrease in RPM
Anomaly Alert
Anodot
Time series
anomaly
detection
Alerts & graphs
statsd
Stats
aggregation
Forwarding
metrics
JVM servers
Metrics library
metrics / 1m
Graphs
Anomaly Alert
Anodot
Time series
anomaly
detection
Alerts & graphs
statsd
Stats
aggregation
Forwarding
metrics
JVM servers
Metrics library
metrics / 1m
Graphs
Precise
information
Alert the right
person
Automation
Service anomalies
FRONTEND
•
Users effected or not? How and where?
•
Success event count is low
•
Client error increased in Canada from
Chrome
Anomaly
Alert
STORM &
ESPER
Realtime
streaming
processing
Metrics / 1m
Client
flume
events Graphs
Client
flume
events
Anodot
Time series
anomaly
detection
Alerts &
graphs
Anomaly
Alert
STORM &
ESPER
Realtime
streaming
processing
Metrics / 1m
Client
flume
events Graphs
Client
flume
events
Precise
information
Alert the right
person
Automation
Anodot
Time series
anomaly
detection
Alerts &
graphs
Alerts management
•
Which active alerts do I have?
•
What changes could cause the problem?
•
I have 2 active alerts on MySql and 2
deployments in the last hour
Alert
BigPanda
Central alerts &
changes
Alerts &
Changes
Changes
Integrations
Deployments
Chef uploads
Alerts
integrations
NewRelic
Sensu
Nagios
PingDom Web UI
Alert
BigPanda
Central alerts &
changes
Alerts &
Changes
Changes
Integrations
Deployments
Chef uploads
Alerts
integrations
NewRelic
Sensu
Nagios
PingDom
Precise
information
Alert the right
person
Automation
Precise
information
Web UI
Questions?

More Related Content

PPTX
StatsCraft 2015: Introduction to monitoring - Yoav Abrahami and Mark Sonis
PPTX
Server and application monitoring webinars [Applications Manager] - Part 3
PPTX
Free Netflow analyzer training - diagnosing_and_troubleshooting
PPTX
NetFlow Analyzer Training Part I: Getting the initial settings right
PPTX
It's What's Inside that Counts!
PPTX
PPTX
Leading American Entertainment Company implements OpManager
PPTX
Server and application monitoring webinars [Applications Manager] - Part 2
StatsCraft 2015: Introduction to monitoring - Yoav Abrahami and Mark Sonis
Server and application monitoring webinars [Applications Manager] - Part 3
Free Netflow analyzer training - diagnosing_and_troubleshooting
NetFlow Analyzer Training Part I: Getting the initial settings right
It's What's Inside that Counts!
Leading American Entertainment Company implements OpManager
Server and application monitoring webinars [Applications Manager] - Part 2

What's hot (20)

PPTX
Network Configuration Management - Mumbai Seminar
PPTX
NetFlow Analyzer Training Part II : Diagnosing and troubleshooting traffic is...
PPTX
Free NetFlow Analyzer training - Getting the initial settings right
PPTX
Opmanager technical overview
PPTX
Server and application monitoring webinars [Applications Manager]: Part 1
PPTX
Distributed Tracing in Serverless Systems - DevOpsDays Edinburgh
PPTX
Dashboards, widgets, business views & 3D-data centre
PPTX
Using ai and automation to build resiliency into azure dev ops
PPTX
OpManager training - Device discovery and classification.
PPTX
Global Airline giant's application performance monitoring solution!
PDF
Net Rounds Product Sheet
PPTX
The cloud moved your monitoring cheese
PPTX
From web interface to the database:Monitor all that matters
PDF
Making awesome apps
PPTX
Network fault management and IT automation training
PDF
Monitoring at Facebook - Ran Leibman, Facebook - DevOpsDays Tel Aviv 2015
PPTX
SAP License Audit Report
PPTX
Finding application problems before they impact users
PPTX
Server and application monitoring webinars [Applications Manager] - Part 4
PDF
Microservices: Patterns & Practices
Network Configuration Management - Mumbai Seminar
NetFlow Analyzer Training Part II : Diagnosing and troubleshooting traffic is...
Free NetFlow Analyzer training - Getting the initial settings right
Opmanager technical overview
Server and application monitoring webinars [Applications Manager]: Part 1
Distributed Tracing in Serverless Systems - DevOpsDays Edinburgh
Dashboards, widgets, business views & 3D-data centre
Using ai and automation to build resiliency into azure dev ops
OpManager training - Device discovery and classification.
Global Airline giant's application performance monitoring solution!
Net Rounds Product Sheet
The cloud moved your monitoring cheese
From web interface to the database:Monitor all that matters
Making awesome apps
Network fault management and IT automation training
Monitoring at Facebook - Ran Leibman, Facebook - DevOpsDays Tel Aviv 2015
SAP License Audit Report
Finding application problems before they impact users
Server and application monitoring webinars [Applications Manager] - Part 4
Microservices: Patterns & Practices
Ad

Similar to Monitoring HOWTOs (20)

PPTX
SRECon-Europe-2017: Reducing MTTR and False Escalations: Event Correlation at...
PDF
Cogent Consutlting Case Study
PPTX
Netcetera Proactive Management Service
PDF
Need for Speed: How to Performance Test the right way by Annie Bhaumik
PDF
How the Big Data of APM can Supercharge DevOps
PPTX
Apinizer - Full API Lifecycle and Integration Platform
PPT
PPTX
Application Performance Monitoring with code level diagnostics
PDF
Ibm tealeaf on_cloud
PDF
How to improve your system monitoring
PPTX
Growing into a proactive Data Platform
PPTX
Vibration monitoring adriano engineering
PPTX
Adapting to evolving user, security, and business needs with aruba clear pass
PDF
Autoscaling Confluent Cloud: Should We? How Would We?
PPTX
Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...
PPTX
Export flows, group traffic, map application traffic and more: NetFlow Analyz...
PDF
Managing Quality of Service for Containerized Microservice Applications
PDF
In-production Application Quality Monitoring
PDF
In Production Application Quality Monitoring
PPTX
Performance analysis in eCommerce (retail and Websphere Commerce)
 
SRECon-Europe-2017: Reducing MTTR and False Escalations: Event Correlation at...
Cogent Consutlting Case Study
Netcetera Proactive Management Service
Need for Speed: How to Performance Test the right way by Annie Bhaumik
How the Big Data of APM can Supercharge DevOps
Apinizer - Full API Lifecycle and Integration Platform
Application Performance Monitoring with code level diagnostics
Ibm tealeaf on_cloud
How to improve your system monitoring
Growing into a proactive Data Platform
Vibration monitoring adriano engineering
Adapting to evolving user, security, and business needs with aruba clear pass
Autoscaling Confluent Cloud: Should We? How Would We?
Reduce SRE Stress: Minimizing Service Downtime with Grafana, InfluxDB and Tel...
Export flows, group traffic, map application traffic and more: NetFlow Analyz...
Managing Quality of Service for Containerized Microservice Applications
In-production Application Quality Monitoring
In Production Application Quality Monitoring
Performance analysis in eCommerce (retail and Websphere Commerce)
 
Ad

Recently uploaded (20)

PPT
Occupational Health and Safety Management System
PDF
distributed database system" (DDBS) is often used to refer to both the distri...
PPTX
Fundamentals of Mechanical Engineering.pptx
PPTX
communication and presentation skills 01
PPTX
Feature types and data preprocessing steps
PPTX
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
PDF
Exploratory_Data_Analysis_Fundamentals.pdf
PDF
Abrasive, erosive and cavitation wear.pdf
PPTX
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
PDF
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PPTX
Current and future trends in Computer Vision.pptx
PPTX
Management Information system : MIS-e-Business Systems.pptx
PDF
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
PPTX
CURRICULAM DESIGN engineering FOR CSE 2025.pptx
PDF
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
PDF
22EC502-MICROCONTROLLER AND INTERFACING-8051 MICROCONTROLLER.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf
Occupational Health and Safety Management System
distributed database system" (DDBS) is often used to refer to both the distri...
Fundamentals of Mechanical Engineering.pptx
communication and presentation skills 01
Feature types and data preprocessing steps
AUTOMOTIVE ENGINE MANAGEMENT (MECHATRONICS).pptx
Exploratory_Data_Analysis_Fundamentals.pdf
Abrasive, erosive and cavitation wear.pdf
Sorting and Hashing in Data Structures with Algorithms, Techniques, Implement...
BIO-INSPIRED HORMONAL MODULATION AND ADAPTIVE ORCHESTRATION IN S-AI-GPT
Fundamentals of safety and accident prevention -final (1).pptx
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
Current and future trends in Computer Vision.pptx
Management Information system : MIS-e-Business Systems.pptx
Level 2 – IBM Data and AI Fundamentals (1)_v1.1.PDF
CURRICULAM DESIGN engineering FOR CSE 2025.pptx
PREDICTION OF DIABETES FROM ELECTRONIC HEALTH RECORDS
22EC502-MICROCONTROLLER AND INTERFACING-8051 MICROCONTROLLER.pdf
R24 SURVEYING LAB MANUAL for civil enggi
null (2) bgfbg bfgb bfgb fbfg bfbgf b.pdf

Monitoring HOWTOs