SlideShare a Scribd company logo
[Free OpManager training]  Part 4- Network fault-management & IT automation
Week 4
Effective fault management and IT
automation
1. How to identify the faults
quickly?
2. How to prioritize the problems?
All services are
currently UP
1. How to identify the faults
quickly?
2. How to prioritize the problems?
3. How do you get it resolved
quickly?
Agenda
• Alarm severity levels
• Threshold violation alarms
• Other alarms : VMWare; Event logs; SNMP traps and Syslogs
• Notifications
• Using an IT workflow to remediate problems
• Tips and tricks
• Questions
Alarm severity levels
Severity Color code
Attention
Trouble
Critical
Service down
Clear
Device down
Interface down Severity: predefined
Process down
Service down
URL down
Severity: predefined
Event log
Syslog
SNMP trap
Severity: configurable
Threshold-based alarms
• Configuring threshold values on an individual device
• Configuring consecutive times
• Configuring rearm value to clear fault alarms
• Using device templates to configure thresholds globally based on device type
Threshold-based alarms
VMWare alarms; Event logs; SNMP traps; Syslogs
Alarms for inventory changes
o vMotion
o Host added/removed
o Host or VMs connected/disconnected
o VMs powered on/off
o VMs orphaned
o Scheduled task removed
o Etc.
Querying more events from the Vcenter server / ESX host
VMware events
Event log alarms
Prerequisites
o Check if WMI and RPC services are enabled on the Windows servers
o Default WMI ports: 135 & 445, 5000 to 6000 (TCP)
• Configuring event logs for a Windows server in OpManager
• Ignoring a specific event log from a Windows server
• Configuring OpManager to handle event floods (http://help.opmanager.com/stopping-event-flood)
o serverparameters.conf (OpManager/conf/OpManager)
o EVENTS_PER_HOUR 1000
o EVENT_FLOOD_SEVERITY Critical
SNMP trap alarms
5things that you should know about SNMP traps in OpManager
1. Unsolicited traps
2. Varbinds
3. Failure component
4. Loading traps from MIB files
5. Forwarding trap messages to another NMS platform
OpManager
Trap-
Receiver
Router
Switch
Firewall
Server
SNMP Agent
Management
Definitions
Management
Database
Trap
(162)
#1 Unsolicited traps
I have configured a Router to forward SNMP traps to
OpManager's server. However I don't get to see an alarm?
How do I fix this?
Things to verify :
 Verify whether the Router is added to OpManager
 Verify whether the 'Trap rule' is available for the respective event
 Verify whether the trap event is listed under 'Unsolicited traps'
Solution: Identify the event from the 'Unsolicited traps' and add a new trap rule
#2 Varbinds
I have a Windows server added to OpManager. It triggers 100s of trap events with various
messages from x.x.x.x OID. However I want to filter the trap event only if the priority is
'critical' and clear the event automatically when the priority is 'low'? How do I achieve this?
Know
• What is a varbind?
• How to identify the varbinds from trap event?
Solution: Use 'match criteria' to filter and clear the trap alarms based on 'varbinds'
#3 Failure component
I have a Switch added to OpManager. It triggers a failure trap event for BGP down from .1.3.6.1.2.1.15.7.2
OID and a clear event for BGP up from .1.3.6.1.2.1.15.7.1 OID. This generates two different alarms in
OpManager. I want the clear alarm for BGP up event merged with the original alarm as it is for the same link.
How do I achieve this?
Solution: Provide a common 'failure component' in both the trap rule
It generates two different alarm because OpManager receives the
trap from two different OIDs and each one got a separate trap rule
Syslog alarms
Prerequisites
o Configure devices to forward syslog events to OpManager's server
o Default ports: 514 & 519 (UDP); configurable
• Creating a syslog rule
o Syslog receiver
• Using facility name, severity, or match text to filter and clear syslog alarms (regex format)
• Identifying the syslog flow rate from OpManager
• Forwarding syslog messages to another NMS platform
Notifications
Notification
cycle
Profile type
- Send email or SMS
- Run system
command
- Run program
- Log a ticket
- Web alarm
- Syslog
- Trap
Alarm criteria
- Device down
- Service down
- Hardware fault
- Threshold violation
- Virtual device fault
- UCS fault
Device selection
- Category
- Business view
- Devices
Schedule
- All the time
- Selected time window
- Delayed trigger
- Recurring trigger
Preview
- Verify inputs
- Add a profile
#1 Email notification
Steps :
1. Configure mail server settings
2. Create a notification profile for 'email';
- Select the required 'alarm criteria'; -
Associate the profile with 'required devices';
I want to receive an email notification for all
service down alarms. How do I configure
this?
#2 Log a ticket
Steps :
1. Setting up the integration with ServiceDesk Plus
2. Create a notification profile for 'log a ticket'; - Select
the category, group and technician; - Select the
required 'alarm criteria'; - Associate the
profile with 'required devices';
I want OpManager to create a ticket in ServiceDesk Plus whenever a
problem is detected in the interface. The ticket should have the fields like
category, group and technician filled automatically.
IT workflow automation
• Get more space on the server for better performance
• Test SNMP service
• Export/ Import available templates
https://resources.manageengine.com/resources/forum/opmanager/workflows
IT workflow automation
Create a workflow Associate devices Schedule/trigger tasks
1 2 3
Tips and tricks
Tips and tricks
• Configure device dependencies to stop polling a dependent device
when its parent device is down
• Suppress known alarms from an individual device
• Configure the downtime scheduler and stop polling devices during
maintenance windows
• Configure alarm escalation and notify the super admin when a critical
alarm is not cleared within a given amount of time
youtube.com/opmanagertechvideos
help.opmanager.com
opmanager-
support@manageengine.com
+1 (888) 720-9500 / +1 (408) 916-
9400
Need more help?
forums.manageengine.com/opmanager
Free ITOM Seminar
https://www.manageengine.com/itom/seminars/chicago-la-2018.html
[Free OpManager training]  Part 4- Network fault-management & IT automation
www.manageengine.com
THANK YOU

More Related Content

PPTX
Free OpManager training_ Part 2-server monitoring
PPTX
Export flows, group traffic, map application traffic and more: NetFlow Analyz...
PPTX
Network Maps & Reporting [Free OpManager Training - Part 5]
PPTX
Monitoring network performance- Part 3_Free OpManager training
PPTX
Free OpManager training_Part 1- Discovery & classification
PPTX
Free OpManager training Part 3 - Monitoring Network Performance and Network Maps
PPTX
Free NetFlow Analyzer training - Getting the initial settings right
PPTX
Free OpManager training Part 4 - Monitoring Network Performance and Network Maps
Free OpManager training_ Part 2-server monitoring
Export flows, group traffic, map application traffic and more: NetFlow Analyz...
Network Maps & Reporting [Free OpManager Training - Part 5]
Monitoring network performance- Part 3_Free OpManager training
Free OpManager training_Part 1- Discovery & classification
Free OpManager training Part 3 - Monitoring Network Performance and Network Maps
Free NetFlow Analyzer training - Getting the initial settings right
Free OpManager training Part 4 - Monitoring Network Performance and Network Maps

What's hot (20)

PPTX
Free OpManager training Part1- Discovery and classification season#3
PPTX
Free OpManager training Part 2 Monitoring Server Performance- season#3
PPTX
OpManager training - Device discovery and classification.
PPT
Monitor and manage everything Cisco using OpManager
PPT
OpManager Major Features
PPTX
Network fault management and IT automation training
PPTX
Network and server performance monitoring training
PPTX
[Season - 3 OpManager Training] Monitoring Network Performance
PPT
Proof of Concept Guide for ManageEngine OpManager
PPTX
[Season - 3 Free OpManager Training] Monitoring Server Performance
PPTX
Overview and features of NCM
PDF
[Season - 3] OpManager Training - Network Maps,Reports and Best Practices
PPTX
Free training on NCM - Discovery & Disaster recovery
PPTX
Understanding firewall-policies-their-effectiveness-in-defending-against-netw...
PPTX
Configlets, compliance, RBAC & reports - Network Configuration Manager
PPTX
ManageEngine OpUtils Technical Overview
PPTX
Virtual Firewall Management
PPTX
Season 3 [free OpManager training]_Part1- Discovery and classification
PPTX
Free OpManager training Part1- Discovery and classification
PPTX
Network scanner
Free OpManager training Part1- Discovery and classification season#3
Free OpManager training Part 2 Monitoring Server Performance- season#3
OpManager training - Device discovery and classification.
Monitor and manage everything Cisco using OpManager
OpManager Major Features
Network fault management and IT automation training
Network and server performance monitoring training
[Season - 3 OpManager Training] Monitoring Network Performance
Proof of Concept Guide for ManageEngine OpManager
[Season - 3 Free OpManager Training] Monitoring Server Performance
Overview and features of NCM
[Season - 3] OpManager Training - Network Maps,Reports and Best Practices
Free training on NCM - Discovery & Disaster recovery
Understanding firewall-policies-their-effectiveness-in-defending-against-netw...
Configlets, compliance, RBAC & reports - Network Configuration Manager
ManageEngine OpUtils Technical Overview
Virtual Firewall Management
Season 3 [free OpManager training]_Part1- Discovery and classification
Free OpManager training Part1- Discovery and classification
Network scanner
Ad

Similar to [Free OpManager training] Part 4- Network fault-management & IT automation (20)

PPTX
Season 4 [Free OpManager training] Part4 - Network fault management & IT auto...
PPTX
Free OpManager training Part 4 - Fault Management and IT automation
PPT
Role of OpManager in event and fault management
PPTX
Opmanager Workshop - Middle East
PPT
Krall Cis516 Opmanager
PPTX
Season 4 [Free OpManager training] Part1- Discovery and classification
PDF
Monitoring Far Beyond the Operating System - WeOp 2014
PDF
Zabbix Smart problem detection - FISL 2015 workshop
PPTX
OpManager Technical Overview
PPTX
Opmanagertechnicaloverview 160128123947
PPTX
Overview OpManager
PPT
Vincent Haynes(Cis516 Assignment Week 9)
PPTX
Opmanager technical overview
PPTX
New OpManager v12
PPT
Op Manager7
PPTX
Design Like a Pro: Alarm Management
PPTX
Design Like a Pro: Alarm Management
PDF
OSMC 2008 | Advanced Windows monitoring and NSClient++ with Nagios by Michael...
PPT
DM_AC3302_E01 EMS Maintenance and Troubleshooting 73P.ppt
PPT
Newest Family Member - IT Automation With Opalis
Season 4 [Free OpManager training] Part4 - Network fault management & IT auto...
Free OpManager training Part 4 - Fault Management and IT automation
Role of OpManager in event and fault management
Opmanager Workshop - Middle East
Krall Cis516 Opmanager
Season 4 [Free OpManager training] Part1- Discovery and classification
Monitoring Far Beyond the Operating System - WeOp 2014
Zabbix Smart problem detection - FISL 2015 workshop
OpManager Technical Overview
Opmanagertechnicaloverview 160128123947
Overview OpManager
Vincent Haynes(Cis516 Assignment Week 9)
Opmanager technical overview
New OpManager v12
Op Manager7
Design Like a Pro: Alarm Management
Design Like a Pro: Alarm Management
OSMC 2008 | Advanced Windows monitoring and NSClient++ with Nagios by Michael...
DM_AC3302_E01 EMS Maintenance and Troubleshooting 73P.ppt
Newest Family Member - IT Automation With Opalis
Ad

More from ManageEngine, Zoho Corporation (20)

PPTX
Create seamless customer experiences
PDF
From web interface to database: Monitor what matters
PDF
NetFlow Analyzer Free Training Series Part I - May 2020
PDF
Overcome real-time server and VM monitoring challenges
PPTX
Modernizing Cloud and Hyperconverged Infrastructure monitoring
PPTX
Deliver seamless digital experience
PDF
Free NetFlow Analyzer training Season 1 Part 2 - Feb 2020
PPTX
From web interface to the database:Monitor all that matters
PDF
NetFlow Analyzer Training Season 1 Part 1 - Feb 2020 - EST
PDF
NetFlow Analyzer Training Season 1 Part 1 - Feb 2020 - GMT
PDF
NetFlow Analyzer Product Overview
PPTX
Monitoring cloud applications and hyperconverged infrastructure
PPTX
Building the right website monitoring strategy
PPTX
Unlock the value of your big data infrastructure
PPTX
Key to optimal end user experience
PPTX
Monitoring cloud applications and containers
PPTX
implementing the right website monitoring strategy
PPTX
Big data and non relational database
PPTX
Visibility-from web application interface to the database
PPTX
OpUtils Free training
Create seamless customer experiences
From web interface to database: Monitor what matters
NetFlow Analyzer Free Training Series Part I - May 2020
Overcome real-time server and VM monitoring challenges
Modernizing Cloud and Hyperconverged Infrastructure monitoring
Deliver seamless digital experience
Free NetFlow Analyzer training Season 1 Part 2 - Feb 2020
From web interface to the database:Monitor all that matters
NetFlow Analyzer Training Season 1 Part 1 - Feb 2020 - EST
NetFlow Analyzer Training Season 1 Part 1 - Feb 2020 - GMT
NetFlow Analyzer Product Overview
Monitoring cloud applications and hyperconverged infrastructure
Building the right website monitoring strategy
Unlock the value of your big data infrastructure
Key to optimal end user experience
Monitoring cloud applications and containers
implementing the right website monitoring strategy
Big data and non relational database
Visibility-from web application interface to the database
OpUtils Free training

Recently uploaded (20)

PPTX
Reimagine Home Health with the Power of Agentic AI​
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
Digital Strategies for Manufacturing Companies
PPTX
ai tools demonstartion for schools and inter college
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
PDF
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
PPTX
Essential Infomation Tech presentation.pptx
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PDF
medical staffing services at VALiNTRY
PDF
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PPTX
Introduction to Artificial Intelligence
PDF
System and Network Administration Chapter 2
PDF
Nekopoi APK 2025 free lastest update
PDF
top salesforce developer skills in 2025.pdf
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
Reimagine Home Health with the Power of Agentic AI​
Odoo POS Development Services by CandidRoot Solutions
PTS Company Brochure 2025 (1).pdf.......
Design an Analysis of Algorithms I-SECS-1021-03
Digital Strategies for Manufacturing Companies
ai tools demonstartion for schools and inter college
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Audit Checklist Design Aligning with ISO, IATF, and Industry Standards — Omne...
SAP S4 Hana Brochure 3 (PTS SYSTEMS AND SOLUTIONS)
Essential Infomation Tech presentation.pptx
VVF-Customer-Presentation2025-Ver1.9.pptx
medical staffing services at VALiNTRY
Claude Code: Everyone is a 10x Developer - A Comprehensive AI-Powered CLI Tool
Design an Analysis of Algorithms II-SECS-1021-03
Introduction to Artificial Intelligence
System and Network Administration Chapter 2
Nekopoi APK 2025 free lastest update
top salesforce developer skills in 2025.pdf
CHAPTER 2 - PM Management and IT Context
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...

[Free OpManager training] Part 4- Network fault-management & IT automation

  • 2. Week 4 Effective fault management and IT automation
  • 3. 1. How to identify the faults quickly? 2. How to prioritize the problems?
  • 4. All services are currently UP 1. How to identify the faults quickly? 2. How to prioritize the problems? 3. How do you get it resolved quickly?
  • 5. Agenda • Alarm severity levels • Threshold violation alarms • Other alarms : VMWare; Event logs; SNMP traps and Syslogs • Notifications • Using an IT workflow to remediate problems • Tips and tricks • Questions
  • 8. Device down Interface down Severity: predefined
  • 9. Process down Service down URL down Severity: predefined
  • 12. • Configuring threshold values on an individual device • Configuring consecutive times • Configuring rearm value to clear fault alarms • Using device templates to configure thresholds globally based on device type Threshold-based alarms
  • 13. VMWare alarms; Event logs; SNMP traps; Syslogs
  • 14. Alarms for inventory changes o vMotion o Host added/removed o Host or VMs connected/disconnected o VMs powered on/off o VMs orphaned o Scheduled task removed o Etc. Querying more events from the Vcenter server / ESX host VMware events
  • 15. Event log alarms Prerequisites o Check if WMI and RPC services are enabled on the Windows servers o Default WMI ports: 135 & 445, 5000 to 6000 (TCP) • Configuring event logs for a Windows server in OpManager • Ignoring a specific event log from a Windows server • Configuring OpManager to handle event floods (http://help.opmanager.com/stopping-event-flood) o serverparameters.conf (OpManager/conf/OpManager) o EVENTS_PER_HOUR 1000 o EVENT_FLOOD_SEVERITY Critical
  • 16. SNMP trap alarms 5things that you should know about SNMP traps in OpManager 1. Unsolicited traps 2. Varbinds 3. Failure component 4. Loading traps from MIB files 5. Forwarding trap messages to another NMS platform OpManager Trap- Receiver Router Switch Firewall Server SNMP Agent Management Definitions Management Database Trap (162)
  • 17. #1 Unsolicited traps I have configured a Router to forward SNMP traps to OpManager's server. However I don't get to see an alarm? How do I fix this? Things to verify :  Verify whether the Router is added to OpManager  Verify whether the 'Trap rule' is available for the respective event  Verify whether the trap event is listed under 'Unsolicited traps' Solution: Identify the event from the 'Unsolicited traps' and add a new trap rule
  • 18. #2 Varbinds I have a Windows server added to OpManager. It triggers 100s of trap events with various messages from x.x.x.x OID. However I want to filter the trap event only if the priority is 'critical' and clear the event automatically when the priority is 'low'? How do I achieve this? Know • What is a varbind? • How to identify the varbinds from trap event? Solution: Use 'match criteria' to filter and clear the trap alarms based on 'varbinds'
  • 19. #3 Failure component I have a Switch added to OpManager. It triggers a failure trap event for BGP down from .1.3.6.1.2.1.15.7.2 OID and a clear event for BGP up from .1.3.6.1.2.1.15.7.1 OID. This generates two different alarms in OpManager. I want the clear alarm for BGP up event merged with the original alarm as it is for the same link. How do I achieve this? Solution: Provide a common 'failure component' in both the trap rule It generates two different alarm because OpManager receives the trap from two different OIDs and each one got a separate trap rule
  • 20. Syslog alarms Prerequisites o Configure devices to forward syslog events to OpManager's server o Default ports: 514 & 519 (UDP); configurable • Creating a syslog rule o Syslog receiver • Using facility name, severity, or match text to filter and clear syslog alarms (regex format) • Identifying the syslog flow rate from OpManager • Forwarding syslog messages to another NMS platform
  • 22. Notification cycle Profile type - Send email or SMS - Run system command - Run program - Log a ticket - Web alarm - Syslog - Trap Alarm criteria - Device down - Service down - Hardware fault - Threshold violation - Virtual device fault - UCS fault Device selection - Category - Business view - Devices Schedule - All the time - Selected time window - Delayed trigger - Recurring trigger Preview - Verify inputs - Add a profile
  • 23. #1 Email notification Steps : 1. Configure mail server settings 2. Create a notification profile for 'email'; - Select the required 'alarm criteria'; - Associate the profile with 'required devices'; I want to receive an email notification for all service down alarms. How do I configure this?
  • 24. #2 Log a ticket Steps : 1. Setting up the integration with ServiceDesk Plus 2. Create a notification profile for 'log a ticket'; - Select the category, group and technician; - Select the required 'alarm criteria'; - Associate the profile with 'required devices'; I want OpManager to create a ticket in ServiceDesk Plus whenever a problem is detected in the interface. The ticket should have the fields like category, group and technician filled automatically.
  • 26. • Get more space on the server for better performance • Test SNMP service • Export/ Import available templates https://resources.manageengine.com/resources/forum/opmanager/workflows IT workflow automation Create a workflow Associate devices Schedule/trigger tasks 1 2 3
  • 28. Tips and tricks • Configure device dependencies to stop polling a dependent device when its parent device is down • Suppress known alarms from an individual device • Configure the downtime scheduler and stop polling devices during maintenance windows • Configure alarm escalation and notify the super admin when a critical alarm is not cleared within a given amount of time
  • 29. youtube.com/opmanagertechvideos help.opmanager.com opmanager- support@manageengine.com +1 (888) 720-9500 / +1 (408) 916- 9400 Need more help? forums.manageengine.com/opmanager