SlideShare a Scribd company logo
Hardware Monitoring
with the new
IPMI Plugin v2
Werner Fischer, Technology Specialist Thomas-Krenn.AG
6. OSMC / Nuremberg / Germany
29th
November 2011
slide 2/37
who
I am not
who I am
Werner Fischer
Introduction
working for a
Server vendor
Linux user
since 2001
Kernel or H/W
developer
slide 3/37
who is
Server & accessories
"Made in Germany"
Introduction
based in Freyung,
Bavaria
serving all over Europe
slide 4/37
Some questions ...
Should I use monitoring?
slide 5/37
Should I use monitoring?
It depends on what you
want to do in your free time...
remember yesterday evening?
Some questions ...
✘✔
slide 6/37
All drives of the RAID 6 O.K.?
Some questions ...
✔✔ ✔ ✔
slide 7/37
All network connections O.K.?
Some questions ...
✔
✔
slide 8/37
All FANs O.K.?
Some questions ...
?
slide 9/37
All power supplies O.K.?
Some questions ...
?
slide 10/37
Can we monitor all these servers?
Some questions ...
any IPMI
compatible
server
slide 11/37
Can we monitor all these servers?
Easily?
Some questions ...
any IPMI
compatible
server
slide 12/37
Can we monitor all these servers?
Easily?
With one single tool?
Some questions ...
any IPMI
compatible
server
slide 13/37
Can we monitor all these servers?
Easily?
With one single tool?
Some questions ...
slide 14/37
Agenda
1) IPMI overview
2) Plugin implementation
3) Live demo
4) Common pitfalls
slide 15/37
Intelligent Platform Management Interface
• IPMI developed by Intel, HP, NEC, Dell
– 1998: IPMI v1.0
– 2001: IPMI v1.5
– 2004: IPMI v2.0
slide 16/37
IPMI main features
 Monitoring
(temp, fans, ...)
 Recovery Control
(power on/off/reset)
 Logging
(System Event Log)
 Inventory
(FRU information)
slide 17/37
IPMI overview
Chassis board
Motherboard
Processor
board
Memory
board
Baseboard
Management
Controller
(BMC)
System bus
NVS Storage
SDR
SEL
FRU
Chassis
mgmt.
(Satellite
Controller)
Sensors & Controls
Fan sensor
Temp. sensor
Power control
Reset control
…
FRU
Temp. s.
FRU
private mgmt. busses
IPMB
M/B
Serial
Controller
Serial
Port
Sharing
BMC
Serial
Controller
Serial/Modem
interface
LAN
interface
Serial
Connector
LAN
Connector
PCI mgmt. bus
Network
(LAN)
Controller
Remote Mmgt. Card
(KVM over IP, ...)
Auxillary
IPMB Connector
ICMB
ICMB
bridge
System
interface
Redundant Power
board
FRU Temp.
sensor
…
FRU
access req.
root privileges
access req.
username &
password
slide 18/37
IPMI Channel Privilege Levels (LAN access)
use privilege level 'User'
for monitoring purposes
Privilege Level Allows
User ● query sensors
Operator ● nearly all IPMI commands
● but no changing of out-of-
band interfaces
Administrator • all IPMI commands
slide 19/37
Example: remote control with ipmitool
[user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
                  ­U admin power status
Password:
Chassis Power is off
[user@adminpc ~]$
[user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
                  ­U admin power status
Password:
Chassis Power is off
[user@adminpc ~]$
[user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
                  ­U admin power on
Password:
Chassis Power Control: Up/On
[user@adminpc ~]$
[user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
                  ­U admin power on
Password:
Chassis Power Control: Up/On
[user@adminpc ~]$
[user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
                  ­U admin power status
Password:
Chassis Power is on
[user@adminpc ~]$
[user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211 
                  ­U admin power status
Password:
Chassis Power is on
[user@adminpc ~]$
slide 20/37
Discrete Threshold
multiple states possible:
● up to 15 states
● each state is reflected by a bit
● multiple state bits can active
changes event status on:
● analog reading compared to
threshold values
can provide:
● generic states
● sensor-specific states
provides:
● analog reading
of the sensor
● discr. threshold
comparison status bit
other class similar to discrete:
● OEM: discrete sensor where the
meaning of the states (offsets) are
OEM defined
IPMI Sensor Classes (1/2)
slide 21/37
Discrete Threshold
IPMI Sensor Classes (2/2)
[root@test ~]# ipmitool sdr get "PS2 Status"
Sensor ID              : PS2 Status (0x71)
 Entity ID             : 10.2 (Power Supply)
 Sensor Type (Discrete): Power Supply
 States Asserted       : Power Supply
                         [Presence detected]
                         [Power Supply AC lost]
 Assertion Events      : Power Supply
                         [Presence detected]
                         [Power Supply AC lost]
 Assertions Enabled    : Power Supply
                         [Presence detected]
                         [Failure detected]
                         [Predictive failure]
                         [Power Supply AC lost]
[...]
 Deassertions Enabled  : Power Supply
[...]
[root@test ~]# ipmitool sdr get "PS2 Status"
Sensor ID              : PS2 Status (0x71)
 Entity ID             : 10.2 (Power Supply)
 Sensor Type (Discrete): Power Supply
 States Asserted       : Power Supply
                         [Presence detected]
                         [Power Supply AC lost]
 Assertion Events      : Power Supply
                         [Presence detected]
                         [Power Supply AC lost]
 Assertions Enabled    : Power Supply
                         [Presence detected]
                         [Failure detected]
                         [Predictive failure]
                         [Power Supply AC lost]
[...]
 Deassertions Enabled  : Power Supply
[...]
[root@test ~]# ipmitool sdr get "Fan 1"
Sensor ID              : Fan 1 (0x50)
 Entity ID             : 29.1 (Fan Device)
 Sensor Type (Analog)  : Fan
 Sensor Reading        : 5719 (+/­ 0) RPM
 Status                : ok
 Nominal Reading       : 6708.000
 Normal Minimum        : 2451.000
 Normal Maximum        : 10965.000
 Lower critical        : 1720.000
 Lower non­critical    : 1978.000
 Positive Hysteresis   : 86.000
 Negative Hysteresis   : 86.000
 Minimum sensor range  : Unspecified
 Maximum sensor range  : Unspecified
 Event Message Control : Per­threshold
 Readable Thresholds   : lcr lnc 
 Settable Thresholds   : lcr lnc 
 Threshold Read Mask   : lcr lnc 
 Assertion Events      : 
 Assertions Enabled    : lnc­ lcr­ 
 Deassertions Enabled  : lnc­ lcr­ 
[root@test ~]# ipmitool sdr get "Fan 1"
Sensor ID              : Fan 1 (0x50)
 Entity ID             : 29.1 (Fan Device)
 Sensor Type (Analog)  : Fan
 Sensor Reading        : 5719 (+/­ 0) RPM
 Status                : ok
 Nominal Reading       : 6708.000
 Normal Minimum        : 2451.000
 Normal Maximum        : 10965.000
 Lower critical        : 1720.000
 Lower non­critical    : 1978.000
 Positive Hysteresis   : 86.000
 Negative Hysteresis   : 86.000
 Minimum sensor range  : Unspecified
 Maximum sensor range  : Unspecified
 Event Message Control : Per­threshold
 Readable Thresholds   : lcr lnc 
 Settable Thresholds   : lcr lnc 
 Threshold Read Mask   : lcr lnc 
 Assertion Events      : 
 Assertions Enabled    : lnc­ lcr­ 
 Deassertions Enabled  : lnc­ lcr­ 
slide 22/37
IPMI Sensor Types
root@test:~# ipmi­sensors ­L
Temperature
Voltage
Current
Fan
Physical_Security
Platform_Security_Violation_Attempt
Processor
Power_Supply
Power_Unit
Cooling_Device
[…]
root@test:~# ipmi­sensors ­L
Temperature
Voltage
Current
Fan
Physical_Security
Platform_Security_Violation_Attempt
Processor
Power_Supply
Power_Unit
Cooling_Device
[…]
slide 23/37
Example: query sensors with FreeIPMI
[root@testserver ~]# ipmimonitoring
Record_ID | Sensor Name | Sensor Group | Monitoring Status| 
Sensor Units | Sensor Reading
[...]
17 | Fan 5              | Fan     | Nominal | RPM | 9052.000000 
18 | Fan 6              | Fan     | Nominal | RPM | 8060.000000 
19 | PS1 AC Current     | Current | Nominal | A   | 0.124000 
20 | PS2 AC Current     | Current | Nominal | A   | 0.992000 
[...]
36 | Physical Scrty     | Physical Security | Critical | N/A | 
                                      'General Chassis Intrusion'
[root@testserver ~]# ipmimonitoring
Record_ID | Sensor Name | Sensor Group | Monitoring Status| 
Sensor Units | Sensor Reading
[...]
17 | Fan 5              | Fan     | Nominal | RPM | 9052.000000 
18 | Fan 6              | Fan     | Nominal | RPM | 8060.000000 
19 | PS1 AC Current     | Current | Nominal | A   | 0.124000 
20 | PS2 AC Current     | Current | Nominal | A   | 0.992000 
[...]
36 | Physical Scrty     | Physical Security | Critical | N/A | 
                                      'General Chassis Intrusion'
slide 24/37
Example: interpret discrete sensors( FreeIPMI)
root@test:~# cat /etc/freeipmi/freeipmi_interpret_sensor.conf
[…]
## IPMI_Physical_Security 
#
# IPMI_Physical_Security_No_Event                  Nominal
# IPMI_Physical_Security_General_Chassis_Intrusion Critical
# IPMI_Physical_Security_Drive_Bay_Intrusion       Critical
[…]
# IPMI_Power_Supply_No_Event                       Nominal
# IPMI_Power_Supply_Presence_Detected              Nominal
# IPMI_Power_Supply_Power_Supply_Failure_Detected  Critical
# IPMI_Power_Supply_Predictive_Failure             Critical
# IPMI_Power_Supply_Power_Supply_Input_Lost_AC_DC  Critical
[…]
## IPMI_Memory
#
# IPMI_Memory_No_Event                             Nominal
# IPMI_Memory_Correctable_Memory_Error             Warning
# IPMI_Memory_Uncorrectable_Memory_Error           Critical
root@test:~# cat /etc/freeipmi/freeipmi_interpret_sensor.conf
[…]
## IPMI_Physical_Security 
#
# IPMI_Physical_Security_No_Event                  Nominal
# IPMI_Physical_Security_General_Chassis_Intrusion Critical
# IPMI_Physical_Security_Drive_Bay_Intrusion       Critical
[…]
# IPMI_Power_Supply_No_Event                       Nominal
# IPMI_Power_Supply_Presence_Detected              Nominal
# IPMI_Power_Supply_Power_Supply_Failure_Detected  Critical
# IPMI_Power_Supply_Predictive_Failure             Critical
# IPMI_Power_Supply_Power_Supply_Input_Lost_AC_DC  Critical
[…]
## IPMI_Memory
#
# IPMI_Memory_No_Event                             Nominal
# IPMI_Memory_Correctable_Memory_Error             Warning
# IPMI_Memory_Uncorrectable_Memory_Error           Critical
slide 25/37
IPMI System Event Log (SEL)
• stored in non-volatile storage
[root@testserver ~]# ipmitool sel elist
  40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted
  54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant
  68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost
  7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant
[...]
 2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted
[root@testserver ~]# ipmitool sel elist
  40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted
  54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant
  68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost
  7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant
[...]
 2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted
[root@testserver ~]# ipmitool sel elist
Power Supply PS1 Status | Power Supply AC lost | Asserted
Power Unit Power Redundancy | Fully Redundant
Power Unit Power Redundancy | Redundancy Lost
Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant
[...]
Physical Security Physical Scrty | General Chassis intrusion | Asserted
[root@testserver ~]# ipmitool sel elist
Power Supply PS1 Status | Power Supply AC lost | Asserted
Power Unit Power Redundancy | Fully Redundant
Power Unit Power Redundancy | Redundancy Lost
Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant
[...]
Physical Security Physical Scrty | General Chassis intrusion | Asserted
slide 26/37
Agenda
1) IPMI overview
2) Plugin implementation
3) Live demo
4) Common pitfalls
slide 27/37
Plugin implementation
• Bash script
• uses FreeIPMI, gawk
# ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg 
IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4185.000000 'FAN 
2'=4320.000000 'FAN 3'=4590.000000 'FAN 4'=4320.000000 'FAN 
A'=4590.000000 'Vcore'=0.712000 '3.3VCC'=3.392000 '12V'=12.190000 
'VDIMM'=1.528000 '5VCC'=5.088000 '­12V'=­11.681000 'VBAT'=3.024000 
'VSB'=3.344000 'AVCC'=3.408000 
# ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg 
IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4185.000000 'FAN 
2'=4320.000000 'FAN 3'=4590.000000 'FAN 4'=4320.000000 'FAN 
A'=4590.000000 'Vcore'=0.712000 '3.3VCC'=3.392000 '12V'=12.190000 
'VDIMM'=1.528000 '5VCC'=5.088000 '­12V'=­11.681000 'VBAT'=3.024000 
'VSB'=3.344000 'AVCC'=3.408000 
slide 28/37
Plugin implementation
• Bash script
• uses FreeIPMI, gawk
# ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg ­v 2
IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4320.000000 'FAN 
[…] 
System Temp = 29.000000 (Status: Nominal)
CPU Temp = 'Low' (Status: Nominal)
FAN 1 = 4320.000000 (Status: Nominal)
FAN 2 = 4320.000000 (Status: Nominal)
FAN 3 = 4590.000000 (Status: Nominal)
[…]
AVCC = 3.408000 (Status: Nominal)
Chassis Intru = 'OK' (Status: Nominal)
PS Status = 'Presence detected' (Status: Nominal) 
# ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg ­v 2
IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4320.000000 'FAN 
[…] 
System Temp = 29.000000 (Status: Nominal)
CPU Temp = 'Low' (Status: Nominal)
FAN 1 = 4320.000000 (Status: Nominal)
FAN 2 = 4320.000000 (Status: Nominal)
FAN 3 = 4590.000000 (Status: Nominal)
[…]
AVCC = 3.408000 (Status: Nominal)
Chassis Intru = 'OK' (Status: Nominal)
PS Status = 'Presence detected' (Status: Nominal) 
slide 29/37
Plugin implementation
• clear illustration in webinterfaces
slide 30/37
Agenda
1) IPMI overview
2) Plugin implementation
3) Live demo
4) Common pitfalls
slide 31/37
Agenda
1) IPMI overview
2) Plugin implementation
3) Live demo
4) Common pitfalls
slide 32/37
Common pitfalls
• sensors with state N/A
• solution shortest-term: exclude (-x opt.)
• solution short-term: FreeIPMI update
tkwiki.cc/FreeIPMI-NA-Sensor
[…]
12 | CPU1 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h'
13 | CPU2 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h' 
[…]
[…]
12 | CPU1 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h'
13 | CPU2 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h' 
[…]
slide 33/37
Common pitfalls
• unrecognized events
• solution shortest-term: ignore unrec. e.
tkwiki.cc/FreeIPMI-Unrec-Event
[…] 
40 | Status | Cable/Interconnect | Nominal | N/A | 
'Cable/Interconnect is connected'
41 | RAC Status | Module/Board | N/A | N/A | 'Unrecognized Event = 
0001h' 'Unrecognized Event = 0002h' 'Unrecognized Event = 0004h'
42 | OS Watchdog | Watchdog 2 | Nominal | N/A | 'OK'
[…]  
[…] 
40 | Status | Cable/Interconnect | Nominal | N/A | 
'Cable/Interconnect is connected'
41 | RAC Status | Module/Board | N/A | N/A | 'Unrecognized Event = 
0001h' 'Unrecognized Event = 0002h' 'Unrecognized Event = 0004h'
42 | OS Watchdog | Watchdog 2 | Nominal | N/A | 'OK'
[…]  
slide 34/37
Agenda
1) IPMI overview
2) Plugin implementation
3) Live demo
4) Common pitfalls
some conclusions …
slide 35/37
Conclusions (1/2)
• Download:
www.thomas-krenn.com/en/oss
• Mailing List:
lists.thomas-krenn.com
• Thanks for your contribution:
Nikolaus Filus, Timme Katz, Lars
Meuser, Sebastian Mörchen, Gustav
Olsson, Holger Paschke, Andy Spiegl,
Ulrich Zehl
slide 36/37
Conclusions (2/2)
 Monitor hardware
with Icinga & IPMI
 Problems?
They will tell you!
 It'll save you
time & money
Get German article
on the plugin for free at
tkwiki.cc/ipmi-plugin
Thanks for your time!

More Related Content

PPSX
AVR I/O programming Atmega 64 uNiboard ThinkLABS maaniq
PDF
Timers and Endge-aligned PWM
PDF
AAME ARM Techcon2013 002v02 Advanced Features
PPTX
embedded system and AVR
PPT
A tmega8 basics
PPT
Developing an avr microcontroller system
PPT
AT mega8 basics
PDF
Atmel microcontrollers-a tmega328-p_datasheet
AVR I/O programming Atmega 64 uNiboard ThinkLABS maaniq
Timers and Endge-aligned PWM
AAME ARM Techcon2013 002v02 Advanced Features
embedded system and AVR
A tmega8 basics
Developing an avr microcontroller system
AT mega8 basics
Atmel microcontrollers-a tmega328-p_datasheet

What's hot (20)

PDF
Q4.11: ARM Architecture
PDF
AVR introduction
PDF
AAME ARM Techcon2013 005v02 System Startup
PDF
AAME ARM Techcon2013 003v02 Software Development
PPTX
Avr and arm
PPTX
Introduction to arm processor
PDF
Arm cm3 architecture_and_programmer_model
PPT
SIEMENS S7-300c.ppt
PDF
1 Day Arm 2007
PDF
Pic microcontroller step by step your complete guide
PPT
DCS PRESENTATION
PDF
Sitrain s7 1200 pwm pid
PDF
Signal Handling in Linux
PDF
Microcontroller part 2
PDF
Practical reverse engineering and exploit development for AVR-based Embedded ...
PDF
89c5131datasheet
PDF
ARM 7 Detailed instruction set
PPT
Programmable Rf Attenuation Project
PPTX
Arm architecture
PPT
Overview of LPC214x MCUs
Q4.11: ARM Architecture
AVR introduction
AAME ARM Techcon2013 005v02 System Startup
AAME ARM Techcon2013 003v02 Software Development
Avr and arm
Introduction to arm processor
Arm cm3 architecture_and_programmer_model
SIEMENS S7-300c.ppt
1 Day Arm 2007
Pic microcontroller step by step your complete guide
DCS PRESENTATION
Sitrain s7 1200 pwm pid
Signal Handling in Linux
Microcontroller part 2
Practical reverse engineering and exploit development for AVR-based Embedded ...
89c5131datasheet
ARM 7 Detailed instruction set
Programmable Rf Attenuation Project
Arm architecture
Overview of LPC214x MCUs
Ad

Similar to OSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner Fischer (20)

PDF
OSDC 2010 | The Power of IPMI by Werner Fischer
PPT
Ipmi Server Management
PDF
OSMC 2014: Server Hardware Monitoring done right | Werner Fischer
PDF
2015.1.5 os.server.keyterms
PDF
OSMC 2014 | Server Hardware Monitoring done right by Werner Fischer
ODP
Information Gathering 2
PDF
Icinga Camp Berlin 2017 - 10 Tips for better Hardware Monitoring
PPTX
[Wroclaw #8] IPMI appsec - an express train to hell
PDF
20111130 hardware-monitoring-with-the-new-ipmi-plugin-v2
PDF
BMCArmor: A Hardware Protection Scheme for Bare-metal Clouds
PDF
Pandora FMS: WMI Basic Monitoring
PDF
Acpi and smi handlers some limits to trusted computing
PPT
Network Implementation and Support Lesson 02 Managing Hardware Devices - Er...
PPT
Ipmi spec ch1~6_simon_20110422
PDF
How to enable AMD IOMMU in coreboot?
PPTX
Power aware operating system
PDF
物理サーバーを支える技術
PDF
OWASP Poland Day 2018 - Dani Ramirez - IPMI hacking
PDF
PMBus Specification Rev 1.2 Presentation 20100228.pdf
PPTX
Opmanager Workshop - Middle East
OSDC 2010 | The Power of IPMI by Werner Fischer
Ipmi Server Management
OSMC 2014: Server Hardware Monitoring done right | Werner Fischer
2015.1.5 os.server.keyterms
OSMC 2014 | Server Hardware Monitoring done right by Werner Fischer
Information Gathering 2
Icinga Camp Berlin 2017 - 10 Tips for better Hardware Monitoring
[Wroclaw #8] IPMI appsec - an express train to hell
20111130 hardware-monitoring-with-the-new-ipmi-plugin-v2
BMCArmor: A Hardware Protection Scheme for Bare-metal Clouds
Pandora FMS: WMI Basic Monitoring
Acpi and smi handlers some limits to trusted computing
Network Implementation and Support Lesson 02 Managing Hardware Devices - Er...
Ipmi spec ch1~6_simon_20110422
How to enable AMD IOMMU in coreboot?
Power aware operating system
物理サーバーを支える技術
OWASP Poland Day 2018 - Dani Ramirez - IPMI hacking
PMBus Specification Rev 1.2 Presentation 20100228.pdf
Opmanager Workshop - Middle East
Ad

Recently uploaded (20)

PDF
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
PDF
Complete Guide to Website Development in Malaysia for SMEs
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
Digital Systems & Binary Numbers (comprehensive )
PDF
iTop VPN Free 5.6.0.5262 Crack latest version 2025
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
17 Powerful Integrations Your Next-Gen MLM Software Needs
PDF
Designing Intelligence for the Shop Floor.pdf
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
Download FL Studio Crack Latest version 2025 ?
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
iTop VPN Crack Latest Version Full Key 2025
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PDF
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
PDF
Cost to Outsource Software Development in 2025
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Autodesk AutoCAD Crack Free Download 2025
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
Complete Guide to Website Development in Malaysia for SMEs
Wondershare Filmora 15 Crack With Activation Key [2025
Reimagine Home Health with the Power of Agentic AI​
Digital Systems & Binary Numbers (comprehensive )
iTop VPN Free 5.6.0.5262 Crack latest version 2025
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
17 Powerful Integrations Your Next-Gen MLM Software Needs
Designing Intelligence for the Shop Floor.pdf
wealthsignaloriginal-com-DS-text-... (1).pdf
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Download FL Studio Crack Latest version 2025 ?
Navsoft: AI-Powered Business Solutions & Custom Software Development
iTop VPN Crack Latest Version Full Key 2025
Design an Analysis of Algorithms I-SECS-1021-03
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
Cost to Outsource Software Development in 2025
CHAPTER 2 - PM Management and IT Context
Autodesk AutoCAD Crack Free Download 2025

OSMC 2011 | Hardware Monitoring with the new IPMI Plugin v2 by Werner Fischer

  • 1. Hardware Monitoring with the new IPMI Plugin v2 Werner Fischer, Technology Specialist Thomas-Krenn.AG 6. OSMC / Nuremberg / Germany 29th November 2011
  • 2. slide 2/37 who I am not who I am Werner Fischer Introduction working for a Server vendor Linux user since 2001 Kernel or H/W developer
  • 3. slide 3/37 who is Server & accessories "Made in Germany" Introduction based in Freyung, Bavaria serving all over Europe
  • 4. slide 4/37 Some questions ... Should I use monitoring?
  • 5. slide 5/37 Should I use monitoring? It depends on what you want to do in your free time... remember yesterday evening? Some questions ... ✘✔
  • 6. slide 6/37 All drives of the RAID 6 O.K.? Some questions ... ✔✔ ✔ ✔
  • 7. slide 7/37 All network connections O.K.? Some questions ... ✔ ✔
  • 8. slide 8/37 All FANs O.K.? Some questions ... ?
  • 9. slide 9/37 All power supplies O.K.? Some questions ... ?
  • 10. slide 10/37 Can we monitor all these servers? Some questions ... any IPMI compatible server
  • 11. slide 11/37 Can we monitor all these servers? Easily? Some questions ... any IPMI compatible server
  • 12. slide 12/37 Can we monitor all these servers? Easily? With one single tool? Some questions ... any IPMI compatible server
  • 13. slide 13/37 Can we monitor all these servers? Easily? With one single tool? Some questions ...
  • 14. slide 14/37 Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls
  • 15. slide 15/37 Intelligent Platform Management Interface • IPMI developed by Intel, HP, NEC, Dell – 1998: IPMI v1.0 – 2001: IPMI v1.5 – 2004: IPMI v2.0
  • 16. slide 16/37 IPMI main features  Monitoring (temp, fans, ...)  Recovery Control (power on/off/reset)  Logging (System Event Log)  Inventory (FRU information)
  • 17. slide 17/37 IPMI overview Chassis board Motherboard Processor board Memory board Baseboard Management Controller (BMC) System bus NVS Storage SDR SEL FRU Chassis mgmt. (Satellite Controller) Sensors & Controls Fan sensor Temp. sensor Power control Reset control … FRU Temp. s. FRU private mgmt. busses IPMB M/B Serial Controller Serial Port Sharing BMC Serial Controller Serial/Modem interface LAN interface Serial Connector LAN Connector PCI mgmt. bus Network (LAN) Controller Remote Mmgt. Card (KVM over IP, ...) Auxillary IPMB Connector ICMB ICMB bridge System interface Redundant Power board FRU Temp. sensor … FRU access req. root privileges access req. username & password
  • 18. slide 18/37 IPMI Channel Privilege Levels (LAN access) use privilege level 'User' for monitoring purposes Privilege Level Allows User ● query sensors Operator ● nearly all IPMI commands ● but no changing of out-of- band interfaces Administrator • all IPMI commands
  • 19. slide 19/37 Example: remote control with ipmitool [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211                    ­U admin power status Password: Chassis Power is off [user@adminpc ~]$ [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211                    ­U admin power status Password: Chassis Power is off [user@adminpc ~]$ [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211                    ­U admin power on Password: Chassis Power Control: Up/On [user@adminpc ~]$ [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211                    ­U admin power on Password: Chassis Power Control: Up/On [user@adminpc ~]$ [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211                    ­U admin power status Password: Chassis Power is on [user@adminpc ~]$ [user@adminpc ~]$ ipmitool ­I lan ­H 192.168.1.211                    ­U admin power status Password: Chassis Power is on [user@adminpc ~]$
  • 20. slide 20/37 Discrete Threshold multiple states possible: ● up to 15 states ● each state is reflected by a bit ● multiple state bits can active changes event status on: ● analog reading compared to threshold values can provide: ● generic states ● sensor-specific states provides: ● analog reading of the sensor ● discr. threshold comparison status bit other class similar to discrete: ● OEM: discrete sensor where the meaning of the states (offsets) are OEM defined IPMI Sensor Classes (1/2)
  • 21. slide 21/37 Discrete Threshold IPMI Sensor Classes (2/2) [root@test ~]# ipmitool sdr get "PS2 Status" Sensor ID              : PS2 Status (0x71)  Entity ID             : 10.2 (Power Supply)  Sensor Type (Discrete): Power Supply  States Asserted       : Power Supply                          [Presence detected]                          [Power Supply AC lost]  Assertion Events      : Power Supply                          [Presence detected]                          [Power Supply AC lost]  Assertions Enabled    : Power Supply                          [Presence detected]                          [Failure detected]                          [Predictive failure]                          [Power Supply AC lost] [...]  Deassertions Enabled  : Power Supply [...] [root@test ~]# ipmitool sdr get "PS2 Status" Sensor ID              : PS2 Status (0x71)  Entity ID             : 10.2 (Power Supply)  Sensor Type (Discrete): Power Supply  States Asserted       : Power Supply                          [Presence detected]                          [Power Supply AC lost]  Assertion Events      : Power Supply                          [Presence detected]                          [Power Supply AC lost]  Assertions Enabled    : Power Supply                          [Presence detected]                          [Failure detected]                          [Predictive failure]                          [Power Supply AC lost] [...]  Deassertions Enabled  : Power Supply [...] [root@test ~]# ipmitool sdr get "Fan 1" Sensor ID              : Fan 1 (0x50)  Entity ID             : 29.1 (Fan Device)  Sensor Type (Analog)  : Fan  Sensor Reading        : 5719 (+/­ 0) RPM  Status                : ok  Nominal Reading       : 6708.000  Normal Minimum        : 2451.000  Normal Maximum        : 10965.000  Lower critical        : 1720.000  Lower non­critical    : 1978.000  Positive Hysteresis   : 86.000  Negative Hysteresis   : 86.000  Minimum sensor range  : Unspecified  Maximum sensor range  : Unspecified  Event Message Control : Per­threshold  Readable Thresholds   : lcr lnc   Settable Thresholds   : lcr lnc   Threshold Read Mask   : lcr lnc   Assertion Events      :   Assertions Enabled    : lnc­ lcr­   Deassertions Enabled  : lnc­ lcr­  [root@test ~]# ipmitool sdr get "Fan 1" Sensor ID              : Fan 1 (0x50)  Entity ID             : 29.1 (Fan Device)  Sensor Type (Analog)  : Fan  Sensor Reading        : 5719 (+/­ 0) RPM  Status                : ok  Nominal Reading       : 6708.000  Normal Minimum        : 2451.000  Normal Maximum        : 10965.000  Lower critical        : 1720.000  Lower non­critical    : 1978.000  Positive Hysteresis   : 86.000  Negative Hysteresis   : 86.000  Minimum sensor range  : Unspecified  Maximum sensor range  : Unspecified  Event Message Control : Per­threshold  Readable Thresholds   : lcr lnc   Settable Thresholds   : lcr lnc   Threshold Read Mask   : lcr lnc   Assertion Events      :   Assertions Enabled    : lnc­ lcr­   Deassertions Enabled  : lnc­ lcr­ 
  • 22. slide 22/37 IPMI Sensor Types root@test:~# ipmi­sensors ­L Temperature Voltage Current Fan Physical_Security Platform_Security_Violation_Attempt Processor Power_Supply Power_Unit Cooling_Device […] root@test:~# ipmi­sensors ­L Temperature Voltage Current Fan Physical_Security Platform_Security_Violation_Attempt Processor Power_Supply Power_Unit Cooling_Device […]
  • 23. slide 23/37 Example: query sensors with FreeIPMI [root@testserver ~]# ipmimonitoring Record_ID | Sensor Name | Sensor Group | Monitoring Status|  Sensor Units | Sensor Reading [...] 17 | Fan 5              | Fan     | Nominal | RPM | 9052.000000  18 | Fan 6              | Fan     | Nominal | RPM | 8060.000000  19 | PS1 AC Current     | Current | Nominal | A   | 0.124000  20 | PS2 AC Current     | Current | Nominal | A   | 0.992000  [...] 36 | Physical Scrty     | Physical Security | Critical | N/A |                                        'General Chassis Intrusion' [root@testserver ~]# ipmimonitoring Record_ID | Sensor Name | Sensor Group | Monitoring Status|  Sensor Units | Sensor Reading [...] 17 | Fan 5              | Fan     | Nominal | RPM | 9052.000000  18 | Fan 6              | Fan     | Nominal | RPM | 8060.000000  19 | PS1 AC Current     | Current | Nominal | A   | 0.124000  20 | PS2 AC Current     | Current | Nominal | A   | 0.992000  [...] 36 | Physical Scrty     | Physical Security | Critical | N/A |                                        'General Chassis Intrusion'
  • 24. slide 24/37 Example: interpret discrete sensors( FreeIPMI) root@test:~# cat /etc/freeipmi/freeipmi_interpret_sensor.conf […] ## IPMI_Physical_Security  # # IPMI_Physical_Security_No_Event                  Nominal # IPMI_Physical_Security_General_Chassis_Intrusion Critical # IPMI_Physical_Security_Drive_Bay_Intrusion       Critical […] # IPMI_Power_Supply_No_Event                       Nominal # IPMI_Power_Supply_Presence_Detected              Nominal # IPMI_Power_Supply_Power_Supply_Failure_Detected  Critical # IPMI_Power_Supply_Predictive_Failure             Critical # IPMI_Power_Supply_Power_Supply_Input_Lost_AC_DC  Critical […] ## IPMI_Memory # # IPMI_Memory_No_Event                             Nominal # IPMI_Memory_Correctable_Memory_Error             Warning # IPMI_Memory_Uncorrectable_Memory_Error           Critical root@test:~# cat /etc/freeipmi/freeipmi_interpret_sensor.conf […] ## IPMI_Physical_Security  # # IPMI_Physical_Security_No_Event                  Nominal # IPMI_Physical_Security_General_Chassis_Intrusion Critical # IPMI_Physical_Security_Drive_Bay_Intrusion       Critical […] # IPMI_Power_Supply_No_Event                       Nominal # IPMI_Power_Supply_Presence_Detected              Nominal # IPMI_Power_Supply_Power_Supply_Failure_Detected  Critical # IPMI_Power_Supply_Predictive_Failure             Critical # IPMI_Power_Supply_Power_Supply_Input_Lost_AC_DC  Critical […] ## IPMI_Memory # # IPMI_Memory_No_Event                             Nominal # IPMI_Memory_Correctable_Memory_Error             Warning # IPMI_Memory_Uncorrectable_Memory_Error           Critical
  • 25. slide 25/37 IPMI System Event Log (SEL) • stored in non-volatile storage [root@testserver ~]# ipmitool sel elist   40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted   54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant   68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost   7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant [...]  2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted [root@testserver ~]# ipmitool sel elist   40 | 06/21/2010 | 14:29:29 | Power Supply PS1 Status | Power Supply AC lost | Asserted   54 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Fully Redundant   68 | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Redundancy Lost   7c | 06/21/2010 | 14:29:29 | Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant [...]  2fc | 06/21/2010 | 15:20:32 | Physical Security Physical Scrty | General Chassis intrusion | Asserted [root@testserver ~]# ipmitool sel elist Power Supply PS1 Status | Power Supply AC lost | Asserted Power Unit Power Redundancy | Fully Redundant Power Unit Power Redundancy | Redundancy Lost Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant [...] Physical Security Physical Scrty | General Chassis intrusion | Asserted [root@testserver ~]# ipmitool sel elist Power Supply PS1 Status | Power Supply AC lost | Asserted Power Unit Power Redundancy | Fully Redundant Power Unit Power Redundancy | Redundancy Lost Power Unit Power Redundancy | Non­Redundant: Sufficient from Redundant [...] Physical Security Physical Scrty | General Chassis intrusion | Asserted
  • 26. slide 26/37 Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls
  • 27. slide 27/37 Plugin implementation • Bash script • uses FreeIPMI, gawk # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg  IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4185.000000 'FAN  2'=4320.000000 'FAN 3'=4590.000000 'FAN 4'=4320.000000 'FAN  A'=4590.000000 'Vcore'=0.712000 '3.3VCC'=3.392000 '12V'=12.190000  'VDIMM'=1.528000 '5VCC'=5.088000 '­12V'=­11.681000 'VBAT'=3.024000  'VSB'=3.344000 'AVCC'=3.408000  # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg  IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4185.000000 'FAN  2'=4320.000000 'FAN 3'=4590.000000 'FAN 4'=4320.000000 'FAN  A'=4590.000000 'Vcore'=0.712000 '3.3VCC'=3.392000 '12V'=12.190000  'VDIMM'=1.528000 '5VCC'=5.088000 '­12V'=­11.681000 'VBAT'=3.024000  'VSB'=3.344000 'AVCC'=3.408000 
  • 28. slide 28/37 Plugin implementation • Bash script • uses FreeIPMI, gawk # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg ­v 2 IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4320.000000 'FAN  […]  System Temp = 29.000000 (Status: Nominal) CPU Temp = 'Low' (Status: Nominal) FAN 1 = 4320.000000 (Status: Nominal) FAN 2 = 4320.000000 (Status: Nominal) FAN 3 = 4590.000000 (Status: Nominal) […] AVCC = 3.408000 (Status: Nominal) Chassis Intru = 'OK' (Status: Nominal) PS Status = 'Presence detected' (Status: Nominal)  # ./check_ipmi_sensor ­H 10.10.10.114 ­f /etc/ipmi­config/ipmi.cfg ­v 2 IPMI Status: OK | 'System Temp'=29.000000 'FAN 1'=4320.000000 'FAN  […]  System Temp = 29.000000 (Status: Nominal) CPU Temp = 'Low' (Status: Nominal) FAN 1 = 4320.000000 (Status: Nominal) FAN 2 = 4320.000000 (Status: Nominal) FAN 3 = 4590.000000 (Status: Nominal) […] AVCC = 3.408000 (Status: Nominal) Chassis Intru = 'OK' (Status: Nominal) PS Status = 'Presence detected' (Status: Nominal) 
  • 29. slide 29/37 Plugin implementation • clear illustration in webinterfaces
  • 30. slide 30/37 Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls
  • 31. slide 31/37 Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls
  • 32. slide 32/37 Common pitfalls • sensors with state N/A • solution shortest-term: exclude (-x opt.) • solution short-term: FreeIPMI update tkwiki.cc/FreeIPMI-NA-Sensor […] 12 | CPU1 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h' 13 | CPU2 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h'  […] […] 12 | CPU1 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h' 13 | CPU2 Temp | OEM Reserved | N/A | N/A | N/A | 'OEM Event = 0000h'  […]
  • 33. slide 33/37 Common pitfalls • unrecognized events • solution shortest-term: ignore unrec. e. tkwiki.cc/FreeIPMI-Unrec-Event […]  40 | Status | Cable/Interconnect | Nominal | N/A |  'Cable/Interconnect is connected' 41 | RAC Status | Module/Board | N/A | N/A | 'Unrecognized Event =  0001h' 'Unrecognized Event = 0002h' 'Unrecognized Event = 0004h' 42 | OS Watchdog | Watchdog 2 | Nominal | N/A | 'OK' […]   […]  40 | Status | Cable/Interconnect | Nominal | N/A |  'Cable/Interconnect is connected' 41 | RAC Status | Module/Board | N/A | N/A | 'Unrecognized Event =  0001h' 'Unrecognized Event = 0002h' 'Unrecognized Event = 0004h' 42 | OS Watchdog | Watchdog 2 | Nominal | N/A | 'OK' […]  
  • 34. slide 34/37 Agenda 1) IPMI overview 2) Plugin implementation 3) Live demo 4) Common pitfalls some conclusions …
  • 35. slide 35/37 Conclusions (1/2) • Download: www.thomas-krenn.com/en/oss • Mailing List: lists.thomas-krenn.com • Thanks for your contribution: Nikolaus Filus, Timme Katz, Lars Meuser, Sebastian Mörchen, Gustav Olsson, Holger Paschke, Andy Spiegl, Ulrich Zehl
  • 36. slide 36/37 Conclusions (2/2)  Monitor hardware with Icinga & IPMI  Problems? They will tell you!  It'll save you time & money
  • 37. Get German article on the plugin for free at tkwiki.cc/ipmi-plugin Thanks for your time!