SlideShare a Scribd company logo
vSphere Performance Monitoring and TroubleshootingOverviewWhat?CPU, Memory, Disk, NetworkHow?Use available tools and a systematic methodologyWhy?Need to build confidence in virtualizing critical and high demand applications
vSphere Performance Monitoring and TroubleshootingTop IssuesTop Issues:Storage "performance capacity" oversubscriptionMemory oversubscriptionSMP overuseFirmware & driver issues
vSphere Performance Monitoring and TroubleshootingWhat tools do we have at our disposal?Top tools for information collection:vCenter - Performance charts and alarmsGuest OS* - Task Manager/Resource Monitor and PerfMonESX Host - esxtop and vscsiStatsvSpherePowerCLI*Guest based monitoring is subject to inaccuracy
vSphere Performance Monitoring and TroubleshootingPrepare vCenter Settings
vSphere Performance Monitoring and TroubleshootingPrepare vCenter Settings
vSphere Performance Monitoring and TroubleshootingPrepare vCenter SettingsPrepare custom vCenter alerts:Host Console Swap In Rate  512KBps Warning, 1024 KBps AlertHost Console Swap Out Rate  512KBps Warning, 1024 KBps AlertVM CPU Ready  1000ms Warning, 2000ms AlertVM Disk Latency  20ms Warning, 50ms Alert
vSphere Performance Monitoring and TroubleshootingPrepare vCenter Settings
vSphere Performance Monitoring and TroubleshootingPrepare vCenter Settings
vSphere Performance Monitoring and TroubleshootingPrepare esxtopESXTOP realtime monitoring:esxtop(run command from SSH or tech-support mode)s 2 (refresh view every 2 seconds)V (View VMs only)h(for quick in-tool command reference)Batch Mode for a 5 minute capture of all stats:esxtop-b -a -d 2 -n 150 > esxtop_capture.csv
vSphere Performance Monitoring and TroubleshootingPrepare PowerCLIRun PowerCLI:Tip: Run as AdministratorSet-ExecutionPolicyremotesignedConnect-VIServer -Server <host> -Protocol https -User <user> -Password <pass>	<host> can be IP address or name of ESX server or vCenterGet-VMGet-Stat -common -realtime
vSphere Performance Monitoring and TroubleshootingWhere do we get started?
vSphere Performance Monitoring and TroubleshootingNetwork Overview
vSphere Performance Monitoring and TroubleshootingNetworkTroubleshooting Guidance:1.	Physical Issues - A bad cable, a failing switch port or NIC, or an incompatible/flawed firmware or device driver (use VMXNET3 whenever possible)2.	Configuration Issues - Inconsistent configuration of vSwitches, Port Groups, or upstream VLAN trunks3.	Capacity Issues - Too many VMs on a single NIC; inadequate switch backplane or uplink capacity; sharing “unmanaged” network infrastructure for storage and data4.	Thresholds – Bandwidth saturation, dropped packets
vSphere Performance Monitoring and TroubleshootingNetwork – What can we see?
vSphere Performance Monitoring and TroubleshootingNetworkvCenter Metrics: Receive packets droppedTransmit packets dropped
vSphere Performance Monitoring and TroubleshootingNetworkESXTOP Metrics:
vSphere Performance Monitoring and TroubleshootingNetworkESXTOP Commands:esxtops 2nf
vSphere Performance Monitoring and TroubleshootingNetworkESXTOP Example:
vSphere Performance Monitoring and TroubleshootingNetworkPowerCLI Commands:Get-Stat -net -realtimeGet-Stat -Entity <Host> -stat net.droppedRx.summationGet-Stat -Entity <Host> -stat net.droppedTx.summation
vSphere Performance Monitoring and TroubleshootingNetwork – What can’t we see?
vSphere Performance Monitoring and TroubleshootingNetworkPossible resources for external monitoring:Native Telnet/SSH/HTTP-based interface counters and statsThird-party SNMP, NetFlow and ICMP tools
vSphere Performance Monitoring and TroubleshootingCPU Overview
vSphere Performance Monitoring and TroubleshootingCPUTroubleshooting Guidance:1.	Physical Issues - Rare and always catastrophic (e.g. obvious)2.	Configuration Issues - Too many / too few vCPUs per VM; SMP/HAL mismatch; incorrect CPU affinity settings3.	Capacity Issues - CPU saturation at the guest or host level; CPU starvation due to high IO or other system level ops4.	Thresholds – Waiting for CPU cycles (due to co-scheduling, swapping, high IO)
vSphere Performance Monitoring and TroubleshootingCPU – What can we see?
vSphere Performance Monitoring and TroubleshootingCPUvCenter Metrics: Host/Guest SaturationStacked Graph (per VM)Usage
vSphere Performance Monitoring and TroubleshootingCPUvCenter Metrics:GuestReady (value/20=n%)Swap Wait
vSphere Performance Monitoring and TroubleshootingCPUESXTOP Metrics:
vSphere Performance Monitoring and TroubleshootingCPUESXTOP Commands:esxtops 2Vce GID (expand/contract a VM world)
vSphere Performance Monitoring and TroubleshootingCPUESXTOP Example:Excessive vCPUs
vSphere Performance Monitoring and TroubleshootingCPUESXTOP Example: Now with fewer vCPUs
vSphere Performance Monitoring and TroubleshootingCPUESXTOP Example:SMP impacting multiple VMs
vSphere Performance Monitoring and TroubleshootingCPUPowerCLI ExampleGet-Stat -cpuGet-Stat -Entity <VM> -stat cpu.ready.summation -realtimeVery cool script code at:http://www.peetersonline.nl/index.php/vmware/examine-vmware-cpu-ready-times-with-powershell/
vSphere Performance Monitoring and TroubleshootingCPU – Not much else to see…
vSphere Performance Monitoring and TroubleshootingCPUPossible resources for external monitoring:Vendor specific systems management tools,MS System Center, etc.http://www.peetersonline.nl/index.php/vmware/examine-vmware-cpu-ready-times-with-powershell/
vSphere Performance Monitoring and TroubleshootingMemory Overview
vSphere Performance Monitoring and TroubleshootingMemoryTroubleshooting Guidance:1.	Physical Issues - Rare and usually catastrophic2.	Configuration Issues - Memory overcommit; incorrect configuration of shares, reservations or limits3.	Capacity Issues - Physical memory exhaustion4.	Thresholds – Active memory swapping
vSphere Performance Monitoring and TroubleshootingMemory – What can we see?
vSphere Performance Monitoring and TroubleshootingMemoryvCenter MetricsSwap in rateSwap out rateSwap used
vSphere Performance Monitoring and TroubleshootingMemoryESXTOP Metrics:
vSphere Performance Monitoring and TroubleshootingMemoryESXTOP Commands:esxtops 2Vmf
vSphere Performance Monitoring and TroubleshootingMemoryESXTOP Example:m – Heavy swapping and ballooning
vSphere Performance Monitoring and TroubleshootingMemoryPowerCLI Commands:Get-Stat -memGet-Stat -Entity <VM> -stat mem.swapoutRate.average -realtimeGet-Stat -Entity <VM> -stat mem.swapinRate.average -realtimeGet-Stat -Entity <VM> -stat mem.vmmemctl.average -realtimeGet-Stat -Entity <Host> -stat mem.swapused.average -realtime
vSphere Performance Monitoring and TroubleshootingMemory – The occasional DIMM failure…
vSphere Performance Monitoring and TroubleshootingMemoryPossible external monitoring options:Vendor specific systems management tools, MS System Center, etc.Don’t forget vCenter ‘Hardware Status’ reporting
vSphere Performance Monitoring and TroubleshootingStorage Overview
vSphere Performance Monitoring and TroubleshootingStorageTroubleshooting Guidance:1.	Physical Issues - A bad cable, a failing switch port or HBA/NIC, or an incompatible/flawed firmware or device driver (use LSI Logic Parallel/SAS as appropriate)2.	Configuration Issues - Inconsistent or incorrect configuration of LUN masking, zoning, or multi-pathing; inappropriate resource provisioning; aligning queue depth with storage type3.	Capacity Issues - Too many VMs or VMDKs on a LUN; too much IO load for an array or RAID group4.	Thresholds – Latency and queuing
vSphere Performance Monitoring and TroubleshootingStorage – What can we see?
vSphere Performance Monitoring and TroubleshootingStoragevCenter Metrics:DatastoreRead latencyWrite latency
vSphere Performance Monitoring and TroubleshootingStorageESXTOP Metrics:
vSphere Performance Monitoring and TroubleshootingStorage
vSphere Performance Monitoring and TroubleshootingStorageESXTOP Commands (HBA/LUN):esxtops 2Vdfe vmhba#
vSphere Performance Monitoring and TroubleshootingStorageESXTOP Commands(LUN/Datastore):esxtops 2VuL 38fe <devname>
vSphere Performance Monitoring and TroubleshootingStorageESXTOP Commands (VM/VMDK):esxtops 2Vvfe GID
vSphere Performance Monitoring and TroubleshootingStorageESXTOP Examples: d - Multipathing / Expand adapter to view targets
vSphere Performance Monitoring and TroubleshootingStorageESXTOP Examples: u - Queuing, Disk or Kernel?
vSphere Performance Monitoring and TroubleshootingStorageESXTOP Examples:v - Identify the IO consumer
vSphere Performance Monitoring and TroubleshootingStoragevscsiStatsCommand:[root@host ~]# cd /usr/lib/vmware/bin./vscsiStats -l./vscsiStats -s -w <worldid>./vscsiStats -w <worldid> -p all -c > /path/vscsistats.csv./vscsiStats -x
vSphere Performance Monitoring and TroubleshootingStoragevscsiStatsExample:
vSphere Performance Monitoring and TroubleshootingStoragevscsiStatsExample:
vSphere Performance Monitoring and TroubleshootingStoragevscsiStatsExample:http://dunnsept.wordpress.com/2010/03/11/new-vscsistats-excel-macro/
vSphere Performance Monitoring and TroubleshootingStoragevscsiStatshistograms:
vSphere Performance Monitoring and TroubleshootingStoragePowerCLI Commands:Get-Stat -diskGet-Stat -stat disk.totalLatency.average -realtimeGet-Stat -stat disk.deviceLatency.average -realtimeGet-Stat -stat disk.kernelLatency.average -realtime
vSphere Performance Monitoring and TroubleshootingStorage – What can’t we see?
vSphere Performance Monitoring and TroubleshootingStorage – More of what we can’t see
vSphere Performance Monitoring and TroubleshootingStoragePossible external monitoring solutions:Vendor specific SAN and fabric/network tools, native Telnet/SSH/HTTP-based tools for most networks, third-party SNMP-based tools
vSphere Performance Monitoring and TroubleshootingWorking with PowerCLIPowerCLI Tips:For a complete list of stat objects:Get-StatType -Entity <Host/VM>Pipe the outputs to a file:Get-Stat -stat <stat> -realtime | ft -autosize > c:\temp\<filename>.csvImport the CSV file data to a spreadsheet with fixed width parametersBuild pretty graphs
vSphere Performance Monitoring and TroubleshootingWorking with PowerCLI
vSphere Performance Monitoring and TroubleshootingWay More InformationESXTOP / vscsiStats / PowerCLI:http://www.yellow-bricks.com/esxtop/     Special thanks to Duncan Epping!http://communities.vmware.com/docs/DOC-3930http://communities.vmware.com/docs/DOC-9279http://communities.vmware.com/docs/DOC-10095http://www.vmware.com/support/developer/PowerCLI/PowerCLI41/html/Get-Stat.htmlhttp://www.lucd.info/2009/12/30/powercli-vsphere-statistics-part-1-the-basics/http://simongreaves.co.uk/blog/esxtop-guidehttp://dunnsept.wordpress.com/2010/03/11/new-vscsistats-excel-macro/
vSphere Performance Monitoring and TroubleshootingEasy button?What is the problem with these tools?Limited alerting mechanisms, no collection automation or historical data for comparison, and no correlation of events!vCenter Operations Standard / Enterprise

More Related Content

PPTX
VMware vSphere technical presentation
PPTX
VMware Advance Troubleshooting Workshop - Day 5
PPTX
VMware Advance Troubleshooting Workshop - Day 2
PPTX
VMware Advance Troubleshooting Workshop - Day 4
PPTX
Esxi troubleshooting
PPTX
VMware ESXi 6.0 Installation Process
PPTX
Introduction to Hyper-V
PPTX
VMware vSphere 6.0 - Troubleshooting Training - Day 1
VMware vSphere technical presentation
VMware Advance Troubleshooting Workshop - Day 5
VMware Advance Troubleshooting Workshop - Day 2
VMware Advance Troubleshooting Workshop - Day 4
Esxi troubleshooting
VMware ESXi 6.0 Installation Process
Introduction to Hyper-V
VMware vSphere 6.0 - Troubleshooting Training - Day 1

What's hot (20)

PDF
vSAN Beyond The Basics
PPTX
NSX-T Architecture and Components.pptx
PDF
VMware vSphere Networking deep dive
PPTX
Microsoft Hyper-V
PDF
Managing ESXi - Tools and Techniques
PDF
Smart monitoring how does oracle rac manage resource, state ukoug19
PDF
VMware Virtual SAN Presentation
PPTX
Vce vxrail-customer-presentation new
PDF
Understanding oracle rac internals part 1 - slides
PDF
VSAN – Architettura e Design
PPT
VMWARE ESX
PDF
MySQL with DRBD/Pacemaker/Corosync on Linux
PPTX
VMware vSAN - Novosco, June 2017
PPTX
VMware Cloud Foundation - PnP presentation 8_6_18 EN.pptx
PDF
VMware - Virtual SAN - IT Changes Everything
PPTX
ZFS appliance
PDF
Percona xtrabackup - MySQL Meetup @ Mumbai
PPTX
VMware vSphere Storage Appliance (VSA) - Technical Presentation,Almacenamien...
PPTX
Hyper-Converged Infrastructure Vx Rail
PPTX
Meetup 23 - 02 - OVN - The future of networking in OpenStack
vSAN Beyond The Basics
NSX-T Architecture and Components.pptx
VMware vSphere Networking deep dive
Microsoft Hyper-V
Managing ESXi - Tools and Techniques
Smart monitoring how does oracle rac manage resource, state ukoug19
VMware Virtual SAN Presentation
Vce vxrail-customer-presentation new
Understanding oracle rac internals part 1 - slides
VSAN – Architettura e Design
VMWARE ESX
MySQL with DRBD/Pacemaker/Corosync on Linux
VMware vSAN - Novosco, June 2017
VMware Cloud Foundation - PnP presentation 8_6_18 EN.pptx
VMware - Virtual SAN - IT Changes Everything
ZFS appliance
Percona xtrabackup - MySQL Meetup @ Mumbai
VMware vSphere Storage Appliance (VSA) - Technical Presentation,Almacenamien...
Hyper-Converged Infrastructure Vx Rail
Meetup 23 - 02 - OVN - The future of networking in OpenStack
Ad

Viewers also liked (20)

PPTX
VMware Performance Troubleshooting
PPTX
VMware Advance Troubleshooting Workshop - Day 6
PDF
VMware Performance for Gurus - A Tutorial
PPTX
Denver VMUG nov 2011
PPTX
Citrix Remote Access Solution Soup
PPTX
VMware Advance Troubleshooting Workshop - Day 3
PPTX
IdP, SAML, OAuth
PDF
Inside the Hadoop Machine @ VMworld
PDF
Is your cloud ready for Big Data? Strata NY 2013
PDF
Architecting Virtualized Infrastructure for Big Data
PDF
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
PDF
Hadoop on VMware
PPTX
Making of the Burner Board
PPTX
Hadoop on Virtual Machines
PDF
Big Data/Hadoop Infrastructure Considerations
PDF
Virtualization Primer for Java Developers
PDF
Solaris Internals Preso circa 2009
PDF
VMworld 2013: Storage DRS: Deep Dive and Best Practices to Suit Your Storage ...
PDF
Building Big Data Applications
PDF
Virtualizing Oracle Databases with VMware
VMware Performance Troubleshooting
VMware Advance Troubleshooting Workshop - Day 6
VMware Performance for Gurus - A Tutorial
Denver VMUG nov 2011
Citrix Remote Access Solution Soup
VMware Advance Troubleshooting Workshop - Day 3
IdP, SAML, OAuth
Inside the Hadoop Machine @ VMworld
Is your cloud ready for Big Data? Strata NY 2013
Architecting Virtualized Infrastructure for Big Data
Apachecon Euro 2012: Elastic, Multi-tenant Hadoop on Demand
Hadoop on VMware
Making of the Burner Board
Hadoop on Virtual Machines
Big Data/Hadoop Infrastructure Considerations
Virtualization Primer for Java Developers
Solaris Internals Preso circa 2009
VMworld 2013: Storage DRS: Deep Dive and Best Practices to Suit Your Storage ...
Building Big Data Applications
Virtualizing Oracle Databases with VMware
Ad

Similar to VMware vSphere Performance Troubleshooting (20)

PDF
vBrownBag 2015-07-08 - VCP6-DCV Foundations Section 7: Monitoring
PPT
ESX performance problems 10 steps
PDF
Advanced performance troubleshooting using esxtop
PDF
vSphere APIs for performance monitoring
PPTX
Taking VMware Performance Monitoring Beyond VCOPS
PPTX
VMWare monitoring - Discover and monitor your virtual environment
PPT
VMWare Performance Tuning by Virtera (Jan 2009)
PDF
VMworld 2013: vCenter Operations Management –Troubleshooting Best Practices
PPTX
V center operations standard presentation
PPTX
Master VMware Performance and Capacity Management
PDF
Maximum Vsphere Tips Howtos And Best Practices For Working With Vmware Vspher...
PDF
The have no fear guide to virtualizing databases
ODP
Nagios Conference 2014 - Troy Lea - Monitoring VMware Virtualization Using vMA
PPTX
PDF
vRealize Operations 6.4: Supercharge your SDDC Intelligent Operations
PPTX
Troubleshooting Storage Devices Using vRealize Operations (formerly vC Ops)
PDF
Vsphere esxi-vcenter-server-55-troubleshooting-guide
PDF
Presentation cloud infrastructure launch – what’s new
PDF
Presentation cloud infrastructure launch – what’s new
PPTX
5 Things to Ask Your Virtualization Administrator
vBrownBag 2015-07-08 - VCP6-DCV Foundations Section 7: Monitoring
ESX performance problems 10 steps
Advanced performance troubleshooting using esxtop
vSphere APIs for performance monitoring
Taking VMware Performance Monitoring Beyond VCOPS
VMWare monitoring - Discover and monitor your virtual environment
VMWare Performance Tuning by Virtera (Jan 2009)
VMworld 2013: vCenter Operations Management –Troubleshooting Best Practices
V center operations standard presentation
Master VMware Performance and Capacity Management
Maximum Vsphere Tips Howtos And Best Practices For Working With Vmware Vspher...
The have no fear guide to virtualizing databases
Nagios Conference 2014 - Troy Lea - Monitoring VMware Virtualization Using vMA
vRealize Operations 6.4: Supercharge your SDDC Intelligent Operations
Troubleshooting Storage Devices Using vRealize Operations (formerly vC Ops)
Vsphere esxi-vcenter-server-55-troubleshooting-guide
Presentation cloud infrastructure launch – what’s new
Presentation cloud infrastructure launch – what’s new
5 Things to Ask Your Virtualization Administrator

Recently uploaded (20)

PDF
Machine learning based COVID-19 study performance prediction
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
cuic standard and advanced reporting.pdf
PDF
Electronic commerce courselecture one. Pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
KodekX | Application Modernization Development
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Machine learning based COVID-19 study performance prediction
Network Security Unit 5.pdf for BCA BBA.
The Rise and Fall of 3GPP – Time for a Sabbatical?
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Chapter 3 Spatial Domain Image Processing.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Big Data Technologies - Introduction.pptx
Review of recent advances in non-invasive hemoglobin estimation
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Advanced methodologies resolving dimensionality complications for autism neur...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
cuic standard and advanced reporting.pdf
Electronic commerce courselecture one. Pdf
The AUB Centre for AI in Media Proposal.docx
KodekX | Application Modernization Development
Digital-Transformation-Roadmap-for-Companies.pptx
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

VMware vSphere Performance Troubleshooting

  • 1. vSphere Performance Monitoring and TroubleshootingOverviewWhat?CPU, Memory, Disk, NetworkHow?Use available tools and a systematic methodologyWhy?Need to build confidence in virtualizing critical and high demand applications
  • 2. vSphere Performance Monitoring and TroubleshootingTop IssuesTop Issues:Storage "performance capacity" oversubscriptionMemory oversubscriptionSMP overuseFirmware & driver issues
  • 3. vSphere Performance Monitoring and TroubleshootingWhat tools do we have at our disposal?Top tools for information collection:vCenter - Performance charts and alarmsGuest OS* - Task Manager/Resource Monitor and PerfMonESX Host - esxtop and vscsiStatsvSpherePowerCLI*Guest based monitoring is subject to inaccuracy
  • 4. vSphere Performance Monitoring and TroubleshootingPrepare vCenter Settings
  • 5. vSphere Performance Monitoring and TroubleshootingPrepare vCenter Settings
  • 6. vSphere Performance Monitoring and TroubleshootingPrepare vCenter SettingsPrepare custom vCenter alerts:Host Console Swap In Rate  512KBps Warning, 1024 KBps AlertHost Console Swap Out Rate  512KBps Warning, 1024 KBps AlertVM CPU Ready  1000ms Warning, 2000ms AlertVM Disk Latency  20ms Warning, 50ms Alert
  • 7. vSphere Performance Monitoring and TroubleshootingPrepare vCenter Settings
  • 8. vSphere Performance Monitoring and TroubleshootingPrepare vCenter Settings
  • 9. vSphere Performance Monitoring and TroubleshootingPrepare esxtopESXTOP realtime monitoring:esxtop(run command from SSH or tech-support mode)s 2 (refresh view every 2 seconds)V (View VMs only)h(for quick in-tool command reference)Batch Mode for a 5 minute capture of all stats:esxtop-b -a -d 2 -n 150 > esxtop_capture.csv
  • 10. vSphere Performance Monitoring and TroubleshootingPrepare PowerCLIRun PowerCLI:Tip: Run as AdministratorSet-ExecutionPolicyremotesignedConnect-VIServer -Server <host> -Protocol https -User <user> -Password <pass> <host> can be IP address or name of ESX server or vCenterGet-VMGet-Stat -common -realtime
  • 11. vSphere Performance Monitoring and TroubleshootingWhere do we get started?
  • 12. vSphere Performance Monitoring and TroubleshootingNetwork Overview
  • 13. vSphere Performance Monitoring and TroubleshootingNetworkTroubleshooting Guidance:1. Physical Issues - A bad cable, a failing switch port or NIC, or an incompatible/flawed firmware or device driver (use VMXNET3 whenever possible)2. Configuration Issues - Inconsistent configuration of vSwitches, Port Groups, or upstream VLAN trunks3. Capacity Issues - Too many VMs on a single NIC; inadequate switch backplane or uplink capacity; sharing “unmanaged” network infrastructure for storage and data4. Thresholds – Bandwidth saturation, dropped packets
  • 14. vSphere Performance Monitoring and TroubleshootingNetwork – What can we see?
  • 15. vSphere Performance Monitoring and TroubleshootingNetworkvCenter Metrics: Receive packets droppedTransmit packets dropped
  • 16. vSphere Performance Monitoring and TroubleshootingNetworkESXTOP Metrics:
  • 17. vSphere Performance Monitoring and TroubleshootingNetworkESXTOP Commands:esxtops 2nf
  • 18. vSphere Performance Monitoring and TroubleshootingNetworkESXTOP Example:
  • 19. vSphere Performance Monitoring and TroubleshootingNetworkPowerCLI Commands:Get-Stat -net -realtimeGet-Stat -Entity <Host> -stat net.droppedRx.summationGet-Stat -Entity <Host> -stat net.droppedTx.summation
  • 20. vSphere Performance Monitoring and TroubleshootingNetwork – What can’t we see?
  • 21. vSphere Performance Monitoring and TroubleshootingNetworkPossible resources for external monitoring:Native Telnet/SSH/HTTP-based interface counters and statsThird-party SNMP, NetFlow and ICMP tools
  • 22. vSphere Performance Monitoring and TroubleshootingCPU Overview
  • 23. vSphere Performance Monitoring and TroubleshootingCPUTroubleshooting Guidance:1. Physical Issues - Rare and always catastrophic (e.g. obvious)2. Configuration Issues - Too many / too few vCPUs per VM; SMP/HAL mismatch; incorrect CPU affinity settings3. Capacity Issues - CPU saturation at the guest or host level; CPU starvation due to high IO or other system level ops4. Thresholds – Waiting for CPU cycles (due to co-scheduling, swapping, high IO)
  • 24. vSphere Performance Monitoring and TroubleshootingCPU – What can we see?
  • 25. vSphere Performance Monitoring and TroubleshootingCPUvCenter Metrics: Host/Guest SaturationStacked Graph (per VM)Usage
  • 26. vSphere Performance Monitoring and TroubleshootingCPUvCenter Metrics:GuestReady (value/20=n%)Swap Wait
  • 27. vSphere Performance Monitoring and TroubleshootingCPUESXTOP Metrics:
  • 28. vSphere Performance Monitoring and TroubleshootingCPUESXTOP Commands:esxtops 2Vce GID (expand/contract a VM world)
  • 29. vSphere Performance Monitoring and TroubleshootingCPUESXTOP Example:Excessive vCPUs
  • 30. vSphere Performance Monitoring and TroubleshootingCPUESXTOP Example: Now with fewer vCPUs
  • 31. vSphere Performance Monitoring and TroubleshootingCPUESXTOP Example:SMP impacting multiple VMs
  • 32. vSphere Performance Monitoring and TroubleshootingCPUPowerCLI ExampleGet-Stat -cpuGet-Stat -Entity <VM> -stat cpu.ready.summation -realtimeVery cool script code at:http://www.peetersonline.nl/index.php/vmware/examine-vmware-cpu-ready-times-with-powershell/
  • 33. vSphere Performance Monitoring and TroubleshootingCPU – Not much else to see…
  • 34. vSphere Performance Monitoring and TroubleshootingCPUPossible resources for external monitoring:Vendor specific systems management tools,MS System Center, etc.http://www.peetersonline.nl/index.php/vmware/examine-vmware-cpu-ready-times-with-powershell/
  • 35. vSphere Performance Monitoring and TroubleshootingMemory Overview
  • 36. vSphere Performance Monitoring and TroubleshootingMemoryTroubleshooting Guidance:1. Physical Issues - Rare and usually catastrophic2. Configuration Issues - Memory overcommit; incorrect configuration of shares, reservations or limits3. Capacity Issues - Physical memory exhaustion4. Thresholds – Active memory swapping
  • 37. vSphere Performance Monitoring and TroubleshootingMemory – What can we see?
  • 38. vSphere Performance Monitoring and TroubleshootingMemoryvCenter MetricsSwap in rateSwap out rateSwap used
  • 39. vSphere Performance Monitoring and TroubleshootingMemoryESXTOP Metrics:
  • 40. vSphere Performance Monitoring and TroubleshootingMemoryESXTOP Commands:esxtops 2Vmf
  • 41. vSphere Performance Monitoring and TroubleshootingMemoryESXTOP Example:m – Heavy swapping and ballooning
  • 42. vSphere Performance Monitoring and TroubleshootingMemoryPowerCLI Commands:Get-Stat -memGet-Stat -Entity <VM> -stat mem.swapoutRate.average -realtimeGet-Stat -Entity <VM> -stat mem.swapinRate.average -realtimeGet-Stat -Entity <VM> -stat mem.vmmemctl.average -realtimeGet-Stat -Entity <Host> -stat mem.swapused.average -realtime
  • 43. vSphere Performance Monitoring and TroubleshootingMemory – The occasional DIMM failure…
  • 44. vSphere Performance Monitoring and TroubleshootingMemoryPossible external monitoring options:Vendor specific systems management tools, MS System Center, etc.Don’t forget vCenter ‘Hardware Status’ reporting
  • 45. vSphere Performance Monitoring and TroubleshootingStorage Overview
  • 46. vSphere Performance Monitoring and TroubleshootingStorageTroubleshooting Guidance:1. Physical Issues - A bad cable, a failing switch port or HBA/NIC, or an incompatible/flawed firmware or device driver (use LSI Logic Parallel/SAS as appropriate)2. Configuration Issues - Inconsistent or incorrect configuration of LUN masking, zoning, or multi-pathing; inappropriate resource provisioning; aligning queue depth with storage type3. Capacity Issues - Too many VMs or VMDKs on a LUN; too much IO load for an array or RAID group4. Thresholds – Latency and queuing
  • 47. vSphere Performance Monitoring and TroubleshootingStorage – What can we see?
  • 48. vSphere Performance Monitoring and TroubleshootingStoragevCenter Metrics:DatastoreRead latencyWrite latency
  • 49. vSphere Performance Monitoring and TroubleshootingStorageESXTOP Metrics:
  • 50. vSphere Performance Monitoring and TroubleshootingStorage
  • 51. vSphere Performance Monitoring and TroubleshootingStorageESXTOP Commands (HBA/LUN):esxtops 2Vdfe vmhba#
  • 52. vSphere Performance Monitoring and TroubleshootingStorageESXTOP Commands(LUN/Datastore):esxtops 2VuL 38fe <devname>
  • 53. vSphere Performance Monitoring and TroubleshootingStorageESXTOP Commands (VM/VMDK):esxtops 2Vvfe GID
  • 54. vSphere Performance Monitoring and TroubleshootingStorageESXTOP Examples: d - Multipathing / Expand adapter to view targets
  • 55. vSphere Performance Monitoring and TroubleshootingStorageESXTOP Examples: u - Queuing, Disk or Kernel?
  • 56. vSphere Performance Monitoring and TroubleshootingStorageESXTOP Examples:v - Identify the IO consumer
  • 57. vSphere Performance Monitoring and TroubleshootingStoragevscsiStatsCommand:[root@host ~]# cd /usr/lib/vmware/bin./vscsiStats -l./vscsiStats -s -w <worldid>./vscsiStats -w <worldid> -p all -c > /path/vscsistats.csv./vscsiStats -x
  • 58. vSphere Performance Monitoring and TroubleshootingStoragevscsiStatsExample:
  • 59. vSphere Performance Monitoring and TroubleshootingStoragevscsiStatsExample:
  • 60. vSphere Performance Monitoring and TroubleshootingStoragevscsiStatsExample:http://dunnsept.wordpress.com/2010/03/11/new-vscsistats-excel-macro/
  • 61. vSphere Performance Monitoring and TroubleshootingStoragevscsiStatshistograms:
  • 62. vSphere Performance Monitoring and TroubleshootingStoragePowerCLI Commands:Get-Stat -diskGet-Stat -stat disk.totalLatency.average -realtimeGet-Stat -stat disk.deviceLatency.average -realtimeGet-Stat -stat disk.kernelLatency.average -realtime
  • 63. vSphere Performance Monitoring and TroubleshootingStorage – What can’t we see?
  • 64. vSphere Performance Monitoring and TroubleshootingStorage – More of what we can’t see
  • 65. vSphere Performance Monitoring and TroubleshootingStoragePossible external monitoring solutions:Vendor specific SAN and fabric/network tools, native Telnet/SSH/HTTP-based tools for most networks, third-party SNMP-based tools
  • 66. vSphere Performance Monitoring and TroubleshootingWorking with PowerCLIPowerCLI Tips:For a complete list of stat objects:Get-StatType -Entity <Host/VM>Pipe the outputs to a file:Get-Stat -stat <stat> -realtime | ft -autosize > c:\temp\<filename>.csvImport the CSV file data to a spreadsheet with fixed width parametersBuild pretty graphs
  • 67. vSphere Performance Monitoring and TroubleshootingWorking with PowerCLI
  • 68. vSphere Performance Monitoring and TroubleshootingWay More InformationESXTOP / vscsiStats / PowerCLI:http://www.yellow-bricks.com/esxtop/ Special thanks to Duncan Epping!http://communities.vmware.com/docs/DOC-3930http://communities.vmware.com/docs/DOC-9279http://communities.vmware.com/docs/DOC-10095http://www.vmware.com/support/developer/PowerCLI/PowerCLI41/html/Get-Stat.htmlhttp://www.lucd.info/2009/12/30/powercli-vsphere-statistics-part-1-the-basics/http://simongreaves.co.uk/blog/esxtop-guidehttp://dunnsept.wordpress.com/2010/03/11/new-vscsistats-excel-macro/
  • 69. vSphere Performance Monitoring and TroubleshootingEasy button?What is the problem with these tools?Limited alerting mechanisms, no collection automation or historical data for comparison, and no correlation of events!vCenter Operations Standard / Enterprise

Editor's Notes

  • #2: Who uses Resource Pools? How many have reservations or limits?
  • #3: Use a Host CPU stacked (per VM) graph to quickly identify leading consumers
  • #7: Don’t necessary need CPU saturation for overcommit to have an effect on performance
  • #8: Don’t necessary need CPU saturation for overcommit to have an effect on performance
  • #9: Don’t necessary need CPU saturation for overcommit to have an effect on performance