SlideShare a Scribd company logo
Storage IO Control: Concepts, Configuration and Best
Practices to Tame Different Storage Architectures
Sachin Manpathak, VMware
Mustafa Uysal, VMware
Sunil Muralidhar, VMware
VSVC5364
#VSVC5364
2
Disclaimer
 This session may contain product features that are
currently under development.
 This session/overview of the new technology represents
no commitment from VMware to deliver these features in
any generally available product.
 Features are subject to change, and must not be included in
contracts, purchase orders, or sales agreements of any kind.
 Technical feasibility and market demand will affect final delivery.
 Pricing and packaging for any new technologies or features
discussed or presented have not been determined.
3
VMware Vision: Software Defined Storage
Software Defined Storage
Software-Defined Storage Vision
Enable new storage tiers
Enable DAS & server flash for shared
storage along with enterprise SAN/NAS
Enable tight integration with storage
ecosystem
Tighter integrations with broad storage
ecosystem through APIs
Deliver policy-based automated storage
management
Automatically enforce per-VM SLAs for all
apps across different types of storage
“Gold”
Array(s)
“Silver”
Array(s)
Distributed
Storage
Hard
disks
SSD Hard
disks
SSD
Availability = 99.99%
DR RTO = 1
“Gold” SLA
Availability = 99%
Throughput = 1000 R/s, 20 W/s
Latency = 95% under 5 ms
DR RPO = 1’, RTO = 10’
Back up = hourly
Capacity res = 100%
Web Server
Database Server
Availability =
99.99%
DR RTO = 1 hour
Max Laten
“Bronze” SLA
Availability = 99%
Throughput = 100 R/s,10 W/s
Latency = 90% under 10 ms
DR RPO = 60’, RTO = 360’
Back up = weekly
Security = encryption
ReduceStorageCostandComplexity
App Server
Roadmap
4
Software-Defined Storage: Summary Roadmap
vSphere storage
features
Storage IO Control,
Storage vMotion,
Storage DRS,
Profile Driven Storage
Enable New
Storage Tiers
Policy-based storage
management
Virtual Volumes
VM-aware data
management with
enterprise storage
arrays
Tight integration with
storage systems
Policy-based storage
management
For local storage
vSphere Storage
Appliance
Low cost, simple shared
storage for small
deployments
Virtual SAN
Policy-driven storage for
cloud-scale deployments
Virtual Flash
Virtual SAN
Data services
Virtual Flash
Write-back caching
Policy-based storage
management
For external storage
H2 2013 / H1 2014 RoadmapToday
Roadmap
5
Outline
 Storage IO Control (SIOC) Overview
 Deployment Scenarios
 Improvements in vSphere 5.1 and 5.5
 Preview from SIOC Labs
Survey: http://bit.ly/siocsdrs
6
The Problem
What you see
Database
Server Farms
Online store:
Product Catalog
Online Store:
Data Mining
(low priority)
Shared
Datastore
Online Store:
Order Processing
What you want to see
Shared
Datastore
Online store:
Product Catalog
Online Store:
Data Mining
(low priority)
Online Store:
Order Processing
7
Solution: Storage IO Control
 Detect Congestion
• SIOC monitors average IO latency for a datastore
• Latency above a threshold indicates congestion
 SIOC throttles IOs once congestion is detected
• Control IOs issued per host
• Based on VMs and their shares on each host
• Throttling adjusted dynamically based on workload
• Idleness
• Bursty behavior
8
Congestion Threshold
 Performance suffers if datastore
is overloaded
 Congestion threshold value (ms):
• Higher is better for overall throughput
• Lower is better for stronger isolation
 SIOC default setting: 90% of peak
IOPs capacity
 Changing default threshold:
 Percentage or absolute value
Throughput(IOPS)
Datastore Load
No benefit
beyond certain
load
Latency
Datastore Load
9
Distributed Storage Access
10
10
10
50
20
30
50
1005030
Shares
vol1 vol1 vol1
 VMs running on multiple hosts
 Shared storage: SAN/NFS
 VMs interfere with each other
 No centralized control
 VM shares control amount of
IO throttling
10
Control IOs Issued per Host (Based on Shares)
With SIOC: All VMs get equal queue slots
Without SIOC: VM C gets equal queue slots as VMs A+ B
VM Disk
Shares
A 1000
B 1000
C 1000
11
What Do I/O Shares Mean?
 Two main units exist in industry
• Bandwidth (MB/s)
• Throughput (IOPS)
 Both have problems
• Using bandwidth may hurt workloads with large IO sizes
• Using IOPS may hurt VMs with sequential IOs
 SIOC: carves out storage array queue among VMs
• VMs reuse queue slots faster or slower (depending on array latency)
• Sequential streams get higher IOPS even if shares identical
• Workloads with high read cache hit rates
• This is a good thing!
• Maintains high overall throughput
12
Configuring Storage IO Control
2 simple steps:
1. Enable Storage I/O Control on a datastore
2. Set virtual disk controls for VMs
13
Enabling Storage IO Control
14
Storage IO Control Configuration
15
Setting Virtual Disk Shares
16
Storage IO Control In Action
 New Datastore performance metrics
• Storage IO Control Normalized Latency
• Storage IO Control Aggregate IOPs
 Latency is normalized by I/O size
 Averaged across all ESX hosts
 SIOC invoked every 4 seconds
• Latency computation
• I/O throttling
40ms
30ms
20ms
17
Outline
 Storage IO Control (SIOC) Overview
 Deployment Scenarios
 Improvements in vSphere 5.1 and 5.5
 Preview from SIOC Labs
18
Deployment: Shared Storage Pools
 Enable SIOC on all datastores
 Use same congestion threshold
 SIOC will adjust queue depth for
all datastores based on demand
SIOC SIOC
BA
Shared Storage Pool
IO Queue
19
Deployment: Auto-tiered LUN
 Set lower congestion threshold
• Based on LUN configuration
• Based on application needs
• More SSDs -> lower value
 SIOC will adjust queue depth
and do prioritized scheduling
Capacity Tier
Fast Tier
Medium Tier
One IO queue
SIOCSIOCSIOC
20
VMs with Multiple VMDKs
 VM IO allocation on a datastore
• Sum of shares of all VMDKs
 A low priority VM with many
VMDKs may get higher priority
• Unusued shares flow across VMDKs
 VMDKs split across datastores
• No flow of unused shares
 Consider IO sum of shares per
datastore while provisioning
VMs.
800300 200200
500 200 800Allocations
21
Best Practices
 Avoid mixing vSphere LUNs and non-vSphere LUNs on the same
physical storage
• SIOC will detect this and raise an alarm
 Configure host IO queue size with highest allowed value
• Maximum flexibility for SIOC throttling
 Keep congestion threshold conservative
• Will improve overall utilization
• Set lower if latency is more important than throughput
22
VM Snapshots and Storage vMotion IOs
 VM snapshot and Storage vMotion IO charged to VM
 SIOC throttles all IOs from a VM
• IOs from Storage vMotion activity does not affect important VMs
• Storage array is not overwhelmed with IO activity burst
 SIOC’s distributed IO allocation consistent with ESXi host scheduler
• ESXi host scheduler does not differentiate Storage vMotion IOs
23
NFS Only: Shared File Permissions
 SIOC uses shared files for its distributed computation.
• Needed to compute entitled host queue size across hosts
 Likely causes
• Improper implementation of NFS storage in vSphere: no root squash
 Best practices
• Always use recommended security setting on NFS datastores
24
Outline
 Storage IO Control (SIOC) Overview
 Deployment Scenarios
 Improvements in vSphere 5.1 and 5.5
 Preview from SIOC Labs
25
Improvements in 5.1 and 5.5 releases
 Automatic congestion threshold
• Can use % of peak capacity to determine congestion threshold
 Lesser disk IO
• Reduction in SIOC IOs when LUN is idle
 Improved stats reporting
• SIOC based storage statistics available by default in vSphere 5.5
 Full interop with storage workflows and conditions in vSphere 5.5
• Unmount, Destroy, APD (all paths down) and PDL (permanent data loss)
• Fixed in 5.1: “Unable to delete datastore with SIOC enabled”
26
Using SIOC with Virtual Flash (vFlash)
 SIOC and vFlash are
complementary
 SIOC does not throttle SSD
reads/writes
 SIOC proportionally allocates
post-cache IOs
• Latency controls during warm-up
 Best Practice: Allocate shares
to VMs consistent with vFlash
allocation
vFlash Infrastructure
Cache software Cache software
I/O Queue
Storage
27
Outline
 Storage IO Control (SIOC) Overview
 Deployment Scenarios
 Improvements in vSphere 5.1 and 5.5
 Preview from SIOC Labs
28
IO Reservations
 IO reservation control
• In addition to shares and Limits
• Specified per VMDK in IOPs
 SIOC distributes capacity using
shares, limits and reservations
 Storage DRS considers IO
reservation during initial
placement and load balancing SIOC SIOC
R=100,200 IOPs R=150 R=250
Estimated
Peak: 5430
IOPs
29
Resource Controls
 Fine-grain resource controls
• Per VM latency along with R,L,S
• Latency managed by Storage DRS/SIOC
• Enforced by smart arrays (vVols/vSAN)
 IO Resource pools for VMs / VMDKs
• Reservation, Limit, Shares control for a group of VMs or VMDKs
• No need to set per VM controls
30
Summary
 Easy to use – just two steps
• Enable Storage IO Control on a datastore
• Set IO shares and limit values for virtual disks
 Performance isolation among VMs using IO shares
 Automatic detection of I/O congestion
 Protect critical applications during I/O congestion
THANK YOU
http://bit.ly/siocsdrs
VMworld 2013: Storage IO Control: Concepts, Configuration and Best Practices to Tame Different Storage Architectures
Storage IO Control: Concepts, Configuration and Best
Practices to Tame Different Storage Architectures
Sachin Manpathak, VMware
VSVC5364
#VSVC5364
34
Thanks!
Sachin Manpathak (smanpathak@vmware.com)
Mustafa Uysal (muysal@vmware.com)
Sunil Muralidhar (muralidhars@vmware.com)
http://bit.ly/siocsdrs

More Related Content

PPTX
Virtual SAN 6.2, hyper-converged infrastructure software
PPTX
A day in the life of a VSAN I/O - STO7875
PDF
VMworld Europe 2014: Top 10 Do’s / Don’ts of Data Protection For VMware vSphere
PPTX
What is coming for VMware vSphere?
PDF
VMworld Europe 2014: Virtual SAN Architecture Deep Dive
PDF
VMware Vsan vtug 2014
PDF
VMware - Virtual SAN - IT Changes Everything
PPTX
Five common customer use cases for Virtual SAN - VMworld US / 2015
Virtual SAN 6.2, hyper-converged infrastructure software
A day in the life of a VSAN I/O - STO7875
VMworld Europe 2014: Top 10 Do’s / Don’ts of Data Protection For VMware vSphere
What is coming for VMware vSphere?
VMworld Europe 2014: Virtual SAN Architecture Deep Dive
VMware Vsan vtug 2014
VMware - Virtual SAN - IT Changes Everything
Five common customer use cases for Virtual SAN - VMworld US / 2015

What's hot (20)

PPTX
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
PDF
VMworld Europe 2014: What’s New in End User Computing: Full Desktop Automatio...
PDF
Presentazione VMware @ VMUGIT UserCon 2015
PPTX
Inf bco2891 release candidate v11 copy
PPTX
Presentation v mware virtual san 6.0
PPTX
STO7535 Virtual SAN Proof of Concept - VMworld 2016
PDF
VMworld 2014: Virtual SAN Architecture Deep Dive
PPTX
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
PDF
VMware Virtual SAN Presentation
PPTX
Storage Changes in VMware vSphere 4.1
PDF
VSAN-VMWorld2015-Rev08
PPTX
VMware virtual SAN 6 overview
PPTX
STO7534 VSAN Day 2 Operations (VMworld 2016)
PDF
VSAN – Architettura e Design
PDF
VMware Backups That Work—Lessons Learned From VADP Performance Benchmark Testing
PPTX
2017 VMUG Storage Policy Based Management
PDF
VMworld 2014: vSphere HA Best Practices and FT Tech Preview
PDF
VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...
PDF
VMworld 2013: vSphere Data Protection (VDP) Technical Deep Dive and Troublesh...
PPT
VMware Virtual SAN slideshow
Track 1 Virtualizing Critical Applications with VMWARE VISPHERE by Roshan Shetty
VMworld Europe 2014: What’s New in End User Computing: Full Desktop Automatio...
Presentazione VMware @ VMUGIT UserCon 2015
Inf bco2891 release candidate v11 copy
Presentation v mware virtual san 6.0
STO7535 Virtual SAN Proof of Concept - VMworld 2016
VMworld 2014: Virtual SAN Architecture Deep Dive
VMworld 2015: The Future of Software- Defined Storage- What Does it Look Like...
VMware Virtual SAN Presentation
Storage Changes in VMware vSphere 4.1
VSAN-VMWorld2015-Rev08
VMware virtual SAN 6 overview
STO7534 VSAN Day 2 Operations (VMworld 2016)
VSAN – Architettura e Design
VMware Backups That Work—Lessons Learned From VADP Performance Benchmark Testing
2017 VMUG Storage Policy Based Management
VMworld 2014: vSphere HA Best Practices and FT Tech Preview
VMware: Enabling Software-Defined Storage Using Virtual SAN (Technical Decisi...
VMworld 2013: vSphere Data Protection (VDP) Technical Deep Dive and Troublesh...
VMware Virtual SAN slideshow
Ad

Viewers also liked (20)

PPT
MS Paint Collection: 04 - Astrology Series
PDF
An Efficient Backup and Replication of Storage
PDF
Sustaining Your Career
ODP
Parashar21 : What ? Why ? and How ?
PDF
Where We're Headed and Where NSX Fits In
PPTX
Indian Astrology | Astrology Services
PDF
Getting Involved in VMUG
PDF
The Future of Cloud Networking is VMware NSX (Danish VMUG edition)
PDF
VMworld Europe 2014: Storage DRS - Deep Dive and Best Practices
PDF
An Introduction to Vagrant and Docker
PPT
Astrology basic
PDF
Solving io bottleneck
PPTX
Next-Generation Best Practices for VMware and Storage
PPTX
Simple math fusion-io_v4
PDF
The Vision for the Future of Network Virtualization with VMware NSX
PDF
The Full Stack Journey (DFW)
PDF
The Vision for the Future of Network Virtualization with VMware NSX (Q2 2016)
PDF
Areas of KP research 2015
PDF
An Overview of Linux Networking Options
PPTX
Kp interpretation of 6th house
MS Paint Collection: 04 - Astrology Series
An Efficient Backup and Replication of Storage
Sustaining Your Career
Parashar21 : What ? Why ? and How ?
Where We're Headed and Where NSX Fits In
Indian Astrology | Astrology Services
Getting Involved in VMUG
The Future of Cloud Networking is VMware NSX (Danish VMUG edition)
VMworld Europe 2014: Storage DRS - Deep Dive and Best Practices
An Introduction to Vagrant and Docker
Astrology basic
Solving io bottleneck
Next-Generation Best Practices for VMware and Storage
Simple math fusion-io_v4
The Vision for the Future of Network Virtualization with VMware NSX
The Full Stack Journey (DFW)
The Vision for the Future of Network Virtualization with VMware NSX (Q2 2016)
Areas of KP research 2015
An Overview of Linux Networking Options
Kp interpretation of 6th house
Ad

Similar to VMworld 2013: Storage IO Control: Concepts, Configuration and Best Practices to Tame Different Storage Architectures (20)

PPTX
VMware Technology: Deliver Predictable Application Performance & Improve Infr...
PDF
What's New in VMware vSphere 5.0 - Storage
PPT
#VMUGMTL - Xsigo Breakout
PDF
The Unofficial VCAP / VCP VMware Study Guide
PDF
Na vsc install
PPTX
VMworld 2015: Explaining Advanced Virtual Volumes Configurations
PDF
V mware 云的构成及v sphere 虚拟网络及存储的现状及未来展望
PDF
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
PDF
VMworld 2013: What's New in VMware vSphere?
PDF
Partner Presentation vSphere6-VSAN-vCloud-vRealize
PDF
Presentation v mworld 2011
PDF
Vsphere esxi-vcenter-server-50-storage-guide
PDF
Poc guide vsan
PPTX
VMware vSAN - Novosco, June 2017
PDF
VMware: Enabling Software-Defined Storage Using Virtual SAN (Business Decisio...
PDF
VMworld 2013: vSphere Flash Read Cache Technical Overview
PPTX
Storage for Virtual Environments 2011 R2
PPTX
Troubleshooting Storage Devices Using vRealize Operations (formerly vC Ops)
PDF
VMworld 2014: Extreme Performance Series
VMware Technology: Deliver Predictable Application Performance & Improve Infr...
What's New in VMware vSphere 5.0 - Storage
#VMUGMTL - Xsigo Breakout
The Unofficial VCAP / VCP VMware Study Guide
Na vsc install
VMworld 2015: Explaining Advanced Virtual Volumes Configurations
V mware 云的构成及v sphere 虚拟网络及存储的现状及未来展望
VMworld Europe 2014: Advanced SQL Server on vSphere Techniques and Best Pract...
VMworld 2013: What's New in VMware vSphere?
Partner Presentation vSphere6-VSAN-vCloud-vRealize
Presentation v mworld 2011
Vsphere esxi-vcenter-server-50-storage-guide
Poc guide vsan
VMware vSAN - Novosco, June 2017
VMware: Enabling Software-Defined Storage Using Virtual SAN (Business Decisio...
VMworld 2013: vSphere Flash Read Cache Technical Overview
Storage for Virtual Environments 2011 R2
Troubleshooting Storage Devices Using vRealize Operations (formerly vC Ops)
VMworld 2014: Extreme Performance Series

More from VMworld (20)

PPTX
VMworld 2016: vSphere 6.x Host Resource Deep Dive
PPTX
VMworld 2016: Troubleshooting 101 for Horizon
PPTX
VMworld 2016: Advanced Network Services with NSX
PPTX
VMworld 2016: How to Deploy VMware NSX with Cisco Infrastructure
PPTX
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
PPTX
VMworld 2016: What's New with Horizon 7
PPTX
VMworld 2016: Virtual Volumes Technical Deep Dive
PPTX
VMworld 2016: Advances in Remote Display Protocol Technology with VMware Blas...
PPTX
VMworld 2016: The KISS of vRealize Operations!
PPTX
VMworld 2016: Getting Started with PowerShell and PowerCLI for Your VMware En...
PPTX
VMworld 2016: Ask the vCenter Server Exerts Panel
PPTX
VMworld 2016: Virtualize Active Directory, the Right Way!
PPTX
VMworld 2016: Migrating from a hardware based firewall to NSX to improve perf...
PPTX
VMworld 2015: Troubleshooting for vSphere 6
PPTX
VMworld 2015: Monitoring and Managing Applications with vRealize Operations 6...
PPTX
VMworld 2015: Advanced SQL Server on vSphere
PPTX
VMworld 2015: Virtualize Active Directory, the Right Way!
PPTX
VMworld 2015: Site Recovery Manager and Policy Based DR Deep Dive with Engine...
PPTX
VMworld 2015: Building a Business Case for Virtual SAN
PPTX
VMworld 2015: Virtual Volumes Technical Deep Dive
VMworld 2016: vSphere 6.x Host Resource Deep Dive
VMworld 2016: Troubleshooting 101 for Horizon
VMworld 2016: Advanced Network Services with NSX
VMworld 2016: How to Deploy VMware NSX with Cisco Infrastructure
VMworld 2016: Enforcing a vSphere Cluster Design with PowerCLI Automation
VMworld 2016: What's New with Horizon 7
VMworld 2016: Virtual Volumes Technical Deep Dive
VMworld 2016: Advances in Remote Display Protocol Technology with VMware Blas...
VMworld 2016: The KISS of vRealize Operations!
VMworld 2016: Getting Started with PowerShell and PowerCLI for Your VMware En...
VMworld 2016: Ask the vCenter Server Exerts Panel
VMworld 2016: Virtualize Active Directory, the Right Way!
VMworld 2016: Migrating from a hardware based firewall to NSX to improve perf...
VMworld 2015: Troubleshooting for vSphere 6
VMworld 2015: Monitoring and Managing Applications with vRealize Operations 6...
VMworld 2015: Advanced SQL Server on vSphere
VMworld 2015: Virtualize Active Directory, the Right Way!
VMworld 2015: Site Recovery Manager and Policy Based DR Deep Dive with Engine...
VMworld 2015: Building a Business Case for Virtual SAN
VMworld 2015: Virtual Volumes Technical Deep Dive

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
Big Data Technologies - Introduction.pptx
PPTX
Cloud computing and distributed systems.
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Empathic Computing: Creating Shared Understanding
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Encapsulation theory and applications.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
Electronic commerce courselecture one. Pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
The Rise and Fall of 3GPP – Time for a Sabbatical?
Big Data Technologies - Introduction.pptx
Cloud computing and distributed systems.
Reach Out and Touch Someone: Haptics and Empathic Computing
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Empathic Computing: Creating Shared Understanding
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Encapsulation theory and applications.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Electronic commerce courselecture one. Pdf
MIND Revenue Release Quarter 2 2025 Press Release
Review of recent advances in non-invasive hemoglobin estimation
Building Integrated photovoltaic BIPV_UPV.pdf
The AUB Centre for AI in Media Proposal.docx
“AI and Expert System Decision Support & Business Intelligence Systems”
Agricultural_Statistics_at_a_Glance_2022_0.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx

VMworld 2013: Storage IO Control: Concepts, Configuration and Best Practices to Tame Different Storage Architectures

  • 1. Storage IO Control: Concepts, Configuration and Best Practices to Tame Different Storage Architectures Sachin Manpathak, VMware Mustafa Uysal, VMware Sunil Muralidhar, VMware VSVC5364 #VSVC5364
  • 2. 2 Disclaimer  This session may contain product features that are currently under development.  This session/overview of the new technology represents no commitment from VMware to deliver these features in any generally available product.  Features are subject to change, and must not be included in contracts, purchase orders, or sales agreements of any kind.  Technical feasibility and market demand will affect final delivery.  Pricing and packaging for any new technologies or features discussed or presented have not been determined.
  • 3. 3 VMware Vision: Software Defined Storage Software Defined Storage Software-Defined Storage Vision Enable new storage tiers Enable DAS & server flash for shared storage along with enterprise SAN/NAS Enable tight integration with storage ecosystem Tighter integrations with broad storage ecosystem through APIs Deliver policy-based automated storage management Automatically enforce per-VM SLAs for all apps across different types of storage “Gold” Array(s) “Silver” Array(s) Distributed Storage Hard disks SSD Hard disks SSD Availability = 99.99% DR RTO = 1 “Gold” SLA Availability = 99% Throughput = 1000 R/s, 20 W/s Latency = 95% under 5 ms DR RPO = 1’, RTO = 10’ Back up = hourly Capacity res = 100% Web Server Database Server Availability = 99.99% DR RTO = 1 hour Max Laten “Bronze” SLA Availability = 99% Throughput = 100 R/s,10 W/s Latency = 90% under 10 ms DR RPO = 60’, RTO = 360’ Back up = weekly Security = encryption ReduceStorageCostandComplexity App Server Roadmap
  • 4. 4 Software-Defined Storage: Summary Roadmap vSphere storage features Storage IO Control, Storage vMotion, Storage DRS, Profile Driven Storage Enable New Storage Tiers Policy-based storage management Virtual Volumes VM-aware data management with enterprise storage arrays Tight integration with storage systems Policy-based storage management For local storage vSphere Storage Appliance Low cost, simple shared storage for small deployments Virtual SAN Policy-driven storage for cloud-scale deployments Virtual Flash Virtual SAN Data services Virtual Flash Write-back caching Policy-based storage management For external storage H2 2013 / H1 2014 RoadmapToday Roadmap
  • 5. 5 Outline  Storage IO Control (SIOC) Overview  Deployment Scenarios  Improvements in vSphere 5.1 and 5.5  Preview from SIOC Labs Survey: http://bit.ly/siocsdrs
  • 6. 6 The Problem What you see Database Server Farms Online store: Product Catalog Online Store: Data Mining (low priority) Shared Datastore Online Store: Order Processing What you want to see Shared Datastore Online store: Product Catalog Online Store: Data Mining (low priority) Online Store: Order Processing
  • 7. 7 Solution: Storage IO Control  Detect Congestion • SIOC monitors average IO latency for a datastore • Latency above a threshold indicates congestion  SIOC throttles IOs once congestion is detected • Control IOs issued per host • Based on VMs and their shares on each host • Throttling adjusted dynamically based on workload • Idleness • Bursty behavior
  • 8. 8 Congestion Threshold  Performance suffers if datastore is overloaded  Congestion threshold value (ms): • Higher is better for overall throughput • Lower is better for stronger isolation  SIOC default setting: 90% of peak IOPs capacity  Changing default threshold:  Percentage or absolute value Throughput(IOPS) Datastore Load No benefit beyond certain load Latency Datastore Load
  • 9. 9 Distributed Storage Access 10 10 10 50 20 30 50 1005030 Shares vol1 vol1 vol1  VMs running on multiple hosts  Shared storage: SAN/NFS  VMs interfere with each other  No centralized control  VM shares control amount of IO throttling
  • 10. 10 Control IOs Issued per Host (Based on Shares) With SIOC: All VMs get equal queue slots Without SIOC: VM C gets equal queue slots as VMs A+ B VM Disk Shares A 1000 B 1000 C 1000
  • 11. 11 What Do I/O Shares Mean?  Two main units exist in industry • Bandwidth (MB/s) • Throughput (IOPS)  Both have problems • Using bandwidth may hurt workloads with large IO sizes • Using IOPS may hurt VMs with sequential IOs  SIOC: carves out storage array queue among VMs • VMs reuse queue slots faster or slower (depending on array latency) • Sequential streams get higher IOPS even if shares identical • Workloads with high read cache hit rates • This is a good thing! • Maintains high overall throughput
  • 12. 12 Configuring Storage IO Control 2 simple steps: 1. Enable Storage I/O Control on a datastore 2. Set virtual disk controls for VMs
  • 14. 14 Storage IO Control Configuration
  • 16. 16 Storage IO Control In Action  New Datastore performance metrics • Storage IO Control Normalized Latency • Storage IO Control Aggregate IOPs  Latency is normalized by I/O size  Averaged across all ESX hosts  SIOC invoked every 4 seconds • Latency computation • I/O throttling 40ms 30ms 20ms
  • 17. 17 Outline  Storage IO Control (SIOC) Overview  Deployment Scenarios  Improvements in vSphere 5.1 and 5.5  Preview from SIOC Labs
  • 18. 18 Deployment: Shared Storage Pools  Enable SIOC on all datastores  Use same congestion threshold  SIOC will adjust queue depth for all datastores based on demand SIOC SIOC BA Shared Storage Pool IO Queue
  • 19. 19 Deployment: Auto-tiered LUN  Set lower congestion threshold • Based on LUN configuration • Based on application needs • More SSDs -> lower value  SIOC will adjust queue depth and do prioritized scheduling Capacity Tier Fast Tier Medium Tier One IO queue SIOCSIOCSIOC
  • 20. 20 VMs with Multiple VMDKs  VM IO allocation on a datastore • Sum of shares of all VMDKs  A low priority VM with many VMDKs may get higher priority • Unusued shares flow across VMDKs  VMDKs split across datastores • No flow of unused shares  Consider IO sum of shares per datastore while provisioning VMs. 800300 200200 500 200 800Allocations
  • 21. 21 Best Practices  Avoid mixing vSphere LUNs and non-vSphere LUNs on the same physical storage • SIOC will detect this and raise an alarm  Configure host IO queue size with highest allowed value • Maximum flexibility for SIOC throttling  Keep congestion threshold conservative • Will improve overall utilization • Set lower if latency is more important than throughput
  • 22. 22 VM Snapshots and Storage vMotion IOs  VM snapshot and Storage vMotion IO charged to VM  SIOC throttles all IOs from a VM • IOs from Storage vMotion activity does not affect important VMs • Storage array is not overwhelmed with IO activity burst  SIOC’s distributed IO allocation consistent with ESXi host scheduler • ESXi host scheduler does not differentiate Storage vMotion IOs
  • 23. 23 NFS Only: Shared File Permissions  SIOC uses shared files for its distributed computation. • Needed to compute entitled host queue size across hosts  Likely causes • Improper implementation of NFS storage in vSphere: no root squash  Best practices • Always use recommended security setting on NFS datastores
  • 24. 24 Outline  Storage IO Control (SIOC) Overview  Deployment Scenarios  Improvements in vSphere 5.1 and 5.5  Preview from SIOC Labs
  • 25. 25 Improvements in 5.1 and 5.5 releases  Automatic congestion threshold • Can use % of peak capacity to determine congestion threshold  Lesser disk IO • Reduction in SIOC IOs when LUN is idle  Improved stats reporting • SIOC based storage statistics available by default in vSphere 5.5  Full interop with storage workflows and conditions in vSphere 5.5 • Unmount, Destroy, APD (all paths down) and PDL (permanent data loss) • Fixed in 5.1: “Unable to delete datastore with SIOC enabled”
  • 26. 26 Using SIOC with Virtual Flash (vFlash)  SIOC and vFlash are complementary  SIOC does not throttle SSD reads/writes  SIOC proportionally allocates post-cache IOs • Latency controls during warm-up  Best Practice: Allocate shares to VMs consistent with vFlash allocation vFlash Infrastructure Cache software Cache software I/O Queue Storage
  • 27. 27 Outline  Storage IO Control (SIOC) Overview  Deployment Scenarios  Improvements in vSphere 5.1 and 5.5  Preview from SIOC Labs
  • 28. 28 IO Reservations  IO reservation control • In addition to shares and Limits • Specified per VMDK in IOPs  SIOC distributes capacity using shares, limits and reservations  Storage DRS considers IO reservation during initial placement and load balancing SIOC SIOC R=100,200 IOPs R=150 R=250 Estimated Peak: 5430 IOPs
  • 29. 29 Resource Controls  Fine-grain resource controls • Per VM latency along with R,L,S • Latency managed by Storage DRS/SIOC • Enforced by smart arrays (vVols/vSAN)  IO Resource pools for VMs / VMDKs • Reservation, Limit, Shares control for a group of VMs or VMDKs • No need to set per VM controls
  • 30. 30 Summary  Easy to use – just two steps • Enable Storage IO Control on a datastore • Set IO shares and limit values for virtual disks  Performance isolation among VMs using IO shares  Automatic detection of I/O congestion  Protect critical applications during I/O congestion
  • 33. Storage IO Control: Concepts, Configuration and Best Practices to Tame Different Storage Architectures Sachin Manpathak, VMware VSVC5364 #VSVC5364
  • 34. 34 Thanks! Sachin Manpathak (smanpathak@vmware.com) Mustafa Uysal (muysal@vmware.com) Sunil Muralidhar (muralidhars@vmware.com) http://bit.ly/siocsdrs