SlideShare a Scribd company logo
Independent Analysis of Storage and Data Protection
Why is Virtualization creating
Storage Sprawl?
by George Crump, Lead Analyst
Desktop and server virtualization have brought many benefits to the data center. These two initiatives have
allowed IT to respond quickly to the needs of the organization while driving down IT costs, physical footprint
requirements and energy demands. But there is one area of the data center that has actually increased in cost
since virtualization started to make its way into production… storage. Because of virtualization, more data centers
need flash to meet the random I/O nature of the virtualized environment, which of course is more expensive, on
a dollar per GB basis, than hard disk drives. The single biggest problem however is the significant increase in the
number of discrete storage systems that service the environment. This “storage sprawl” threatens the return on
investment (ROI) of virtualization projects and makes storage more complex to manage.
Why is Storage Sprawl Worse in Virtual Environments?
Storage sprawl has become worse in organizations that have made a significant investment in virtualization
technologies. For example most virtual desktop infrastructures (VDI) have an allflash array to handle desktop
images, preventing boot storms, logout storms and maintaining acceptable performance throughout the day.
But most VDI environments also need a file store for user home directories. There is little to be gained if this data
is placed on the all-flash array, but certainly data centers need to provide storage to their users to support user
created data. As a result most organizations end up buying a separate Network Attached Storage (NAS) device
to support user home directories and other types of unstructured data.
The virtual server environment will also see multiple storage systems implemented to support its operation.
This includes a high performance system, typically an all-flash array, to make sure that critical applications
experience a performance profile similar to their bare metal days. There is also a workhorse type of system
that today is often a mix of flash and hard disk drives, a hybrid system. The role of this system is to support
the mid-tier applications and services.
Finally there is a third type of storage often used for old VMs and sometimes old user data, as well as data being
collected from big data sources like sensors. This system is often capacity focused using high capacity hard
drives to create an affordable storage area for this “at rest” data.
Does Storage Sprawl Matter?
Some vendors will claim that storage sprawl no longer matters in virtualized environments, that the hypervisor
can manage the movement of virtual machine data between types of storage, just like it moves virtual machines
between different types of compute servers. This line of thinking encourages the purchase of the above purpose
specific storage systems; an all-flash array for performance sensitive VMs, a hybrid array for more modest VMs
www.storageswiss.com ©2015, Storage Switzerland, All rights reserved
and a hard disk based system for VMs that have no use for the performance that a flash based system could
provide.
What is not explained is that these separate silos of storage still need to be managed. They often use their own
unique snapshot, replication and other data services per system. That means that an IT administrator needs to
learn each storage system interface and scripting language. It also means that when data is transferred between
these systems, that the application may see a performance slow down, and may cause an outage, while this
migration occurs. The transfer needs to copy or move data from the primary storage system to the alternative
storage system and that means, at a minimum, transferring data across the storage network and potentially
the general purpose network. Finally, there is an optimization issue. Similar to direct attached storage, silos
of shared storage can’t be used optimally because there is rarely a balance between performance focused
applications and capacity based ones.
Storage sprawl results in three issues for IT managers. First is wasted IT administrator time even though some-
thing like an all-flash or even a hybrid array was purchased to reduce time spent tuning storage performance.
The second factor is unpredictable performance deliverables, even though these systems were implemented
to resolve that issue. The need to transfer data across a network will alter performance characteristics. And
third the imbalance of performance and capacity resources among the storage silos.
Can Sprawl Be Stopped?
The solution, a high-performance mixed workload storage system that supports a wide variety of workloads and
a wide variety of storage protocols while delivering the performance and storage economics that the various
workloads need. The key to full return on storage consolidation investment is a single system that can be fully
utilized while still delivering on the specific demands required by each workload it supports.
Why Stop Virtualization’s Storage Sprawl
For a storage system to stop storage sprawl there are several key requirements which it must deliver, and if it
can successfully deliver these requirements, then the benefits of a single system are numerous. First a single
unit is simply easier to manage. No matter how much software is layered over the management of multiple
storage systems, it can’t hide the fact that there are indeed multiple storage systems there that require more
time to manage.
The second benefit is that a single storage system can be used more efficiently. A data center with a single
storage system never has to worry about storage system B running out of storage capacity while storage
system A is running out of storage performance.
The third benefit is economics. Even if the independent storage systems are priced aggressively, the
organization is still paying for multiple storage controllers, storage capacity and storage software features.
And of course, each will have its own service agreement that has to be maintained.
The Requirements for Consolidated Storage
The first requirement is for mixed media. While flash has become less expensive, it is still not as competitive as
capacity disk pricing. And a storage system that is trying to consolidate ALL storage will be required to support
data types that simply don’t belong on flash. At the same time, performance demands need to be addressed
www.storageswiss.com ©2015, Storage Switzerland, All rights reserved
by the consolidated storage system. These systems will also have to leverage flash storage to meet those
demands. Further they should than what flash can provide it is able to deliver that too.
The second requirement is for a high performance HDD tier. Too many hybrid storage systems leverage flash
and then only high capacity HDDs as their second tier. The problem is that high capacity hard drives are slower,
and because fewer of them are required to meet the capacity demands of the environment there are less of the
drives to dedicate to performance.
The consolidated system needs to leverage faster performing drives of more moderate capacity. This also
means that more drives will be used. This combination will deliver respectable HDD performance without
increasing cost. More importantly, there will be less of a drop off between flash and HDD performance in
the case of a cache miss.
The third requirement is that the consolidated storage system also support multiple protocols. This means going
beyond the classic SAN/NAS support and providing both Object and even Mainframe access. For storage
consolidation to make sense it needs to move beyond the virtualized use case and be able to provide capacity
for analytics and other workload types.
The fourth requirement is that the consolidated system be able to keep up with performance and capacity
demands as they continue to increase over time. This means that the system should have the ability to scale-out
instead of scale up. A scale-out system allows the organization to avoid costly fork-lift upgrades to their storage
systems. This is even more critical as workloads are consolidated since the used capacity of the system will be
so much higher. High capacity means longer migrations; longer migrations mean more downtime. Scale-out
eliminates the need for migration.
Finally, and most important, is the need for a highly reliable system with multiple points of redundancy. If the
storage system is truly going to be the only storage system in the environment, then it can’t fail or experience
any outages. This reliability should be adjustable by application or workload so that mission critical workloads
could survive multiple outages.
Conclusion
A single consolidated storage system should bring many benefits to the organization, but those benefits
come at a risk of variable application performance and also a greater risk of failure. To mitigate these risks the
consolidating storage system needs to leverage multiple types of storage media and have multiple points of
redundancy built into the system. If this combination can be delivered, the virtual environment should become
simpler to administrate while also being less expensive to run.
Sponsored by INFINIDAT
Storage Switzerland is the leading storage analyst firm focused on the emerging storage categories of memory
based storage (Flash), big data, virtualization, cloud computing and data protection. The firm is widely recognized
for its blogs, white papers, and videos on such current technologies like all-flash arrays, deduplication, software-
defined storage, backup appliances, and storage networking. The “Switzerland” in the firm’s name indicates our
pledge to provide neutral analysis of the storage marketplace, rather than focusing on a single vendor or approach.
INFINIDAT has brought to market InfiniBox, a new generation of highly reliable, scalable and efficient storage
systems designed specifically to support a wide variety of workload types, including those in virtual architectures.
InfiniBox allows an organization to consolidate down to a single solution that has the performance, capacity
and affordability to support most of an organization’s workloads.

More Related Content

PDF
Net App Unified Storage Architecture
PDF
Fluid Data Storage:Driving Flexibility in the Data Center
PDF
Edison IBM FlashSystem and Tributary White Paper Final
PDF
Storage Virtualization: Towards an Efficient and Scalable Framework
PDF
Managing data to improve disaster recovery preparedness » data center knowledge
PDF
Software defined storage rev. 2.0
PDF
Solution Brief HPE StoreOnce backup with Veeam
PDF
Data Lake Protection - A Technical Review
 
Net App Unified Storage Architecture
Fluid Data Storage:Driving Flexibility in the Data Center
Edison IBM FlashSystem and Tributary White Paper Final
Storage Virtualization: Towards an Efficient and Scalable Framework
Managing data to improve disaster recovery preparedness » data center knowledge
Software defined storage rev. 2.0
Solution Brief HPE StoreOnce backup with Veeam
Data Lake Protection - A Technical Review
 

What's hot (20)

PDF
Sample_Blueprint-Fault_Tolerant_NAS
PDF
Insiders Guide- Managing Storage Performance
PDF
Sun Open Storage
PDF
Configuration and Deployment Guide For Memcached on Intel® Architecture
PDF
Windows server 2012 R2 private cloud virtualization and storage
PDF
Vmware virtualization in data centers
PDF
New Features For Your Software Defined Storage
PDF
IBM Storwize V7000 and Storwize V7000 Unified Disk Systems
PDF
twp-oracledatabasebackupservice-2183633
PDF
Flashelastic
PDF
Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...
 
DOCX
Information Storage and Management notes ssmeena
PDF
Data Domain Architecture
PDF
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
PDF
Connect July-Aug 2014
PDF
White Paper: Optimizing Primary Storage Through File Archiving with EMC Cloud...
 
PDF
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
PDF
8 Strategies For Building A Modern DataCenter
PDF
Dmg emc-avamar-optimized-backup-recovery-dedupe[1]
PDF
Generic RLM White Paper
Sample_Blueprint-Fault_Tolerant_NAS
Insiders Guide- Managing Storage Performance
Sun Open Storage
Configuration and Deployment Guide For Memcached on Intel® Architecture
Windows server 2012 R2 private cloud virtualization and storage
Vmware virtualization in data centers
New Features For Your Software Defined Storage
IBM Storwize V7000 and Storwize V7000 Unified Disk Systems
twp-oracledatabasebackupservice-2183633
Flashelastic
Data Warehouse Scalability Using Cisco Unified Computing System and Oracle Re...
 
Information Storage and Management notes ssmeena
Data Domain Architecture
[IJET-V1I6P11] Authors: A.Stenila, M. Kavitha, S.Alonshia
Connect July-Aug 2014
White Paper: Optimizing Primary Storage Through File Archiving with EMC Cloud...
 
Positioning IBM Flex System 16 Gb Fibre Channel Fabric for Storage-Intensive ...
8 Strategies For Building A Modern DataCenter
Dmg emc-avamar-optimized-backup-recovery-dedupe[1]
Generic RLM White Paper
Ad

Similar to Why is Virtualization Creating Storage Sprawl? By Storage Switzerland (20)

PDF
Storage for VMware vSphere IBM Storwize V7000 Unified Provides Enterprise-Cla...
PDF
Taneja Group: Midrange Redefined – the IBM Storwize V7000 Analyst Paper
DOCX
Storage Area Networks Unit 1 Notes
PDF
Storage Virtualization isn’t About Storage
PDF
Product Brief Storage Virtualization isn’t About Storage
PDF
The State of the Core – Engineering the Enterprise Storage Infrastructure wit...
PDF
Engineering the Enterprise Storage Infrastructure with the IBM DS8000
PDF
Real-time Compression Advances Storage Optimization: A white paper by IDC
PPTX
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data
PDF
IRJET- Open Source Solution for Centralized Storage System using Network ...
PPTX
London VMUG Presentation 19th July 2012
PDF
CloudByte_CureForNoisyNeighbors
PPTX
A brief introduction to data storage
PPT
lec-7.ppt It Infrastructure: Storage
PDF
White Paper: EMC Isilon OneFS Operating System
 
PDF
Using ibm total storage productivity center for disk to monitor the svc redp3961
PDF
A New Era in Midrange Storage IDC Analyst paper
PDF
IBM Storwize V7000 and Storwize V7000 Unified Disk Systems
PDF
IBM Storwize V7000 and Storwize V7000 Unified Disk Systems
PDF
Today's Need To Manage The Storage Polymorphism
Storage for VMware vSphere IBM Storwize V7000 Unified Provides Enterprise-Cla...
Taneja Group: Midrange Redefined – the IBM Storwize V7000 Analyst Paper
Storage Area Networks Unit 1 Notes
Storage Virtualization isn’t About Storage
Product Brief Storage Virtualization isn’t About Storage
The State of the Core – Engineering the Enterprise Storage Infrastructure wit...
Engineering the Enterprise Storage Infrastructure with the IBM DS8000
Real-time Compression Advances Storage Optimization: A white paper by IDC
Webinar: End NAS Sprawl - Gain Control Over Unstructured Data
IRJET- Open Source Solution for Centralized Storage System using Network ...
London VMUG Presentation 19th July 2012
CloudByte_CureForNoisyNeighbors
A brief introduction to data storage
lec-7.ppt It Infrastructure: Storage
White Paper: EMC Isilon OneFS Operating System
 
Using ibm total storage productivity center for disk to monitor the svc redp3961
A New Era in Midrange Storage IDC Analyst paper
IBM Storwize V7000 and Storwize V7000 Unified Disk Systems
IBM Storwize V7000 and Storwize V7000 Unified Disk Systems
Today's Need To Manage The Storage Polymorphism
Ad

Recently uploaded (20)

PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Machine learning based COVID-19 study performance prediction
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
cuic standard and advanced reporting.pdf
PDF
Approach and Philosophy of On baking technology
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
DOCX
The AUB Centre for AI in Media Proposal.docx
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
NewMind AI Monthly Chronicles - July 2025
Machine learning based COVID-19 study performance prediction
Per capita expenditure prediction using model stacking based on satellite ima...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
cuic standard and advanced reporting.pdf
Approach and Philosophy of On baking technology
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Mobile App Security Testing_ A Comprehensive Guide.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Review of recent advances in non-invasive hemoglobin estimation
MYSQL Presentation for SQL database connectivity
Diabetes mellitus diagnosis method based random forest with bat algorithm
The AUB Centre for AI in Media Proposal.docx

Why is Virtualization Creating Storage Sprawl? By Storage Switzerland

  • 1. Independent Analysis of Storage and Data Protection Why is Virtualization creating Storage Sprawl? by George Crump, Lead Analyst Desktop and server virtualization have brought many benefits to the data center. These two initiatives have allowed IT to respond quickly to the needs of the organization while driving down IT costs, physical footprint requirements and energy demands. But there is one area of the data center that has actually increased in cost since virtualization started to make its way into production… storage. Because of virtualization, more data centers need flash to meet the random I/O nature of the virtualized environment, which of course is more expensive, on a dollar per GB basis, than hard disk drives. The single biggest problem however is the significant increase in the number of discrete storage systems that service the environment. This “storage sprawl” threatens the return on investment (ROI) of virtualization projects and makes storage more complex to manage. Why is Storage Sprawl Worse in Virtual Environments? Storage sprawl has become worse in organizations that have made a significant investment in virtualization technologies. For example most virtual desktop infrastructures (VDI) have an allflash array to handle desktop images, preventing boot storms, logout storms and maintaining acceptable performance throughout the day. But most VDI environments also need a file store for user home directories. There is little to be gained if this data is placed on the all-flash array, but certainly data centers need to provide storage to their users to support user created data. As a result most organizations end up buying a separate Network Attached Storage (NAS) device to support user home directories and other types of unstructured data. The virtual server environment will also see multiple storage systems implemented to support its operation. This includes a high performance system, typically an all-flash array, to make sure that critical applications experience a performance profile similar to their bare metal days. There is also a workhorse type of system that today is often a mix of flash and hard disk drives, a hybrid system. The role of this system is to support the mid-tier applications and services. Finally there is a third type of storage often used for old VMs and sometimes old user data, as well as data being collected from big data sources like sensors. This system is often capacity focused using high capacity hard drives to create an affordable storage area for this “at rest” data. Does Storage Sprawl Matter? Some vendors will claim that storage sprawl no longer matters in virtualized environments, that the hypervisor can manage the movement of virtual machine data between types of storage, just like it moves virtual machines between different types of compute servers. This line of thinking encourages the purchase of the above purpose specific storage systems; an all-flash array for performance sensitive VMs, a hybrid array for more modest VMs www.storageswiss.com ©2015, Storage Switzerland, All rights reserved
  • 2. and a hard disk based system for VMs that have no use for the performance that a flash based system could provide. What is not explained is that these separate silos of storage still need to be managed. They often use their own unique snapshot, replication and other data services per system. That means that an IT administrator needs to learn each storage system interface and scripting language. It also means that when data is transferred between these systems, that the application may see a performance slow down, and may cause an outage, while this migration occurs. The transfer needs to copy or move data from the primary storage system to the alternative storage system and that means, at a minimum, transferring data across the storage network and potentially the general purpose network. Finally, there is an optimization issue. Similar to direct attached storage, silos of shared storage can’t be used optimally because there is rarely a balance between performance focused applications and capacity based ones. Storage sprawl results in three issues for IT managers. First is wasted IT administrator time even though some- thing like an all-flash or even a hybrid array was purchased to reduce time spent tuning storage performance. The second factor is unpredictable performance deliverables, even though these systems were implemented to resolve that issue. The need to transfer data across a network will alter performance characteristics. And third the imbalance of performance and capacity resources among the storage silos. Can Sprawl Be Stopped? The solution, a high-performance mixed workload storage system that supports a wide variety of workloads and a wide variety of storage protocols while delivering the performance and storage economics that the various workloads need. The key to full return on storage consolidation investment is a single system that can be fully utilized while still delivering on the specific demands required by each workload it supports. Why Stop Virtualization’s Storage Sprawl For a storage system to stop storage sprawl there are several key requirements which it must deliver, and if it can successfully deliver these requirements, then the benefits of a single system are numerous. First a single unit is simply easier to manage. No matter how much software is layered over the management of multiple storage systems, it can’t hide the fact that there are indeed multiple storage systems there that require more time to manage. The second benefit is that a single storage system can be used more efficiently. A data center with a single storage system never has to worry about storage system B running out of storage capacity while storage system A is running out of storage performance. The third benefit is economics. Even if the independent storage systems are priced aggressively, the organization is still paying for multiple storage controllers, storage capacity and storage software features. And of course, each will have its own service agreement that has to be maintained. The Requirements for Consolidated Storage The first requirement is for mixed media. While flash has become less expensive, it is still not as competitive as capacity disk pricing. And a storage system that is trying to consolidate ALL storage will be required to support data types that simply don’t belong on flash. At the same time, performance demands need to be addressed www.storageswiss.com ©2015, Storage Switzerland, All rights reserved
  • 3. by the consolidated storage system. These systems will also have to leverage flash storage to meet those demands. Further they should than what flash can provide it is able to deliver that too. The second requirement is for a high performance HDD tier. Too many hybrid storage systems leverage flash and then only high capacity HDDs as their second tier. The problem is that high capacity hard drives are slower, and because fewer of them are required to meet the capacity demands of the environment there are less of the drives to dedicate to performance. The consolidated system needs to leverage faster performing drives of more moderate capacity. This also means that more drives will be used. This combination will deliver respectable HDD performance without increasing cost. More importantly, there will be less of a drop off between flash and HDD performance in the case of a cache miss. The third requirement is that the consolidated storage system also support multiple protocols. This means going beyond the classic SAN/NAS support and providing both Object and even Mainframe access. For storage consolidation to make sense it needs to move beyond the virtualized use case and be able to provide capacity for analytics and other workload types. The fourth requirement is that the consolidated system be able to keep up with performance and capacity demands as they continue to increase over time. This means that the system should have the ability to scale-out instead of scale up. A scale-out system allows the organization to avoid costly fork-lift upgrades to their storage systems. This is even more critical as workloads are consolidated since the used capacity of the system will be so much higher. High capacity means longer migrations; longer migrations mean more downtime. Scale-out eliminates the need for migration. Finally, and most important, is the need for a highly reliable system with multiple points of redundancy. If the storage system is truly going to be the only storage system in the environment, then it can’t fail or experience any outages. This reliability should be adjustable by application or workload so that mission critical workloads could survive multiple outages. Conclusion A single consolidated storage system should bring many benefits to the organization, but those benefits come at a risk of variable application performance and also a greater risk of failure. To mitigate these risks the consolidating storage system needs to leverage multiple types of storage media and have multiple points of redundancy built into the system. If this combination can be delivered, the virtual environment should become simpler to administrate while also being less expensive to run. Sponsored by INFINIDAT Storage Switzerland is the leading storage analyst firm focused on the emerging storage categories of memory based storage (Flash), big data, virtualization, cloud computing and data protection. The firm is widely recognized for its blogs, white papers, and videos on such current technologies like all-flash arrays, deduplication, software- defined storage, backup appliances, and storage networking. The “Switzerland” in the firm’s name indicates our pledge to provide neutral analysis of the storage marketplace, rather than focusing on a single vendor or approach. INFINIDAT has brought to market InfiniBox, a new generation of highly reliable, scalable and efficient storage systems designed specifically to support a wide variety of workload types, including those in virtual architectures. InfiniBox allows an organization to consolidate down to a single solution that has the performance, capacity and affordability to support most of an organization’s workloads.