SlideShare a Scribd company logo
Implementing Affordable Disaster Recovery with Hyper-V andMulti-Site ClusteringGreg Shields, MVPPartner and Principal Technologistwww.ConcentratedTech.com
This slide deck was used in one of our many conference presentations. We hope you enjoy it, and invite you to use it within your own organization however you like.For more information on our company, including information on private classes and upcoming conference appearances, please visit our Web site, www.ConcentratedTech.com. For links to newly-posted decks, follow us on Twitter:@concentrateddon or @concentratdgregThis work is copyright ©Concentrated Technology, LLC
What Makes a Disaster?Which of the following would you consider a disaster?
A naturally-occurring event, such as a tornado, flood, or hurricane, impacts your datacenter and causes damage.  That damage causes the entire processing of that datacenter to cease.
A widespread incident, such as a water leakage or long-term power outage, that interrupts the functionality of your datacenter for an extended period of time.
A problem with a virtual host creates a “blue screen of death”, immediately ceasing all processing on that server.
An administrator installs a piece of code that causes problems with a service, shutting down that service and preventing some action from occurring on the server.
An issue with power connections causes a server or an entire rack of servers to inadvertently and rapidly power down.What Makes a Disaster?Which of the following would you consider a disaster?
A naturally-occurring event, such as a tornado, flood, or hurricane, impacts your datacenter and causes damage.  That damage causes the entire processing of that datacenter to cease.
A widespread incident, such as a water leakage or long-term power outage, that interrupts the functionality of your datacenter for an extended period of time.
A problem with a virtual host creates a “blue screen of death”, immediately ceasing all processing on that server.
An administrator installs a piece of code that causes problems with a service, shutting down that service and preventing some action from occurring on the server.
An issue with power connections causes a server or an entire rack of servers to inadvertently and rapidly power down.DISASTER!JUST A BAD DAY!
What Makes a Disaster?Your decision to “declare a disaster” and move to “disaster ops” is a major one.The technologies used for disaster protection are different than those used for high-availability.More complex.More expensive.
What Makes a Disaster?Your decision to “declare a disaster” and move to “disaster ops” is a major one.The technologies used for disaster protection are different than those used for high-availability.More complex.More expensive.Failover and failback processes involve more thought.You might not be able to just “fail back” with a click of a button.
A Disastrous PollWhere are We?  Who Here is…Planning a DR Environment?In Process of Implementing One?Already Enjoying One?What’s a “DR Environment” ???
Multi-Site Hyper-V == Single-Site Hyper-VDON’T PANIC:  Multi-site Hyper-V looks very much the same as single-site Hyper-V.Microsoft has not done a good job of explaining this fact!Some Hyper-V hosts.Some networking and storage.Virtual machines that Live Migrate around.
Multi-Site Hyper-V == Single-Site Hyper-VDON’T PANIC:  Multi-site Hyper-V looks very much the same as single-site Hyper-V.Microsoft has not done a good job of explaining this fact!Some Hyper-V hosts.Some networking and storage.Virtual machines that Live Migrate around.But there are some major differences too…VMs can Live Migrate across sites.Sites typically have different subnet arrangements.Data in the primary site must be replaced with the DR site.Clients need to know where your servers go!
Constructing Site-Proof Hyper-V:Three Things You NeedAt a very high level, Hyper-V disaster recovery is three things:A storage mechanismA replication mechanismA set of target servers and a cluster to receive virtual machines and their dataOnce you have these three things, layering Hyper-V atop is easy.
Constructing Site-Proof Hyper-V:Three Things You NeedReplication MechanismStorage Device(s)Target Servers
Thing 1:A Storage MechanismTypically, two SANs in two different locationsFibre Channel , iSCSI, FCoE, heck JBOD.Often similar model or manufacturer. This similarity can be necessary (although not required) for some replication mechanisms to function property.
Thing 1:A Storage MechanismTypically, two SANs in two different locationsFibre Channel , iSCSI, FCoE, heck JBOD.Often similar model or manufacturer. This similarity can be necessary (although not required) for some replication mechanisms to function property.Backup SAN doesn’t necessarily need to be of the same size or speed as the primary SANReplicated data isn’t always full set of data.You may not need disaster recovery for everything.DR Environments:  Where Old SANs Go To Die.
Thing 2:A Replication MechanismReplication between SANs must occur.There are two commonly-accepted ways to accomplish this….
Thing 2:A Replication MechanismReplication between SANs must occur.There are two commonly-accepted ways to accomplish this….SynchronouslyChanges are made on one node at a time.  Subsequent changes on primary SAN must wait for ACK from backup SAN.AsynchronouslyChanges on backup SAN will eventually be written.  Changes queued at primary SAN to be transferred at intervals.
Thing 2:A Replication MechanismSynchronouslyChanges are made on one node at a time.  Subsequent changes on primary SAN must wait for ACK from backup SAN.
Thing 2:A Replication MechanismAsynchronouslyChanges on backup SAN will eventually be written.  Are queued at primary SAN to be transferred at intervals.
Class DiscussionWhich would you choose?  Why?Class DiscussionWhich would you choose?  Why?
Synchronous
Assures no loss of data.
Requires a high-bandwidth and low-latency connection.
Write and acknowledgement latencies impact performance.
Requires shorter distances between storage devices.
Asynchronous
Potential for loss of data during a failure.
Leverages smaller-bandwidth connections, more tolerant of latency.
No performance impact.
Potential to stretch across longer distances.
Your Recovery Point Objective makes this decision…Thing 2½:Replication Processing LocationThere are also two locations for replication processing…
Thing 2½:Replication Processing LocationThere are also two locations for replication processing…Storage LayerReplication processing is handled by the SAN itself.Agents are often installed to virtual hosts or machines to ensure crash consistency.Easier to set up, fewer moving parts.  More scalable.Concerns about crash consistency.OS / Application LayerReplication processing is handled by software in the VM OS.This software also operates as the agent.More challenging to set up, more moving parts.  More installations to manage/monitor.  Scalability and cost are linear.Fewer concerns about crash consistency.
Thing 3:Target Servers and a ClusterFinally are target servers and a cluster in the backup site.
Clustering’s Sordid HistoryWindows NT 4.0Microsoft Cluster Service “Wolfpack”.“As the corporate expert in Windows clustering, I recommend you don’t use Windows clustering.”
Clustering’s Sordid HistoryWindows NT 4.0Microsoft Cluster Service “Wolfpack”.“As the corporate expert in Windows clustering, I recommend you don’t use Windows clustering.”Windows 2000Greater availability, scalability.  Still painful.Windows 2003Added iSCSI storage to traditional Fibre Channel.SCSI Resets still used as method of last resort (painful).
Clustering’s Sordid HistoryWindows NT 4.0Microsoft Cluster Service “Wolfpack”.“As the corporate expert in Windows clustering, I recommend you don’t use Windows clustering.”Windows 2000Greater availability, scalability.  Still painful.Windows 2003Added iSCSI storage to traditional Fibre Channel.SCSI Resets still used as method of last resort (painful).Windows 2008Eliminated use of SCSI Resets.Eliminated full-solution HCL requirement.Added Cluster Validation Wizard and pre-cluster tests.Clusters can now span subnets (ta-da!)
Clustering’s Sordid HistoryWindows NT 4.0Microsoft Cluster Service “Wolfpack”.“As the corporate expert in Windows clustering, I recommend you don’t use Windows clustering.”Windows 2000Greater availability, scalability.  Still painful.Windows 2003Added iSCSI storage to traditional Fibre Channel.SCSI Resets still used as method of last resort (painful).Windows 2008Eliminated use of SCSI Resets.Eliminated full-solution HCL requirement.Added Cluster Validation Wizard and pre-cluster tests.Clusters can now span subnets (ta-da!)Windows 2008 R2Improvements to Cluster Validation Wizard and Migration Wizard.Additional cluster services.Cluster Shared Volumes (!) and Live Migration (!)
So, What IS a Cluster?
So, What IS a Cluster?Quorum Drive & Storage for Hyper-V VMs
So, What IS a Multi-Site Cluster?
Quorum:  Windows Clustering’s Most Confusing ConfigurationEver been to a Kiwanis meeting…?
Quorum:  Windows Clustering’s Most Confusing ConfigurationEver been to a Kiwanis meeting…?A cluster “exists” because it has quorum between its members.  That quorum is achieved through a voting process.Different Kiwanis clubs have different rules for quorum.Different clusters have different rules for quorum.
Quorum:  Windows Clustering’s Most Confusing ConfigurationEver been to a Kiwanis meeting…?A cluster “exists” because it has quorum between its members.  That quorum is achieved through a voting process.Different Kiwanis clubs have different rules for quorum.Different clusters have different rules for quorum.If a cluster “loses quorum”, the entire cluster shuts down and ceases to exist.  This happens until quorum is regained.This is much different than a resource failover, which is the reason why clusters are implemented.Multiple quorum models exist.
Four Options for QuorumNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority:  Disk Only
Four Options for QuorumNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority:  Disk Only
Four Options for QuorumNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority:  Disk Only
Four Options for QuorumNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority:  Disk Only
Quorum in Multi-Site ClustersNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority:  Disk OnlyMicrosoft recommends using the Node and File Share Majority model for multi-site clusters.This model provides the best protection for a full-site outage.Full-site outage requires a file share witness in a third geographic location.

More Related Content

PPT
ESXpert strategies VMware vSphere
PPT
How I reshaped my lab environment
PDF
VMworld 2013: Virtualize Active Directory ‒ The Right Way!
PPT
Ha & drs gotcha's
PPTX
London VMUG July 2011
PPTX
VMworld US 2011 - Avoiding the 16 Biggest HA & DRS Configuration Mistakes
PDF
Maximizing Business Continuity and Minimizing Recovery Time Objectives in Win...
PDF
EVault Technical DRaaS Guide_Final
ESXpert strategies VMware vSphere
How I reshaped my lab environment
VMworld 2013: Virtualize Active Directory ‒ The Right Way!
Ha & drs gotcha's
London VMUG July 2011
VMworld US 2011 - Avoiding the 16 Biggest HA & DRS Configuration Mistakes
Maximizing Business Continuity and Minimizing Recovery Time Objectives in Win...
EVault Technical DRaaS Guide_Final

What's hot (20)

PPTX
Veeam webinar - Deduplication best practices
PDF
Virtualization and Cloud Computing with Elastic Server On Demand
PDF
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
PDF
Cloud stack troubleshooting
PDF
Orchestration for the rest of us
PPTX
NetApp against ransomware
PPT
ESX performance problems 10 steps
PDF
Troubleshooting Apache Cloudstack
PDF
SQL Server Clustering for Dummies
PPTX
Compare Clustering Methods for MS SQL Server
PDF
NetIQ Disaster Recovery ebook
PDF
Virtualization Technology, Cloud Computing & Building a Private Cloud
PDF
Why resilience - A primer at varying flight altitudes
PPT
virtualization and cloud
PDF
Server Virtualization
PDF
Vs host clustering
PDF
JavaOne 2014: Taming the Cloud Database with jclouds
PDF
CloudStack, jclouds, Jenkins and CloudCat
PDF
Virtual Router in CloudStack 4.4
PDF
Veeam Backup & Replication v8 for VMware — General Overview
Veeam webinar - Deduplication best practices
Virtualization and Cloud Computing with Elastic Server On Demand
Troubleshooting Strategies for CloudStack Installations by Kirk Kosinski
Cloud stack troubleshooting
Orchestration for the rest of us
NetApp against ransomware
ESX performance problems 10 steps
Troubleshooting Apache Cloudstack
SQL Server Clustering for Dummies
Compare Clustering Methods for MS SQL Server
NetIQ Disaster Recovery ebook
Virtualization Technology, Cloud Computing & Building a Private Cloud
Why resilience - A primer at varying flight altitudes
virtualization and cloud
Server Virtualization
Vs host clustering
JavaOne 2014: Taming the Cloud Database with jclouds
CloudStack, jclouds, Jenkins and CloudCat
Virtual Router in CloudStack 4.4
Veeam Backup & Replication v8 for VMware — General Overview
Ad

Viewers also liked (20)

PPTX
Meet Windows PowerShell
PPTX
VDI-in-a-Box: Microsoft Desktop Virtualization for Smaller Businesses and Uses
PPT
PS scripting and modularization
PPT
No-script PowerShell v2
PDF
Advanced Tools & Scripting with PowerShell 3.0 Jump Start - Certificate
PPT
PS error handling and debugging
PPTX
Combining output from multiple sources
PPTX
Ive got a powershell secret
PPTX
Basic PowerShell Toolmaking - Spiceworld 2016 session
PPTX
PowerShell custom properties
PPT
Free tools for win server administration
PPT
Automating Active Directory mgmt in PowerShell
KEY
PowerShell and the Future of Windows Automation
PPTX
PowerShell and WMI
PPT
Managing SQLserver
PPT
PowerShell Functions
PPT
PowerShell 8tips
PPT
Automating ad with powershell
PPT
Ad disasters & how to prevent them
PPTX
Introduction to powershell
Meet Windows PowerShell
VDI-in-a-Box: Microsoft Desktop Virtualization for Smaller Businesses and Uses
PS scripting and modularization
No-script PowerShell v2
Advanced Tools & Scripting with PowerShell 3.0 Jump Start - Certificate
PS error handling and debugging
Combining output from multiple sources
Ive got a powershell secret
Basic PowerShell Toolmaking - Spiceworld 2016 session
PowerShell custom properties
Free tools for win server administration
Automating Active Directory mgmt in PowerShell
PowerShell and the Future of Windows Automation
PowerShell and WMI
Managing SQLserver
PowerShell Functions
PowerShell 8tips
Automating ad with powershell
Ad disasters & how to prevent them
Introduction to powershell
Ad

Similar to Implementing dr w. hyper v clustering (20)

PPT
Hyper v r2 deep dive
PPT
SQL Server Cluster Presentation
PDF
Introduction to failover clustering with sql server
PPTX
Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...
PDF
SQL Server Clustering and High Availability
PDF
Planning For Catastrophe with IBM WAS and IBM BPM
PPTX
SQL Server High Availability and Disaster Recovery
PDF
SQLDay2013_Denny Cherry - SQLServer2012inaHighlyAvailableWorld
PPT
High Availability with Windows Server Clustering and Geo-Clustering
PPTX
HostClustering_1.pptx
PPTX
Introduction to High Availability with SQL Server
PPTX
Whats new in Microsoft Windows Server 2016 Clustering and Storage
PPTX
Availability Considerations for SQL Server
PDF
Backing up your virtual environment best practices
PPTX
Hyper-V’s Virtualization Enhancements - EPC Group
PPTX
BITIC-27 Proyecto 3 BITIC 3 2021 Andres Labera Failover-Cluster.pptx
PDF
5 Ways to Avoid Server and Application Downtime
PPTX
New Platforms, New Technologies, Old Headaches
PPSX
Win2k8 cluster kaliyan
PPTX
Multi site Clustering with Windows Server 2008 Enterprise
Hyper v r2 deep dive
SQL Server Cluster Presentation
Introduction to failover clustering with sql server
Business_Continuity_Planning_with_SQL_Server_HADR_options_TechEd_Bangalore_20...
SQL Server Clustering and High Availability
Planning For Catastrophe with IBM WAS and IBM BPM
SQL Server High Availability and Disaster Recovery
SQLDay2013_Denny Cherry - SQLServer2012inaHighlyAvailableWorld
High Availability with Windows Server Clustering and Geo-Clustering
HostClustering_1.pptx
Introduction to High Availability with SQL Server
Whats new in Microsoft Windows Server 2016 Clustering and Storage
Availability Considerations for SQL Server
Backing up your virtual environment best practices
Hyper-V’s Virtualization Enhancements - EPC Group
BITIC-27 Proyecto 3 BITIC 3 2021 Andres Labera Failover-Cluster.pptx
5 Ways to Avoid Server and Application Downtime
New Platforms, New Technologies, Old Headaches
Win2k8 cluster kaliyan
Multi site Clustering with Windows Server 2008 Enterprise

More from Concentrated Technology (20)

PPT
Wsus sample scripts
PPTX
Wsus best practices
PPT
Virtualization today
PPTX
Virtualization auditing & security deck v1.0
PPTX
PPT
From VB Script to PowerShell
PPT
Top ESXi command line v2.0
PPT
Supporting SQLserver
PPT
PPT
Securely connecting to apps over the internet using rds
PPT
Rapidly deploying software
PPTX
PowerShell crashcourse for Sharepoint admins
PPT
Prepping software for w7 deployment
PPT
PowerShell Remoting
PPTX
PowerShell crashcourse for sharepoint
PPT
PowerShell crashcourse
PPTX
PowerShell crash course
PPTX
Managing SQLserver for the reluctant DBA
PPTX
Managing enterprise with PowerShell remoting
PPTX
Inventory your network and clients with PowerShell
Wsus sample scripts
Wsus best practices
Virtualization today
Virtualization auditing & security deck v1.0
From VB Script to PowerShell
Top ESXi command line v2.0
Supporting SQLserver
Securely connecting to apps over the internet using rds
Rapidly deploying software
PowerShell crashcourse for Sharepoint admins
Prepping software for w7 deployment
PowerShell Remoting
PowerShell crashcourse for sharepoint
PowerShell crashcourse
PowerShell crash course
Managing SQLserver for the reluctant DBA
Managing enterprise with PowerShell remoting
Inventory your network and clients with PowerShell

Recently uploaded (20)

PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Encapsulation theory and applications.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
A Presentation on Artificial Intelligence
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPT
Teaching material agriculture food technology
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Modernizing your data center with Dell and AMD
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Spectral efficient network and resource selection model in 5G networks
Encapsulation_ Review paper, used for researhc scholars
Reach Out and Touch Someone: Haptics and Empathic Computing
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Encapsulation theory and applications.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Understanding_Digital_Forensics_Presentation.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
A Presentation on Artificial Intelligence
Per capita expenditure prediction using model stacking based on satellite ima...
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
Teaching material agriculture food technology
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Modernizing your data center with Dell and AMD
Network Security Unit 5.pdf for BCA BBA.
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Spectral efficient network and resource selection model in 5G networks

Implementing dr w. hyper v clustering

  • 1. Implementing Affordable Disaster Recovery with Hyper-V andMulti-Site ClusteringGreg Shields, MVPPartner and Principal Technologistwww.ConcentratedTech.com
  • 2. This slide deck was used in one of our many conference presentations. We hope you enjoy it, and invite you to use it within your own organization however you like.For more information on our company, including information on private classes and upcoming conference appearances, please visit our Web site, www.ConcentratedTech.com. For links to newly-posted decks, follow us on Twitter:@concentrateddon or @concentratdgregThis work is copyright ©Concentrated Technology, LLC
  • 3. What Makes a Disaster?Which of the following would you consider a disaster?
  • 4. A naturally-occurring event, such as a tornado, flood, or hurricane, impacts your datacenter and causes damage. That damage causes the entire processing of that datacenter to cease.
  • 5. A widespread incident, such as a water leakage or long-term power outage, that interrupts the functionality of your datacenter for an extended period of time.
  • 6. A problem with a virtual host creates a “blue screen of death”, immediately ceasing all processing on that server.
  • 7. An administrator installs a piece of code that causes problems with a service, shutting down that service and preventing some action from occurring on the server.
  • 8. An issue with power connections causes a server or an entire rack of servers to inadvertently and rapidly power down.What Makes a Disaster?Which of the following would you consider a disaster?
  • 9. A naturally-occurring event, such as a tornado, flood, or hurricane, impacts your datacenter and causes damage. That damage causes the entire processing of that datacenter to cease.
  • 10. A widespread incident, such as a water leakage or long-term power outage, that interrupts the functionality of your datacenter for an extended period of time.
  • 11. A problem with a virtual host creates a “blue screen of death”, immediately ceasing all processing on that server.
  • 12. An administrator installs a piece of code that causes problems with a service, shutting down that service and preventing some action from occurring on the server.
  • 13. An issue with power connections causes a server or an entire rack of servers to inadvertently and rapidly power down.DISASTER!JUST A BAD DAY!
  • 14. What Makes a Disaster?Your decision to “declare a disaster” and move to “disaster ops” is a major one.The technologies used for disaster protection are different than those used for high-availability.More complex.More expensive.
  • 15. What Makes a Disaster?Your decision to “declare a disaster” and move to “disaster ops” is a major one.The technologies used for disaster protection are different than those used for high-availability.More complex.More expensive.Failover and failback processes involve more thought.You might not be able to just “fail back” with a click of a button.
  • 16. A Disastrous PollWhere are We? Who Here is…Planning a DR Environment?In Process of Implementing One?Already Enjoying One?What’s a “DR Environment” ???
  • 17. Multi-Site Hyper-V == Single-Site Hyper-VDON’T PANIC: Multi-site Hyper-V looks very much the same as single-site Hyper-V.Microsoft has not done a good job of explaining this fact!Some Hyper-V hosts.Some networking and storage.Virtual machines that Live Migrate around.
  • 18. Multi-Site Hyper-V == Single-Site Hyper-VDON’T PANIC: Multi-site Hyper-V looks very much the same as single-site Hyper-V.Microsoft has not done a good job of explaining this fact!Some Hyper-V hosts.Some networking and storage.Virtual machines that Live Migrate around.But there are some major differences too…VMs can Live Migrate across sites.Sites typically have different subnet arrangements.Data in the primary site must be replaced with the DR site.Clients need to know where your servers go!
  • 19. Constructing Site-Proof Hyper-V:Three Things You NeedAt a very high level, Hyper-V disaster recovery is three things:A storage mechanismA replication mechanismA set of target servers and a cluster to receive virtual machines and their dataOnce you have these three things, layering Hyper-V atop is easy.
  • 20. Constructing Site-Proof Hyper-V:Three Things You NeedReplication MechanismStorage Device(s)Target Servers
  • 21. Thing 1:A Storage MechanismTypically, two SANs in two different locationsFibre Channel , iSCSI, FCoE, heck JBOD.Often similar model or manufacturer. This similarity can be necessary (although not required) for some replication mechanisms to function property.
  • 22. Thing 1:A Storage MechanismTypically, two SANs in two different locationsFibre Channel , iSCSI, FCoE, heck JBOD.Often similar model or manufacturer. This similarity can be necessary (although not required) for some replication mechanisms to function property.Backup SAN doesn’t necessarily need to be of the same size or speed as the primary SANReplicated data isn’t always full set of data.You may not need disaster recovery for everything.DR Environments: Where Old SANs Go To Die.
  • 23. Thing 2:A Replication MechanismReplication between SANs must occur.There are two commonly-accepted ways to accomplish this….
  • 24. Thing 2:A Replication MechanismReplication between SANs must occur.There are two commonly-accepted ways to accomplish this….SynchronouslyChanges are made on one node at a time. Subsequent changes on primary SAN must wait for ACK from backup SAN.AsynchronouslyChanges on backup SAN will eventually be written. Changes queued at primary SAN to be transferred at intervals.
  • 25. Thing 2:A Replication MechanismSynchronouslyChanges are made on one node at a time. Subsequent changes on primary SAN must wait for ACK from backup SAN.
  • 26. Thing 2:A Replication MechanismAsynchronouslyChanges on backup SAN will eventually be written. Are queued at primary SAN to be transferred at intervals.
  • 27. Class DiscussionWhich would you choose? Why?Class DiscussionWhich would you choose? Why?
  • 29. Assures no loss of data.
  • 30. Requires a high-bandwidth and low-latency connection.
  • 31. Write and acknowledgement latencies impact performance.
  • 32. Requires shorter distances between storage devices.
  • 34. Potential for loss of data during a failure.
  • 35. Leverages smaller-bandwidth connections, more tolerant of latency.
  • 37. Potential to stretch across longer distances.
  • 38. Your Recovery Point Objective makes this decision…Thing 2½:Replication Processing LocationThere are also two locations for replication processing…
  • 39. Thing 2½:Replication Processing LocationThere are also two locations for replication processing…Storage LayerReplication processing is handled by the SAN itself.Agents are often installed to virtual hosts or machines to ensure crash consistency.Easier to set up, fewer moving parts. More scalable.Concerns about crash consistency.OS / Application LayerReplication processing is handled by software in the VM OS.This software also operates as the agent.More challenging to set up, more moving parts. More installations to manage/monitor. Scalability and cost are linear.Fewer concerns about crash consistency.
  • 40. Thing 3:Target Servers and a ClusterFinally are target servers and a cluster in the backup site.
  • 41. Clustering’s Sordid HistoryWindows NT 4.0Microsoft Cluster Service “Wolfpack”.“As the corporate expert in Windows clustering, I recommend you don’t use Windows clustering.”
  • 42. Clustering’s Sordid HistoryWindows NT 4.0Microsoft Cluster Service “Wolfpack”.“As the corporate expert in Windows clustering, I recommend you don’t use Windows clustering.”Windows 2000Greater availability, scalability. Still painful.Windows 2003Added iSCSI storage to traditional Fibre Channel.SCSI Resets still used as method of last resort (painful).
  • 43. Clustering’s Sordid HistoryWindows NT 4.0Microsoft Cluster Service “Wolfpack”.“As the corporate expert in Windows clustering, I recommend you don’t use Windows clustering.”Windows 2000Greater availability, scalability. Still painful.Windows 2003Added iSCSI storage to traditional Fibre Channel.SCSI Resets still used as method of last resort (painful).Windows 2008Eliminated use of SCSI Resets.Eliminated full-solution HCL requirement.Added Cluster Validation Wizard and pre-cluster tests.Clusters can now span subnets (ta-da!)
  • 44. Clustering’s Sordid HistoryWindows NT 4.0Microsoft Cluster Service “Wolfpack”.“As the corporate expert in Windows clustering, I recommend you don’t use Windows clustering.”Windows 2000Greater availability, scalability. Still painful.Windows 2003Added iSCSI storage to traditional Fibre Channel.SCSI Resets still used as method of last resort (painful).Windows 2008Eliminated use of SCSI Resets.Eliminated full-solution HCL requirement.Added Cluster Validation Wizard and pre-cluster tests.Clusters can now span subnets (ta-da!)Windows 2008 R2Improvements to Cluster Validation Wizard and Migration Wizard.Additional cluster services.Cluster Shared Volumes (!) and Live Migration (!)
  • 45. So, What IS a Cluster?
  • 46. So, What IS a Cluster?Quorum Drive & Storage for Hyper-V VMs
  • 47. So, What IS a Multi-Site Cluster?
  • 48. Quorum: Windows Clustering’s Most Confusing ConfigurationEver been to a Kiwanis meeting…?
  • 49. Quorum: Windows Clustering’s Most Confusing ConfigurationEver been to a Kiwanis meeting…?A cluster “exists” because it has quorum between its members. That quorum is achieved through a voting process.Different Kiwanis clubs have different rules for quorum.Different clusters have different rules for quorum.
  • 50. Quorum: Windows Clustering’s Most Confusing ConfigurationEver been to a Kiwanis meeting…?A cluster “exists” because it has quorum between its members. That quorum is achieved through a voting process.Different Kiwanis clubs have different rules for quorum.Different clusters have different rules for quorum.If a cluster “loses quorum”, the entire cluster shuts down and ceases to exist. This happens until quorum is regained.This is much different than a resource failover, which is the reason why clusters are implemented.Multiple quorum models exist.
  • 51. Four Options for QuorumNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority: Disk Only
  • 52. Four Options for QuorumNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority: Disk Only
  • 53. Four Options for QuorumNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority: Disk Only
  • 54. Four Options for QuorumNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority: Disk Only
  • 55. Quorum in Multi-Site ClustersNode and Disk MajorityNode MajorityNode and File Share MajorityNo Majority: Disk OnlyMicrosoft recommends using the Node and File Share Majority model for multi-site clusters.This model provides the best protection for a full-site outage.Full-site outage requires a file share witness in a third geographic location.
  • 56. Quorum in Multi-Site ClustersUse the Node and File Share QuorumPrevents entire-site outage from impacting quorum.Enables creation of multiple clusters if necessary.Third Site for Witness Server
  • 57. I Need a Third Site? Seriously?Here’s where Microsoft’s ridiculous quorum notion gets unnecessarily complicated…What happens if you put the quorum’s file share in the primary site?The secondary site might not automatically come online after a primary site failure.Votes in secondary site < Votes in primary siteLet’s count on our fingers…
  • 58. I Need a Third Site? Seriously?Here’s where Microsoft’s ridiculous quorum notion gets unnecessarily complicated…What happens if you put the quorum’s file share in the secondary site?A failure in the secondary site could cause the primary site to go down.Votes in secondary site > votes in primary site.More fingers…This problem gets even weirder as time passes and the number of servers changes in each site.
  • 59. I Need a Third Site? Seriously?Third Site for Witness Server
  • 61. Multi-Site Cluster Tips/TricksInstall servers to sites so that your primary site always contains more servers than backup sites.Eliminates some problems with quorum during site outage.
  • 62. Multi-Site Cluster Tips/TricksManage Preferred Owners & Persistent Mode options.Make sure your servers fail over to servers in the same site first.But also make sure they have options on failing over elsewhere.
  • 64. Multi-Site Cluster Tips/TricksManage Preferred Owners & Persistent Mode options.Make sure your servers fail over to servers in the same site first.But also make sure they have options on failing over elsewhere.Consider carefully the effects of Failback.Failback is a great solution for resetting after a failure.But Failback can be a massive problem-causer as well.Its effects are particularly pronounced in Multi-Site Clusters.Recommendation: Turn it off, (until you’re ready).
  • 66. Multi-Site Cluster Tips/TricksResist creating clusters that support other services.A Hyper-V cluster is a Hyper-V cluster is a Hyper-V cluster.
  • 67. Multi-Site Cluster Tips/TricksResist creating clusters that support other services.A Hyper-V cluster is a Hyper-V cluster is a Hyper-V cluster.Use disk “dependencies” as Affinity/Anti-Affinity rules.Hyper-V all by itself doesn’t have an elegant way to affinitize.Setting disk dependencies against each other is a work-around.
  • 68. Multi-Site Cluster Tips/TricksResist creating clusters that support other services.A Hyper-V cluster is a Hyper-V cluster is a Hyper-V cluster.Use disk “dependencies” as Affinity/Anti-Affinity rules.Hyper-V all by itself doesn’t have an elegant way to affinitize.Setting disk dependencies against each other is a work-around.Add Servers in PairsEnsures that a server loss won’t cause site split brain.This is less a problem with the File Share Witness configuration.
  • 70. Most Important!Ensure that networking remains available when VMs migrate from primary to backup site.
  • 71. Most Important!Ensure that networking remains available when VMs migrate from primary to backup site.Clustering can span subnets!This is good, but only if you plan for it…Remember that crossing subnets also means changing IP address, subnet mask, gateway, etc, at new site.This can be automatically done by using DHCP and dynamic DNS, or must be manually updated.DNS replication is also a problem. Clients will require time to update their local cache.Consider reducing DNS TTL or clearing client cache.
  • 72. Implementing Affordable Disaster Recovery with Hyper-V andMulti-Site ClusteringGreg Shields, MVPPartner and Principal Technologistwww.ConcentratedTech.com
  • 73. This slide deck was used in one of our many conference presentations. We hope you enjoy it, and invite you to use it within your own organization however you like.For more information on our company, including information on private classes and upcoming conference appearances, please visit our Web site, www.ConcentratedTech.com. For links to newly-posted decks, follow us on Twitter:@concentrateddon or @concentratdgregThis work is copyright ©Concentrated Technology, LLC