SlideShare a Scribd company logo
High Availability of Azure
Applications(PaaS)
Himanshu Sahu
Mindfire Solutions
himanshus@mindfiresolutions.com
Agenda
Introduction
Windows Azure Role Architecture
Fault Domains in Windows Azure
Update Domains in Windows Azure
Windows Azure Host OS Updates
Windows Azure Guest OS Updates
Techniques for High Availability
High Availability in Azure
Introduction
ALWAYS ON
Reliability and Scalability
Design for failure
Implement separation of function
Use a service-oriented architecture
Windows Azure Role Architecture
Fault Domains in Windows Azure
Fault Domains
Fault Domain is a physical unit of failure, and is closely related to
the physical infrastructure in the data centers. In Windows Azure
the rack can be considered a fault domain. However there is no 1:1
mapping between fault domain and rack.
Windows Azure Fabric is responsible to deploy the instances of
your application in different fault domains. Right now Fabric makes
sure that your application uses at least 2 (two) fault domains.
As a developer have no direct control over how many fault
domains your application will use.
Update Domains in Windows Azure
Update Domains
Upgrade Domain is a logical unit, which determines how particular
service will be upgraded.
The default number of upgrade domains that are configured for
your application is 5 (five). You can control how many upgrade
domains your application will use through the upgradeDomain
configuration setting in your service definition file (CSDEF).
Windows Azure Host Updates
When and Why
Windows Azure deploys updates to the host OS approximately
once per month. This ensures that Windows Azure provides a
reliable, efficient and secure platform for hosting your applications.
The HA consists of multiple subcomponents, such as the Network
Agent (NA) that manages virtual machine VLANs and the Virtual
Machine virtual disk driver that connects Virtual Machine disks to
the blobs containing their data in Windows Azure Storage. Azure
therefore update the HA and its subcomponents at different
intervals, depending on when a fix or new functionality is ready.
Windows Azure Host Updates
Windows Azure Host Updates
How
The host OS reboots instances and the fabric controller ensures that only
instances from one upgrade domain at a time will be rebooted.
Virtual machines running on the server that have an Input Endpoint in
their role’s service model are removed from the load balancer rotation so
that no new requests will come to the virtual machine and instead new
requests are sent to other instances of that role as per the Azure load-
balancing policies.
Each virtual machine hosting a Web or Worker Role receives a Stopping
event, whereas VM Roles receive a standard Windows shutdown event.
Worker, Web, and Virtual machine roles are allowed five minutes to
respond to the stopping and shutdown event before they are forcibly
stopped.
Windows Azure Host Updates
How
After all guest virtual machines are stopped, the root partition OS
shuts down and the server reboots.
The updated root partition OS starts.
The virtual machines hosted on the server boot and start their
application code.
Virtual machines hosting service roles with Input Endpoints
reconnect to the load balancer, enabling them to receive client
request
Windows Azure Guest Updates
Once the Host OS has finished upgrading across the datacenter
then the Guest OS will be upgraded for services which are
configured to use automatic Guest OS versions and this upgrade
will proceed using standard upgrade domain rules for your service.
Your VM will be rebooted and the Windows Partition (the D drive)
will be reimaged with the upgraded OS.
The Guest OS update process is much faster than the Host OS
update since the fabric only has to coordinate the update within
your hosted service and your upgrade domains.
Availability
An available application considers the availability of its underlying
infrastructure and dependent services. Available applications
remove single points of failure through redundancy and resilient
design
Azure SLA
More Instances in Azure
Make Guest OS Update Manual
Availability
Scalability directly affects availability—an application that fails
under increased load is no longer available. Scalable applications
are able to meet increased demand with consistent results in
acceptable time windows.
Auto Scaling in Azure
Availability
Protection against hardware failures
Because every application is made up of multiple instances of each
role, hardware failures—a disk crash, a network fault, or the death
of a server machine—won’t take down the application. To help
with this, the fabric controller doesn’t choose machines for an
application’s instances at random. Instead, different instances of
the same role are placed in different fault domains. A fault domain
is a set of hardware—computers, switches, and more—that share a
single point of failure. (For example, all of the computers in a single
fault domain might rely on the same switch to connect to the
network.) Because of this, a single hardware failure can’t take
down an entire application. The application might temporarily lose
some instances, but it will continue to behave correctly.
Availability
Protection against software failures
The fabric controller can also detect failures caused by software. If
the code in an instance crashes or the VM in which it’s running
goes down, the fabric controller will start either just the code or, if
necessary, a new VM for that role. While any work the instance
was doing when it failed will be lost, the new instance will become
part of the application as soon as it starts running.
Availability
The ability to update applications with no application
downtime
When a new version of the application needs to be deployed, the
fabric controller can shut down the instances in just one update
domain, update the code for these, then create new instances
from that new code. Once those instances are running, it can do
the same thing to instances in the next update domain, and so on.
While users might see different versions of the application during
this process, depending on which instance they happen to interact
with, the application as a whole remains continuously available.
Availability
The ability to update Windows and other supporting
software with no application downtime.
Answer is Update Domain. :)
Resources
https://msdn.microsoft.com/enus/library/azure/dn251004.aspx
http://blogs.msdn.com/b/kwill/archive/2011/05/05/windows-azure-r
http://blog.toddysm.com/2010/04/upgrade-domains-and-fault-doma
http://blogs.msdn.com/b/kwill/archive/2012/09/19/role-
instance-restarts-due-to-os-upgrades.aspx
Questions?
Thank you!

More Related Content

PPTX
Glimpse of Loops Vs Set
PPTX
Decapitating Selenium with JavaScript
PDF
Selenium and Sauce Labs
PPTX
Moving From a Selenium Grid to the Cloud - A Real Life Story
PPTX
Testing Agile Web Services from soapUI
PPTX
Save time, save money: Move your selenium testing on cloud
PPTX
Spring boot
PPTX
Sauce Labs for Visual Studio Team Services & TFS
Glimpse of Loops Vs Set
Decapitating Selenium with JavaScript
Selenium and Sauce Labs
Moving From a Selenium Grid to the Cloud - A Real Life Story
Testing Agile Web Services from soapUI
Save time, save money: Move your selenium testing on cloud
Spring boot
Sauce Labs for Visual Studio Team Services & TFS

What's hot (20)

PDF
Load Testing with JMeter, BlazeMeter, New Relic
PPTX
Continuous Testing in the Cloud
PPTX
Romulus crisan + radu pascal click'n'deploy
PDF
Chris Omland - AWS Code Deploy - BSDC 2016
PPT
Continuous integration
PPTX
Azure webinar kolkata
PDF
Docker - Contain that Wild Application by Marvin Arcilla
PPTX
Parallel Testing with Python with Selenium and Sauce Labs
PDF
Automated-Testing-inside-containers
PPTX
01 overview-servlets-and-environment-setup
PPTX
Integration-Monday-Serverless-Slackbots-with-Azure-Durable-Functions
PPTX
RightScale API: How To Build Your Own IT Vending Machine - RightScale Compute...
PPTX
Cypress Automation
PPTX
Postmortem of a uwp xaml application development
PDF
Cypress e2e automation testing - day1 intor by: Hassan Hameed
PDF
100% Puppet Cloud Deployment of Legacy Software
PDF
Advanced Spring Boot with Consul
PPT
Nakal think test_2015
PDF
How To Use Selenium Successfully (Java Edition)
PPT
Cloud Delivery
Load Testing with JMeter, BlazeMeter, New Relic
Continuous Testing in the Cloud
Romulus crisan + radu pascal click'n'deploy
Chris Omland - AWS Code Deploy - BSDC 2016
Continuous integration
Azure webinar kolkata
Docker - Contain that Wild Application by Marvin Arcilla
Parallel Testing with Python with Selenium and Sauce Labs
Automated-Testing-inside-containers
01 overview-servlets-and-environment-setup
Integration-Monday-Serverless-Slackbots-with-Azure-Durable-Functions
RightScale API: How To Build Your Own IT Vending Machine - RightScale Compute...
Cypress Automation
Postmortem of a uwp xaml application development
Cypress e2e automation testing - day1 intor by: Hassan Hameed
100% Puppet Cloud Deployment of Legacy Software
Advanced Spring Boot with Consul
Nakal think test_2015
How To Use Selenium Successfully (Java Edition)
Cloud Delivery
Ad

Similar to High Availability of Azure Applications (20)

PPT
128692851-Introducing-Windows-Azure.ppt
PPTX
Microsoft Windows Azure - Cloud Computing Hosting Environment Presentation
PPTX
ES19 – Under the Hood: Inside the Cloud Computing Hosting Environmnent
PPTX
Introducing Windows Azure
PPTX
Azure IaaS
PDF
Azure Interview Questions and Answers PDF By ScholarHat
PPTX
From legacy to Azure Service Fabric in 360h. Odessa.
PDF
Azure Solution Architect Interview Questions By ScholarHat
PPTX
PPTX
Introduction of Windows azure and overview
PPTX
Top 10 IaaS Highlights for Developers
PDF
Major Cloud Providers - Azure.pdf new doc
PPTX
Azure virtual machines & Terraform
PPTX
Azure service fabric
PPTX
Service Fabric and Azure Service Fabric Mesh introduction
PPTX
Global Azure Bootcamp: Azure service fabric
PPTX
azure_fundamentals_5674379643333389633.pptx
PPTX
Road to cloud-iaas
PPTX
Microsoft Azure Hybrid Cloud - Getting Started For Techies
PPTX
Cnam cours azure iaas
128692851-Introducing-Windows-Azure.ppt
Microsoft Windows Azure - Cloud Computing Hosting Environment Presentation
ES19 – Under the Hood: Inside the Cloud Computing Hosting Environmnent
Introducing Windows Azure
Azure IaaS
Azure Interview Questions and Answers PDF By ScholarHat
From legacy to Azure Service Fabric in 360h. Odessa.
Azure Solution Architect Interview Questions By ScholarHat
Introduction of Windows azure and overview
Top 10 IaaS Highlights for Developers
Major Cloud Providers - Azure.pdf new doc
Azure virtual machines & Terraform
Azure service fabric
Service Fabric and Azure Service Fabric Mesh introduction
Global Azure Bootcamp: Azure service fabric
azure_fundamentals_5674379643333389633.pptx
Road to cloud-iaas
Microsoft Azure Hybrid Cloud - Getting Started For Techies
Cnam cours azure iaas
Ad

More from Mindfire Solutions (20)

PDF
Physician Search and Review
PDF
diet management app
PDF
Business Technology Solution
PDF
Remote Health Monitoring
PDF
Influencer Marketing Solution
PPTX
IOT Hands On
ODP
Oracle Sql Developer-Getting Started
PPT
Adaptive Layout In iOS 8
PPT
Introduction to Auto-layout : iOS/Mac
PPT
LINQPad - utility Tool
PPT
Get started with watch kit development
PPTX
Swift vs Objective-C
ODP
Material Design in Android
ODP
Introduction to OData
PPT
Ext js Part 2- MVC
PPT
ExtJs Basic Part-1
PPT
Spring Security Introduction
PPT
Angular In Depth
PPT
Django Models
Physician Search and Review
diet management app
Business Technology Solution
Remote Health Monitoring
Influencer Marketing Solution
IOT Hands On
Oracle Sql Developer-Getting Started
Adaptive Layout In iOS 8
Introduction to Auto-layout : iOS/Mac
LINQPad - utility Tool
Get started with watch kit development
Swift vs Objective-C
Material Design in Android
Introduction to OData
Ext js Part 2- MVC
ExtJs Basic Part-1
Spring Security Introduction
Angular In Depth
Django Models

Recently uploaded (20)

PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
Understanding Forklifts - TECH EHS Solution
PPTX
ai tools demonstartion for schools and inter college
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
PDF
How to Migrate SBCGlobal Email to Yahoo Easily
PDF
AI in Product Development-omnex systems
PPTX
VVF-Customer-Presentation2025-Ver1.9.pptx
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PPTX
Online Work Permit System for Fast Permit Processing
PDF
Softaken Excel to vCard Converter Software.pdf
PPTX
ManageIQ - Sprint 268 Review - Slide Deck
PDF
Nekopoi APK 2025 free lastest update
PDF
top salesforce developer skills in 2025.pdf
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PDF
PTS Company Brochure 2025 (1).pdf.......
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
PDF
2025 Textile ERP Trends: SAP, Odoo & Oracle
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
Odoo Companies in India – Driving Business Transformation.pdf
Understanding Forklifts - TECH EHS Solution
ai tools demonstartion for schools and inter college
Navsoft: AI-Powered Business Solutions & Custom Software Development
Why TechBuilder is the Future of Pickup and Delivery App Development (1).pdf
How to Migrate SBCGlobal Email to Yahoo Easily
AI in Product Development-omnex systems
VVF-Customer-Presentation2025-Ver1.9.pptx
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
Online Work Permit System for Fast Permit Processing
Softaken Excel to vCard Converter Software.pdf
ManageIQ - Sprint 268 Review - Slide Deck
Nekopoi APK 2025 free lastest update
top salesforce developer skills in 2025.pdf
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PTS Company Brochure 2025 (1).pdf.......
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
2025 Textile ERP Trends: SAP, Odoo & Oracle
Wondershare Filmora 15 Crack With Activation Key [2025

High Availability of Azure Applications

  • 1. High Availability of Azure Applications(PaaS) Himanshu Sahu Mindfire Solutions himanshus@mindfiresolutions.com
  • 2. Agenda Introduction Windows Azure Role Architecture Fault Domains in Windows Azure Update Domains in Windows Azure Windows Azure Host OS Updates Windows Azure Guest OS Updates Techniques for High Availability
  • 3. High Availability in Azure Introduction ALWAYS ON Reliability and Scalability Design for failure Implement separation of function Use a service-oriented architecture
  • 4. Windows Azure Role Architecture
  • 5. Fault Domains in Windows Azure Fault Domains Fault Domain is a physical unit of failure, and is closely related to the physical infrastructure in the data centers. In Windows Azure the rack can be considered a fault domain. However there is no 1:1 mapping between fault domain and rack. Windows Azure Fabric is responsible to deploy the instances of your application in different fault domains. Right now Fabric makes sure that your application uses at least 2 (two) fault domains. As a developer have no direct control over how many fault domains your application will use.
  • 6. Update Domains in Windows Azure Update Domains Upgrade Domain is a logical unit, which determines how particular service will be upgraded. The default number of upgrade domains that are configured for your application is 5 (five). You can control how many upgrade domains your application will use through the upgradeDomain configuration setting in your service definition file (CSDEF).
  • 7. Windows Azure Host Updates When and Why Windows Azure deploys updates to the host OS approximately once per month. This ensures that Windows Azure provides a reliable, efficient and secure platform for hosting your applications. The HA consists of multiple subcomponents, such as the Network Agent (NA) that manages virtual machine VLANs and the Virtual Machine virtual disk driver that connects Virtual Machine disks to the blobs containing their data in Windows Azure Storage. Azure therefore update the HA and its subcomponents at different intervals, depending on when a fix or new functionality is ready.
  • 9. Windows Azure Host Updates How The host OS reboots instances and the fabric controller ensures that only instances from one upgrade domain at a time will be rebooted. Virtual machines running on the server that have an Input Endpoint in their role’s service model are removed from the load balancer rotation so that no new requests will come to the virtual machine and instead new requests are sent to other instances of that role as per the Azure load- balancing policies. Each virtual machine hosting a Web or Worker Role receives a Stopping event, whereas VM Roles receive a standard Windows shutdown event. Worker, Web, and Virtual machine roles are allowed five minutes to respond to the stopping and shutdown event before they are forcibly stopped.
  • 10. Windows Azure Host Updates How After all guest virtual machines are stopped, the root partition OS shuts down and the server reboots. The updated root partition OS starts. The virtual machines hosted on the server boot and start their application code. Virtual machines hosting service roles with Input Endpoints reconnect to the load balancer, enabling them to receive client request
  • 11. Windows Azure Guest Updates Once the Host OS has finished upgrading across the datacenter then the Guest OS will be upgraded for services which are configured to use automatic Guest OS versions and this upgrade will proceed using standard upgrade domain rules for your service. Your VM will be rebooted and the Windows Partition (the D drive) will be reimaged with the upgraded OS. The Guest OS update process is much faster than the Host OS update since the fabric only has to coordinate the update within your hosted service and your upgrade domains.
  • 12. Availability An available application considers the availability of its underlying infrastructure and dependent services. Available applications remove single points of failure through redundancy and resilient design Azure SLA More Instances in Azure Make Guest OS Update Manual
  • 13. Availability Scalability directly affects availability—an application that fails under increased load is no longer available. Scalable applications are able to meet increased demand with consistent results in acceptable time windows. Auto Scaling in Azure
  • 14. Availability Protection against hardware failures Because every application is made up of multiple instances of each role, hardware failures—a disk crash, a network fault, or the death of a server machine—won’t take down the application. To help with this, the fabric controller doesn’t choose machines for an application’s instances at random. Instead, different instances of the same role are placed in different fault domains. A fault domain is a set of hardware—computers, switches, and more—that share a single point of failure. (For example, all of the computers in a single fault domain might rely on the same switch to connect to the network.) Because of this, a single hardware failure can’t take down an entire application. The application might temporarily lose some instances, but it will continue to behave correctly.
  • 15. Availability Protection against software failures The fabric controller can also detect failures caused by software. If the code in an instance crashes or the VM in which it’s running goes down, the fabric controller will start either just the code or, if necessary, a new VM for that role. While any work the instance was doing when it failed will be lost, the new instance will become part of the application as soon as it starts running.
  • 16. Availability The ability to update applications with no application downtime When a new version of the application needs to be deployed, the fabric controller can shut down the instances in just one update domain, update the code for these, then create new instances from that new code. Once those instances are running, it can do the same thing to instances in the next update domain, and so on. While users might see different versions of the application during this process, depending on which instance they happen to interact with, the application as a whole remains continuously available.
  • 17. Availability The ability to update Windows and other supporting software with no application downtime. Answer is Update Domain. :)