SlideShare a Scribd company logo
Availability and Durability
Availability and Durability
Availability
Will I be able to access my data now and when I need it?
Percentage of time an application provides the operations expected of it
Durability
Will my data be available a er 10 or 100 or 1000 years?
Examples of measuring availability and durability:
4 9's - 99.99
11 9's - 99.999999999
Typically, an availability of four 9's is considered very good
Typically, a durability of eleven 9's is considered very good
69
Availability
Availability
Availability Downtime (in a month) Comment
99.95% 22 minutes
99.99% (4 9's) 4 and 1/2 minutes Typically online apps aim for 99.99% (4 9's) availability
99.999% (5 9's) 26 seconds Achieving 5 9's availability is tough
70
What does a durability of 11 9's
mean?
If you store one million files for ten million
years, you would expect to lose one file
Why should durability be high?
Because we hate losing data
Once we lose data, it is gone
Durability
Durability
71
Increasing Availability:
Have multiple standbys available OR
distribute the database
in multiple Zones
in multiple Regions
Increasing Durability:
Multiple copies of data (standbys,
snapshots, transaction logs and replicas)
in multiple Zones
in multiple Regions
Replicating data comes with its own
challenges!
We will talk about them a little later
Increasing Availability and Durability of Databases
Increasing Availability and Durability of Databases
72
Imagine a financial transaction being lost
Imagine a trade being lost
Imagine a stock exchange going down for an hour
Typically businesses are fine with some downtime but they
hate losing data
Availability and Durability are technical measures
How do we measure how quickly we can recover from
failure?
RPO (Recovery Point Objective): Maximum acceptable period of data loss
RTO (Recovery Time Objective): Maximum acceptable downtime
Achieving minimum RTO and RPO is expensive
Trade-off based on the criticality of the data
Database Terminology : RTO and RPO
Database Terminology : RTO and RPO
73
Question - RTO and RPO
Question - RTO and RPO
You are running an application in VM instance storing its data on a persistent
data storage. You are taking snapshots every 48 hours. If the VM instance
crashes, you can manually bring it back up in 45 minutes from the snapshot.
What is your RTO and RPO?
RTO - 45 minutes
RPO - 48 hours
74
Achieving RTO and RPO - Failover Examples
Achieving RTO and RPO - Failover Examples
Scenario Solution
Very small data loss (RPO - 1 minute)
Very small downtime (RTO - 5 minutes)
Hot standby - Automatically synchronize data
Have a standby ready to pick up load
Use automatic failover from master to standby
Very small data loss (RPO - 1 minute)
BUT I can tolerate some downtimes (RTO - 15 minutes)
Warm standby - Automatically synchronize data
Have a standby with minimum infrastructure
Scale it up when a failure happens
Data is critical (RPO - 1 minute) but I can tolerate
downtime of a few hours (RTO - few hours)
Create regular data snapshots and transaction
logs
Create database from snapshots and
transactions logs when a failure happens
Data can be lost without a problem (for example: cached
data)
Failover to a completely new server
75
New reporting and analytics applications are being
launched using the same database
These applications will ONLY read data
Within a few days you see that the database
performance is impacted
How can we fix the problem?
Vertically scale the database - increase CPU and memory
Create a database cluster (Distribute the database) - Typically
database clusters are expensive to setup
Create read replicas - Run read only applications against read
replicas
(New Scenario) Reporting and Analytics Applications
(New Scenario) Reporting and Analytics Applications
76
How do you ensure that data in multiple database instances
(standbys and replicas) is updated simultaneously?
Strong consistency - Synchronous replication to all replicas
Will be slow if you have multiple replicas or standbys
Eventual consistency - Asynchronous replication. A little lag -
few seconds - before the change is available in all replicas
In the intermediate period, different replicas might return different values
Used when scalability is more important than data integrity
Examples : Social Media Posts - Facebook status messages, Twitter tweets,
Linked in posts etc
Read-a er-Write consistency - Inserts are immediately
available
However, updates would have eventual consistency
Consistency
Consistency
77
There are several categories of databases:
Relational (OLTP and OLAP), Document, Key Value, Graph, In Memory among others
Choosing type of database for your use case is not easy. A few
factors:
Do you want a fixed schema?
Do you want flexibility in defining and changing your schema? (schemaless)
What level of transaction properties do you need? (atomicity and consistency)
What kind of latency do you want? (seconds, milliseconds or microseconds)
How many transactions do you expect? (hundreds or thousands or millions of
transactions per second)
How much data will be stored? (MBs or GBs or TBs or PBs)
and a lot more...
Database Categories
Database Categories
78

More Related Content

PPTX
Polyglot Persistence
PPTX
HbaseHivePigbyRohitDubey
PDF
Beginner's Guide to High Availability for Postgres
 
PPTX
Deepak_ppt_ver1.0.pptx
PPTX
U1-NOSQL.pptx DIFFERENT TYPES OF NOSQL DATABASES
PDF
Beginner's Guide to High Availability for Postgres - French
 
PDF
Real-world consistency explained
PPTX
Public Sector Virtual Town Hall: High Availability for PostgreSQL
 
Polyglot Persistence
HbaseHivePigbyRohitDubey
Beginner's Guide to High Availability for Postgres
 
Deepak_ppt_ver1.0.pptx
U1-NOSQL.pptx DIFFERENT TYPES OF NOSQL DATABASES
Beginner's Guide to High Availability for Postgres - French
 
Real-world consistency explained
Public Sector Virtual Town Hall: High Availability for PostgreSQL
 

Similar to AZ900-AzureFundamentals-part-8.pdf (20)

PDF
Webinar slides: Designing Open Source Databases for High Availability
PPTX
Hbase hive pig
ODP
Front Range PHP NoSQL Databases
PPTX
Beginners Guide to High Availability for Postgres
 
PDF
MongoDB: What, why, when
PPTX
Mike walsh hadr_toomanychoices_20191107
PPTX
Beginner's Guide to High Availability for Postgres
 
PPT
PDF
PPTX
MongoDB Session 1
PDF
Oracle vs NoSQL – The good, the bad and the ugly
KEY
2011 Db Intro
PPTX
NoSQL with Microsoft Azure
PPTX
CodeFutures - Scaling Your Database in the Cloud
PDF
Building on quicksand microservices indicthreads
PPTX
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
PPTX
The Rise of NoSQL and Polyglot Persistence
PPTX
NoSQLDatabases
PPTX
Data Processing in Fundamentals of IT
PPTX
Introduction to DBMS_VP.pptx
Webinar slides: Designing Open Source Databases for High Availability
Hbase hive pig
Front Range PHP NoSQL Databases
Beginners Guide to High Availability for Postgres
 
MongoDB: What, why, when
Mike walsh hadr_toomanychoices_20191107
Beginner's Guide to High Availability for Postgres
 
MongoDB Session 1
Oracle vs NoSQL – The good, the bad and the ugly
2011 Db Intro
NoSQL with Microsoft Azure
CodeFutures - Scaling Your Database in the Cloud
Building on quicksand microservices indicthreads
A Survey of Advanced Non-relational Database Systems: Approaches and Applicat...
The Rise of NoSQL and Polyglot Persistence
NoSQLDatabases
Data Processing in Fundamentals of IT
Introduction to DBMS_VP.pptx

More from ssuser2dbaee (11)

PDF
AZ900-AzureFundamentals-part-11.pdf
PDF
AZ900-AzureFundamentals-part-5.pdf
PDF
AZ900-AzureFundamentals-part-7.pdf
PDF
AZ900-AzureFundamentals-part-6.pdf
PDF
AZ900-AzureFundamentals-part-2.pdf
PDF
AZ900-AzureFundamentals-part-9.pdf
PDF
AZ900-AzureFundamentals-part-3.pdf
PDF
AZ900-AzureFundamentals-part-10.pdf
PDF
AZ900-AzureFundamentals-part-4.pdf
DOCX
NetApp CIFS Audit.docx
DOCX
Netapp_Aggregates.docx
AZ900-AzureFundamentals-part-11.pdf
AZ900-AzureFundamentals-part-5.pdf
AZ900-AzureFundamentals-part-7.pdf
AZ900-AzureFundamentals-part-6.pdf
AZ900-AzureFundamentals-part-2.pdf
AZ900-AzureFundamentals-part-9.pdf
AZ900-AzureFundamentals-part-3.pdf
AZ900-AzureFundamentals-part-10.pdf
AZ900-AzureFundamentals-part-4.pdf
NetApp CIFS Audit.docx
Netapp_Aggregates.docx

Recently uploaded (20)

PPT
Module 1.ppt Iot fundamentals and Architecture
PPTX
Tartificialntelligence_presentation.pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
STKI Israel Market Study 2025 version august
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PPTX
OMC Textile Division Presentation 2021.pptx
PPTX
Programs and apps: productivity, graphics, security and other tools
PPTX
cloud_computing_Infrastucture_as_cloud_p
PPTX
TLE Review Electricity (Electricity).pptx
PPTX
observCloud-Native Containerability and monitoring.pptx
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
O2C Customer Invoices to Receipt V15A.pptx
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
Module 1.ppt Iot fundamentals and Architecture
Tartificialntelligence_presentation.pptx
DP Operators-handbook-extract for the Mautical Institute
gpt5_lecture_notes_comprehensive_20250812015547.pdf
STKI Israel Market Study 2025 version august
Zenith AI: Advanced Artificial Intelligence
A comparative study of natural language inference in Swahili using monolingua...
A contest of sentiment analysis: k-nearest neighbor versus neural network
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
OMC Textile Division Presentation 2021.pptx
Programs and apps: productivity, graphics, security and other tools
cloud_computing_Infrastucture_as_cloud_p
TLE Review Electricity (Electricity).pptx
observCloud-Native Containerability and monitoring.pptx
NewMind AI Weekly Chronicles - August'25-Week II
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
O2C Customer Invoices to Receipt V15A.pptx
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf

AZ900-AzureFundamentals-part-8.pdf

  • 1. Availability and Durability Availability and Durability Availability Will I be able to access my data now and when I need it? Percentage of time an application provides the operations expected of it Durability Will my data be available a er 10 or 100 or 1000 years? Examples of measuring availability and durability: 4 9's - 99.99 11 9's - 99.999999999 Typically, an availability of four 9's is considered very good Typically, a durability of eleven 9's is considered very good 69
  • 2. Availability Availability Availability Downtime (in a month) Comment 99.95% 22 minutes 99.99% (4 9's) 4 and 1/2 minutes Typically online apps aim for 99.99% (4 9's) availability 99.999% (5 9's) 26 seconds Achieving 5 9's availability is tough 70
  • 3. What does a durability of 11 9's mean? If you store one million files for ten million years, you would expect to lose one file Why should durability be high? Because we hate losing data Once we lose data, it is gone Durability Durability 71
  • 4. Increasing Availability: Have multiple standbys available OR distribute the database in multiple Zones in multiple Regions Increasing Durability: Multiple copies of data (standbys, snapshots, transaction logs and replicas) in multiple Zones in multiple Regions Replicating data comes with its own challenges! We will talk about them a little later Increasing Availability and Durability of Databases Increasing Availability and Durability of Databases 72
  • 5. Imagine a financial transaction being lost Imagine a trade being lost Imagine a stock exchange going down for an hour Typically businesses are fine with some downtime but they hate losing data Availability and Durability are technical measures How do we measure how quickly we can recover from failure? RPO (Recovery Point Objective): Maximum acceptable period of data loss RTO (Recovery Time Objective): Maximum acceptable downtime Achieving minimum RTO and RPO is expensive Trade-off based on the criticality of the data Database Terminology : RTO and RPO Database Terminology : RTO and RPO 73
  • 6. Question - RTO and RPO Question - RTO and RPO You are running an application in VM instance storing its data on a persistent data storage. You are taking snapshots every 48 hours. If the VM instance crashes, you can manually bring it back up in 45 minutes from the snapshot. What is your RTO and RPO? RTO - 45 minutes RPO - 48 hours 74
  • 7. Achieving RTO and RPO - Failover Examples Achieving RTO and RPO - Failover Examples Scenario Solution Very small data loss (RPO - 1 minute) Very small downtime (RTO - 5 minutes) Hot standby - Automatically synchronize data Have a standby ready to pick up load Use automatic failover from master to standby Very small data loss (RPO - 1 minute) BUT I can tolerate some downtimes (RTO - 15 minutes) Warm standby - Automatically synchronize data Have a standby with minimum infrastructure Scale it up when a failure happens Data is critical (RPO - 1 minute) but I can tolerate downtime of a few hours (RTO - few hours) Create regular data snapshots and transaction logs Create database from snapshots and transactions logs when a failure happens Data can be lost without a problem (for example: cached data) Failover to a completely new server 75
  • 8. New reporting and analytics applications are being launched using the same database These applications will ONLY read data Within a few days you see that the database performance is impacted How can we fix the problem? Vertically scale the database - increase CPU and memory Create a database cluster (Distribute the database) - Typically database clusters are expensive to setup Create read replicas - Run read only applications against read replicas (New Scenario) Reporting and Analytics Applications (New Scenario) Reporting and Analytics Applications 76
  • 9. How do you ensure that data in multiple database instances (standbys and replicas) is updated simultaneously? Strong consistency - Synchronous replication to all replicas Will be slow if you have multiple replicas or standbys Eventual consistency - Asynchronous replication. A little lag - few seconds - before the change is available in all replicas In the intermediate period, different replicas might return different values Used when scalability is more important than data integrity Examples : Social Media Posts - Facebook status messages, Twitter tweets, Linked in posts etc Read-a er-Write consistency - Inserts are immediately available However, updates would have eventual consistency Consistency Consistency 77
  • 10. There are several categories of databases: Relational (OLTP and OLAP), Document, Key Value, Graph, In Memory among others Choosing type of database for your use case is not easy. A few factors: Do you want a fixed schema? Do you want flexibility in defining and changing your schema? (schemaless) What level of transaction properties do you need? (atomicity and consistency) What kind of latency do you want? (seconds, milliseconds or microseconds) How many transactions do you expect? (hundreds or thousands or millions of transactions per second) How much data will be stored? (MBs or GBs or TBs or PBs) and a lot more... Database Categories Database Categories 78