SlideShare a Scribd company logo
High Availability for
Postgres
Presented by:
Matt Lewandowski, Field CTO
Steve Foley, VP of Public Sector
14 October 2020
• Slides and recording will be available in next 48 hours
• Submit questions via chat window – will be answering at end
• We will be sharing info about EDB and Postgres later
Welcome – Housekeeping Items
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.3
Agenda
1. Concepts of High Availability
2. RPO, RTO and Uptime in High Availability
3. How does High Availability work?
4. High Availability for Postgres using
• Streaming Replication
• Logical Replication
5. Postgres parameters for High Availability (Streaming
Replication)
6. EDB tools for High Availability management and
monitoring
High Availability
Concepts
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.5
High Availability
High availability (HA) is a characteristic of a system, which aims to ensure an agreed level of
operational performance, usually uptime, for a higher than normal period.
Key principles:
• Eliminate single point of failure
• Reliable crossover
• Detection of failures
Ref: https://en.wikipedia.org/wiki/High_availability
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.6
Scheduled/Unscheduled downtime
• Scheduled/planned downtime is a result of maintenance that is disruptive to system
operation and usually cannot be avoided with a currently installed system design.
• It include patches to system software that require a reboot or system configuration
changes that only take effect upon a reboot.
• Unscheduled/Unplanned downtime is the result of downtime events due to some
physical failures/events, such as hardware or software failure or environmental anomaly.
• For example, power outages, failed CPU or RAM components (or possibly other
hardware components failure), network failure, security breaches, or various
applications, middleware, and operating system failures result in Unplanned
outage/Unscheduled downtime.
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.7
Availability calculation
Calculated/expressed as a percentage of uptime in a given year based on the service level
agreements. Some companies exclude the planned outage/scheduled downtime based on
their agreements with customers on the availability of their services.
Availability %
Downtime per
year
Downtime per
month
Downtime per
week
Downtime per
day
99.99% ("four nines") 52.60 minutes 4.38 minutes 1.01 minutes 8.64 seconds
99.995% ("four and a half
nines") 26.30 minutes 2.19 minutes 30.24 seconds 4.32 seconds
99.999% ("five nines") 5.26 minutes 26.30 seconds 6.05 seconds
864.00
milliseconds
RPO/RTO/MTTR/GR
O
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.9
Recovery Point Objective (RPO)
RPO is a measurement of time from the failure, disaster or comparable loss-causing event.
RPO can be used to measure:
• How far back must go, stretching back in time from the disaster to the last point where
data is in a usable format
• How frequently you need to back-up your data, although an RPO doesn’t represent
additional needs like restore time and recovery time.
• How much data is lost following a disaster or loss-causing event
• Ex: RPO = 2 hours
* In case of a crash I may forget everything that I did in the last 2 hours!
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.10
Recovery Time Objective (RTO)
The amount of time an application can be down and not result in significant damage to a
business and the time that it takes for the system to go from loss to recovery
Recovery process includes
• The steps that IT must take to return the application
• And its data to its pre-disaster state.
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.11
RPO vs. RTO
RPOs and RTOs are key concepts for maintaining business continuity and function as
business metrics for calculating how often your business needs to perform data backups.
• RTOs coincide with recovery point objectives (RPOs), a measurement of time from the
failure, disaster or similar loss-causing event.
• RPOs calculate back in time to when your data was last usable, probably the most
recent backup.
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.12
Mean Time To Recover (MTTR)
The average time that a device will take to recover from any failure. systems which have to be
repaired or replaced.
• Examples of such devices range from self-resetting fuses (where the MTTR would be
very short, probably seconds), up to whole systems which have to be repaired or
replaced.
• Usually part of a maintenance contract, where the user would pay more for a system
MTTR of which was 24 hours, than for one of, say, 7 days
• Does not mean the supplier is guaranteeing to have the system up and running again
within 24 hours (or 7 days) of being notified of the failure.
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.13
Geography Recovery Objectives (GRO)
If datacenter becomes unavailable, how long it takes for the service to become available
again.
• It covers RPO/RTO for making services available across the geography.
High Availability
For Postgres
Eliminate Single
Point of Failure
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.16
Eliminate Single Point of failure
• WAL shipping based replication
• Replication based on the archived WAL
• Streaming replication (SR)
• Streaming WAL files to one or more standbys
• Logical replication
• Streaming logical data modifications from the WAL.
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.17
Eliminate Single Point of failure
• Identical to primary system
• Data is still mirrored in real time
• Allows READ
• On failure, can replace primary
• Approaches
• WAL shipping based
• Streaming WAL (widely used after 9.0)
Hot Standby
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.18
Eliminate Single Point of failure
Hot Standby: WAL shipping
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.19
Eliminate Single Point of failure
Monitor: WAL shipping
• Functions on standby
• pg_is_in_recovery()
• pg_last_xlog/wal_replay_location/lsn()
• pg_last_xact_replay_timestamp()
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.20
Eliminate Single Point of failure
Hot Standby: Streaming Replication
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.21
Eliminate Single Point of failure
Streaming Replication
• Asynchronous Streaming Replication
• Synchronous Streaming Replication
• synchronous_standby_names
E.g.
• FIRST 1 (standby_east, standby_west)
• ANY 3 (standby_east, standby_west, eu_standby_east, eu_standby_west)
• 'standby_east, standby_west’
• synchronous_commit
• off/local/remote_write/on/remote_apply
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.22
Eliminate Single Point of failure
Monitor: Streaming Replication
• Views
• Master: pg_stat_replication
• Standby: pg_wal_receiver
Reliable
CrossOver &
Detection
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.24
Reliable Crossover & Detection
• In a redundant system, the crossover point itself becomes a single point of failure.
• Fault-tolerant systems must provide a reliable crossover or automatic switchover
mechanism to avoid failure.
• Detection of failures:
• If the above two principles are proactively monitored, then a user may never see a
system failure.
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.25
Reliable Crossover & Detection
EDB Postgres Failover Manager:
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.26
Reliable Crossover & Detection
EDB Postgres Failover Manager:
RPO/RTO/MTT
R/GPO
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.28
RPO/RTO/MTTR/GPO
Backup And Recovery Tool
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.29
RPO/RTO/MTTR/GPO
Backup And Recovery Tool
High
Availability
Monitoring
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.31
High Availability Monitoring
Postgres Enterprise Manager
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.32
High Availability Monitoring
Postgres Enterprise Manager
Maintenance
Window/
Planned
Downtime
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.34
Maintenance Window/Planned Downtime
Software Updates/Patching
• Three reasons for software updates
• Remedy known software issues
• General stability and reliability of the software
• Security problem
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.35
Maintenance Window/Planned Downtime
Software Updates: Strategies
• Three strategies
• All Nodes Patching
• Rolling Patching
• Minimum Downtime Patching
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.36
Conclusion
• High Availability components
• Hot Standby (Streaming Replication)
• EDB Postgres Failover Manager
• Postgres Enterprise Manager
• Backup And Recovery Tool
• Design consideration
• Near zero downtime software maintenance
• RPO/RTO/GRO
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.37
Resources
• Blog series
• What Does High Availability Really Mean
• Patching Minor Version in Postgres High Availability (HA) Database Cluster
Plans & Strategies for DBAs
• Key Parameters and Configuration for Streaming Replication in Postgres 12
• Quick and Reliable Failure Detection with EDB Postgres Failover Manager
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.38
Market Success | Public Sector focus
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.39
Core team Major contributors Contributors
EDB Open Source Leadership
Named EDB open source committers and contributors
Akshay Joshi Amul Sul Ashesh Vashi Ashutosh Sharma Jeevan Chalke
Dilip Kumar Jeevan Ladhe Mithun Cy Rushabh Lathia Amit Khandekar
Amit Langote Devrim
Gündüz
Robert
Haas
Bruce Momjian
Dave Page
Designates PostgreSQL committers
© Copyright EnterpriseDB Corporation, 2020. All rights reserved.40
Q&A
Other resources
Thank You
Postgres Pulse EDB Youtube Channel

More Related Content

PPTX
Overcoming write availability challenges of PostgreSQL
 
PPTX
PostgreSQL as a Strategic Tool
 
PPTX
Automating a PostgreSQL High Availability Architecture with Ansible
 
PDF
Best Practices in Security with PostgreSQL
 
PPTX
Beginner's Guide to High Availability for Postgres
 
PPTX
Beginners Guide to High Availability for Postgres
 
PDF
Making your PostgreSQL Database Highly Available
 
PPTX
An overview of reference architectures for Postgres
 
Overcoming write availability challenges of PostgreSQL
 
PostgreSQL as a Strategic Tool
 
Automating a PostgreSQL High Availability Architecture with Ansible
 
Best Practices in Security with PostgreSQL
 
Beginner's Guide to High Availability for Postgres
 
Beginners Guide to High Availability for Postgres
 
Making your PostgreSQL Database Highly Available
 
An overview of reference architectures for Postgres
 

What's hot (20)

PPTX
An overview of reference architectures for Postgres
 
PPTX
Expert Guide to Migrating Legacy Databases to Postgres
 
PPTX
How to Design for Database High Availability
 
PDF
Best Practices & Lessons Learned from Deployment of PostgreSQL
 
PPTX
New enhancements for security and usability in EDB 13
 
PDF
Beginner's Guide to High Availability for Postgres - French
 
PDF
Introducing Data Redaction - an enabler to data security in EDB Postgres Adva...
 
PPTX
OLTP+OLAP=HTAP
 
PDF
Beginner's Guide to High Availability for Postgres
 
PPTX
New Integration Options with Postgres Enterprise Manager 8.0
 
PPTX
Database Dumps and Backups
 
PPTX
An Expert Guide to Migrating Legacy Databases to PostgreSQL
 
PDF
Remote DBA Service: Powering your DBA needs
 
PPTX
How to use postgresql.conf to configure and tune the PostgreSQL server
 
PPTX
Migration DB2 to EDB - Project Experience
 
PPTX
PostgreSQL to Accelerate Innovation
 
PDF
EDB & ELOS Technologies - Break Free from Oracle
 
PDF
Using PEM to understand and improve performance in Postgres: Postgres Tuning ...
 
PPTX
PostgreSQL as a Strategic Tool
 
PPTX
Not all open source is the same
 
An overview of reference architectures for Postgres
 
Expert Guide to Migrating Legacy Databases to Postgres
 
How to Design for Database High Availability
 
Best Practices & Lessons Learned from Deployment of PostgreSQL
 
New enhancements for security and usability in EDB 13
 
Beginner's Guide to High Availability for Postgres - French
 
Introducing Data Redaction - an enabler to data security in EDB Postgres Adva...
 
OLTP+OLAP=HTAP
 
Beginner's Guide to High Availability for Postgres
 
New Integration Options with Postgres Enterprise Manager 8.0
 
Database Dumps and Backups
 
An Expert Guide to Migrating Legacy Databases to PostgreSQL
 
Remote DBA Service: Powering your DBA needs
 
How to use postgresql.conf to configure and tune the PostgreSQL server
 
Migration DB2 to EDB - Project Experience
 
PostgreSQL to Accelerate Innovation
 
EDB & ELOS Technologies - Break Free from Oracle
 
Using PEM to understand and improve performance in Postgres: Postgres Tuning ...
 
PostgreSQL as a Strategic Tool
 
Not all open source is the same
 
Ad

Similar to Public Sector Virtual Town Hall: High Availability for PostgreSQL (20)

PDF
From Disaster to Recovery: Preparing Your IT for the Unexpected
PPTX
Times ten 18.1_overview_meetup
PPTX
Hive Performance Dataworks Summit Melbourne February 2019
PDF
Fast SQL on Hadoop, Really?
PDF
times ten in-memory database for extreme performance
PPTX
Availability conceptin operating system.
PPTX
Open Sourcing GemFire - Apache Geode
PPTX
An Introduction to Apache Geode (incubating)
PPTX
CS_10_DR_CFD
PPTX
Gartner pace and bi-modal models
PDF
Interconnect session 3498: Deployment Topologies for Jazz Reporting Service
PDF
NVMe and Flash – Make Your Storage Great Again!
PPTX
Designing a Modern Disaster Recovery Environment
PPTX
eFolder Partner Chat Webinar — Spring Cleaning: Getting Your Clients to Ditch...
PPTX
Designing a Modern Disaster Recovery Environment
PDF
How to Integrate Hyperconverged Systems with Existing SANs
PDF
Zerto for dr migration to cloud overview
PPTX
093049ov10.pptx
PDF
ProfitBricks-white-paper-Disaster-Recovery-US
PDF
Oracle databáze - zkonsolidovat, ochránit a ještě ušetřit! (2. část)
From Disaster to Recovery: Preparing Your IT for the Unexpected
Times ten 18.1_overview_meetup
Hive Performance Dataworks Summit Melbourne February 2019
Fast SQL on Hadoop, Really?
times ten in-memory database for extreme performance
Availability conceptin operating system.
Open Sourcing GemFire - Apache Geode
An Introduction to Apache Geode (incubating)
CS_10_DR_CFD
Gartner pace and bi-modal models
Interconnect session 3498: Deployment Topologies for Jazz Reporting Service
NVMe and Flash – Make Your Storage Great Again!
Designing a Modern Disaster Recovery Environment
eFolder Partner Chat Webinar — Spring Cleaning: Getting Your Clients to Ditch...
Designing a Modern Disaster Recovery Environment
How to Integrate Hyperconverged Systems with Existing SANs
Zerto for dr migration to cloud overview
093049ov10.pptx
ProfitBricks-white-paper-Disaster-Recovery-US
Oracle databáze - zkonsolidovat, ochránit a ještě ušetřit! (2. část)
Ad

More from EDB (20)

PDF
Cloud Migration Paths: Kubernetes, IaaS, or DBaaS
 
PDF
Die 10 besten PostgreSQL-Replikationsstrategien für Ihr Unternehmen
 
PDF
Migre sus bases de datos Oracle a la nube
 
PDF
EFM Office Hours - APJ - July 29, 2021
 
PDF
Benchmarking Cloud Native PostgreSQL
 
PDF
Las Variaciones de la Replicación de PostgreSQL
 
PDF
NoSQL and Spatial Database Capabilities using PostgreSQL
 
PDF
Is There Anything PgBouncer Can’t Do?
 
PDF
Data Analysis with TensorFlow in PostgreSQL
 
PDF
Practical Partitioning in Production with Postgres
 
PDF
A Deeper Dive into EXPLAIN
 
PDF
IOT with PostgreSQL
 
PDF
A Journey from Oracle to PostgreSQL
 
PDF
Psql is awesome!
 
PDF
EDB 13 - New Enhancements for Security and Usability - APJ
 
PPTX
Comment sauvegarder correctement vos données
 
PDF
Cloud Native PostgreSQL - Italiano
 
PDF
New enhancements for security and usability in EDB 13
 
PPTX
Best Practices in Security with PostgreSQL
 
PDF
Cloud Native PostgreSQL - APJ
 
Cloud Migration Paths: Kubernetes, IaaS, or DBaaS
 
Die 10 besten PostgreSQL-Replikationsstrategien für Ihr Unternehmen
 
Migre sus bases de datos Oracle a la nube
 
EFM Office Hours - APJ - July 29, 2021
 
Benchmarking Cloud Native PostgreSQL
 
Las Variaciones de la Replicación de PostgreSQL
 
NoSQL and Spatial Database Capabilities using PostgreSQL
 
Is There Anything PgBouncer Can’t Do?
 
Data Analysis with TensorFlow in PostgreSQL
 
Practical Partitioning in Production with Postgres
 
A Deeper Dive into EXPLAIN
 
IOT with PostgreSQL
 
A Journey from Oracle to PostgreSQL
 
Psql is awesome!
 
EDB 13 - New Enhancements for Security and Usability - APJ
 
Comment sauvegarder correctement vos données
 
Cloud Native PostgreSQL - Italiano
 
New enhancements for security and usability in EDB 13
 
Best Practices in Security with PostgreSQL
 
Cloud Native PostgreSQL - APJ
 

Recently uploaded (20)

PPTX
Cloud computing and distributed systems.
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Machine learning based COVID-19 study performance prediction
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Encapsulation theory and applications.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
cuic standard and advanced reporting.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
A Presentation on Artificial Intelligence
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
Cloud computing and distributed systems.
Spectral efficient network and resource selection model in 5G networks
Machine learning based COVID-19 study performance prediction
Reach Out and Touch Someone: Haptics and Empathic Computing
NewMind AI Weekly Chronicles - August'25 Week I
Encapsulation theory and applications.pdf
Chapter 3 Spatial Domain Image Processing.pdf
cuic standard and advanced reporting.pdf
20250228 LYD VKU AI Blended-Learning.pptx
NewMind AI Monthly Chronicles - July 2025
Per capita expenditure prediction using model stacking based on satellite ima...
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Advanced methodologies resolving dimensionality complications for autism neur...
Understanding_Digital_Forensics_Presentation.pptx
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
The AUB Centre for AI in Media Proposal.docx
A Presentation on Artificial Intelligence
Review of recent advances in non-invasive hemoglobin estimation
Diabetes mellitus diagnosis method based random forest with bat algorithm

Public Sector Virtual Town Hall: High Availability for PostgreSQL

  • 1. High Availability for Postgres Presented by: Matt Lewandowski, Field CTO Steve Foley, VP of Public Sector 14 October 2020
  • 2. • Slides and recording will be available in next 48 hours • Submit questions via chat window – will be answering at end • We will be sharing info about EDB and Postgres later Welcome – Housekeeping Items
  • 3. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.3 Agenda 1. Concepts of High Availability 2. RPO, RTO and Uptime in High Availability 3. How does High Availability work? 4. High Availability for Postgres using • Streaming Replication • Logical Replication 5. Postgres parameters for High Availability (Streaming Replication) 6. EDB tools for High Availability management and monitoring
  • 5. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.5 High Availability High availability (HA) is a characteristic of a system, which aims to ensure an agreed level of operational performance, usually uptime, for a higher than normal period. Key principles: • Eliminate single point of failure • Reliable crossover • Detection of failures Ref: https://en.wikipedia.org/wiki/High_availability
  • 6. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.6 Scheduled/Unscheduled downtime • Scheduled/planned downtime is a result of maintenance that is disruptive to system operation and usually cannot be avoided with a currently installed system design. • It include patches to system software that require a reboot or system configuration changes that only take effect upon a reboot. • Unscheduled/Unplanned downtime is the result of downtime events due to some physical failures/events, such as hardware or software failure or environmental anomaly. • For example, power outages, failed CPU or RAM components (or possibly other hardware components failure), network failure, security breaches, or various applications, middleware, and operating system failures result in Unplanned outage/Unscheduled downtime.
  • 7. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.7 Availability calculation Calculated/expressed as a percentage of uptime in a given year based on the service level agreements. Some companies exclude the planned outage/scheduled downtime based on their agreements with customers on the availability of their services. Availability % Downtime per year Downtime per month Downtime per week Downtime per day 99.99% ("four nines") 52.60 minutes 4.38 minutes 1.01 minutes 8.64 seconds 99.995% ("four and a half nines") 26.30 minutes 2.19 minutes 30.24 seconds 4.32 seconds 99.999% ("five nines") 5.26 minutes 26.30 seconds 6.05 seconds 864.00 milliseconds
  • 9. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.9 Recovery Point Objective (RPO) RPO is a measurement of time from the failure, disaster or comparable loss-causing event. RPO can be used to measure: • How far back must go, stretching back in time from the disaster to the last point where data is in a usable format • How frequently you need to back-up your data, although an RPO doesn’t represent additional needs like restore time and recovery time. • How much data is lost following a disaster or loss-causing event • Ex: RPO = 2 hours * In case of a crash I may forget everything that I did in the last 2 hours!
  • 10. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.10 Recovery Time Objective (RTO) The amount of time an application can be down and not result in significant damage to a business and the time that it takes for the system to go from loss to recovery Recovery process includes • The steps that IT must take to return the application • And its data to its pre-disaster state.
  • 11. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.11 RPO vs. RTO RPOs and RTOs are key concepts for maintaining business continuity and function as business metrics for calculating how often your business needs to perform data backups. • RTOs coincide with recovery point objectives (RPOs), a measurement of time from the failure, disaster or similar loss-causing event. • RPOs calculate back in time to when your data was last usable, probably the most recent backup.
  • 12. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.12 Mean Time To Recover (MTTR) The average time that a device will take to recover from any failure. systems which have to be repaired or replaced. • Examples of such devices range from self-resetting fuses (where the MTTR would be very short, probably seconds), up to whole systems which have to be repaired or replaced. • Usually part of a maintenance contract, where the user would pay more for a system MTTR of which was 24 hours, than for one of, say, 7 days • Does not mean the supplier is guaranteeing to have the system up and running again within 24 hours (or 7 days) of being notified of the failure.
  • 13. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.13 Geography Recovery Objectives (GRO) If datacenter becomes unavailable, how long it takes for the service to become available again. • It covers RPO/RTO for making services available across the geography.
  • 16. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.16 Eliminate Single Point of failure • WAL shipping based replication • Replication based on the archived WAL • Streaming replication (SR) • Streaming WAL files to one or more standbys • Logical replication • Streaming logical data modifications from the WAL.
  • 17. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.17 Eliminate Single Point of failure • Identical to primary system • Data is still mirrored in real time • Allows READ • On failure, can replace primary • Approaches • WAL shipping based • Streaming WAL (widely used after 9.0) Hot Standby
  • 18. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.18 Eliminate Single Point of failure Hot Standby: WAL shipping
  • 19. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.19 Eliminate Single Point of failure Monitor: WAL shipping • Functions on standby • pg_is_in_recovery() • pg_last_xlog/wal_replay_location/lsn() • pg_last_xact_replay_timestamp()
  • 20. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.20 Eliminate Single Point of failure Hot Standby: Streaming Replication
  • 21. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.21 Eliminate Single Point of failure Streaming Replication • Asynchronous Streaming Replication • Synchronous Streaming Replication • synchronous_standby_names E.g. • FIRST 1 (standby_east, standby_west) • ANY 3 (standby_east, standby_west, eu_standby_east, eu_standby_west) • 'standby_east, standby_west’ • synchronous_commit • off/local/remote_write/on/remote_apply
  • 22. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.22 Eliminate Single Point of failure Monitor: Streaming Replication • Views • Master: pg_stat_replication • Standby: pg_wal_receiver
  • 24. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.24 Reliable Crossover & Detection • In a redundant system, the crossover point itself becomes a single point of failure. • Fault-tolerant systems must provide a reliable crossover or automatic switchover mechanism to avoid failure. • Detection of failures: • If the above two principles are proactively monitored, then a user may never see a system failure.
  • 25. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.25 Reliable Crossover & Detection EDB Postgres Failover Manager:
  • 26. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.26 Reliable Crossover & Detection EDB Postgres Failover Manager:
  • 28. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.28 RPO/RTO/MTTR/GPO Backup And Recovery Tool
  • 29. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.29 RPO/RTO/MTTR/GPO Backup And Recovery Tool
  • 31. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.31 High Availability Monitoring Postgres Enterprise Manager
  • 32. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.32 High Availability Monitoring Postgres Enterprise Manager
  • 34. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.34 Maintenance Window/Planned Downtime Software Updates/Patching • Three reasons for software updates • Remedy known software issues • General stability and reliability of the software • Security problem
  • 35. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.35 Maintenance Window/Planned Downtime Software Updates: Strategies • Three strategies • All Nodes Patching • Rolling Patching • Minimum Downtime Patching
  • 36. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.36 Conclusion • High Availability components • Hot Standby (Streaming Replication) • EDB Postgres Failover Manager • Postgres Enterprise Manager • Backup And Recovery Tool • Design consideration • Near zero downtime software maintenance • RPO/RTO/GRO
  • 37. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.37 Resources • Blog series • What Does High Availability Really Mean • Patching Minor Version in Postgres High Availability (HA) Database Cluster Plans & Strategies for DBAs • Key Parameters and Configuration for Streaming Replication in Postgres 12 • Quick and Reliable Failure Detection with EDB Postgres Failover Manager
  • 38. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.38 Market Success | Public Sector focus
  • 39. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.39 Core team Major contributors Contributors EDB Open Source Leadership Named EDB open source committers and contributors Akshay Joshi Amul Sul Ashesh Vashi Ashutosh Sharma Jeevan Chalke Dilip Kumar Jeevan Ladhe Mithun Cy Rushabh Lathia Amit Khandekar Amit Langote Devrim Gündüz Robert Haas Bruce Momjian Dave Page Designates PostgreSQL committers
  • 40. © Copyright EnterpriseDB Corporation, 2020. All rights reserved.40 Q&A Other resources Thank You Postgres Pulse EDB Youtube Channel