SlideShare a Scribd company logo
Scaling Database
Infrastructure @PayPal
Agenda
©2020 PayPal Inc. Confidential and proprietary. 2
1. Introduction
2. PayPal’s Scale
3. Scaling challenges
4. Scaling Methodology
5. Horizontal Scaling
6. Vertical Scaling
7. Q & A
About me
• Database Engineer at PayPal for 8+ Years
• Working on ORACLE Technologies for more than a Decade and ORACLE Certified
professional
• www.linkedin.com/in/pramodkgarre
©2020 PayPal Inc. Confidential and proprietary. 3
We are a trusted part of people’s financial lives and a partner to merchants in
200+ markets around the world
Two decades ago, our founders invented payment technology to make buying and selling
faster, secure, and easier; and put economic power where it belongs: In the hands of people
About PayPal
Our 300+ Million consumers can accept payments in > 100
currencies and interact with 20M+ Merchants across 19K+
corridors
Almost 8000 PayPal team members provide
support to our customers in over 20 languages
750+
ORACLE Instances
Database Infrastructure & Storage Footprint
5M+
Execs/Sec
32% Y-o-Y
DB Storage Growth
93 PB
Total DB Storage
740 700
2000
308
154 67
Oracle
MySQL
Aerospike
Couchbase
Cassandra
Mongo
84
8
26
Block File
Object Das
9
Host Count by Database Type
Storage Footprint (PB) by Type
(Utilization)
Scaling challenges
• Hardware Limits
• CPU
• Memory
• IOPs
• Network
• Interconnect
• Software Limitations at Scale
• Concurrent waits- Enqueues
(Table/Index/Sequences/LOBs)
• REDO – LGWR contention
• SGA contention – latches/Mutex waits
We are only as strong as the slowest component of the system
©2020 PayPal Inc. Confidential and proprietary. 6
Global Cache
SGA SGA
Shared
Storage
SGA
CPUCPU CPU
Tuning .. Scaling Methodology
Replace Tune with Scale, While the business demands change rest of the approach
is still relevant.
• Data/Application Design
• Right Data Normalization, choosing the right Datastore
• Application layer caching , pagination and connection pooling etc.
• Logical Structure of the Database
• Address object-level Bottlenecks - Divide and Conquer
• System Tuning
• Scale up
• Add more Power ( CPU, Memory, Faster Disks, Storage Cache etc. )
• Scale out
• Add more instances – Replicas, Shards, split by Domains and A/A
©2020 PayPal Inc. Confidential and proprietary.
Snippet from ORACLE 7
Document
Data/Application Design to Scale
Application Layer Design considerations
• Application level caching
• Pagination of results
• Optimal SQLs
• Intelligent Mid-Tier
• Persistent connections and Multiplexing
• Slow SQL eviction
• SQL caching/routing
Data Layer design considerations
• Right level of normalization and Design
• Avoid “hot spots”
• Design considerations based on the type of Data.
Ex: Master Data Vs Transactional Data
©2020 PayPal Inc. Confidential and proprietary.
Best way to utilize resources is not to utilize them
DatabaseApplication CacheHit
Miss
Application Caching
Scaling Database logical structures
-Divide and Conquer with in the Database
• Table/LOB Contention
At ~20k inserts/sec, table/lob can get into contention – “enq :HW
contention “
• Out-of-line Lob writes with similar size go behind the same latch
resource and causes contention.
• Partition the table and creating multiple entry point helps
Ex: Range Hash sub partition
• Number of sub-partitions and sub partitioning key was based on
studying read patterns.
• Range sub partitioning along with appropriate local/Global Hash
indexes alleviate Table contention
• Use secure file+ cache LOBs
“In computer science, divide and conquer is an algorithm design paradigm based on multi-branched recursion. A divide-and-conquer algorithm works by recursively breaking
down a problem into two or more sub-problems of the same or related type, until these become simple enough to be solved directly. The solutions to the sub-problems are then
combined to give a solution to the original problem.”
©2020 PayPal Inc. Confidential and proprietary. 9
To convert the heap table to new design chosen -
create a new table with chosen design -> Redirect reads to UNION ALL view of current and new table and Redirect writes to New table with instead of trigger on old table
Scaling Database logical structures
-Divide and Conquer
Scaling Indexes
Right-hand index contention– ”enq: TX - index contention”
• Partition the table and index to create multiple entry points
ex: Range-Hash partitions, Global Hash Indexes.
• Scatter the Index Key – Reverse-Key, Timed IDs etc.
Scaling Sequences
• CACHE, NO-ORDER Sequences in RAC
• Intelligent Mid Tier with Read-Write Split option
©2020 PayPal Inc. Confidential and proprietary. 10
Scaling Database logical structures
-Divide and Conquer
Scaling IOTs @ ~20k inserts per second
IOTS accelerate Primary Key based access. Scaling IOT writes is critical
• Reduce Write contention for same Index Blocks
• Choose appropriate partition/sub partition structure ( Range, Range Hash etc )
• Create the index on mod(Key) to scatter the records across multiple segments and blocks
©2020 PayPal Inc. Confidential and proprietary. 11
Scaling/Tuning Instance components
Scaling Interconnect traffic
Scale the Interconnect for optimal RAC performance
• Database Service isolation to avoid interconnect overwhelming
• Better Data design to avoid Multi-table join queries overwhelming Interconnect Message Bandwidth
• RDS on InfiniBand (up to 40 Gbps speeds ) to achieve ultra-low latency and throughput
• Critical Instance Background processes supporting Cache fusion to run in RT priority.
Scaling LGWR
LGWR is a single threaded process and often single most contention point in ORACLE
• Place REDO log files on faster Disks – flash, RAID 10 Disks
• Following application best practices like proper commit/sleep intervals
©2020 PayPal Inc. Confidential and proprietary. 12
Scale up and Scale out
©2020 PayPal Inc. Confidential and proprietary. 13
Scale UP
Add more resources -uplift to new powerful hardware that leverages the
latest technology and add more such nodes. Cost is the main consideration
• CPU
8 socket machines with 2.9GHz processors and up to 192 cores
• Volatile Memory
Up to 6 TB per node
• Non-Volatile Memory
Nvme flash, Nvme-SSD etc-Bandwidths of 120GB/Sec &50K IOPs
• Storage –High storage cache . 7200 RPM HDDs ,All flash storage
• InfiniBand up to 40GBPs, RoCE network Fabric up to 100GBPs
Scale out Patterns
Read Replicas/Optimized ROs
©2020 PayPal Inc. Confidential and proprietary. 14
ADG/GG
SOR RO1 ROn
R/W RO
Applications
SH0 SH1 SHn
R/W
Applications
R/W
USER
Finance
USER Finance
Monolithic
App
User App
Services
Finance
App
Services
AZ1 AZ2
AZ1 AZ2
A/A
Sharding
Domain Isolation Active/Active
Scale up involves cost and Scale out
enables elastic scaling
• Multi-AZ Read Replicas
• Sharding
• Domain Isolation
• Active-Active
Horizontal Scaling – Read Replicas
©2020 PayPal Inc. Confidential and proprietary. 15
• Scale Read Workloads by adding more replicas
• Optimized Ros ( GG Replicas) can also provide
High Availability by assuming Primary Role
• Avoid Writes on replicas with DB services
• Full copy on each replica
Horizontal Scaling - Sharding
©2018 PayPal Inc. Confidential and proprietary. 16
Sharding Rules
• Each table must have a shard key
column
• Shard Key Must be Unique across
the shards
• No cross Shard joins
• No cross shard writes
• Each SQL should have a shard key
• Tables may need to be
denormalized to support above
rules
• Each shard has a subset of Data
Horizontal scaling – Domain isolation
©2020 PayPal Inc. Confidential and proprietary. 17
• Isolate self-contained domains to different
physical database
• Logical isolation of tables and application users
followed by Physical separation
Horizontal scaling – Active/Active
©2020 PayPal Inc. Confidential and proprietary. 18
• Active/Active is mainly for multi-region high
availability But can also help in linear scaling
• Each Database has full copy of data
• Avoid mutations and collisions across the
Databases – Use UUIDs for Keys , Even/Odd
Sequences ,GG conflict resolution configurations
and application stickiness.
Reporting/Analytic workload offloading
• Changes from FSL enabled SORs are replicated to
Centralized data platform
• Near Real time replication with OGG and
Kafka/Micro batch processing
• Replication Scaling using @RANGE replicats,
Parallel extracts and parallel replicats
©2016 PayPal Inc. Confidential and proprietary. 19
What Next .. Exadata & Oracle 19c
© 2020 PayPal Inc. Confidential and proprietary.
Focusing more on business impacting innovations
Security & ComplianceReliability and Performance
• Integrated Hardware and Software
designed for Scale
• Unique software optimizations
• Highly scalable & fault tolerant
hardware
• Effective Data protection by eliminating
Database sprawl
• Standard Encryption @ Rest (TDE) ,
Performance optimization for TDE
• Standard configuration
• Automated and fast patching of all
components
Efficiency
• DB consolidation and Multi-tenancy
• Less day-to-day management, more
business focusing activities
• Autonomous DB compatibility
• With an infrastructure that’s
engineered to work together with
your Oracle Databases, Oracle
Exadata delivers far more power
with less hardware.
Optimized for Oracle Database
Q & A

More Related Content

PDF
Presentation big dataappliance-overview_oow_v3
PPTX
Hadoop World 2011: Unlocking the Value of Big Data with Oracle - Jean-Pierre ...
PDF
Deploying Full BI Platforms to Oracle Cloud
PPTX
Dataware house Introduction By Quontra Solutions
PPTX
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...
PPTX
Oracle Big data at work
PDF
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
PDF
OBIEE12c and Embedded Essbase 12c - An Initial Look at Query Acceleration Use...
Presentation big dataappliance-overview_oow_v3
Hadoop World 2011: Unlocking the Value of Big Data with Oracle - Jean-Pierre ...
Deploying Full BI Platforms to Oracle Cloud
Dataware house Introduction By Quontra Solutions
Implementing the Business Catalog in the Modern Enterprise: Bridging Traditio...
Oracle Big data at work
Oracle NoSQL Database -- Big Data Bellevue Meetup - 02-18-15
OBIEE12c and Embedded Essbase 12c - An Initial Look at Query Acceleration Use...

What's hot (20)

PDF
A3 transforming data_management_in_the_cloud
PPTX
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
PDF
Integrated Data Warehouse with Hadoop and Oracle Database
PDF
Database@Home - Data Driven : Loading, Indexing, and Searching with Text and ...
PPTX
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
PPTX
Whats new in Oracle Database 12c release 12.1.0.2
PPTX
Modern Data Warehousing with the Microsoft Analytics Platform System
PDF
Database@Home - Maps and Spatial Analyses: How to use them
PDF
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAA
PDF
Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527
PDF
Machine Learning for z/OS
PDF
Deploying Full Oracle BI Platforms to Oracle Cloud - OOW2015
PDF
Meetup Oracle Database MAD_BCN: 1.2 Oracle Database 18c (autonomous database)
PPT
Choosing the Right Big Data Tools for the Job - A Polyglot Approach
PDF
MOUG17 Keynote: Oracle OpenWorld Major Announcements
PDF
Deploying OBIEE in the Cloud - Oracle Openworld 2014
PPTX
SQL Server on Linux - march 2017
PDF
Introducing Kudu, Big Data Warehousing Meetup
PPTX
Db2 analytics accelerator on ibm integrated analytics system technical over...
PDF
Database Cloud Services Office Hours : Oracle sharding hyperscale globally d...
A3 transforming data_management_in_the_cloud
Build and Manage Hadoop & Oracle NoSQL DB Solutions- Impetus Webinar
Integrated Data Warehouse with Hadoop and Oracle Database
Database@Home - Data Driven : Loading, Indexing, and Searching with Text and ...
Leveraging NoSQL Database Technology to Implement Real-time Data Architecture...
Whats new in Oracle Database 12c release 12.1.0.2
Modern Data Warehousing with the Microsoft Analytics Platform System
Database@Home - Maps and Spatial Analyses: How to use them
Temporal Tables, Transparent Archiving in DB2 for z/OS and IDAA
Things Every Oracle DBA Needs to Know About the Hadoop Ecosystem 20170527
Machine Learning for z/OS
Deploying Full Oracle BI Platforms to Oracle Cloud - OOW2015
Meetup Oracle Database MAD_BCN: 1.2 Oracle Database 18c (autonomous database)
Choosing the Right Big Data Tools for the Job - A Polyglot Approach
MOUG17 Keynote: Oracle OpenWorld Major Announcements
Deploying OBIEE in the Cloud - Oracle Openworld 2014
SQL Server on Linux - march 2017
Introducing Kudu, Big Data Warehousing Meetup
Db2 analytics accelerator on ibm integrated analytics system technical over...
Database Cloud Services Office Hours : Oracle sharding hyperscale globally d...
Ad

Similar to Scaling db infra_pay_pal (20)

PPTX
Oracle big data appliance and solutions
PDF
Oracle databáze – Konsolidovaná Data Management Platforma
PPTX
Deploy Apache Spark™ on Rackspace OnMetal™ for Cloud Big Data Platform
PPT
Kb 40 kevin_klineukug_reading20070717[1]
PDF
Novinky v Oracle Database 18c
PDF
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
PDF
Running Production CDC Ingestion Pipelines With Balaji Varadarajan and Pritam...
PDF
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
PDF
Oracle DB In-Memory technologie v kombinaci s procesorem M7
PPTX
Introducing Apache Kudu (Incubating) - Montreal HUG May 2016
PDF
What's New in Apache Hive 3.0?
PDF
What's New in Apache Hive 3.0 - Tokyo
PDF
Webinar future dataintegration-datamesh-and-goldengatekafka
PDF
Oracle Database 19c - poslední z rodiny 12.2 a co přináší nového
PPTX
Phases of Big Data Challenges @ Nokia
PPTX
The role of NoSQL in the Next Generation of Financial Informatics
PPTX
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
PPTX
Is OLAP Dead?: Can Next Gen Tools Take Over?
PDF
Jak konsolidovat Vaše databáze s využitím Cloud služeb?
PPTX
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
Oracle big data appliance and solutions
Oracle databáze – Konsolidovaná Data Management Platforma
Deploy Apache Spark™ on Rackspace OnMetal™ for Cloud Big Data Platform
Kb 40 kevin_klineukug_reading20070717[1]
Novinky v Oracle Database 18c
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
Running Production CDC Ingestion Pipelines With Balaji Varadarajan and Pritam...
ADV Slides: Platforming Your Data for Success – Databases, Hadoop, Managed Ha...
Oracle DB In-Memory technologie v kombinaci s procesorem M7
Introducing Apache Kudu (Incubating) - Montreal HUG May 2016
What's New in Apache Hive 3.0?
What's New in Apache Hive 3.0 - Tokyo
Webinar future dataintegration-datamesh-and-goldengatekafka
Oracle Database 19c - poslední z rodiny 12.2 a co přináší nového
Phases of Big Data Challenges @ Nokia
The role of NoSQL in the Next Generation of Financial Informatics
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
Is OLAP Dead?: Can Next Gen Tools Take Over?
Jak konsolidovat Vaše databáze s využitím Cloud služeb?
Big Data Analytics on the Cloud Oracle Applications AWS Redshift & Tableau
Ad

Recently uploaded (20)

PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
Digital Logic Computer Design lecture notes
PPTX
Construction Project Organization Group 2.pptx
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
web development for engineering and engineering
PPTX
additive manufacturing of ss316l using mig welding
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
PPT on Performance Review to get promotions
PPTX
OOP with Java - Java Introduction (Basics)
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
Digital Logic Computer Design lecture notes
Construction Project Organization Group 2.pptx
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
web development for engineering and engineering
additive manufacturing of ss316l using mig welding
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Foundation to blockchain - A guide to Blockchain Tech
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPT on Performance Review to get promotions
OOP with Java - Java Introduction (Basics)
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Internet of Things (IOT) - A guide to understanding
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf

Scaling db infra_pay_pal

  • 2. Agenda ©2020 PayPal Inc. Confidential and proprietary. 2 1. Introduction 2. PayPal’s Scale 3. Scaling challenges 4. Scaling Methodology 5. Horizontal Scaling 6. Vertical Scaling 7. Q & A
  • 3. About me • Database Engineer at PayPal for 8+ Years • Working on ORACLE Technologies for more than a Decade and ORACLE Certified professional • www.linkedin.com/in/pramodkgarre ©2020 PayPal Inc. Confidential and proprietary. 3
  • 4. We are a trusted part of people’s financial lives and a partner to merchants in 200+ markets around the world Two decades ago, our founders invented payment technology to make buying and selling faster, secure, and easier; and put economic power where it belongs: In the hands of people About PayPal Our 300+ Million consumers can accept payments in > 100 currencies and interact with 20M+ Merchants across 19K+ corridors Almost 8000 PayPal team members provide support to our customers in over 20 languages
  • 5. 750+ ORACLE Instances Database Infrastructure & Storage Footprint 5M+ Execs/Sec 32% Y-o-Y DB Storage Growth 93 PB Total DB Storage 740 700 2000 308 154 67 Oracle MySQL Aerospike Couchbase Cassandra Mongo 84 8 26 Block File Object Das 9 Host Count by Database Type Storage Footprint (PB) by Type (Utilization)
  • 6. Scaling challenges • Hardware Limits • CPU • Memory • IOPs • Network • Interconnect • Software Limitations at Scale • Concurrent waits- Enqueues (Table/Index/Sequences/LOBs) • REDO – LGWR contention • SGA contention – latches/Mutex waits We are only as strong as the slowest component of the system ©2020 PayPal Inc. Confidential and proprietary. 6 Global Cache SGA SGA Shared Storage SGA CPUCPU CPU
  • 7. Tuning .. Scaling Methodology Replace Tune with Scale, While the business demands change rest of the approach is still relevant. • Data/Application Design • Right Data Normalization, choosing the right Datastore • Application layer caching , pagination and connection pooling etc. • Logical Structure of the Database • Address object-level Bottlenecks - Divide and Conquer • System Tuning • Scale up • Add more Power ( CPU, Memory, Faster Disks, Storage Cache etc. ) • Scale out • Add more instances – Replicas, Shards, split by Domains and A/A ©2020 PayPal Inc. Confidential and proprietary. Snippet from ORACLE 7 Document
  • 8. Data/Application Design to Scale Application Layer Design considerations • Application level caching • Pagination of results • Optimal SQLs • Intelligent Mid-Tier • Persistent connections and Multiplexing • Slow SQL eviction • SQL caching/routing Data Layer design considerations • Right level of normalization and Design • Avoid “hot spots” • Design considerations based on the type of Data. Ex: Master Data Vs Transactional Data ©2020 PayPal Inc. Confidential and proprietary. Best way to utilize resources is not to utilize them DatabaseApplication CacheHit Miss Application Caching
  • 9. Scaling Database logical structures -Divide and Conquer with in the Database • Table/LOB Contention At ~20k inserts/sec, table/lob can get into contention – “enq :HW contention “ • Out-of-line Lob writes with similar size go behind the same latch resource and causes contention. • Partition the table and creating multiple entry point helps Ex: Range Hash sub partition • Number of sub-partitions and sub partitioning key was based on studying read patterns. • Range sub partitioning along with appropriate local/Global Hash indexes alleviate Table contention • Use secure file+ cache LOBs “In computer science, divide and conquer is an algorithm design paradigm based on multi-branched recursion. A divide-and-conquer algorithm works by recursively breaking down a problem into two or more sub-problems of the same or related type, until these become simple enough to be solved directly. The solutions to the sub-problems are then combined to give a solution to the original problem.” ©2020 PayPal Inc. Confidential and proprietary. 9 To convert the heap table to new design chosen - create a new table with chosen design -> Redirect reads to UNION ALL view of current and new table and Redirect writes to New table with instead of trigger on old table
  • 10. Scaling Database logical structures -Divide and Conquer Scaling Indexes Right-hand index contention– ”enq: TX - index contention” • Partition the table and index to create multiple entry points ex: Range-Hash partitions, Global Hash Indexes. • Scatter the Index Key – Reverse-Key, Timed IDs etc. Scaling Sequences • CACHE, NO-ORDER Sequences in RAC • Intelligent Mid Tier with Read-Write Split option ©2020 PayPal Inc. Confidential and proprietary. 10
  • 11. Scaling Database logical structures -Divide and Conquer Scaling IOTs @ ~20k inserts per second IOTS accelerate Primary Key based access. Scaling IOT writes is critical • Reduce Write contention for same Index Blocks • Choose appropriate partition/sub partition structure ( Range, Range Hash etc ) • Create the index on mod(Key) to scatter the records across multiple segments and blocks ©2020 PayPal Inc. Confidential and proprietary. 11
  • 12. Scaling/Tuning Instance components Scaling Interconnect traffic Scale the Interconnect for optimal RAC performance • Database Service isolation to avoid interconnect overwhelming • Better Data design to avoid Multi-table join queries overwhelming Interconnect Message Bandwidth • RDS on InfiniBand (up to 40 Gbps speeds ) to achieve ultra-low latency and throughput • Critical Instance Background processes supporting Cache fusion to run in RT priority. Scaling LGWR LGWR is a single threaded process and often single most contention point in ORACLE • Place REDO log files on faster Disks – flash, RAID 10 Disks • Following application best practices like proper commit/sleep intervals ©2020 PayPal Inc. Confidential and proprietary. 12
  • 13. Scale up and Scale out ©2020 PayPal Inc. Confidential and proprietary. 13 Scale UP Add more resources -uplift to new powerful hardware that leverages the latest technology and add more such nodes. Cost is the main consideration • CPU 8 socket machines with 2.9GHz processors and up to 192 cores • Volatile Memory Up to 6 TB per node • Non-Volatile Memory Nvme flash, Nvme-SSD etc-Bandwidths of 120GB/Sec &50K IOPs • Storage –High storage cache . 7200 RPM HDDs ,All flash storage • InfiniBand up to 40GBPs, RoCE network Fabric up to 100GBPs
  • 14. Scale out Patterns Read Replicas/Optimized ROs ©2020 PayPal Inc. Confidential and proprietary. 14 ADG/GG SOR RO1 ROn R/W RO Applications SH0 SH1 SHn R/W Applications R/W USER Finance USER Finance Monolithic App User App Services Finance App Services AZ1 AZ2 AZ1 AZ2 A/A Sharding Domain Isolation Active/Active Scale up involves cost and Scale out enables elastic scaling • Multi-AZ Read Replicas • Sharding • Domain Isolation • Active-Active
  • 15. Horizontal Scaling – Read Replicas ©2020 PayPal Inc. Confidential and proprietary. 15 • Scale Read Workloads by adding more replicas • Optimized Ros ( GG Replicas) can also provide High Availability by assuming Primary Role • Avoid Writes on replicas with DB services • Full copy on each replica
  • 16. Horizontal Scaling - Sharding ©2018 PayPal Inc. Confidential and proprietary. 16 Sharding Rules • Each table must have a shard key column • Shard Key Must be Unique across the shards • No cross Shard joins • No cross shard writes • Each SQL should have a shard key • Tables may need to be denormalized to support above rules • Each shard has a subset of Data
  • 17. Horizontal scaling – Domain isolation ©2020 PayPal Inc. Confidential and proprietary. 17 • Isolate self-contained domains to different physical database • Logical isolation of tables and application users followed by Physical separation
  • 18. Horizontal scaling – Active/Active ©2020 PayPal Inc. Confidential and proprietary. 18 • Active/Active is mainly for multi-region high availability But can also help in linear scaling • Each Database has full copy of data • Avoid mutations and collisions across the Databases – Use UUIDs for Keys , Even/Odd Sequences ,GG conflict resolution configurations and application stickiness.
  • 19. Reporting/Analytic workload offloading • Changes from FSL enabled SORs are replicated to Centralized data platform • Near Real time replication with OGG and Kafka/Micro batch processing • Replication Scaling using @RANGE replicats, Parallel extracts and parallel replicats ©2016 PayPal Inc. Confidential and proprietary. 19
  • 20. What Next .. Exadata & Oracle 19c © 2020 PayPal Inc. Confidential and proprietary. Focusing more on business impacting innovations Security & ComplianceReliability and Performance • Integrated Hardware and Software designed for Scale • Unique software optimizations • Highly scalable & fault tolerant hardware • Effective Data protection by eliminating Database sprawl • Standard Encryption @ Rest (TDE) , Performance optimization for TDE • Standard configuration • Automated and fast patching of all components Efficiency • DB consolidation and Multi-tenancy • Less day-to-day management, more business focusing activities • Autonomous DB compatibility • With an infrastructure that’s engineered to work together with your Oracle Databases, Oracle Exadata delivers far more power with less hardware. Optimized for Oracle Database
  • 21. Q & A

Editor's Notes

  • #6: 20K+ Inserts per sec and 50k+ select per sec
  • #9: 1. Application caching to reduce Database calls
  • #10: Much more .. Transparent encryption, De-Duplication etc, new caching ,locking,space /memory and undo management . • Deletion and Reuse of entire LOBs not just individual chunks. similar size goes for same hash chain latch
  • #11: Timed_id is a data type used to uniquely identify rows in a partitioned table, where the requirement is that the partitioning must be by time. 1590034363100000 1590034363100001 1590034364100002 1590034364100003 1590034365100004 1590034365100005
  • #13:  (a) commit for every and only after 50 rows.    (b) sleep for 1 or 2 seconds AFTER every commit.
  • #21: Automation – scripts , 8 node 1.5TB ,96 cores, 1PB usable, 100GB RDMA over RoCE,~20TB PMEM full Rack , Smart Flash Cache. Storage Indexes., Smart Scans and Cell Offloading, Hybrid Columnar Compression ,IORM Less systems to patch Less complexity of patching Unique features on IO operations Infinity band bug causing incidents (less vendors involved) Cloud alignment 1st attempt: early versions of Exadata before it became the main stream, Matured from last 5 years Critical workload as 1st use case