SlideShare a Scribd company logo
High Performance NoSQL Masterclass
Scaling for Performance
Felipe Cardeneti Mendes
High Performance NoSQL Masterclass
Felipe Cardeneti Mendes
● Solution Architect at ScyllaDB
● Published Author
● Linux and Open Source enthusiast
High Performance NoSQL Masterclass
Agenda
● About ScyllaDB
● Getting started with NoSQL databases
● Deployment and Production Readiness
● Observability Tips and Tricks
High Performance NoSQL Masterclass
About ScyllaDB
High Performance NoSQL Masterclass
ScyllaDB Database Architecture
Horizontal & Vertical Scaling
Built in C++
(no Java overhead)
System and Data
Center Aware
Sharding Per Core Shard-Aware Drivers
Auto-Performance
Tuning
Network
Processor NUMA
Storage
Unique Close-to-Metal Architecture
High Performance NoSQL Masterclass
Shard per Core
Threads Shards
High Performance NoSQL Masterclass
Asynchronous Architecture
Request Answer
Request Answer
Waiting for response
Time Savings
Synchronous
architecture
Asynchronous
architecture
High Performance NoSQL Masterclass
Specialized Cache
Cassandra ScyllaDB
Key
cache
Row
cache
Linux page cache
SSTables
Unified cache
SSTables
Complex Tuning
On-heap /
Off-heap
High Performance NoSQL Masterclass
Ecosystem Compatibility
+ CQL native protocol
+ JMX management protocol
+ Management command line
/REST
+ SSTable file format
+ Configuration file format
+ CQL language
High Performance NoSQL Masterclass
Getting Started with NoSQL
Databases
High Performance NoSQL Masterclass
Modern Business Challenges
Keep CapEx
& OpEx in check
Reduce complexity
Scale as the
data grows
Queries in milliseconds
Leverage massive
amounts of data
Predictable, consistent
performance
High Performance NoSQL Masterclass
Oh… The CAP Theorem
High Performance NoSQL Masterclass
Workload Types
High Performance NoSQL Masterclass
Workload Types
Decision Support
+ More complex queries - large amounts
+ Latency important but not critical
+ Seconds to hours
Fundamental business tasks
+ Simple queries
+ Latency critical
+ Milliseconds per transaction
OLAP
Time
Complexity
of
Query
Time
Complexity
of
Query
OLTP
High Performance NoSQL Masterclass
OLAP Characteristics
Main Characteristics
+ Bound Concurrency
+ Scans and aggregations
+ Rely on MapReduce paradigms
OLAP
Time
Complexity
of
Query
Examples
+ How many users are from the US?
+ How many Twitter posts happened in 2022?
+ Which devices haven’t communicated back
within the past 1 hour?
High Performance NoSQL Masterclass
OLTP Characteristics
Main Characteristics
+ Unbound Concurrency
+ Designed for speed and simplicity
+ Often user facing APIs
Examples
+ What’s the last time an user logged in?
+ What have been the last 10 temperature
measurements for a given device?
+ How many likes a given posting has?
Time
Complexity
of
Query
OLTP
High Performance NoSQL Masterclass
Why not both? Meet Workload Prioritization
100 shares
Ratio = 100:100 (1:1) means equal shares of
processing/resources to complete tasks
Ratio = 100:50 (2:1) means 2X as many shares of processing/resources
for Transactions to complete tasks compared to Analytics
100 shares
100 shares
50 shares
OLTP
OLAP
Which Task to Run
Wide Column Databases Write Path
LSM storage engine’s write path:
18
Writes
commit log
Wide Column Databases Write Path
LSM storage engine’s write path:
19
Writes
commit log
Wide Column Databases Write Path
LSM storage engine’s write path:
20
Writes
commit log
Wide Column Databases Write Path
LSM storage engine’s write path:
21
Writes
commit log
compaction
Wide Column Databases Write Path
LSM storage engine’s write path:
22
Writes
commit log
compaction
Wide Column Databases Write Path
LSM storage engine’s write path:
23
Writes
commit log
What is compaction?
LSM storage engine’s write path:
24
Hidden Gems
+ This technique of keeping sorted files and merging them is well-known and
often called Log-Structured Merge (LSM) Tree
+ Published in 1996, earliest popular application known is the Lucene search
engine, 1999
Characteristics
+ High performance write.
+ Immediately readable.
+ Reasonable performance for read.
What is a compaction strategy?
LSM storage engine’s write path:
25
▪ Which files to compact, and when?
▪ This is called the compaction strategy
▪ The goal of the strategy is low amplification:
○ Avoid read requests needing many sstables.
• read amplification
○ Avoid overwritten/deleted/expired data staying on disk.
○ Avoid excessive temporary disk space needs
• space amplification
○ Avoid compacting the same data again and again.
• write amplification
Which one to choose?
LSM storage engine’s write path:
26
Know your workload
High Performance NoSQL Masterclass
Deployment and Production
Readiness
It all starts with Data Modeling
LSM storage engine’s write path:
28
Do’s
+ Denormalize
+ Query oriented approach
+ High data distribution / cardinality
Dont’s
+ Create hotspots
+ Large partitions/rows/cells/collections/etc
+ Low cardinality tables/views/indexes
Test, test, test…!
LSM storage engine’s write path:
29
Unit Testing
+ Test your workload and access patterns in a Docker container
+ Use specialized stress tools to simulate workload
○ cassandra-stress
○ nosqlbench
○ YCSB
+ OBSERVE the results (more on that later)
Application Development
LSM storage engine’s write path:
30
Functional Testing
+ Use Prepared Statements
+ Configure your routing and load balancing policy correctly
○ DCAware and TokenAware policies
○ ShardAware Drivers if using ScyllaDB!
+ Make use of Asynchronous APIs
+ Ensure your client is NOT a bottleneck
+ Paging is important: Ensure you adjust it right
Test, test, test…!
LSM storage engine’s write path:
31
Readiness Testing
+ What’s the unreplicated data set size?
+ How many operations per second do I need to achieve?
○ Out of these, what are the reads vs writes distribution?
○ What’s the average payload size?
○ What are my latency requirements?
+ How many regions should it replicate to?
+ Do I need indexes or views to satisfy my queries?
+ What is/are the target deployment location(s)?
+ Is the use case growth predictable or unpredictable?
+ What are the data retention requirements?
+ Is the use case storage or CPU bound?
Oh mighty sizing…!
LSM storage engine’s write path:
32
A Sizing Exercise
+ 500k ops/sec with 1KB rows
+ 5TB data set size
+ P99 reads and writes < 10ms
+ Target deployment region: AWS
Simple Math
+ RF=3 / 6 nodes * 5TB = 2.5TB per node
+ 500k ops/sec / 12,5K ops/core = 40 physical cores
+ Result: 6 nodes of i4i.8xlarge
High Performance NoSQL Masterclass
Observability Tips and Tricks
High Performance NoSQL Masterclass
ScyllaDB Monitoring Architecture
High Performance NoSQL Masterclass
ScyllaDB Monitoring Architecture
High Performance NoSQL Masterclass
Keep in touch!
Felipe Cardeneti Mendes
Solutions Architect
ScyllaDB
felipemendes@scylladb.co
m
Find me on LinkedIn

More Related Content

PPTX
Modeling Data and Queries for Wide Column NoSQL
PPTX
Survey of High Performance NoSQL Systems
PPTX
Apache Arrow: In Theory, In Practice
PDF
ClickHouse Keeper
PDF
Oracle Database performance tuning using oratop
PDF
The InnoDB Storage Engine for MySQL
PDF
InnoDb Vs NDB Cluster
PDF
Tanel Poder - Scripts and Tools short
Modeling Data and Queries for Wide Column NoSQL
Survey of High Performance NoSQL Systems
Apache Arrow: In Theory, In Practice
ClickHouse Keeper
Oracle Database performance tuning using oratop
The InnoDB Storage Engine for MySQL
InnoDb Vs NDB Cluster
Tanel Poder - Scripts and Tools short

What's hot (20)

PPTX
Explain the explain_plan
PDF
Using ClickHouse for Experimentation
PDF
Scylla Summit 2022: How to Migrate a Counter Table for 68 Billion Records
PDF
Under the Hood of a Shard-per-Core Database Architecture
PPTX
From cache to in-memory data grid. Introduction to Hazelcast.
PPT
PDF
InnoDB Internal
PDF
Oracle db performance tuning
PDF
Redo log improvements MYSQL 8.0
PDF
Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake
PDF
Oracle_Multitenant_19c_-_All_About_Pluggable_D.pdf
PPTX
Materialized Views and Secondary Indexes in Scylla: They Are finally here!
PPTX
Mongodb basics and architecture
PDF
MyRocks Deep Dive
PPTX
Introduction to Apache ZooKeeper
PDF
Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2
PDF
All about Zookeeper and ClickHouse Keeper.pdf
PDF
Smart monitoring how does oracle rac manage resource, state ukoug19
PPTX
Data Pipelines with Kafka Connect
PDF
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Explain the explain_plan
Using ClickHouse for Experimentation
Scylla Summit 2022: How to Migrate a Counter Table for 68 Billion Records
Under the Hood of a Shard-per-Core Database Architecture
From cache to in-memory data grid. Introduction to Hazelcast.
InnoDB Internal
Oracle db performance tuning
Redo log improvements MYSQL 8.0
Simplify CDC Pipeline with Spark Streaming SQL and Delta Lake
Oracle_Multitenant_19c_-_All_About_Pluggable_D.pdf
Materialized Views and Secondary Indexes in Scylla: They Are finally here!
Mongodb basics and architecture
MyRocks Deep Dive
Introduction to Apache ZooKeeper
Tanel Poder - Troubleshooting Complex Oracle Performance Issues - Part 2
All about Zookeeper and ClickHouse Keeper.pdf
Smart monitoring how does oracle rac manage resource, state ukoug19
Data Pipelines with Kafka Connect
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Ad

Similar to Scaling for Performance (20)

PDF
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
PPTX
Essential Data Engineering for Data Scientist
PDF
Handling the growth of data
PDF
2013 CPM Conference, Nov 6th, NoSQL Capacity Planning
PPTX
Complex Analytics with NoSQL Data Store in Real Time
PPTX
Scylla Summit 2017: Snapfish's Journey Towards Scylla
PDF
ScyllaDB Virtual Workshop: Getting Started with ScyllaDB 2024
PPTX
Augmenting MySQL with NoSQL options - Data Lifecycles
PDF
Building a High Performance Analytics Platform
PPTX
MongoDB vs ScyllaDB: Tractian’s Experience with Real-Time ML
ODP
Databases benoitg 2009-03-10
PDF
Big Data SSD Architecture: Digging Deep to Discover Where SSD Performance Pay...
PDF
NoSQL at Scale: Proven Practices & Pitfalls
PDF
Dissecting Real-World Database Performance Dilemmas
PDF
Scaling, Tuning and Maintaining the Monolith
PPTX
Scylla Virtual Workshop 2022
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
PDF
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
PPTX
Hofstra University - Overview of Big Data
PPTX
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Database Performance at Scale Masterclass: Workload Characteristics by Felipe...
Essential Data Engineering for Data Scientist
Handling the growth of data
2013 CPM Conference, Nov 6th, NoSQL Capacity Planning
Complex Analytics with NoSQL Data Store in Real Time
Scylla Summit 2017: Snapfish's Journey Towards Scylla
ScyllaDB Virtual Workshop: Getting Started with ScyllaDB 2024
Augmenting MySQL with NoSQL options - Data Lifecycles
Building a High Performance Analytics Platform
MongoDB vs ScyllaDB: Tractian’s Experience with Real-Time ML
Databases benoitg 2009-03-10
Big Data SSD Architecture: Digging Deep to Discover Where SSD Performance Pay...
NoSQL at Scale: Proven Practices & Pitfalls
Dissecting Real-World Database Performance Dilemmas
Scaling, Tuning and Maintaining the Monolith
Scylla Virtual Workshop 2022
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
Beyond Linear Scaling: A New Path for Performance with ScyllaDB
Hofstra University - Overview of Big Data
Webinar: ROI on Big Data - RDBMS, NoSQL or Both? A Simple Guide for Knowing H...
Ad

More from ScyllaDB (20)

PDF
Understanding The True Cost of DynamoDB Webinar
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
PDF
New Ways to Reduce Database Costs with ScyllaDB
PDF
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
PDF
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
PDF
Leading a High-Stakes Database Migration
PDF
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
PDF
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
PDF
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
PDF
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
PDF
ScyllaDB: 10 Years and Beyond by Dor Laor
PDF
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
PDF
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
PDF
Vector Search with ScyllaDB by Szymon Wasik
PDF
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
PDF
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
PDF
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
PDF
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
PDF
Lessons Learned from Building a Serverless Notifications System by Srushith R...
PDF
A Dist Sys Programmer's Journey into AI by Piotr Sarna
Understanding The True Cost of DynamoDB Webinar
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
New Ways to Reduce Database Costs with ScyllaDB
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
Leading a High-Stakes Database Migration
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB: 10 Years and Beyond by Dor Laor
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
Vector Search with ScyllaDB by Szymon Wasik
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
Lessons Learned from Building a Serverless Notifications System by Srushith R...
A Dist Sys Programmer's Journey into AI by Piotr Sarna

Recently uploaded (20)

PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
Cloud computing and distributed systems.
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Empathic Computing: Creating Shared Understanding
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Modernizing your data center with Dell and AMD
The Rise and Fall of 3GPP – Time for a Sabbatical?
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Chapter 3 Spatial Domain Image Processing.pdf
Network Security Unit 5.pdf for BCA BBA.
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Cloud computing and distributed systems.
Encapsulation_ Review paper, used for researhc scholars
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
MYSQL Presentation for SQL database connectivity
Understanding_Digital_Forensics_Presentation.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Empathic Computing: Creating Shared Understanding
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Modernizing your data center with Dell and AMD

Scaling for Performance

  • 1. High Performance NoSQL Masterclass Scaling for Performance Felipe Cardeneti Mendes
  • 2. High Performance NoSQL Masterclass Felipe Cardeneti Mendes ● Solution Architect at ScyllaDB ● Published Author ● Linux and Open Source enthusiast
  • 3. High Performance NoSQL Masterclass Agenda ● About ScyllaDB ● Getting started with NoSQL databases ● Deployment and Production Readiness ● Observability Tips and Tricks
  • 4. High Performance NoSQL Masterclass About ScyllaDB
  • 5. High Performance NoSQL Masterclass ScyllaDB Database Architecture Horizontal & Vertical Scaling Built in C++ (no Java overhead) System and Data Center Aware Sharding Per Core Shard-Aware Drivers Auto-Performance Tuning Network Processor NUMA Storage Unique Close-to-Metal Architecture
  • 6. High Performance NoSQL Masterclass Shard per Core Threads Shards
  • 7. High Performance NoSQL Masterclass Asynchronous Architecture Request Answer Request Answer Waiting for response Time Savings Synchronous architecture Asynchronous architecture
  • 8. High Performance NoSQL Masterclass Specialized Cache Cassandra ScyllaDB Key cache Row cache Linux page cache SSTables Unified cache SSTables Complex Tuning On-heap / Off-heap
  • 9. High Performance NoSQL Masterclass Ecosystem Compatibility + CQL native protocol + JMX management protocol + Management command line /REST + SSTable file format + Configuration file format + CQL language
  • 10. High Performance NoSQL Masterclass Getting Started with NoSQL Databases
  • 11. High Performance NoSQL Masterclass Modern Business Challenges Keep CapEx & OpEx in check Reduce complexity Scale as the data grows Queries in milliseconds Leverage massive amounts of data Predictable, consistent performance
  • 12. High Performance NoSQL Masterclass Oh… The CAP Theorem
  • 13. High Performance NoSQL Masterclass Workload Types
  • 14. High Performance NoSQL Masterclass Workload Types Decision Support + More complex queries - large amounts + Latency important but not critical + Seconds to hours Fundamental business tasks + Simple queries + Latency critical + Milliseconds per transaction OLAP Time Complexity of Query Time Complexity of Query OLTP
  • 15. High Performance NoSQL Masterclass OLAP Characteristics Main Characteristics + Bound Concurrency + Scans and aggregations + Rely on MapReduce paradigms OLAP Time Complexity of Query Examples + How many users are from the US? + How many Twitter posts happened in 2022? + Which devices haven’t communicated back within the past 1 hour?
  • 16. High Performance NoSQL Masterclass OLTP Characteristics Main Characteristics + Unbound Concurrency + Designed for speed and simplicity + Often user facing APIs Examples + What’s the last time an user logged in? + What have been the last 10 temperature measurements for a given device? + How many likes a given posting has? Time Complexity of Query OLTP
  • 17. High Performance NoSQL Masterclass Why not both? Meet Workload Prioritization 100 shares Ratio = 100:100 (1:1) means equal shares of processing/resources to complete tasks Ratio = 100:50 (2:1) means 2X as many shares of processing/resources for Transactions to complete tasks compared to Analytics 100 shares 100 shares 50 shares OLTP OLAP Which Task to Run
  • 18. Wide Column Databases Write Path LSM storage engine’s write path: 18 Writes commit log
  • 19. Wide Column Databases Write Path LSM storage engine’s write path: 19 Writes commit log
  • 20. Wide Column Databases Write Path LSM storage engine’s write path: 20 Writes commit log
  • 21. Wide Column Databases Write Path LSM storage engine’s write path: 21 Writes commit log compaction
  • 22. Wide Column Databases Write Path LSM storage engine’s write path: 22 Writes commit log compaction
  • 23. Wide Column Databases Write Path LSM storage engine’s write path: 23 Writes commit log
  • 24. What is compaction? LSM storage engine’s write path: 24 Hidden Gems + This technique of keeping sorted files and merging them is well-known and often called Log-Structured Merge (LSM) Tree + Published in 1996, earliest popular application known is the Lucene search engine, 1999 Characteristics + High performance write. + Immediately readable. + Reasonable performance for read.
  • 25. What is a compaction strategy? LSM storage engine’s write path: 25 ▪ Which files to compact, and when? ▪ This is called the compaction strategy ▪ The goal of the strategy is low amplification: ○ Avoid read requests needing many sstables. • read amplification ○ Avoid overwritten/deleted/expired data staying on disk. ○ Avoid excessive temporary disk space needs • space amplification ○ Avoid compacting the same data again and again. • write amplification
  • 26. Which one to choose? LSM storage engine’s write path: 26 Know your workload
  • 27. High Performance NoSQL Masterclass Deployment and Production Readiness
  • 28. It all starts with Data Modeling LSM storage engine’s write path: 28 Do’s + Denormalize + Query oriented approach + High data distribution / cardinality Dont’s + Create hotspots + Large partitions/rows/cells/collections/etc + Low cardinality tables/views/indexes
  • 29. Test, test, test…! LSM storage engine’s write path: 29 Unit Testing + Test your workload and access patterns in a Docker container + Use specialized stress tools to simulate workload ○ cassandra-stress ○ nosqlbench ○ YCSB + OBSERVE the results (more on that later)
  • 30. Application Development LSM storage engine’s write path: 30 Functional Testing + Use Prepared Statements + Configure your routing and load balancing policy correctly ○ DCAware and TokenAware policies ○ ShardAware Drivers if using ScyllaDB! + Make use of Asynchronous APIs + Ensure your client is NOT a bottleneck + Paging is important: Ensure you adjust it right
  • 31. Test, test, test…! LSM storage engine’s write path: 31 Readiness Testing + What’s the unreplicated data set size? + How many operations per second do I need to achieve? ○ Out of these, what are the reads vs writes distribution? ○ What’s the average payload size? ○ What are my latency requirements? + How many regions should it replicate to? + Do I need indexes or views to satisfy my queries? + What is/are the target deployment location(s)? + Is the use case growth predictable or unpredictable? + What are the data retention requirements? + Is the use case storage or CPU bound?
  • 32. Oh mighty sizing…! LSM storage engine’s write path: 32 A Sizing Exercise + 500k ops/sec with 1KB rows + 5TB data set size + P99 reads and writes < 10ms + Target deployment region: AWS Simple Math + RF=3 / 6 nodes * 5TB = 2.5TB per node + 500k ops/sec / 12,5K ops/core = 40 physical cores + Result: 6 nodes of i4i.8xlarge
  • 33. High Performance NoSQL Masterclass Observability Tips and Tricks
  • 34. High Performance NoSQL Masterclass ScyllaDB Monitoring Architecture
  • 35. High Performance NoSQL Masterclass ScyllaDB Monitoring Architecture
  • 36. High Performance NoSQL Masterclass Keep in touch! Felipe Cardeneti Mendes Solutions Architect ScyllaDB felipemendes@scylladb.co m Find me on LinkedIn