SlideShare a Scribd company logo
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
High-Load Storage of Users’ Actions with ScyllaDB and HDDs

More Related Content

PPTX
How Opera Syncs Tens of Millions of Browsers and Sleeps Well at Night
PPTX
Free & Open DynamoDB API for Everyone
PDF
Lookout on Scaling Security to 100 Million Devices
PPTX
Scylla @ Disney+ Hotstar
PPTX
Augury: Real-Time Insights for the Industrial IoT
PPTX
Meeting the challenges of OLTP Big Data with Scylla
PPTX
Scylla Summit 2018: Getting the Most Out of Scylla on Kubernetes
PPTX
iFood on Delivering 100 Million Events a Month to Restaurants with Scylla
How Opera Syncs Tens of Millions of Browsers and Sleeps Well at Night
Free & Open DynamoDB API for Everyone
Lookout on Scaling Security to 100 Million Devices
Scylla @ Disney+ Hotstar
Augury: Real-Time Insights for the Industrial IoT
Meeting the challenges of OLTP Big Data with Scylla
Scylla Summit 2018: Getting the Most Out of Scylla on Kubernetes
iFood on Delivering 100 Million Events a Month to Restaurants with Scylla

What's hot (20)

PPTX
How Workload Prioritization Reduces Your Datacenter Footprint
PPTX
How to be Successful with Scylla
PPTX
Using ScyllaDB with JanusGraph for Cyber Security
PPTX
Building a Distributed Data Streaming Architecture for Modern Hardware with S...
PPTX
Sizing Your Scylla Cluster
PPTX
Scylla’s Journey Towards Being an Elastic Cloud Native Database
PPTX
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
PPTX
How SkyElectric Uses Scylla to Power Its Smart Energy Platform
PPTX
Scylla Summit 2018: Scylla 3.0 and Beyond
PPTX
Performance Testing: Scylla vs. Cassandra vs. Datastax
PDF
Introducing Scylla Open Source 4.0
PPTX
How ReversingLabs Serves File Reputation Service for 10B Files
PPTX
GPS Insight on Using Presto with Scylla for Data Analytics and Data Archival
PPTX
ScyllaDB's Avi Kivity on UDF, UDA, and the Future
PPTX
Scylla Summit 2019 Keynote - Avi Kivity
PDF
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
PPTX
Captial One: Why Stream Data as Part of Data Transformation?
PPTX
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
PDF
Seastar Summit 2019 vectorized.io
PPTX
Powering a Graph Data System with Scylla + JanusGraph
How Workload Prioritization Reduces Your Datacenter Footprint
How to be Successful with Scylla
Using ScyllaDB with JanusGraph for Cyber Security
Building a Distributed Data Streaming Architecture for Modern Hardware with S...
Sizing Your Scylla Cluster
Scylla’s Journey Towards Being an Elastic Cloud Native Database
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
How SkyElectric Uses Scylla to Power Its Smart Energy Platform
Scylla Summit 2018: Scylla 3.0 and Beyond
Performance Testing: Scylla vs. Cassandra vs. Datastax
Introducing Scylla Open Source 4.0
How ReversingLabs Serves File Reputation Service for 10B Files
GPS Insight on Using Presto with Scylla for Data Analytics and Data Archival
ScyllaDB's Avi Kivity on UDF, UDA, and the Future
Scylla Summit 2019 Keynote - Avi Kivity
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Captial One: Why Stream Data as Part of Data Transformation?
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
Seastar Summit 2019 vectorized.io
Powering a Graph Data System with Scylla + JanusGraph
Ad

Similar to High-Load Storage of Users’ Actions with ScyllaDB and HDDs (20)

PDF
Using ScyllaDB for Real-Time Read-Heavy Workloads.pdf
PDF
Dissecting Real-World Database Performance Dilemmas
PDF
Dissecting Real-World Database Performance Dilemmas
PDF
Using ScyllaDB for Real-Time Write-Heavy Workloads
PDF
Fast NoSQL from HDDs?
PDF
Scylla Summit 2022: Operating at Monstrous Scales: Benchmarking Petabyte Work...
PDF
Replacing Your Cache with ScyllaDB by Felipe Cardeneti Mendes and Tomasz Grabiec
PDF
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
PPTX
Replacing Your Cache with ScyllaDB
PPTX
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
PPTX
4 use cases for C* to Scylla
PPTX
Understanding Storage I/O Under Load
PDF
How to achieve no compromise performance and availability
PDF
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
PPTX
Scylla Summit 2022: Scylla 5.0 New Features, Part 1
PPTX
Real-Time or Analytics Workloads... Why Not Both?
PDF
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
PDF
What Developers Need to Unlearn for High Performance NoSQL
PDF
Scylla Summit 2017: Intel Optane SSDs as the New Accelerator in Your Data Center
PPTX
A Deep Dive into ScyllaDB's Architecture
Using ScyllaDB for Real-Time Read-Heavy Workloads.pdf
Dissecting Real-World Database Performance Dilemmas
Dissecting Real-World Database Performance Dilemmas
Using ScyllaDB for Real-Time Write-Heavy Workloads
Fast NoSQL from HDDs?
Scylla Summit 2022: Operating at Monstrous Scales: Benchmarking Petabyte Work...
Replacing Your Cache with ScyllaDB by Felipe Cardeneti Mendes and Tomasz Grabiec
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
Replacing Your Cache with ScyllaDB
7 Reasons Not to Put an External Cache in Front of Your Database.pptx
4 use cases for C* to Scylla
Understanding Storage I/O Under Load
How to achieve no compromise performance and availability
Scaling Security on 100s of Millions of Mobile Devices Using Apache Kafka® an...
Scylla Summit 2022: Scylla 5.0 New Features, Part 1
Real-Time or Analytics Workloads... Why Not Both?
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
What Developers Need to Unlearn for High Performance NoSQL
Scylla Summit 2017: Intel Optane SSDs as the New Accelerator in Your Data Center
A Deep Dive into ScyllaDB's Architecture
Ad

More from ScyllaDB (20)

PDF
Understanding The True Cost of DynamoDB Webinar
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
PDF
New Ways to Reduce Database Costs with ScyllaDB
PDF
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
PDF
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
PDF
Leading a High-Stakes Database Migration
PDF
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
PDF
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
PDF
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
PDF
ScyllaDB: 10 Years and Beyond by Dor Laor
PDF
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
PDF
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
PDF
Vector Search with ScyllaDB by Szymon Wasik
PDF
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
PDF
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
PDF
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
PDF
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
PDF
Lessons Learned from Building a Serverless Notifications System by Srushith R...
PDF
A Dist Sys Programmer's Journey into AI by Piotr Sarna
Understanding The True Cost of DynamoDB Webinar
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
New Ways to Reduce Database Costs with ScyllaDB
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
Leading a High-Stakes Database Migration
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB: 10 Years and Beyond by Dor Laor
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
Vector Search with ScyllaDB by Szymon Wasik
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
Lessons Learned from Building a Serverless Notifications System by Srushith R...
A Dist Sys Programmer's Journey into AI by Piotr Sarna

Recently uploaded (20)

PDF
Approach and Philosophy of On baking technology
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
Cloud computing and distributed systems.
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
A Presentation on Artificial Intelligence
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
KodekX | Application Modernization Development
Approach and Philosophy of On baking technology
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Understanding_Digital_Forensics_Presentation.pptx
NewMind AI Monthly Chronicles - July 2025
Diabetes mellitus diagnosis method based random forest with bat algorithm
Digital-Transformation-Roadmap-for-Companies.pptx
MYSQL Presentation for SQL database connectivity
Cloud computing and distributed systems.
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
The AUB Centre for AI in Media Proposal.docx
20250228 LYD VKU AI Blended-Learning.pptx
Mobile App Security Testing_ A Comprehensive Guide.pdf
Dropbox Q2 2025 Financial Results & Investor Presentation
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Per capita expenditure prediction using model stacking based on satellite ima...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Encapsulation_ Review paper, used for researhc scholars
A Presentation on Artificial Intelligence
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
KodekX | Application Modernization Development

Editor's Notes

  • #4: Let’s talk numbers Does not include bots, only real users
  • #7: We store every action User may want to see what happened in his mailbox Another examples: investigating possible attacks, sorting out user complaints
  • #9: The thing that we wanted to replace in this scheme was the storage
  • #10: Writes prevail 1000 times
  • #13: The thing that we wanted to replace in this scheme was the storage
  • #14: Tell why we have different amount of nodes in different dcs Think what to answer to questions about CL=ONE We want to be available when a DC goes down, it’s ok for us to serve inconsistent read requests
  • #15: All user data is split by weeks and projects
  • #16: Ambiguous number of network requests to other nodes Can’t trasform all those writes to reads We create another table and duplicate all writes there from the app
  • #17: All user data is split by weeks and projects
  • #18: Latencies are measured from client RPS == API rps + RF + secondary index
  • #19: Remind that we are talking about HDDs
  • #21: They do not recommend hdds It is reasonable
  • #22: In ssd setups it will be probably set to some large value like number of shards The most accurate way is to run benchmarks with different values for num-io-queue
  • #24: Lets say one node failed and we know the exact moment of time when it happened Normally nodetool repair would run full scan but we now the exact moment when problem happened We need to go to nodes from a different DC, transfer data to the affected node and run nodetool repair
  • #25: Refresh will finish soon, then go compactions that do not overload cluster and in our case finished in 6 hours
  • #26: Latencies stay in a reasonable range Resharding is slow but faster than repair and does not overaload cluster
  • #27: Dedicated a whole section for problems with HDDs, what for