SlideShare a Scribd company logo
Addressing the High Cost
of Apache Cassandra
Eyal Gutkind, VP of Solutions Engineering
2
Presenter
Eyal Gutkind
Eyal Gutkind is VP of Solutions at ScyllaDB. Prior to joining
ScyllaDB, Eyal held product management roles at Mirantis
and DataStax, and spent 12 years with Mellanox
Technologies in various engineering management and
product marketing roles. Eyal holds a BSc. degree in
Electrical and Computer Engineering from Ben Gurion
University in Israel and an MBA from Fuqua School of
Business at Duke University.
+ Brief Scylla overview
+ Detailed benchmark comparisons with Cassandra and cost implications
+ The cost reduction of using large nodes
+ Storage cost benefits of Incremental Compaction Strategy
+ Using Workload Prioritization to support multiple workloads in a single cluster
Agenda
4
+ The Real-Time Big Data Database
+ Drop-in replacement for Apache Cassandra
and Amazon DynamoDB
+ 10X the performance & low tail latency
+ Open Source, Enterprise and Cloud options
+ Founded by the creators of KVM hypervisor
+ HQs: Palo Alto, CA, USA; Herzelia, Israel;
Warsaw, Poland
About ScyllaDB
 
Cassandra
Node Count
Scylla
Node Count
Datacenter
Savings
Recordings Ring 432 18 65%
Reminders Ring 96 18 41%
Recordings Secondary Ring 70 18 65%
History Ring 96 6 61%
Instruction and Lookup Ring 268 18 45%
Total 962 78 53%
Pre Scylla
Node Count
962
Post Scylla
Node Count
78
(m4.2xlarge) (i3.4xlarge & i3.8xlarge)
Don’t Take Our Word for It...
Poll Questions
The High Cost of Node Sprawl
+ Heavy administration
+ More things to fail
+ Expensive
+ Complex
“Just Add More Nodes”
…..
8
$900k/yr Datacenter Savings
Rules:
+ Need to meet an SLA of 100k/200k/300k ops at P99 < 10msec
+ Use as little as possible hardware
+ Hardware chosen ideally for each database
+ More details on https://www.scylladb.com/product/benchmarks/aws-i3-metal-benchmark/
+ 4 x i3.metal cost: $112,100
+ 40 x i3.4xlarge cost: $278,560
4 i3.metal
Scylla nodes
40 i3.4xl Cassandra
3.11 nodes
4 i3.metal
Scylla nodes
40 i3.4xl Cassandra
3.11 nodes
4 i3.metal
Scylla nodes
40 i3.4xl Cassandra
3.11 nodes
100% cpu != alert
Compaction != problem
Knowledge == Power
Summary:
+ Cost is 2.5X cheaper
+ 10X reduction in administration overhead.
+ Scylla’s P99 latency was up to 45X better.
+ Cassandra could not meet the SLA in 200k, 300k cases
+ Scylla provides 10x higher up time (MTTF)
+ Scylla is automatically tuned
Are availability and real time important to your business?
4 i3.metal
Scylla nodes
40 i3.4xl Cassandra
3.11 nodes
I3en 60TB Node
+ Reduce number of nodes by using
large nodes
+ Increase storage size when needed,
not nodes
+ Node replacement not an issue, when
your system has resources to handle
the streaming
ICS - 37% Savings and More
Major compaction
starts at this point
+ Available in Scylla Enterprise
+ Help increase storage utilization
+ Reduce compaction workload
+ Combine workloads into one cluster
+ Reduce latency penalties
+ Increase CPU and cluster utilization
Workload Prioritization
We Will Use Cloud Vendors’ DBaaS
Scylla Cloud vs. C* DBaaS Solutions
AWS CMS Vs.
Azure Cosmos Vs.
Scylla Cloud
Storage Cost [$/month/TB] 0.3 0.25
Hassle free
3 x i3.8xlarge instances
Unit Read Cost [$] 0.1095 5.84
Unit Write Cost [$] 0.5475 5.84
Total Storage cost [$/month] $614.40 $343.04
Total Write Cost [$/month] $32,850.00 $34,432.64
Total Read Cost [$/month] $54,750.00 $34,041.36
Total$ / Month $88,214 $68,817 $9,450
+ We have shown how users like Comcast and Kiwi are able to save big $$
+ Benchmarks shows time after time how we provide 10x reductions in number of nodes
+ Using large nodes is not an issue when using the right system resources
+ Compaction are smoother when using ICS
+ Using advanced features as workload prioritization helps users combine workloads into
fewer clusters, saving them money and operational resources
+ With Scylla you are able to reduce TCOs
Summary
Poll Question
Q&A
Eyal Gutkind
VP of Solutions
Book a session with me
+ If you are interested in evaluating your current workloads to learn how you can
save more, you can sign up for a Technical Evaluation session with me.
Link : https://www.scylladb.com/product/technical-consultation/
+ Or email me directly to eyal@scylladb.com if you have any questions.
We will send you these links via email along with the session recording.
22
eyal@scylladb.com
Stay in touch
United States
545 Faber Place
Palo Alto, CA 94303
Israel
11 Galgalei Haplada
Herzelia, Israel
www.scylladb.com
@scylladb
Thank you
United States
545 Faber Place
Palo Alto, CA 94303
Israel
11 Galgalei Haplada
Herzelia, Israel
www.scylladb.com
@scylladb
Poll Question
Poll Question #1
If you are using C* today what is your biggest challenge
A. Operational complexity
B. Finding talent/consultant to manage cluster
C. Cost of support contracts/consultants
D. I have no issues with my C* deployment
Poll Question #2
What is your next database deployment platform
A. On-premise, self managed
B. Cloud, self managed
C. Database as a service
D. No plans for new database deployment
Poll Question #3
What was your expectation from this session?
A.It was technical enough
B.Expected it to be more Technical
C.Was expecting more of a business perspective

More Related Content

PDF
Under the Hood of a Shard-per-Core Database Architecture
PDF
NoSQL and NewSQL: Tradeoffs between Scalable Performance & Consistency
PDF
The Do’s and Don’ts of Benchmarking Databases
PDF
Measuring Database Performance on Bare Metal AWS Instances
PPTX
Scylla Summit 2018: Scylla Feature Talks - SSTables 3.0 File Format
PDF
How to Build a Scylla Database Cluster that Fits Your Needs
PDF
Comparing Apache Cassandra 4.0, 3.0, and ScyllaDB
PPTX
Scylla Summit 2018: Keynote - 4 Years of Scylla
Under the Hood of a Shard-per-Core Database Architecture
NoSQL and NewSQL: Tradeoffs between Scalable Performance & Consistency
The Do’s and Don’ts of Benchmarking Databases
Measuring Database Performance on Bare Metal AWS Instances
Scylla Summit 2018: Scylla Feature Talks - SSTables 3.0 File Format
How to Build a Scylla Database Cluster that Fits Your Needs
Comparing Apache Cassandra 4.0, 3.0, and ScyllaDB
Scylla Summit 2018: Keynote - 4 Years of Scylla

What's hot (20)

PDF
Numberly on Joining Billions of Rows in Seconds: Replacing MongoDB and Hive w...
PDF
How to achieve no compromise performance and availability
PDF
Scylla Virtual Workshop 2020
PDF
Webinar how to build a highly available time series solution with kairos-db (1)
PDF
The True Cost of NoSQL DBaaS Options
PDF
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
PDF
Steering the Sea Monster - Integrating Scylla with Kubernetes
PDF
TechTalk: Reduce Your Storage Footprint with a Revolutionary New Compaction S...
PDF
Introducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible API
PPTX
Overcoming Barriers of Scaling Your Database
PPTX
Powering a Graph Data System with Scylla + JanusGraph
PPTX
Lightweight Transactions in Scylla versus Apache Cassandra
PDF
Wide Column Store NoSQL vs SQL Data Modeling
PDF
Introducing Scylla Cloud
PDF
Running Scylla on Kubernetes with Scylla Operator
PPTX
Scylla Summit 2018: Getting the Most Out of Scylla on Kubernetes
PDF
Keeping your application’s latency SLAs no matter what
PDF
Webinar: Using Control Theory to Keep Compactions Under Control
PPTX
Cassandra vs. ScyllaDB: Evolutionary Differences
PDF
Introducing Scylla Open Source 4.0
Numberly on Joining Billions of Rows in Seconds: Replacing MongoDB and Hive w...
How to achieve no compromise performance and availability
Scylla Virtual Workshop 2020
Webinar how to build a highly available time series solution with kairos-db (1)
The True Cost of NoSQL DBaaS Options
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
Steering the Sea Monster - Integrating Scylla with Kubernetes
TechTalk: Reduce Your Storage Footprint with a Revolutionary New Compaction S...
Introducing Project Alternator - Scylla’s Open-Source DynamoDB-compatible API
Overcoming Barriers of Scaling Your Database
Powering a Graph Data System with Scylla + JanusGraph
Lightweight Transactions in Scylla versus Apache Cassandra
Wide Column Store NoSQL vs SQL Data Modeling
Introducing Scylla Cloud
Running Scylla on Kubernetes with Scylla Operator
Scylla Summit 2018: Getting the Most Out of Scylla on Kubernetes
Keeping your application’s latency SLAs no matter what
Webinar: Using Control Theory to Keep Compactions Under Control
Cassandra vs. ScyllaDB: Evolutionary Differences
Introducing Scylla Open Source 4.0
Ad

Similar to Addressing the High Cost of Apache Cassandra (20)

PDF
How Development Teams Cut Costs with ScyllaDB.pdf
PPTX
Cassandra to ScyllaDB: Technical Comparison and the Path to Success
PDF
How to Monitor and Size Workloads on AWS i3 instances
PPTX
Why We Chose ScyllaDB over DynamoDB for "User Watch Status"
PPTX
mParticle's Journey to Scylla from Cassandra
PPTX
4 use cases for C* to Scylla
PPTX
Scylla Virtual Workshop 2022
PDF
Using ScyllaDB for Extreme Scale Workloads
PDF
Scylla Summit 2022: ScyllaDB Cloud: Simplifying Deployment to the Public Cloud
PDF
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
PDF
New Ways to Reduce Database Costs with ScyllaDB
PDF
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
PDF
Recent ScyllaDB Cloud Highlights and Future Roadmap by Michael Hollander & Iv...
PDF
ScyllaDB Virtual Workshop
PDF
5 Factors When Selecting a High Performance, Low Latency Database
PDF
Running a Cost-Effective DynamoDB-Compatible Database on Managed Kubernetes S...
PDF
Scylla Summit 2016: Why Kenshoo is about to displace Cassandra with Scylla
PDF
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
PDF
Workshop - How to benchmark your database
PDF
Dissecting Real-World Database Performance Dilemmas
How Development Teams Cut Costs with ScyllaDB.pdf
Cassandra to ScyllaDB: Technical Comparison and the Path to Success
How to Monitor and Size Workloads on AWS i3 instances
Why We Chose ScyllaDB over DynamoDB for "User Watch Status"
mParticle's Journey to Scylla from Cassandra
4 use cases for C* to Scylla
Scylla Virtual Workshop 2022
Using ScyllaDB for Extreme Scale Workloads
Scylla Summit 2022: ScyllaDB Cloud: Simplifying Deployment to the Public Cloud
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
New Ways to Reduce Database Costs with ScyllaDB
Scylla Summit 2022: How ScyllaDB Powers This Next Tech Cycle
Recent ScyllaDB Cloud Highlights and Future Roadmap by Michael Hollander & Iv...
ScyllaDB Virtual Workshop
5 Factors When Selecting a High Performance, Low Latency Database
Running a Cost-Effective DynamoDB-Compatible Database on Managed Kubernetes S...
Scylla Summit 2016: Why Kenshoo is about to displace Cassandra with Scylla
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
Workshop - How to benchmark your database
Dissecting Real-World Database Performance Dilemmas
Ad

More from ScyllaDB (20)

PDF
Understanding The True Cost of DynamoDB Webinar
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
PDF
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
PDF
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
PDF
Leading a High-Stakes Database Migration
PDF
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
PDF
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
PDF
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
PDF
ScyllaDB: 10 Years and Beyond by Dor Laor
PDF
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
PDF
Vector Search with ScyllaDB by Szymon Wasik
PDF
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
PDF
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
PDF
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
PDF
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
PDF
Lessons Learned from Building a Serverless Notifications System by Srushith R...
PDF
A Dist Sys Programmer's Journey into AI by Piotr Sarna
PDF
High Availability: Lessons Learned by Paul Preuveneers
PDF
How Natura Uses ScyllaDB and ScyllaDB Connector to Create a Real-time Data Pi...
Understanding The True Cost of DynamoDB Webinar
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
Leading a High-Stakes Database Migration
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB: 10 Years and Beyond by Dor Laor
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
Vector Search with ScyllaDB by Szymon Wasik
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
Lessons Learned from Building a Serverless Notifications System by Srushith R...
A Dist Sys Programmer's Journey into AI by Piotr Sarna
High Availability: Lessons Learned by Paul Preuveneers
How Natura Uses ScyllaDB and ScyllaDB Connector to Create a Real-time Data Pi...

Recently uploaded (20)

PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
How Creative Agencies Leverage Project Management Software.pdf
PDF
Upgrade and Innovation Strategies for SAP ERP Customers
PDF
top salesforce developer skills in 2025.pdf
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
PPTX
history of c programming in notes for students .pptx
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
CHAPTER 2 - PM Management and IT Context
PDF
Digital Strategies for Manufacturing Companies
PPTX
ai tools demonstartion for schools and inter college
PDF
Which alternative to Crystal Reports is best for small or large businesses.pdf
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PPTX
Odoo POS Development Services by CandidRoot Solutions
PDF
Nekopoi APK 2025 free lastest update
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 41
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Raksha Bandhan Grocery Pricing Trends in India 2025.pdf
Odoo Companies in India – Driving Business Transformation.pdf
How Creative Agencies Leverage Project Management Software.pdf
Upgrade and Innovation Strategies for SAP ERP Customers
top salesforce developer skills in 2025.pdf
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Flood Susceptibility Mapping Using Image-Based 2D-CNN Deep Learnin. Overview ...
history of c programming in notes for students .pptx
Design an Analysis of Algorithms I-SECS-1021-03
CHAPTER 2 - PM Management and IT Context
Digital Strategies for Manufacturing Companies
ai tools demonstartion for schools and inter college
Which alternative to Crystal Reports is best for small or large businesses.pdf
wealthsignaloriginal-com-DS-text-... (1).pdf
Design an Analysis of Algorithms II-SECS-1021-03
Navsoft: AI-Powered Business Solutions & Custom Software Development
Odoo POS Development Services by CandidRoot Solutions
Nekopoi APK 2025 free lastest update
Internet Downloader Manager (IDM) Crack 6.42 Build 41

Addressing the High Cost of Apache Cassandra

  • 1. Addressing the High Cost of Apache Cassandra Eyal Gutkind, VP of Solutions Engineering
  • 2. 2 Presenter Eyal Gutkind Eyal Gutkind is VP of Solutions at ScyllaDB. Prior to joining ScyllaDB, Eyal held product management roles at Mirantis and DataStax, and spent 12 years with Mellanox Technologies in various engineering management and product marketing roles. Eyal holds a BSc. degree in Electrical and Computer Engineering from Ben Gurion University in Israel and an MBA from Fuqua School of Business at Duke University.
  • 3. + Brief Scylla overview + Detailed benchmark comparisons with Cassandra and cost implications + The cost reduction of using large nodes + Storage cost benefits of Incremental Compaction Strategy + Using Workload Prioritization to support multiple workloads in a single cluster Agenda
  • 4. 4 + The Real-Time Big Data Database + Drop-in replacement for Apache Cassandra and Amazon DynamoDB + 10X the performance & low tail latency + Open Source, Enterprise and Cloud options + Founded by the creators of KVM hypervisor + HQs: Palo Alto, CA, USA; Herzelia, Israel; Warsaw, Poland About ScyllaDB
  • 5.   Cassandra Node Count Scylla Node Count Datacenter Savings Recordings Ring 432 18 65% Reminders Ring 96 18 41% Recordings Secondary Ring 70 18 65% History Ring 96 6 61% Instruction and Lookup Ring 268 18 45% Total 962 78 53% Pre Scylla Node Count 962 Post Scylla Node Count 78 (m4.2xlarge) (i3.4xlarge & i3.8xlarge) Don’t Take Our Word for It...
  • 7. The High Cost of Node Sprawl + Heavy administration + More things to fail + Expensive + Complex “Just Add More Nodes” …..
  • 9. Rules: + Need to meet an SLA of 100k/200k/300k ops at P99 < 10msec + Use as little as possible hardware + Hardware chosen ideally for each database + More details on https://www.scylladb.com/product/benchmarks/aws-i3-metal-benchmark/ + 4 x i3.metal cost: $112,100 + 40 x i3.4xlarge cost: $278,560 4 i3.metal Scylla nodes 40 i3.4xl Cassandra 3.11 nodes
  • 10. 4 i3.metal Scylla nodes 40 i3.4xl Cassandra 3.11 nodes
  • 11. 4 i3.metal Scylla nodes 40 i3.4xl Cassandra 3.11 nodes
  • 12. 100% cpu != alert Compaction != problem Knowledge == Power
  • 13. Summary: + Cost is 2.5X cheaper + 10X reduction in administration overhead. + Scylla’s P99 latency was up to 45X better. + Cassandra could not meet the SLA in 200k, 300k cases + Scylla provides 10x higher up time (MTTF) + Scylla is automatically tuned Are availability and real time important to your business? 4 i3.metal Scylla nodes 40 i3.4xl Cassandra 3.11 nodes
  • 14. I3en 60TB Node + Reduce number of nodes by using large nodes + Increase storage size when needed, not nodes + Node replacement not an issue, when your system has resources to handle the streaming
  • 15. ICS - 37% Savings and More Major compaction starts at this point + Available in Scylla Enterprise + Help increase storage utilization + Reduce compaction workload
  • 16. + Combine workloads into one cluster + Reduce latency penalties + Increase CPU and cluster utilization Workload Prioritization
  • 17. We Will Use Cloud Vendors’ DBaaS
  • 18. Scylla Cloud vs. C* DBaaS Solutions AWS CMS Vs. Azure Cosmos Vs. Scylla Cloud Storage Cost [$/month/TB] 0.3 0.25 Hassle free 3 x i3.8xlarge instances Unit Read Cost [$] 0.1095 5.84 Unit Write Cost [$] 0.5475 5.84 Total Storage cost [$/month] $614.40 $343.04 Total Write Cost [$/month] $32,850.00 $34,432.64 Total Read Cost [$/month] $54,750.00 $34,041.36 Total$ / Month $88,214 $68,817 $9,450
  • 19. + We have shown how users like Comcast and Kiwi are able to save big $$ + Benchmarks shows time after time how we provide 10x reductions in number of nodes + Using large nodes is not an issue when using the right system resources + Compaction are smoother when using ICS + Using advanced features as workload prioritization helps users combine workloads into fewer clusters, saving them money and operational resources + With Scylla you are able to reduce TCOs Summary
  • 22. Book a session with me + If you are interested in evaluating your current workloads to learn how you can save more, you can sign up for a Technical Evaluation session with me. Link : https://www.scylladb.com/product/technical-consultation/ + Or email me directly to eyal@scylladb.com if you have any questions. We will send you these links via email along with the session recording. 22
  • 24. United States 545 Faber Place Palo Alto, CA 94303 Israel 11 Galgalei Haplada Herzelia, Israel www.scylladb.com @scylladb Thank you
  • 25. United States 545 Faber Place Palo Alto, CA 94303 Israel 11 Galgalei Haplada Herzelia, Israel www.scylladb.com @scylladb
  • 27. Poll Question #1 If you are using C* today what is your biggest challenge A. Operational complexity B. Finding talent/consultant to manage cluster C. Cost of support contracts/consultants D. I have no issues with my C* deployment
  • 28. Poll Question #2 What is your next database deployment platform A. On-premise, self managed B. Cloud, self managed C. Database as a service D. No plans for new database deployment
  • 29. Poll Question #3 What was your expectation from this session? A.It was technical enough B.Expected it to be more Technical C.Was expecting more of a business perspective