SlideShare a Scribd company logo
How Workload Prioritization
Reduces Your Datacenter
Footprint
Eliran Sinvani, Core team leader
Presenter
Eliran Sinvani, core team leader @ScyllaDB
 Eliran is a core team leader in Scylla for the past year.
 Before that, he had 6 years of experience developing real-time and Linux-
based embedded systems.
 He started at Marvell as an L1 comm stack engineer.
 Most recently was a low-level infrastructure team leader at Airspan where he
was involved in the hardware and software planning and execution of the
second generation of Sprint MagicBox.
 Eliran has a BSc in electronics and computer engineering.
 In his spare time, he creates simulations and games for VR systems and
tinkers with open-source embedded projects.
Agenda
■ Theory:
● some basic concepts
● Define the problem
■ Getting Technical:
● Look at the most common 2 existing solutions
● Overview of workload prioritization implementation
● Configuring Workload prioritization overview
■ Getting Practical:
● Commands example
● Some examples
● Real World example
■ Conclusion
■ Questions
Basic Concepts
Different types of loads
■ OLTP
● Small work items
● Latency sensitive
● involves narrow
portion of the data
■ OLAP
● Large work items
● Throughput oriented
● Performed on large
amounts of data
OK, so why can’t I simply do both?
■ We will try to explain:
● What does it mean that OLTP and OLAP conflicts.
● How can some workloads conflict and what is the impact on the Datacenter.
What happens if we just try it?
What happened?
What happened?
What happened?
Traditional Solutions
For Conflicting Workloads
Existing Solutions
■ Divide and conquer!
● Division is space (Multi DC)
■ Wastes resources and money
● Division in time (off peak OLAP)
■ Impacts the QOS for OLTP during
The OLAP periods.
Putting it in Numbers - Multi DC solution
(based on AWS i3.metal)
■ Example:
● Capacity per instance: 15TB
● Minimum amount of instances: 10
■ Assumptions:
● Real time workload is latency sensitive. Only uses 60% of resources (of 10 instances).
● Analytics don’t run constantly, therefore only runs 60% on 100% of resources.
■ HW:
HW Costs Estimated Waste % Estimated Waste $
OLTP DC (10 instances) USD 278,560.00 40% (Resources) USD 167,136.00
OLAP DC (10 instances) USD 278,560.00 40% (Time) USD 167,136.00
● Increased maintenance costs and additional complexity
Scylla’s Solution:
Workload Prioritization
Minimizing Inter-Workload Impact
https://www.scylladb.com/2019/05/23/workload-prioritization-running-oltp-and-olap-traffic-on-
the-same-superhighway/
How does it work?
Schedulers Basics
■ Schedulers work with Shares
Schedulers Basics
100 shares
100 shares
Schedulers Basics
100 shares
50 shares
Schedulers Basics
200 shares
100 shares
Schedulers Basics
Schedulers Basics - operation highlight
■ Shares are really all there is to it :)
■ Schedulers only kicks in when there is a
conflict on the resource.
■ Schedulers maintain fairness by trying to
optimize ratios
● aggregate throughput is not the
goal.
■ Schedulers can be dynamic
● meaning you can change the
amount of shares in real time.
■ Limits the impact of one Shareholder on
another.
Scylla controllers
Scylla controllers
workload changes:
● automatic adjustment
● new equilibrium
Advantages
■ Better system utilization
■ Easier setup
■ Dynamic adjustment
How Does it work?
■ Schedulers
● Easy to configure
● Dynamically adjusted
● Doesn’t harm system utilization
● Limits the impact between different
loads.
How Does it work?
■ Schedulers
■ Converting data processing paths from serial to parallel
How Does it work?
■ Schedulers
■ Converting data processing paths from serial to parallel
■ Operation priority classification
Workload Prioritization
In Practice
So…. Does it work?
So…. Does it work? (behind the scenes)
Configuring Workload prioritization
1. Make users that generates the same workload be part of
the same group.
● Priorities are attached to groups or individual users.
2. Create a service level for the workload and set its shares:
● Share determine the amount of importance of the service level.
● It is always relative to other service levels.
3. Attach the service level to the group of users.
● This will grant the shares to the group of users.
● At that point the workload prioritization mechanizm will start to
● Treat their requests according to priorities.
Configuring Workload prioritization
1. Make users that generates the same workload be part of
the same group.
● CREATE ROLE super_high_priority;
● GRANT super_high_priority TO special_user;
2. Create a service level for the workload and set its shares:
● CREATE SERVICE_LEVEL 'important_load' WITH SHARES=1000;
3. Attach the service level to the group of users.
● ATTACH SERVICE_LEVEL 'important_load' TO ‘super_high_priority;
Making OLTP and OLAP coexist
■ To create the effect of - OLTP always get its way and OLAP utilizes all free
resources:
● OLTP gets 1000 shares and OLAP gets 10 shares.
Prioritizing between some workloads
■ Workload prioritization in general facilitates resource division between several
loads.
■ There are a lot of effects that can be achieved.
■ One constraint: The number of different workloads:
● Due to latency requirements the system can only use 16 scheduling groups.
● Some scheduling groups are used by background processes
● Workload prioritization can take advantage of the remaining scheduling groups.
● Currently we have 8 unassigned scheduling groups.
Prioritizing between some workloads
■ Load1: 200 shares, Load2: 400 shares, Load3: 800 shares
Prioritizing between some workloads
■ Load1: 200 shares, Load2: 400 shares, Load3: 800 shares
Conclusion
Summary
■ The goal is to minimize inter-workload impact without breaking the bank.
■ Schedulers are in the heart of the solution:
● Shares are all there is to it.
Future work
■ Increase visibility with per scheduling group metrics.
■ Achieve even better isolation by canceling serialization points.
■ Increase the number of available workload prioritization scheduling groups.
Thank you Stay in touch
Any questions?
Eliran Sinvani
eliransin@scylladb.com

More Related Content

PPTX
iFood on Delivering 100 Million Events a Month to Restaurants with Scylla
PDF
Lookout on Scaling Security to 100 Million Devices
PPTX
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
PPTX
How Opera Syncs Tens of Millions of Browsers and Sleeps Well at Night
PPTX
SAS Institute on Changing All Four Tires While Driving an AdTech Engine at Fu...
PPTX
How SkyElectric Uses Scylla to Power Its Smart Energy Platform
PPTX
Free & Open DynamoDB API for Everyone
PPTX
How to be Successful with Scylla
iFood on Delivering 100 Million Events a Month to Restaurants with Scylla
Lookout on Scaling Security to 100 Million Devices
FireEye & Scylla: Intel Threat Analysis Using a Graph Database
How Opera Syncs Tens of Millions of Browsers and Sleeps Well at Night
SAS Institute on Changing All Four Tires While Driving an AdTech Engine at Fu...
How SkyElectric Uses Scylla to Power Its Smart Energy Platform
Free & Open DynamoDB API for Everyone
How to be Successful with Scylla

What's hot (20)

PPTX
Powering a Graph Data System with Scylla + JanusGraph
PPTX
Using ScyllaDB with JanusGraph for Cyber Security
PDF
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
PPTX
Sizing Your Scylla Cluster
PPTX
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
PPTX
Event Streaming Architectures with Confluent and ScyllaDB
PDF
A glimpse of cassandra 4.0 features netflix
PPTX
Empowering the AWS DynamoDB™ application developer with Alternator
PPTX
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
PDF
How to Monitor and Size Workloads on AWS i3 instances
PDF
Scylla Summit 2016: Analytics Show Time - Spark and Presto Powered by Scylla
PDF
Scylla: 1 Million CQL operations per second per server
PPTX
Captial One: Why Stream Data as Part of Data Transformation?
PPTX
Scylla Summit 2018: Getting the Most Out of Scylla on Kubernetes
PPTX
Performance Testing: Scylla vs. Cassandra vs. Datastax
PPTX
Scylla’s Journey Towards Being an Elastic Cloud Native Database
PPTX
GPS Insight on Using Presto with Scylla for Data Analytics and Data Archival
PPTX
Scylla Summit 2022: Scylla 5.0 New Features, Part 2
PDF
ScyllaDB @ Apache BigData, may 2016
PDF
The True Cost of NoSQL DBaaS Options
Powering a Graph Data System with Scylla + JanusGraph
Using ScyllaDB with JanusGraph for Cyber Security
AdGear Use Case with Scylla - 1M Queries Per Second with Single-Digit Millise...
Sizing Your Scylla Cluster
Scylla Summit 2019 Keynote - Dor Laor - Beyond Cassandra
Event Streaming Architectures with Confluent and ScyllaDB
A glimpse of cassandra 4.0 features netflix
Empowering the AWS DynamoDB™ application developer with Alternator
High-Load Storage of Users’ Actions with ScyllaDB and HDDs
How to Monitor and Size Workloads on AWS i3 instances
Scylla Summit 2016: Analytics Show Time - Spark and Presto Powered by Scylla
Scylla: 1 Million CQL operations per second per server
Captial One: Why Stream Data as Part of Data Transformation?
Scylla Summit 2018: Getting the Most Out of Scylla on Kubernetes
Performance Testing: Scylla vs. Cassandra vs. Datastax
Scylla’s Journey Towards Being an Elastic Cloud Native Database
GPS Insight on Using Presto with Scylla for Data Analytics and Data Archival
Scylla Summit 2022: Scylla 5.0 New Features, Part 2
ScyllaDB @ Apache BigData, may 2016
The True Cost of NoSQL DBaaS Options
Ad

Similar to How Workload Prioritization Reduces Your Datacenter Footprint (20)

PDF
Scaling Cron at Slack by Claire Adams, Slack
PPTX
Scylla Summit 2022: Scylla 5.0 New Features, Part 1
PDF
Webinar: How to Shrink Your Datacenter Footprint by 50%
PDF
Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...
PDF
Sql server tips from the field
PDF
Netflix SRE perf meetup_slides
PPTX
How Busy Is Too Busy? Automating Your System for Maximum Throughput
PPTX
Introduction to Kafka Cruise Control
PDF
Parallel Batch Performance Considerations
PDF
Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...
PDF
Rally--OpenStack Benchmarking at Scale
PDF
How to Meet Your P99 Goal While Overcommitting Another Workload
PPT
Stream, Stream, Stream: Different Streaming Methods with Spark and Kafka
PPTX
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
PDF
Enabling presto to handle massive scale at lightning speed
PPTX
Performance tuning Grails applications SpringOne 2GX 2014
PPT
261197832 8-performance-tuning-part i
PPTX
Functional reactive programming
PDF
Managing Apache Spark Workload and Automatic Optimizing
PPTX
Parallel Programming
Scaling Cron at Slack by Claire Adams, Slack
Scylla Summit 2022: Scylla 5.0 New Features, Part 1
Webinar: How to Shrink Your Datacenter Footprint by 50%
Auto Scaling Systems With Elastic Spark Streaming: Spark Summit East talk by ...
Sql server tips from the field
Netflix SRE perf meetup_slides
How Busy Is Too Busy? Automating Your System for Maximum Throughput
Introduction to Kafka Cruise Control
Parallel Batch Performance Considerations
Dr Elephant: LinkedIn's Self-Service System for Detecting and Treating Hadoop...
Rally--OpenStack Benchmarking at Scale
How to Meet Your P99 Goal While Overcommitting Another Workload
Stream, Stream, Stream: Different Streaming Methods with Spark and Kafka
Wayfair Storefront Performance Monitoring with InfluxEnterprise by Richard La...
Enabling presto to handle massive scale at lightning speed
Performance tuning Grails applications SpringOne 2GX 2014
261197832 8-performance-tuning-part i
Functional reactive programming
Managing Apache Spark Workload and Automatic Optimizing
Parallel Programming
Ad

More from ScyllaDB (20)

PDF
Understanding The True Cost of DynamoDB Webinar
PDF
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
PDF
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
PDF
New Ways to Reduce Database Costs with ScyllaDB
PDF
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
PDF
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
PDF
Leading a High-Stakes Database Migration
PDF
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
PDF
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
PDF
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
PDF
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
PDF
ScyllaDB: 10 Years and Beyond by Dor Laor
PDF
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
PDF
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
PDF
Vector Search with ScyllaDB by Szymon Wasik
PDF
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
PDF
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
PDF
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
PDF
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
PDF
Lessons Learned from Building a Serverless Notifications System by Srushith R...
Understanding The True Cost of DynamoDB Webinar
Database Benchmarking for Performance Masterclass: Session 2 - Data Modeling ...
Database Benchmarking for Performance Masterclass: Session 1 - Benchmarking F...
New Ways to Reduce Database Costs with ScyllaDB
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep Dive
Powering a Billion Dreams: Scaling Meesho’s E-commerce Revolution with Scylla...
Leading a High-Stakes Database Migration
Achieving Extreme Scale with ScyllaDB: Tips & Tradeoffs
Securely Serving Millions of Boot Artifacts a Day by João Pedro Lima & Matt ...
How Agoda Scaled 50x Throughput with ScyllaDB by Worakarn Isaratham
How Yieldmo Cut Database Costs and Cloud Dependencies Fast by Todd Coleman
ScyllaDB: 10 Years and Beyond by Dor Laor
Reduce Your Cloud Spend with ScyllaDB by Tzach Livyatan
Migrating 50TB Data From a Home-Grown Database to ScyllaDB, Fast by Terence Liu
Vector Search with ScyllaDB by Szymon Wasik
Workload Prioritization: How to Balance Multiple Workloads in a Cluster by Fe...
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
Scaling a Beast: Lessons from 400x Growth in a High-Stakes Financial System b...
Object Storage in ScyllaDB by Ran Regev, ScyllaDB
Lessons Learned from Building a Serverless Notifications System by Srushith R...

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Empathic Computing: Creating Shared Understanding
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Electronic commerce courselecture one. Pdf
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
KodekX | Application Modernization Development
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Big Data Technologies - Introduction.pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Spectroscopy.pptx food analysis technology
Spectral efficient network and resource selection model in 5G networks
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Empathic Computing: Creating Shared Understanding
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Electronic commerce courselecture one. Pdf
Understanding_Digital_Forensics_Presentation.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
20250228 LYD VKU AI Blended-Learning.pptx
Network Security Unit 5.pdf for BCA BBA.
Diabetes mellitus diagnosis method based random forest with bat algorithm
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
KodekX | Application Modernization Development
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Machine learning based COVID-19 study performance prediction
Big Data Technologies - Introduction.pptx
MIND Revenue Release Quarter 2 2025 Press Release
Encapsulation_ Review paper, used for researhc scholars
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Spectroscopy.pptx food analysis technology

How Workload Prioritization Reduces Your Datacenter Footprint

  • 1. How Workload Prioritization Reduces Your Datacenter Footprint Eliran Sinvani, Core team leader
  • 2. Presenter Eliran Sinvani, core team leader @ScyllaDB  Eliran is a core team leader in Scylla for the past year.  Before that, he had 6 years of experience developing real-time and Linux- based embedded systems.  He started at Marvell as an L1 comm stack engineer.  Most recently was a low-level infrastructure team leader at Airspan where he was involved in the hardware and software planning and execution of the second generation of Sprint MagicBox.  Eliran has a BSc in electronics and computer engineering.  In his spare time, he creates simulations and games for VR systems and tinkers with open-source embedded projects.
  • 3. Agenda ■ Theory: ● some basic concepts ● Define the problem ■ Getting Technical: ● Look at the most common 2 existing solutions ● Overview of workload prioritization implementation ● Configuring Workload prioritization overview ■ Getting Practical: ● Commands example ● Some examples ● Real World example ■ Conclusion ■ Questions
  • 5. Different types of loads ■ OLTP ● Small work items ● Latency sensitive ● involves narrow portion of the data ■ OLAP ● Large work items ● Throughput oriented ● Performed on large amounts of data
  • 6. OK, so why can’t I simply do both? ■ We will try to explain: ● What does it mean that OLTP and OLAP conflicts. ● How can some workloads conflict and what is the impact on the Datacenter.
  • 7. What happens if we just try it?
  • 12. Existing Solutions ■ Divide and conquer! ● Division is space (Multi DC) ■ Wastes resources and money ● Division in time (off peak OLAP) ■ Impacts the QOS for OLTP during The OLAP periods.
  • 13. Putting it in Numbers - Multi DC solution (based on AWS i3.metal) ■ Example: ● Capacity per instance: 15TB ● Minimum amount of instances: 10 ■ Assumptions: ● Real time workload is latency sensitive. Only uses 60% of resources (of 10 instances). ● Analytics don’t run constantly, therefore only runs 60% on 100% of resources. ■ HW: HW Costs Estimated Waste % Estimated Waste $ OLTP DC (10 instances) USD 278,560.00 40% (Resources) USD 167,136.00 OLAP DC (10 instances) USD 278,560.00 40% (Time) USD 167,136.00 ● Increased maintenance costs and additional complexity
  • 16. How does it work?
  • 22. Schedulers Basics - operation highlight ■ Shares are really all there is to it :) ■ Schedulers only kicks in when there is a conflict on the resource. ■ Schedulers maintain fairness by trying to optimize ratios ● aggregate throughput is not the goal. ■ Schedulers can be dynamic ● meaning you can change the amount of shares in real time. ■ Limits the impact of one Shareholder on another.
  • 24. Scylla controllers workload changes: ● automatic adjustment ● new equilibrium
  • 25. Advantages ■ Better system utilization ■ Easier setup ■ Dynamic adjustment
  • 26. How Does it work? ■ Schedulers ● Easy to configure ● Dynamically adjusted ● Doesn’t harm system utilization ● Limits the impact between different loads.
  • 27. How Does it work? ■ Schedulers ■ Converting data processing paths from serial to parallel
  • 28. How Does it work? ■ Schedulers ■ Converting data processing paths from serial to parallel ■ Operation priority classification
  • 30. So…. Does it work?
  • 31. So…. Does it work? (behind the scenes)
  • 32. Configuring Workload prioritization 1. Make users that generates the same workload be part of the same group. ● Priorities are attached to groups or individual users. 2. Create a service level for the workload and set its shares: ● Share determine the amount of importance of the service level. ● It is always relative to other service levels. 3. Attach the service level to the group of users. ● This will grant the shares to the group of users. ● At that point the workload prioritization mechanizm will start to ● Treat their requests according to priorities.
  • 33. Configuring Workload prioritization 1. Make users that generates the same workload be part of the same group. ● CREATE ROLE super_high_priority; ● GRANT super_high_priority TO special_user; 2. Create a service level for the workload and set its shares: ● CREATE SERVICE_LEVEL 'important_load' WITH SHARES=1000; 3. Attach the service level to the group of users. ● ATTACH SERVICE_LEVEL 'important_load' TO ‘super_high_priority;
  • 34. Making OLTP and OLAP coexist ■ To create the effect of - OLTP always get its way and OLAP utilizes all free resources: ● OLTP gets 1000 shares and OLAP gets 10 shares.
  • 35. Prioritizing between some workloads ■ Workload prioritization in general facilitates resource division between several loads. ■ There are a lot of effects that can be achieved. ■ One constraint: The number of different workloads: ● Due to latency requirements the system can only use 16 scheduling groups. ● Some scheduling groups are used by background processes ● Workload prioritization can take advantage of the remaining scheduling groups. ● Currently we have 8 unassigned scheduling groups.
  • 36. Prioritizing between some workloads ■ Load1: 200 shares, Load2: 400 shares, Load3: 800 shares
  • 37. Prioritizing between some workloads ■ Load1: 200 shares, Load2: 400 shares, Load3: 800 shares
  • 39. Summary ■ The goal is to minimize inter-workload impact without breaking the bank. ■ Schedulers are in the heart of the solution: ● Shares are all there is to it.
  • 40. Future work ■ Increase visibility with per scheduling group metrics. ■ Achieve even better isolation by canceling serialization points. ■ Increase the number of available workload prioritization scheduling groups.
  • 41. Thank you Stay in touch Any questions? Eliran Sinvani eliransin@scylladb.com

Editor's Notes

  • #2: When transferring this to power point can you make sure that the embedded videos autoplay when presenting? 10x :)