SlideShare a Scribd company logo
Scheduling for Cloud Systems with Multi-level Data
Locality: Throughput and Heavy-traffic Optimality
Ali Yekkehkhany
In collaboration with Qiaomin Xie, and Professor Yi Lu
University of Illinois at Urbana-Champain (UIUC)
1
Data Processing
• Previously, storage and computing were separate
Computing StorageNetwork
2
Data-Intensive Processing
Explosion of data sets by industry and research
Computing StorageNetwork
Bottleneck
3
Data Centers
• Use separate smaller centers for storage
• Move computing to data
Bottleneck
4
Data Centers
Rack Rack
Top of Rack Switch
Core Switch
5
Data-parallel Processing
A A BC D
TA TB
C
Rack 1 Rack 2
local rack-local remote
6
Data-parallel Processing
A A BC D
TB
C
Rack 1 Rack 2
TA
7
Convention
A task type is defined by the locations of its data block
Task
Types
Servers
2,5,6
λ2,5,6
1
4,7,8
λ4,7,8
3,4,9
λ3,4,9
2 3 n
7,8,9
λ7,8,9 λi,j,k
i,j,k O(n3
)
unknown
8
Local, Rack-local, and Remote Service
9
1 2 3 4 5 6 7 8 9 10
Rack 1 Rack 2
Task (1, 3, 4)
Question
10
1 2 3 4 5 6 7 8 9 10
Rack 1 Rack 2
A new task arrives
, and scheduling?
What queue should
the task be routed to?
What algorithm to use for routing
Idle
To which queue should the server
give service when it becomes idle?
Metrics of Optimality for the Algorithm
Throughput Optimality:
Stabilizing any arrival rate vector within capacity
region.
Delay Optimality in Heavy-traffic:
Asymptotically minimizing the average delay as
the arrival rate vector approaches the boundary of
the capacity region.
11
Previous Work for Two Levels of Data Locality
1- Fluid model Planning, Harrison (98), Harrison-Lopez
(99), Bell-Williams (05).
12
Task
Types
Servers
2,5,6
λ2,5,6
1
4,7,8
λ4,7,8
3,4,9
λ3,4,9
2 3 n
7,8,9
λ7,8,9 λi,j,k
i,j,k O(n3
)
unknown
Previous Work for Two Levels of Data Locality
1- Fluid Model Planning:
1.1 Throughput optimal
1.2 Heavy-traffic optimal
But
NOT practical!
13
Previous Work for Two Levels of Data Locality
2- Join the Shortest Queue-Maxweight (JSQ-MW)
Wang et al. (13).
2.1 Throughput optimal
2.2 Not heavy-traffic optimal in all loads
2.3 Heavy-traffic optimal in SPECIFIC loads
14
Previous Work for Two Levels of Data Locality
3- Priority Algorithm for Near Data Scheduling
(Pandas), Q. Xie, Y. Lu (15)
3.1 Throughput optimal
3.2 Heavy-traffic optimal for all loads
15
Three Levels of Data Locality
1. Fluid Model planning
1. Throughput optimal
2. Heavy-traffic optimal
3. NOT practical!
2. Extension of JSQ-MaxWeight
1. Throughput optimal
2. NOT heavy-traffic optimal for all loads
3. Pandas
1. Not throughput optimal
2. Not heavy-traffic optimal
16
Extension of JSQ-MW for Three Levels of Locality
17
1,2,3
Joining the Shortest One
Extension of JSQ-MW for Three Levels of Locality
• Extension of JSQ-MaxWeight for systems
with rack structure, Xie et al. (16):
– Throughput optimal.
– Not heavy-traffic optimal in all loads. Just heavy
traffic optimal in specific loads.
18
Our Throughput and Heavy-traffic
Optimal Algorithm
• The routing and scheduling for our algorithm
is as follows:
– Routing: Weighted Workload
– Scheduling: Priority Scheduling for Local, Rack-
local, and Remote tasks queued in the 3 queues
associated to each server.
19
Weighted-Workload Routing
20
Rack 1 Rack 2
1 2 43
l k r l k rl k r l k r
Weighted-Workload Routing
21
1 2
Rack 1 Rack 2
43
l - local
k - rack-local
r - remote
workload
W
1
W
2
W
3
W
4
l k r l k rl k r l k r
Weighted-Workload Routing
22
1 2
Rack 1 Rack 2
43
W
1
W
2
W
3
W
4
local
rack-local
remote
l k r l k rl k r l k r
Weighted-Workload Routing
23
1 2
Rack 1 Rack 2
43
W
1
W
2
W
3
W
4
local
rack-local
remote
l k r l k rl k r l k r
Weighted-Workload Routing
24
1 2 43
W
1
W
2
W
3
W
4
< <<
l k r l k rl k r l k r
Rack 1 Rack 2
Priority Scheduling
25
1 2
Rack 1 Rack 2
43
Each server serves in the order of
l k r l k rl k r l k r
local,
Priority Scheduling
26
1 2
Rack 1 Rack 2
43
Each server serves in the order of
l k r l k rl k r l k r
local, rack-local, remote
Weighted Workload Algorithm
The Weighted Workload (WW) algorithm
proposed by Xie et al. (16) is proved to be both
throughput optimal and heavy traffic optimal in
all loads.
27
Evaluation
28
Comparing the Stability Regions
29
Heavy-traffic Optimality in Special Load
30
Heavy-traffic optimality of WW
31
References
• [1] Q. Xie, A. Yekkehkhany, Y. Lu. Scheduling with Multi-level Data
Locality: Throughput and Heavy-traffic Optimality. In Proceedings of
INFOCOM. IEEE, 2016.
• [2] Q. Xie, and Y. Lu. Priotrity Algorithm for Near-data Scheduling:
Throughput and Heavy-traffic Optimality. In Proceedings of INFOCOM.
IEEE, 2015.
• [3] W. Wang, K. Zhu, L. Ying, J. Tan, and L. Zhang. Map Task Schedul-
ing in MapReduce with Data Locality: Throughput and Heavy-traffic
Optimality. In Proceedings of INFOCOM. IEEE, 2013.
• [4] J. M. Harrison. Heavy traffic analysis of a system with parallel servers:
Asymptotic optimality of discrete review policies. Annals of Applied
Probability, 1998.
• [5] J. M. Harrison and M. J. L´opez. Heavy traffic resource pooling in
parallel-server systems. Queueing Syst. Theory Appl., 33(4), Apr. 1999.
32
Future Work
• Scheduling for multi-level data locality instead
of three levels of data locality.
33
Thanks for Your Attention
34
Any Questions?!
35
Ali Yekkehkhany
yekkehk2@illinois.edu

More Related Content

PDF
LOAD BALANCING ALGORITHM TO IMPROVE RESPONSE TIME ON CLOUD COMPUTING
PPT
Seattle Cassandra Meetup - Cassandra 1.2 - Eddie Satterly
PDF
Improved Performance of LEACH for WSN Using Precise Number of Cluster-Head an...
PPTX
06 how to write a map reduce version of k-means clustering
PPTX
compiler design
PDF
Determining the k in k-means with MapReduce
PPT
task scheduling in cloud datacentre using genetic algorithm
PPTX
cloud schedualing
LOAD BALANCING ALGORITHM TO IMPROVE RESPONSE TIME ON CLOUD COMPUTING
Seattle Cassandra Meetup - Cassandra 1.2 - Eddie Satterly
Improved Performance of LEACH for WSN Using Precise Number of Cluster-Head an...
06 how to write a map reduce version of k-means clustering
compiler design
Determining the k in k-means with MapReduce
task scheduling in cloud datacentre using genetic algorithm
cloud schedualing

What's hot (19)

PDF
Self-adaptive container monitoring with performance-aware Load-Shedding policies
PDF
Load Rebalancing for Distributed Hash Tables in Cloud Computing
PDF
Future Science on Future OpenStack
PPTX
Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...
PPTX
IEEE Paper Presentation by Chandan Kumar
PDF
Latency-aware Elastic Scaling for Distributed Data Stream Processing Systems
PPT
Scheduling in cloud
PPTX
Leach
PPTX
Job sequence scheduling for cloud computing
PPTX
(Slides) Task scheduling algorithm for multicore processor system for minimiz...
DOCX
Time sensitive utility-based single-copy
PDF
Staffetta: Smart Duty-Cycling for Opportunistic Data Collection
PDF
HACC: Fitting the Universe Inside a Supercomputer
PDF
FrackingPaper
PDF
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
PDF
TASK SCHEDULING USING AMALGAMATION OF MET HEURISTICS SWARM OPTIMIZATION ALGOR...
PPTX
Smallworld Data Check-Out to Microstation
PDF
Graphical Closure Rules for Unsupervised Load Classification in NILM Systems
PDF
Public Cloud Partition Using Load Status Evaluation and Cloud Division Rules
Self-adaptive container monitoring with performance-aware Load-Shedding policies
Load Rebalancing for Distributed Hash Tables in Cloud Computing
Future Science on Future OpenStack
Task Scheduling using Tabu Search algorithm in Cloud Computing Environment us...
IEEE Paper Presentation by Chandan Kumar
Latency-aware Elastic Scaling for Distributed Data Stream Processing Systems
Scheduling in cloud
Leach
Job sequence scheduling for cloud computing
(Slides) Task scheduling algorithm for multicore processor system for minimiz...
Time sensitive utility-based single-copy
Staffetta: Smart Duty-Cycling for Opportunistic Data Collection
HACC: Fitting the Universe Inside a Supercomputer
FrackingPaper
DEEP-mon: Dynamic and Energy Efficient Power monitoring for container-based i...
TASK SCHEDULING USING AMALGAMATION OF MET HEURISTICS SWARM OPTIMIZATION ALGOR...
Smallworld Data Check-Out to Microstation
Graphical Closure Rules for Unsupervised Load Classification in NILM Systems
Public Cloud Partition Using Load Status Evaluation and Cloud Division Rules
Ad

Similar to Scheduling for cloud systems with multi level data locality (20)

PDF
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...
PDF
M.E Computer Science Mobile Computing Projects
PDF
M phil-computer-science-mobile-computing-projects
PDF
M.Phil Computer Science Mobile Computing Projects
PDF
intro lect.pdfkkpkpkpkpkpjjkojkopjjojjoj
PDF
A QOS BASED LOAD BALANCED SWITCH
DOCX
Rc maca receiver-centric mac protocol for event-driven wireless sensor networks
DOCX
Orchestrating Bulk Data Transfers across Geo-Distributed Datacenters
DOCX
Orchestrating bulk data transfers across
DOCX
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERS
PDF
Dynamic classification in silicon-based forwarding engine environments
PDF
Nearest Adjacent Node Discovery Scheme for Routing Protocol in Wireless Senso...
PPTX
The Role of Inter-Controller Traffic in SDN Controllers Placement
PDF
A Parallel Algorithm Template for Updating Single-Source Shortest Paths in La...
PPTX
Energy efficient cluster head selection in LEACH protocol
PDF
Dual-resource TCPAQM for Processing-constrained Networks
PDF
Analysis of Latency and Throughput of 2D Torus Topology using Modified XY Rou...
PPTX
SDN approach.pptx
PDF
Network-aware Data Management for Large Scale Distributed Applications, IBM R...
DOCX
JPN1413 An Energy-Balanced Routing Method Based on Forward-Aware Factor for...
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...
M.E Computer Science Mobile Computing Projects
M phil-computer-science-mobile-computing-projects
M.Phil Computer Science Mobile Computing Projects
intro lect.pdfkkpkpkpkpkpjjkojkopjjojjoj
A QOS BASED LOAD BALANCED SWITCH
Rc maca receiver-centric mac protocol for event-driven wireless sensor networks
Orchestrating Bulk Data Transfers across Geo-Distributed Datacenters
Orchestrating bulk data transfers across
ORCHESTRATING BULK DATA TRANSFERS ACROSS GEO-DISTRIBUTED DATACENTERS
Dynamic classification in silicon-based forwarding engine environments
Nearest Adjacent Node Discovery Scheme for Routing Protocol in Wireless Senso...
The Role of Inter-Controller Traffic in SDN Controllers Placement
A Parallel Algorithm Template for Updating Single-Source Shortest Paths in La...
Energy efficient cluster head selection in LEACH protocol
Dual-resource TCPAQM for Processing-constrained Networks
Analysis of Latency and Throughput of 2D Torus Topology using Modified XY Rou...
SDN approach.pptx
Network-aware Data Management for Large Scale Distributed Applications, IBM R...
JPN1413 An Energy-Balanced Routing Method Based on Forward-Aware Factor for...
Ad

More from knowdiff (17)

PPTX
Ut talk feb 2017
PDF
Ali khalili: Towards an Open Linked Data-based Infrastructure for Studying Sc...
PPTX
Amin Milani Fard: Directed Model Inference for Testing and Analysis of Web Ap...
PDF
Knowledge based economy and power of crowd sourcing
PDF
Amin tayyebi: Big Data and Land Use Change Science
PDF
Mehdi Rezagholizadeh: Image Sensor Modeling: Color Measurement at Low Light L...
PDF
Sara Afshar: Scheduling and Resource Sharing in Multiprocessor Real-Time Systems
PPTX
Seyed Mehdi mohaghegh: Modelling material use within the low carbon energy pa...
PDF
Narjess Afzaly: Model Your Problem with Graphs and Generate your objects
PDF
Computational methods applications in air pollution modeling (Dr. Yadghar)
PDF
Somaz Kolahi : Functional Dependencies: Redundancy Analysis and Correcting Vi...
PPT
Uncalibrated Image-Based Robotic Visual Servoing (knowdiff.net)
PPT
Knowdiff visiting lecturer 140 (Azad Shademan): Uncalibrated Image-Based Robo...
PPS
Mehran Shaghaghi: Quantum Mechanics Dilemmas
PPT
Hossein Taghavi : Codes on Graphs
PDF
Dr. Amir Nejat
PDF
Alborz
Ut talk feb 2017
Ali khalili: Towards an Open Linked Data-based Infrastructure for Studying Sc...
Amin Milani Fard: Directed Model Inference for Testing and Analysis of Web Ap...
Knowledge based economy and power of crowd sourcing
Amin tayyebi: Big Data and Land Use Change Science
Mehdi Rezagholizadeh: Image Sensor Modeling: Color Measurement at Low Light L...
Sara Afshar: Scheduling and Resource Sharing in Multiprocessor Real-Time Systems
Seyed Mehdi mohaghegh: Modelling material use within the low carbon energy pa...
Narjess Afzaly: Model Your Problem with Graphs and Generate your objects
Computational methods applications in air pollution modeling (Dr. Yadghar)
Somaz Kolahi : Functional Dependencies: Redundancy Analysis and Correcting Vi...
Uncalibrated Image-Based Robotic Visual Servoing (knowdiff.net)
Knowdiff visiting lecturer 140 (Azad Shademan): Uncalibrated Image-Based Robo...
Mehran Shaghaghi: Quantum Mechanics Dilemmas
Hossein Taghavi : Codes on Graphs
Dr. Amir Nejat
Alborz

Recently uploaded (20)

PDF
PPT on Performance Review to get promotions
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
web development for engineering and engineering
PPTX
Sustainable Sites - Green Building Construction
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
additive manufacturing of ss316l using mig welding
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
Geodesy 1.pptx...............................................
PPTX
OOP with Java - Java Introduction (Basics)
PPT on Performance Review to get promotions
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
web development for engineering and engineering
Sustainable Sites - Green Building Construction
CYBER-CRIMES AND SECURITY A guide to understanding
Automation-in-Manufacturing-Chapter-Introduction.pdf
R24 SURVEYING LAB MANUAL for civil enggi
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Foundation to blockchain - A guide to Blockchain Tech
UNIT 4 Total Quality Management .pptx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
additive manufacturing of ss316l using mig welding
Embodied AI: Ushering in the Next Era of Intelligent Systems
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Geodesy 1.pptx...............................................
OOP with Java - Java Introduction (Basics)

Scheduling for cloud systems with multi level data locality

  • 1. Scheduling for Cloud Systems with Multi-level Data Locality: Throughput and Heavy-traffic Optimality Ali Yekkehkhany In collaboration with Qiaomin Xie, and Professor Yi Lu University of Illinois at Urbana-Champain (UIUC) 1
  • 2. Data Processing • Previously, storage and computing were separate Computing StorageNetwork 2
  • 3. Data-Intensive Processing Explosion of data sets by industry and research Computing StorageNetwork Bottleneck 3
  • 4. Data Centers • Use separate smaller centers for storage • Move computing to data Bottleneck 4
  • 5. Data Centers Rack Rack Top of Rack Switch Core Switch 5
  • 6. Data-parallel Processing A A BC D TA TB C Rack 1 Rack 2 local rack-local remote 6
  • 7. Data-parallel Processing A A BC D TB C Rack 1 Rack 2 TA 7
  • 8. Convention A task type is defined by the locations of its data block Task Types Servers 2,5,6 λ2,5,6 1 4,7,8 λ4,7,8 3,4,9 λ3,4,9 2 3 n 7,8,9 λ7,8,9 λi,j,k i,j,k O(n3 ) unknown 8
  • 9. Local, Rack-local, and Remote Service 9 1 2 3 4 5 6 7 8 9 10 Rack 1 Rack 2 Task (1, 3, 4)
  • 10. Question 10 1 2 3 4 5 6 7 8 9 10 Rack 1 Rack 2 A new task arrives , and scheduling? What queue should the task be routed to? What algorithm to use for routing Idle To which queue should the server give service when it becomes idle?
  • 11. Metrics of Optimality for the Algorithm Throughput Optimality: Stabilizing any arrival rate vector within capacity region. Delay Optimality in Heavy-traffic: Asymptotically minimizing the average delay as the arrival rate vector approaches the boundary of the capacity region. 11
  • 12. Previous Work for Two Levels of Data Locality 1- Fluid model Planning, Harrison (98), Harrison-Lopez (99), Bell-Williams (05). 12 Task Types Servers 2,5,6 λ2,5,6 1 4,7,8 λ4,7,8 3,4,9 λ3,4,9 2 3 n 7,8,9 λ7,8,9 λi,j,k i,j,k O(n3 ) unknown
  • 13. Previous Work for Two Levels of Data Locality 1- Fluid Model Planning: 1.1 Throughput optimal 1.2 Heavy-traffic optimal But NOT practical! 13
  • 14. Previous Work for Two Levels of Data Locality 2- Join the Shortest Queue-Maxweight (JSQ-MW) Wang et al. (13). 2.1 Throughput optimal 2.2 Not heavy-traffic optimal in all loads 2.3 Heavy-traffic optimal in SPECIFIC loads 14
  • 15. Previous Work for Two Levels of Data Locality 3- Priority Algorithm for Near Data Scheduling (Pandas), Q. Xie, Y. Lu (15) 3.1 Throughput optimal 3.2 Heavy-traffic optimal for all loads 15
  • 16. Three Levels of Data Locality 1. Fluid Model planning 1. Throughput optimal 2. Heavy-traffic optimal 3. NOT practical! 2. Extension of JSQ-MaxWeight 1. Throughput optimal 2. NOT heavy-traffic optimal for all loads 3. Pandas 1. Not throughput optimal 2. Not heavy-traffic optimal 16
  • 17. Extension of JSQ-MW for Three Levels of Locality 17 1,2,3 Joining the Shortest One
  • 18. Extension of JSQ-MW for Three Levels of Locality • Extension of JSQ-MaxWeight for systems with rack structure, Xie et al. (16): – Throughput optimal. – Not heavy-traffic optimal in all loads. Just heavy traffic optimal in specific loads. 18
  • 19. Our Throughput and Heavy-traffic Optimal Algorithm • The routing and scheduling for our algorithm is as follows: – Routing: Weighted Workload – Scheduling: Priority Scheduling for Local, Rack- local, and Remote tasks queued in the 3 queues associated to each server. 19
  • 20. Weighted-Workload Routing 20 Rack 1 Rack 2 1 2 43 l k r l k rl k r l k r
  • 21. Weighted-Workload Routing 21 1 2 Rack 1 Rack 2 43 l - local k - rack-local r - remote workload W 1 W 2 W 3 W 4 l k r l k rl k r l k r
  • 22. Weighted-Workload Routing 22 1 2 Rack 1 Rack 2 43 W 1 W 2 W 3 W 4 local rack-local remote l k r l k rl k r l k r
  • 23. Weighted-Workload Routing 23 1 2 Rack 1 Rack 2 43 W 1 W 2 W 3 W 4 local rack-local remote l k r l k rl k r l k r
  • 24. Weighted-Workload Routing 24 1 2 43 W 1 W 2 W 3 W 4 < << l k r l k rl k r l k r Rack 1 Rack 2
  • 25. Priority Scheduling 25 1 2 Rack 1 Rack 2 43 Each server serves in the order of l k r l k rl k r l k r local,
  • 26. Priority Scheduling 26 1 2 Rack 1 Rack 2 43 Each server serves in the order of l k r l k rl k r l k r local, rack-local, remote
  • 27. Weighted Workload Algorithm The Weighted Workload (WW) algorithm proposed by Xie et al. (16) is proved to be both throughput optimal and heavy traffic optimal in all loads. 27
  • 30. Heavy-traffic Optimality in Special Load 30
  • 32. References • [1] Q. Xie, A. Yekkehkhany, Y. Lu. Scheduling with Multi-level Data Locality: Throughput and Heavy-traffic Optimality. In Proceedings of INFOCOM. IEEE, 2016. • [2] Q. Xie, and Y. Lu. Priotrity Algorithm for Near-data Scheduling: Throughput and Heavy-traffic Optimality. In Proceedings of INFOCOM. IEEE, 2015. • [3] W. Wang, K. Zhu, L. Ying, J. Tan, and L. Zhang. Map Task Schedul- ing in MapReduce with Data Locality: Throughput and Heavy-traffic Optimality. In Proceedings of INFOCOM. IEEE, 2013. • [4] J. M. Harrison. Heavy traffic analysis of a system with parallel servers: Asymptotic optimality of discrete review policies. Annals of Applied Probability, 1998. • [5] J. M. Harrison and M. J. L´opez. Heavy traffic resource pooling in parallel-server systems. Queueing Syst. Theory Appl., 33(4), Apr. 1999. 32
  • 33. Future Work • Scheduling for multi-level data locality instead of three levels of data locality. 33
  • 34. Thanks for Your Attention 34