ScaleJoin: a Deterministic, Disjoint-Parallel and Skew-Resilient Stream Join

ScaleJoin: a Deterministic,
Disjoint-Parallel and Skew-Resilient
Stream Join
Vincenzo Gulisano, Yiannis Nikolakopoulos,
Marina Papatriantafilou, Philippas Tsigas
2015-10-31 1
Chalmers University
of technology

Agenda
• What is a stream join?
• Which are the challenges of a parallel stream join?
• Why ScaleJoin?
• How well does ScaleJoin addresses stream joins’
challenges?
• Conclusions
2015-10-31 2

Agenda
• Why ScaleJoin?
challenges?
• Conclusions
2015-10-31 3

Motivation
Applications in sensor networks, cyber-physical
systems:
• large and fluctuating volumes of data generated
continuously
demand for:
• Continuous processing of data streams
• In a real-time fashion
Store-then-process is not feasible!!!
2015-10-31 4

What is a stream join?
2015-10-31 5
Data stream:
unbounded sequence of tuples
t1
t2
t3
t4
t1
t2
t3
t4
t1
t2
t3
t4
R S
Sliding
window Window
size WS
WSWR
Predicate P

Why parallel stream joins?
• WS = 600 seconds
• R receives 500 tuples/second
• S receives 500 tuples/second
• WR will contain 300,000 tuples
• WS will contain 300,000 tuples
• Each new tuple from R gets compared with
all the tuples in WS
• Each new tuple from S gets compared with
all the tuples in WR
… 300,000,000 comparisons/second!
t1
t2
t3
t4
t1
t2
t3
t4
R S
WSWR
2015-10-31 6

Agenda
• Why ScaleJoin?
challenges?
• Conclusions
2015-10-31 7

Which are the challenges of a parallel stream join?
Scalability
High
throughput
Low latency
Disjoint
parallelism
Skew
resilience
Determinism
2015-10-31 8

Agenda
• Why ScaleJoin?
challenges?
• Conclusions
2015-10-31 9

The 3-step procedure (sequential stream join)
For each incoming tuple t:
1. compare t with all tuples in opposite window given predicate P
2. add t to its window
3. remove stale tuples from t’s window
Add tuples to S
Add tuples to R
Prod
R
Prod
S
Consume resultsConsPU
2015-10-31 10
We assume each
producer delivers tuples
in timestamp order

The 3-step procedure, is it enough?
Scalability
High
throughput
Low latency
Disjoint
parallelism
Skew
resilience
Determinism
2015-10-31 11
t1
t2
t1
t2
R S
WSWR
t3
t1
t2
t1
t2
R S
WSWR
t4
t3

Enforcing determinism in sequential stream joins
• Next tuple to process = earliest(tS,tR)
• The earliest(tS,tR) tuple is referred to as the next ready tuple
• Process ready tuples in timestamp order  Determinism
PU
tS tR
2015-10-31 12

Deterministic 3-step procedure
Pick the next ready tuple t:
1. compare t with all tuples in opposite window given predicate P
Add tuples to S
Add tuples to R
Prod
R
Prod
S
Consume resultsConsPU
2015-10-31 13

Shared-nothing parallel stream join
(state-of-the-art)
Prod
R
Prod
S
PU1
PU2
PUN
… Cons
Add tuple to PUi S
Add tuple to PUi R
Consume results
Pick the next ready tuple t:
1. compare t with all tuples in opposite window given P
Chose a PU
Chose a PU
Take the next
ready output tuple
Scalability
High
throughput
Low latency
Disjoint
parallelism
Skew
resilience
Determinism
2015-10-31 14
Merge

Shared-nothing parallel stream join
(state-of-the-art)
Prod
R
Prod
S
PU1
PU2
PUN
…
2015-10-31 15
enqueue()
dequeue()
ConsMerge

From coarse-grained to fine-grained synchronization
Prod
R
Prod
S
PU1
PU2
PUN
…
Cons
2015-10-31 16

ScaleGate
2015-10-31 17
addTuple(tuple,sourceID)
allows a tuple from sourceID to be merged by ScaleGate in the
resulting timestamp-sorted stream of ready tuples.
getNextReadyTuple(readerID)
provides to readerID the next earliest ready tuple that has not been
yet consumed by the former.
https://github.com/dcs-chalmers/ScaleGate_Java

ScaleJoin
Prod
R
Prod
S
PU1
PU2
PUN
…
Cons
Add tuple SGin
Add tuple SGin
Get next ready
output tuple
from SGout
Get next ready input tuple from SGin
2. add t to its window in a round-robin fashion
2015-10-31 18
SGin SGout
Steps for PU

2015-10-31 19
t1
t2
R S
WR
t3
t4
R S
t4
t1
WR
R S
t4
t2
WR
R S
t4
WR
t3
Sequential stream join:
ScaleJoin with 3 PUs:
ScaleJoin (example)

ScaleJoin
Prod
R
Prod
S
PU1
PU2
PUN
… Cons
Add tuple SGin
Add tuple SGin
Get next ready
output tuple
from SGout
2015-10-31 20
SGin SGout
Scalability
High
throughput
Low latency
Disjoint
parallelism
Skew
resilience
Determinism
Prod
S
Prod
S
Prod
R Get next ready input tuple from SGin
2. add t to its window in a round robin fashion
Steps for PUi

Agenda
• Why ScaleJoin?
challenges?
• Conclusions
2015-10-31 21

Evaluation setup
• Common benchmark
• Implemented in Java
• Evaluation platform
– NUMA architecture: 2.6 GHz AMD Opteron 6230 (48 cores over 4
sockets), 64 GB of memory
– Architecture with Hyper Threading: 2.0 GHz Intel Xeon E5-2650 (16
cores over 2 sockets), 64 GB of memory
2015-10-31 22
t1
t2
t3
t4
t1
t2
t3
t4
R S
R: <timestamp,x,y,z> S: <timestamp,a,b,c,d>
P: a−10≤x≤a+10 AND b−10≤y≤b+10

ScaleJoin Scalability – comparisons/second
2015-10-31 23
Number of PUs

ScaleJoin latency – milliseconds
2015-10-31 24
Number of PUs

ScaleJoin skew-resilience
Constant distinct rates with peaks
2015-10-31 25

Agenda
• Why ScaleJoin?
challenges?
• Conclusions
2015-10-31 26

Conclusions
• ScaleJoin: a Deterministic, Disjoint-Parallel and
Skew-Resilient Stream Join
• Challenges of parallel
stream joins
• Fine-grained synchronization (ScaleGate)
• 4 billion comparisons/second, with latency lower
than 60 milliseconds
Scalability
High
throughput
Low latency
Disjoint
parallelism
Skew
resilience
Determinism
2015-10-31 27

ScaleJoin: a Deterministic,
Disjoint-Parallel and Skew-Resilient
Stream Join
Vincenzo Gulisano, Yiannis Nikolakopoulos,
Marina Papatriantafilou, Philippas Tsigas
Thank you! Questions?
2015-10-31 28

ScaleJoin: a Deterministic, Disjoint-Parallel and Skew-Resilient Stream Join

More Related Content

Similar to ScaleJoin: a Deterministic, Disjoint-Parallel and Skew-Resilient Stream Join (10)

Recently uploaded (20)

ScaleJoin: a Deterministic, Disjoint-Parallel and Skew-Resilient Stream Join

Editor's Notes