Netflix's Big Leap from Oracle to Cassandra

NETFLIX’S BIG LEAP
FROM
ORACLE TO C*
ROOPA TANGIRALA
Engineering Manager
Netflix

WHO AM I?
Engineering Manager @ Netflix
Twitter - @roopatangirala
Email roopa.tangirala@gmail.com
Linkedin - https://www.linkedin.com/pub/roopa-tangirala/3/960/2b

OVERVIEW
• Brief History
• Set up
• Migration Strategy
• Migration Challenges
• Example of Real use cases
• Lessons learnt

REQUIREMENTS
• HIGHLY AVAILABLE
• MULTI DATACENTER SUPPORT
• PREDICTABLE PERFORMANCE AT SCALE

WHY C* ?
• Massively scalable architecture
• Multi-datacenter, multi-directional replication
• Linear scale performance
• Transparent fault detection and recovery
• Flexible, dynamic schema data
• Guaranteed data safety
• Tunable data consistency

MICRO SERVICES
• Horizontal, Homogenous, Commoditized

DATA MODEL REVIEW
keyspace
column family
Row
column
•name
•value
•timestamp

DB/Schema
Table
Row
column
• name
• value
O
R
A
C
L
E
CASSANDRA

SCHEMALESS DESIGN
• Row-oriented
• Number of columns/Names can differ
namexyz Paul zip 95123
nameabc Adam zip 94538 sex Male
namecde XYZ

UNDERSTAND WRITE PATH
client
Commit
log (Disk)
Memtable
(memory)
sstable sstable sstable
Flush

UNDERSTAND READ PATH
client
memtable
sstable
sstable
sstable
Row cache/key cache

LOGIC IN APPLICATION
• Stored procedures
• Functions
• Triggers
• Referential integrity constraints

DATACENTER AWS
ORACLE
C*
APP
DUAL WRITESREADS
CONSISTENCY
CHECKER
WRITES
READS
WRITES
FORKLIFT

MAIN APPROACH
• DUAL WRITES
• FORKLIFT OLD DATASET
• CONSISTENCY CHECKER
• MIGRATE READS TO C* FIRST
• SWITCH OFF WRITES TO DC

Relationships – Better in App Layer

ORACLE SEQUENCE
• USE UUID for Unique keys
• Distributed Sequence Generator for Ordering
• C* counters – not so much

Heavy Transactional Use Case
RDBMS

DENORMALIZE
DENORMALIZE
DENORMALIZE
DENORMALIZE
DENORMALIZE

API
• High concurrency
• Range scans
• ~1MB of data
• Caused heap issues at Cassandra level
• Very high read latency

Range scan replaced with
inverted index
0000/odp/{ui}/pathEvaluator_2 active Scripts_1 Text datafalseallocation
0000
/xbox/{ui}/pathEvaluator_
1
active Scripts_1 Text datafalseallocation
0000/tvui/{ui}/pathEvaluator_1 active Scripts_1 Text datafalseallocation
active_Scripts_idx
1,
2
Idx_1 /tvui/{ui}/pathEvaluator 2/odp/{ui}/pathEvaluator 2/odp/{ui}/pathEvaluator 2/odp/{ui}/pathEvaluator
scripts
client
1
2

Inverted Index considerations
• Column name can be used a row key placeholder
• Hotspots!!
• Sharding

Problem
Growth Pattern: “Hockey-stick”
Retention Policy: Retain forever
Access Pattern: Retrieve all for a
customer
Scaling and performance challenges as the data
grows
48

Goals
Small Storage Footprint
Consistent Read/Write Performance
Infinite Scalability
49

Old Data Model - Pros
Simple Wide Rows
No additional processing
Fast Write
Fast Read for Majority
51

Old Data Model - Cons
Read latency increases as number of viewing records
increases
Cassandra Internal
• Number of SSTables
• Compaction
• Read Repair
Lesson learned : Wide Row Limitation
52

New Data Model
Split Viewing History into two Column Families
1. Recent
• Small records with frequent updates
• Cassandra tuning : compaction, read
repair, etc.
2. Compressed
• Large historical records with rare updates
• Rollup
• Compression
• Cassandra: rare compaction, no read repair
53

Think Data Archival
Data stores grow exponentially
Have a process in place to archive data
• Moving to a separate column family
• Moving to a separate cluster (non SSD)
• Setting right expectations w.r.t latencies
with historical data
Cassandra TTL’s

Cinematch Rating Service
• First Model
• Second Model
Movie_id12345 BLOB
Movie_id4355 BLOB
545 1 545 4 545 534512345 4 545 2
454355 2 66 2 67 4

WHAT WORKED?
12345
4355
343 674
5
Name
3
4542443 242 Name
BLOB
4 BLOB52

BLOB vs COLUMN/VALUE
Try out column/value approach first and hopefully
it satisfies avg/95/99th
Column value pro’s:
• Write payload can be smaller
• Query by specific columns
• Read path does not require reading the entire row
Blob considerations
• Read percentage is very high (90’s)
• Read latencies are very important
• All the data is read most of the time

LESSONS LEARNT
• Get the model right
• Baby Steps
• Huge Engineering effort
• Think Compression
• Performance test
• DB just for Data

WE ARE HIRING !! – CHECK OUT JOBS.NETFLIX.COM

Netflix's Big Leap from Oracle to Cassandra

More Related Content

What's hot (20)

Similar to Netflix's Big Leap from Oracle to Cassandra (20)

Recently uploaded (20)

Netflix's Big Leap from Oracle to Cassandra

Editor's Notes