SlideShare a Scribd company logo
CASSANDRA
Looking at the bottom
What are we learning today?


> Why NoSQL like CASSANDRA

> Internal organ of CASSANDRA

> Basics of Distributed System Theory

> Basics of Scalable system design
> Consistency via ACID transactions

> Strict constraint checks

> Maturity

> Huge toolkits

Strength of SQL
CAP Theorem
Challenges with RDMS
> Guess how long it might take for aurora for handling
network partition or master failure??

Challenges with RDMS
> Challenge with Single Master(Tough Heavy writing)

Challenges with RDMS
SHARDING

Solution for Heavy writing
> Making high availability

> Remove split brain syndrome

> Faster Write

Tackle these topics
Cassandra uses a synthesis of well known techniques to achieve scalability and
availability. Cassandra was designed to fulfill the storage needs of the Inbox
Search problem. In-box Search is a feature that enables users to search through
their Facebook Inbox. At Facebook this meant the system was required to
handle a very high write throughput, billions of writes per day, and also scale
with the number of users. Since users are served from data centers that are
geographically distributed, being able to replicate data across data centers was
key to keep search latencies down. Inbox Search was launched in June of 2008
for around 100 million users and today we are at over 250 million users and
Cassandra has kept up the promise so far. Cassandra is now deployed as the
backend storage system for multiple services within Facebook.
History of CASSANDRA
High Availability
In Cassandra, cluster nodes communicates between each
other by using a protocol called “Gossip”

How node know other node?
Consistent Hashing
> Fault tolerance gained by duplication

> Scale out or scale down can be 

handled by consistent hashing very easily

Fault Tolerance
Internal data structure to achieve
massive write faster

Achieving faster write
Number to remember
commit_log(disc) -> write to slave -> response to client
What happened at the time to read like below query after
inserting billions of row? 

select * from activities where

activity_id=875858858 and timestamp>x and
timestamp<y

How writes work?
INDEXING

(add more details and challenges)

Answer
What is log structured merge tree?
LSM tree
Answer
Steps: 

* Log the record(for recovery in case of
crash)

* Add to C0 component(in memory writes)

* Merge merge.. into C1, compaction
compaction and add to SSTable in the disc. 

How it works?
How does it look like?
How read work?
Thanks
The Omar
Future Principle Software Engineer

More Related Content

PPTX
Migrating Data Pipeline from MongoDB to Cassandra
PPT
Nosql Introduction
PDF
SanDisk: Persistent Memory and Cassandra
PDF
NoSQL Database
PDF
Apache Cassandra overview
PPT
NoSQL Part 3, or A Modest Proposal for a Heretical Key Valuestore
PDF
Cassandra
PDF
Cassandra course content
Migrating Data Pipeline from MongoDB to Cassandra
Nosql Introduction
SanDisk: Persistent Memory and Cassandra
NoSQL Database
Apache Cassandra overview
NoSQL Part 3, or A Modest Proposal for a Heretical Key Valuestore
Cassandra
Cassandra course content

What's hot (20)

PDF
Cassandra one page
PDF
NOSQL- Presentation on NoSQL
PPT
Introduciton to Apache Cassandra
PPTX
ScimoreDB @ CommunityDays 2011
PDF
Introduction of Redis as NoSQL Database
PPTX
Enterprise SaaS Persistence With AWS Databases
PPT
PENXY - Redis in Azure
PPTX
MongoDB-SESSION02
PPTX
NewSQL: The Best of Both "OldSQL" and "NoSQL"
PDF
MySQL on AWS RDS
PPTX
Scylla Summit 2018: The Short and Straight Road That Leads from Cassandra to ...
PDF
Cassandra Community Webinar | Data Model on Fire
PPTX
Couchbase
PDF
MariaDB on MS Azure - 2
PDF
Webtech Conference: NoSQL and Web scalability
PPTX
MongoDB-SESION01
PDF
Cassandra - how to fail?
PDF
C* Summit 2013: Practice Makes Perfect: Extreme Cassandra Optimization by Alb...
PDF
Evolution of DBA in the Cloud Era
PDF
MongoDB and DynamoDB
Cassandra one page
NOSQL- Presentation on NoSQL
Introduciton to Apache Cassandra
ScimoreDB @ CommunityDays 2011
Introduction of Redis as NoSQL Database
Enterprise SaaS Persistence With AWS Databases
PENXY - Redis in Azure
MongoDB-SESSION02
NewSQL: The Best of Both "OldSQL" and "NoSQL"
MySQL on AWS RDS
Scylla Summit 2018: The Short and Straight Road That Leads from Cassandra to ...
Cassandra Community Webinar | Data Model on Fire
Couchbase
MariaDB on MS Azure - 2
Webtech Conference: NoSQL and Web scalability
MongoDB-SESION01
Cassandra - how to fail?
C* Summit 2013: Practice Makes Perfect: Extreme Cassandra Optimization by Alb...
Evolution of DBA in the Cloud Era
MongoDB and DynamoDB
Ad

Similar to Cassandra tw presentation (20)

PDF
A Deep Dive into Apache Cassandra for .NET Developers
PPTX
Presentation of Apache Cassandra
PPT
in this ppt the basic details of cassandra database
PPT
6.1-Cassandra.ppt
PPT
Cassandra
PPT
6.1-Cassandra.ppt
PPTX
Getting started with Cassandra 2.1
PPTX
Cassandra tutorial
PPTX
Cassandra & Python - Springfield MO User Group
PPT
Introduction to cassandra
PPTX
Cassandra an overview
PPT
5266732.ppt
PPTX
An Introduction to Cassandra - Oracle User Group
PDF
Cassandra - A Decentralized Structured Storage System
ODP
Intro to cassandra
PPTX
Migrating from a Relational Database to Cassandra: Why, Where, When and How
PPT
NOSQL Database: Apache Cassandra
ODP
Cassandra Insider
PDF
Cassandra 101
PDF
Moving from a Relational Database to Cassandra: Why, Where, When, and How
A Deep Dive into Apache Cassandra for .NET Developers
Presentation of Apache Cassandra
in this ppt the basic details of cassandra database
6.1-Cassandra.ppt
Cassandra
6.1-Cassandra.ppt
Getting started with Cassandra 2.1
Cassandra tutorial
Cassandra & Python - Springfield MO User Group
Introduction to cassandra
Cassandra an overview
5266732.ppt
An Introduction to Cassandra - Oracle User Group
Cassandra - A Decentralized Structured Storage System
Intro to cassandra
Migrating from a Relational Database to Cassandra: Why, Where, When and How
NOSQL Database: Apache Cassandra
Cassandra Insider
Cassandra 101
Moving from a Relational Database to Cassandra: Why, Where, When, and How
Ad

Recently uploaded (20)

PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
Internet of Things (IOT) - A guide to understanding
PDF
Digital Logic Computer Design lecture notes
PDF
composite construction of structures.pdf
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
Well-logging-methods_new................
PDF
PPT on Performance Review to get promotions
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
additive manufacturing of ss316l using mig welding
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPT
Project quality management in manufacturing
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
web development for engineering and engineering
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Foundation to blockchain - A guide to Blockchain Tech
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
bas. eng. economics group 4 presentation 1.pptx
Internet of Things (IOT) - A guide to understanding
Digital Logic Computer Design lecture notes
composite construction of structures.pdf
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Well-logging-methods_new................
PPT on Performance Review to get promotions
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
additive manufacturing of ss316l using mig welding
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Project quality management in manufacturing
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
web development for engineering and engineering
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx

Cassandra tw presentation

  • 2. What are we learning today? > Why NoSQL like CASSANDRA > Internal organ of CASSANDRA > Basics of Distributed System Theory > Basics of Scalable system design
  • 3. > Consistency via ACID transactions > Strict constraint checks > Maturity > Huge toolkits Strength of SQL
  • 6. > Guess how long it might take for aurora for handling network partition or master failure?? Challenges with RDMS
  • 7. > Challenge with Single Master(Tough Heavy writing) Challenges with RDMS
  • 9. > Making high availability > Remove split brain syndrome > Faster Write Tackle these topics
  • 10. Cassandra uses a synthesis of well known techniques to achieve scalability and availability. Cassandra was designed to fulfill the storage needs of the Inbox Search problem. In-box Search is a feature that enables users to search through their Facebook Inbox. At Facebook this meant the system was required to handle a very high write throughput, billions of writes per day, and also scale with the number of users. Since users are served from data centers that are geographically distributed, being able to replicate data across data centers was key to keep search latencies down. Inbox Search was launched in June of 2008 for around 100 million users and today we are at over 250 million users and Cassandra has kept up the promise so far. Cassandra is now deployed as the backend storage system for multiple services within Facebook. History of CASSANDRA
  • 12. In Cassandra, cluster nodes communicates between each other by using a protocol called “Gossip” How node know other node?
  • 14. > Fault tolerance gained by duplication > Scale out or scale down can be handled by consistent hashing very easily Fault Tolerance
  • 15. Internal data structure to achieve massive write faster Achieving faster write
  • 17. commit_log(disc) -> write to slave -> response to client What happened at the time to read like below query after inserting billions of row? select * from activities where activity_id=875858858 and timestamp>x and timestamp<y How writes work?
  • 18. INDEXING (add more details and challenges) Answer
  • 19. What is log structured merge tree? LSM tree
  • 21. Steps: * Log the record(for recovery in case of crash) * Add to C0 component(in memory writes) * Merge merge.. into C1, compaction compaction and add to SSTable in the disc. How it works?
  • 22. How does it look like?
  • 24. Thanks The Omar Future Principle Software Engineer