SlideShare a Scribd company logo
It's about time :
Deep dive into event store using
Apache Cassandra
by Nikunj Thakkar
Agenda
●
What is Big Data?
●
So far in AJM Bigdata
Series
●
Where it is? Am I using
it?
Big Data at-a-glance
Introduction to
Apache Cassandra
●
What, When and Why of
Cassandra
●
Protocol, Architecture,
Queries and Evrything
else
●
Interesting Use-cases
●
Demo
Big Data at-a-glance
What is BigData?
Deep dive into event store using Apache Cassandra
What is Big Data?
Large amount of data
that can only be
processed during
night hours
क्यया कममी रह गयमीथमी ममेरमे
जजॉब डडिडसकस रप्शन मम?
What is Big Data?
So far in AJM
Big Data
Series
4 V's Of
Big Data
4 V's Of
Big Data
Volume
Variety
Velocity
Veracity
CAP
Theoram
CAP
Theoram
Consistency
Availability
Partition tolerance
Family
of
NOSQL
DATABASES
Family
of
NOSQL
DATABASES
Wide Column Store / Column Families
Document Store
Wide Column Store / Column FamiliesWide Column Store / Column Families
Key Value / Tuple Store
Graph Databases
Multimodel Databases
Object Databases
Grid & Cloud Database Solutions
Object Databases
XML Databases
Multidimensional Databases
Multivalue Databases
Big Data:
Where it is?
Big Data:
Am I using it?
Targeted marketing
Public sector
Big Data:
Am I using it?
Health care
Social media and web
data
Global personal location tracking
Social media and web
data
Social media and web
data
Automated device generated data
Introduction to
Apache Cassandra
Hey wait,
First tell me about
events and event
stores.
What is
Apache Cassandra?
Top level Apache Project
Born at Facebook
Google's Big Table + Amazone's Dynamo =
Cassandra
Demo Time
Deep dive into event store using Apache Cassandra
Network Topology – Multiple DC
Why
Apache Cassandra?
Elastic scalability
Always on architecture -
No single point of failure
Fast linear-scale performance
Flexible data storage
Easy data distribution
Operational simplicity
Transaction support
Apache Cassandra: When to use?
Deep dive into event store using Apache Cassandra
Just kidding.... We will cover this part in use cases. :) :)
Apache Cassandra: Interesting Facts
Apache
Cassandra:
Interesting
Facts
Protocol
Apache
Cassandra:
Interesting
Facts
Protocol
Thrift
vs
CQL Binary Protocol
Apache
Cassandra:
Interesting
Facts
Architecture
Apache
Cassandra:
Interesting
Facts
Architecture
Key structures
➔
Node
➔
Data Center
➔
Cluster
➔
Commit Log
➔
Table
➔
SSTable
Apache
Cassandra:
Interesting
Facts
Architecture
Key components
Apache
Cassandra:
Interesting
Facts
Architecture
Key components
➔
Gossip
➔
Partitioner
➔
Replication factor
➔
Replica placement
strategy
➔
Snitch
Apache
Cassandra:
Interesting
Facts
Cassandra Query
Language
Apache
Cassandra:
Interesting
Facts
Cassandra Query Language
➔
CRUD
➔
Data Modeling
➔
Indexing
Apache Cassandra@
Disqus
Apache Cassandra@
Disqus
➔
Disqus - Disqus is a discussion platform for the
web. It connects publishers with users and
allow them to have a public discourse in a
medium that allows communication across the
web.
Apache Cassandra@
Disqus
➔
Disqus uses Cassandra in a number of different places. Mainly
in the product; it’s used for content recommendation and also
a little bit of advertising. Let’s say you’re on that article
reading about the war in Syria and you notice that there’s
another interesting article relating to what the British PMs
have released as a public statement relating to whether or
not it’s legal to go to war, and maybe you’re interested in
reading that response. What Cassandra does is it powers the
analytics and content engine behind how disqus recommends
content.
Apache Cassandra@
Disqus
➔
Main cluster - 24 nodes
➔
CPU - 6-core Xeons 3Ghz – Biggest – Because
turning out to be a small bottleneck at times
➔
24GB RAM – Per node – 8 GB Heap Size
➔
32 or 48GBs RAM wasn't helping much
➔
it’s handling our load of about 30,000 reads a second
Apache Cassandra@
Many other companies
Deep dive into event store using Apache Cassandra
Thank you :)
Resources for Material
● http://smartdatacollective.com/bernardmarr/277731/big-data-25-facts-everyone-need
s-know
● http://blog.gramener.com/1984/indian-elections-2014-big-data-for-billion-people
● http://indiaspora.org/blog/indian-elections-2014-big-data-for-billion-people/
● http://www.slideshare.net/BernardMarr/140228-big-data-volume-velocity-variety-varac
ity-value
● http://www.datastax.com/documentation/cassandra/2.1/cassandra/gettingStartedCass
andraIntro.html
● http://planetcassandra.org/
● http://planetcassandra.org/blog/disqus-discusses-migration-from-redis-to-cassandra-for
-horizontal-scalability/
● http://wiki.apache.org/cassandra/
Resources for Graphics
● http://newstonight.net/content/obesity-pushing-diabetes-cases
● http://lordapes.blogspot.in/
● http://blog.marketo.com/2013/07/big-data-it-doesnt-mean-what-you-think-it-means.html
● http://www.portaloko.hr/clanak/20-stvari-koje-muskarci-nikada-nece-shvatiti-kod-zena/0/59710/
● http://www.blankchapters.com/wp-content/uploads/2012/12/meme-data-data-everywhere.png
● https://medium.com/media-changes/don-draper-applies-for-a-job-in-2013-59aec7398582
● http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-on.html
● http://www.slideshare.net/planetcassandra/apache-cassandra-and-datastax-enterprise-explained-
with-peter-halliday-at-wildhacks-nu
● http://qph.is.quoracdn.net/main-qimg-dce3b73956c5313650022a5b22068982?convert_to_webp
=true
● http://treasure.diylol.com/uploads/post/image/553987/resized_kevin-hart-meme-generator-i-woul
d-take-questions-but-the-way-my-presentation-is-set-up-f1cfd6.jpg

More Related Content

PDF
codecentric AG: CQRS and Event Sourcing Applications with Cassandra
PDF
Feeding Cassandra with Spark-Streaming and Kafka
PDF
Event Sourcing with Cassandra (from Cassandra Japan Meetup in Tokyo March 2016)
PPTX
Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...
PDF
Cassandra CLuster Management by Japan Cassandra Community
PPTX
Scylla Summit 2018: From SAP to Scylla - Tracking the Fleet at GPS Insight
PDF
Apache cassandra & apache spark for time series data
PDF
Proofpoint: Fraud Detection and Security on Social Media
codecentric AG: CQRS and Event Sourcing Applications with Cassandra
Feeding Cassandra with Spark-Streaming and Kafka
Event Sourcing with Cassandra (from Cassandra Japan Meetup in Tokyo March 2016)
Real Time Business Intelligence with Cassandra, Kafka and Hadoop - A Real Sto...
Cassandra CLuster Management by Japan Cassandra Community
Scylla Summit 2018: From SAP to Scylla - Tracking the Fleet at GPS Insight
Apache cassandra & apache spark for time series data
Proofpoint: Fraud Detection and Security on Social Media

What's hot (20)

PDF
Building Event Streaming Architectures on Scylla and Kafka
PDF
Azure + DataStax Enterprise Powers Office 365 Per User Store
PDF
Macy's: Changing Engines in Mid-Flight
PDF
Instaclustr Webinar 50,000 Transactions Per Second with Apache Spark on Apach...
PDF
Real-time Cassandra
PDF
Instaclustr webinar 2017 feb 08 japan
PDF
Lambda at Weather Scale - Cassandra Summit 2015
PDF
Capital One: Using Cassandra In Building A Reporting Platform
PDF
Building a Real-time Streaming ETL Framework Using ksqlDB and NoSQL
PDF
Cassandra & Spark for IoT
PDF
Avoiding the Pit of Despair - Event Sourcing with Akka and Cassandra
PDF
Webinar: How to Shrink Your Datacenter Footprint by 50%
PDF
Cassandra Summit 2014: Apache Cassandra Best Practices at Ebay
PDF
Data Pipelines with Spark & DataStax Enterprise
PPTX
C*ollege Credit: CEP Distribtued Processing on Cassandra with Storm
PDF
Big Data Day LA 2015 - Sparking up your Cassandra Cluster- Analytics made Awe...
PDF
Micro-batching: High-performance writes
PDF
Wide Column Store NoSQL vs SQL Data Modeling
PDF
Scalable Data Modeling by Example (Carlos Alonso, Job and Talent) | Cassandra...
PDF
The True Cost of NoSQL DBaaS Options
Building Event Streaming Architectures on Scylla and Kafka
Azure + DataStax Enterprise Powers Office 365 Per User Store
Macy's: Changing Engines in Mid-Flight
Instaclustr Webinar 50,000 Transactions Per Second with Apache Spark on Apach...
Real-time Cassandra
Instaclustr webinar 2017 feb 08 japan
Lambda at Weather Scale - Cassandra Summit 2015
Capital One: Using Cassandra In Building A Reporting Platform
Building a Real-time Streaming ETL Framework Using ksqlDB and NoSQL
Cassandra & Spark for IoT
Avoiding the Pit of Despair - Event Sourcing with Akka and Cassandra
Webinar: How to Shrink Your Datacenter Footprint by 50%
Cassandra Summit 2014: Apache Cassandra Best Practices at Ebay
Data Pipelines with Spark & DataStax Enterprise
C*ollege Credit: CEP Distribtued Processing on Cassandra with Storm
Big Data Day LA 2015 - Sparking up your Cassandra Cluster- Analytics made Awe...
Micro-batching: High-performance writes
Wide Column Store NoSQL vs SQL Data Modeling
Scalable Data Modeling by Example (Carlos Alonso, Job and Talent) | Cassandra...
The True Cost of NoSQL DBaaS Options
Ad

Viewers also liked (17)

PDF
Cassandra as an event sourced journal for big data analytics Cassandra Summit...
PDF
CQRS and Event Sourcing with Akka, Cassandra and RabbitMQ
PDF
Akka persistence == event sourcing in 30 minutes
PDF
Developing event-driven microservices with event sourcing and CQRS (svcc, sv...
PDF
PDF
Scalable PHP Applications With Cassandra
PPTX
Using Time Window Compaction Strategy For Time Series Workloads
PPTX
Cassandra Summit 2015: Real World DTCS For Operators
PDF
CQRS and Event Sourcing with MongoDB and PHP
PDF
Developing functional domain models with event sourcing (sbtb, sbtb2015)
PPTX
Músculos de la pelvis
PPT
DDD Framework for Java: JdonFramework
PDF
Handling Eventual Consistency in JVM Microservices with Event Sourcing (javao...
PDF
Microservices + Events + Docker = A Perfect Trio (dockercon)
PDF
Event-sourced architectures with Akka
PDF
MongoDB as Message Queue
PDF
Tutoriel : Personnaliser le pin sur Pinterest
Cassandra as an event sourced journal for big data analytics Cassandra Summit...
CQRS and Event Sourcing with Akka, Cassandra and RabbitMQ
Akka persistence == event sourcing in 30 minutes
Developing event-driven microservices with event sourcing and CQRS (svcc, sv...
Scalable PHP Applications With Cassandra
Using Time Window Compaction Strategy For Time Series Workloads
Cassandra Summit 2015: Real World DTCS For Operators
CQRS and Event Sourcing with MongoDB and PHP
Developing functional domain models with event sourcing (sbtb, sbtb2015)
Músculos de la pelvis
DDD Framework for Java: JdonFramework
Handling Eventual Consistency in JVM Microservices with Event Sourcing (javao...
Microservices + Events + Docker = A Perfect Trio (dockercon)
Event-sourced architectures with Akka
MongoDB as Message Queue
Tutoriel : Personnaliser le pin sur Pinterest
Ad

Similar to Deep dive into event store using Apache Cassandra (20)

PPTX
Cassandra in Industry
PDF
PPTX
Unit -3 _Cassandra-CRUD Operations_Practice Examples
PPTX
Unit -3 -Features of Cassandra, CQL Data types, CQLSH, Keyspaces
PPT
Apache Cassandra training. Overview and Basics
PDF
Beginning Apache Cassandra Development 1st Edition Vivek Mishra Auth
PPTX
Presentation of Apache Cassandra
PDF
cassandra
PPTX
Cassandra implementation for collecting data and presenting data
PDF
Developing with Cassandra
PDF
Introduction to Apache Cassandra
PDF
Apache cassandra architecture internals
PPTX
Apache Cassandra.pptx
PPTX
CASSANDRA apache cassandra apacheee.pptx
PDF
CASSANDRA A DISTRIBUTED NOSQL DATABASE FOR HOTEL MANAGEMENT SYSTEM
PPTX
Cassandra tutorial
PPTX
Whynosql
PDF
04-Introduction-to-CassandraDB-.pdf
PDF
Cassandra NoSQL Tutorial
PDF
Apache Cassandra in the Real World
Cassandra in Industry
Unit -3 _Cassandra-CRUD Operations_Practice Examples
Unit -3 -Features of Cassandra, CQL Data types, CQLSH, Keyspaces
Apache Cassandra training. Overview and Basics
Beginning Apache Cassandra Development 1st Edition Vivek Mishra Auth
Presentation of Apache Cassandra
cassandra
Cassandra implementation for collecting data and presenting data
Developing with Cassandra
Introduction to Apache Cassandra
Apache cassandra architecture internals
Apache Cassandra.pptx
CASSANDRA apache cassandra apacheee.pptx
CASSANDRA A DISTRIBUTED NOSQL DATABASE FOR HOTEL MANAGEMENT SYSTEM
Cassandra tutorial
Whynosql
04-Introduction-to-CassandraDB-.pdf
Cassandra NoSQL Tutorial
Apache Cassandra in the Real World

Recently uploaded (20)

PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PDF
Machine learning based COVID-19 study performance prediction
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Electronic commerce courselecture one. Pdf
PPT
Teaching material agriculture food technology
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Big Data Technologies - Introduction.pptx
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Machine learning based COVID-19 study performance prediction
The Rise and Fall of 3GPP – Time for a Sabbatical?
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Electronic commerce courselecture one. Pdf
Teaching material agriculture food technology
20250228 LYD VKU AI Blended-Learning.pptx
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
A Presentation on Artificial Intelligence
Building Integrated photovoltaic BIPV_UPV.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Understanding_Digital_Forensics_Presentation.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Big Data Technologies - Introduction.pptx
NewMind AI Monthly Chronicles - July 2025
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Chapter 3 Spatial Domain Image Processing.pdf
Spectral efficient network and resource selection model in 5G networks
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf

Deep dive into event store using Apache Cassandra