SlideShare a Scribd company logo
SOUTH BAY CASSANDRA USERS MARCH 2016
BACKUP AND RESTORE FOR
APACHE CASSANDRA
Aaron Morton
@aaronmorton
CEO
Licensed under a Creative Commons Attribution-NonCommercial 3.0 New Zealand License
AboutThe Last Pickle.
Work with clients to deliver and improve Apache Cassandra
based solutions.
Apache Cassandra Committer and DataStax MVPs.
Based in New Zealand,Australia, France & USA.
Why Backup
Commit Log Archiving
Table Snap
Why Backup?
Replication is for Availability.
Why Backup?
Replicate good data as fast as
bad data.
Three ReasonsTo Backup…
Business Continuity Planning /
Disaster Recovery Planning
(AKA Data Centre is on fire.)
Three ReasonsTo Backup…
Environment Cloning
(AKA Let’s make a new Data Centre.)
Three ReasonsTo Backup…
Point In Time Recovery
(AKA Bad deploy.)
Why Backup
Commit Log Archiving
Table Snap
Commit Log
Writes are first written to the
Commit Log (on each node).
Commit Log
Commit Log can grow up to
8GB in size.
Commit Log
Commit Log is made up of 32
MB Segments.
Commit Log
Commit Log contains
Mutations, which have row
fragments.
Commit Log
Mutations are serialised in the
form they are sent over the
wire.
Commit Log Archiving
Archive Segment when full.
Restore Segments at startup
(if specified).
commitlog_archiving.properties
archive_command=
Run this command when a Segment
is full.
commitlog_archiving.properties
restore_directories=
Read all files in this CSV list of
directories at startup and run
restore_command for each.
commitlog_archiving.properties
restore_point_in_time=
Stop processing mutations with a
timestamp higher than this.
commitlog_archiving.properties
precision=MICROSECONDS
Precision used for timestamps.
Cassandra Parameter
-Dcassandra.replayList=
CSV white list of keyspace.table to
replay.
Why Backup
Commit Log Archiving
Table Snap
Table Snap
Table Snap
Continually Backup and
Restore SSTables to S3.
tablesnap
Watch for files closed or
moved into the data
directories.
tablesnap
Upload all SSTable
components, splitting large
files, using multiple threads.
tablesnap
Includes a list of SSTables in
the directory.
tablesnap
Skips file if it was removed by
compaction during processing.
tablechop
Deletes old files from the
backup set to implement a
rolling window.
tablechop
Specify how many days to
keep.
tablechop
Use - -debug to reduce the
stress.
(AKA Dry Run, does not delete the files.)
tableslurp
Slurp SSTables from S3 to a
local directory for restoring.
tableslurp
Restores the latest backup set,
or a named backup set.
Table Snap Pros
Simple.
Table Snap Cons
No monitoring.
Manual restore into cluster.
No support for topology
change.
Thanks.
Aaron Morton
@aaronmorton
Co-Founder & Principal Consultant
www.thelastpickle.com

More Related Content

PDF
Apache Con NA 2013 - Cassandra Internals
PDF
Cassandra SF Meetup - CQL Performance With Apache Cassandra 3.X
PPTX
Reduce Resource Consumption & Clone in Seconds your Oracle Virtual Environmen...
PPTX
Cassandra Backups and Restorations Using Ansible (Joshua Wickman, Knewton) | ...
PDF
Cassandra London March 2016 - Lightening talk - introduction to incremental ...
PPTX
Oracle: Binding versus caging
PPTX
Example R usage for oracle DBA UKOUG 2013
PDF
Advanced Apache Cassandra Operations with JMX
Apache Con NA 2013 - Cassandra Internals
Cassandra SF Meetup - CQL Performance With Apache Cassandra 3.X
Reduce Resource Consumption & Clone in Seconds your Oracle Virtual Environmen...
Cassandra Backups and Restorations Using Ansible (Joshua Wickman, Knewton) | ...
Cassandra London March 2016 - Lightening talk - introduction to incremental ...
Oracle: Binding versus caging
Example R usage for oracle DBA UKOUG 2013
Advanced Apache Cassandra Operations with JMX

What's hot (20)

DOCX
Enable archivelod mode in oracle rac12cR1 with asm location
PDF
The Best and Worst of Cassandra-stress Tool (Christopher Batey, The Last Pick...
PDF
Cassandra 2.1 boot camp, Read/Write path
PPTX
Building Spark as Service in Cloud
PPTX
Montreal User Group - Cloning Cassandra
PDF
Database High Availability Using SHADOW Systems
ODP
Clug 2011 March web server optimisation
PDF
Cassandra Community Webinar | In Case of Emergency Break Glass
KEY
PostgreSQL
PDF
Learning postgresql
PDF
Case Studies on PostgreSQL
PDF
Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]
PDF
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
PPTX
Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...
ODP
Introduction to apache_cassandra_for_developers-lhg
PDF
InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...
PPT
Introduction to apache_cassandra_for_develope
PPTX
mesos-devoxx14
PDF
Cassandra at Instagram (August 2013)
PPTX
HBase at Flurry
Enable archivelod mode in oracle rac12cR1 with asm location
The Best and Worst of Cassandra-stress Tool (Christopher Batey, The Last Pick...
Cassandra 2.1 boot camp, Read/Write path
Building Spark as Service in Cloud
Montreal User Group - Cloning Cassandra
Database High Availability Using SHADOW Systems
Clug 2011 March web server optimisation
Cassandra Community Webinar | In Case of Emergency Break Glass
PostgreSQL
Learning postgresql
Case Studies on PostgreSQL
Accumulo Summit 2015: Using Fluo to incrementally process data in Accumulo [API]
A Detailed Look At cassandra.yaml (Edward Capriolo, The Last Pickle) | Cassan...
Accumulo Summit 2015: Reactive programming in Accumulo: The Observable WAL [I...
Introduction to apache_cassandra_for_developers-lhg
InfluxDB IOx Tech Talks: Intro to the InfluxDB IOx Read Buffer - A Read-Optim...
Introduction to apache_cassandra_for_develope
mesos-devoxx14
Cassandra at Instagram (August 2013)
HBase at Flurry
Ad

Similar to Cassandra South Bay Meetup - Backup And Restore For Apache Cassandra (11)

PDF
DataStax: Backup and Restore in Cassandra and OpsCenter
PDF
Think_your_Postgres_backups_and_recovery_are_safe_lets_talk.pptx
PDF
MySQL Server Backup, Restoration, and Disaster Recovery Planning
PPTX
Backups
PDF
MySQL Server Backup, Restoration, And Disaster Recovery Planning Presentation
PPTX
Datos io meetup 2016
PPT
Les 05 Create Bu
PDF
Rman Reporting Opertions - LIST & REPORT
PDF
1 Dundee - Cassandra 101
ODP
Pdb my sql backup london percona live 2012
PDF
DB2 LUW - Backup and Recovery
DataStax: Backup and Restore in Cassandra and OpsCenter
Think_your_Postgres_backups_and_recovery_are_safe_lets_talk.pptx
MySQL Server Backup, Restoration, and Disaster Recovery Planning
Backups
MySQL Server Backup, Restoration, And Disaster Recovery Planning Presentation
Datos io meetup 2016
Les 05 Create Bu
Rman Reporting Opertions - LIST & REPORT
1 Dundee - Cassandra 101
Pdb my sql backup london percona live 2012
DB2 LUW - Backup and Recovery
Ad

More from aaronmorton (17)

PDF
Cassandra Day Atlanta 2016 - Monitoring Cassandra
PDF
Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandra
PDF
Cassandra sf 2015 - Steady State Data Size With Compaction, Tombstones, and TTL
PDF
Cassandra TK 2014 - Large Nodes
PDF
Cassandra Community Webinar August 29th 2013 - In Case Of Emergency, Break Glass
PDF
Cassandra Community Webinar - August 22 2013 - Cassandra Internals
PDF
Cassandra SF 2013 - In Case Of Emergency Break Glass
PDF
Cassandra SF 2013 - Cassandra Internals
PDF
Cassandra Community Webinar - Introduction To Apache Cassandra 1.2
PDF
Apache Cassandra in Bangalore - Cassandra Internals and Performance
KEY
Cassandra SF 2012 - Technical Deep Dive: query performance
KEY
Hello @world #cassandra
KEY
Cassandra does what ? Code Mania 2012
PDF
Nzpug welly-cassandra-02-12-2010
PDF
Introduction to Cassandra
PDF
Building a distributed Key-Value store with Cassandra
PDF
Cassandra - Wellington No Sql
Cassandra Day Atlanta 2016 - Monitoring Cassandra
Cassandra SF 2015 - Repeatable, Scalable, Reliable, Observable Cassandra
Cassandra sf 2015 - Steady State Data Size With Compaction, Tombstones, and TTL
Cassandra TK 2014 - Large Nodes
Cassandra Community Webinar August 29th 2013 - In Case Of Emergency, Break Glass
Cassandra Community Webinar - August 22 2013 - Cassandra Internals
Cassandra SF 2013 - In Case Of Emergency Break Glass
Cassandra SF 2013 - Cassandra Internals
Cassandra Community Webinar - Introduction To Apache Cassandra 1.2
Apache Cassandra in Bangalore - Cassandra Internals and Performance
Cassandra SF 2012 - Technical Deep Dive: query performance
Hello @world #cassandra
Cassandra does what ? Code Mania 2012
Nzpug welly-cassandra-02-12-2010
Introduction to Cassandra
Building a distributed Key-Value store with Cassandra
Cassandra - Wellington No Sql

Recently uploaded (20)

PDF
cuic standard and advanced reporting.pdf
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
cuic standard and advanced reporting.pdf
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Encapsulation_ Review paper, used for researhc scholars
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Building Integrated photovoltaic BIPV_UPV.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
MYSQL Presentation for SQL database connectivity
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Digital-Transformation-Roadmap-for-Companies.pptx
Chapter 3 Spatial Domain Image Processing.pdf
Review of recent advances in non-invasive hemoglobin estimation
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Electronic commerce courselecture one. Pdf
NewMind AI Weekly Chronicles - August'25 Week I
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Empathic Computing: Creating Shared Understanding
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy

Cassandra South Bay Meetup - Backup And Restore For Apache Cassandra