SlideShare a Scribd company logo
april25-26
sanfrancisco
cloud success starts here
Building RightScale’s Globally
Distributed Datastore
Josep M. Blanquer, Chief Architect
#2#2
#RightscaleCompute
In this talk…
• Intro
• Data Taxonomy
• Data Storage Design
• Scale, HA and DR considerations
• Conclusion
#3#3
#RightscaleCompute
Intro: Expectations and scope
What this is and what is not
• IS a talk about:
• how RightScale has designed and implemented its backing datastores
• …for a few of the most representative internal systems
• …with the rationale behind it
• Is NOT a talk about
• RightScale’s overall architecture
• Nodes or hosts, it’s about Systems
• RightScale’s data modeling
#4#4
#RightscaleCompute
Intro: Tools and Technologies
• RightScale uses a mix of RDBMS and NoSQL technologies:
• MySQL , Cassandra and S3 (for backups and archiving)
• Transactionality:
• MySQL: strong ACID properties
• Cassandra: no Atomicity, eventually Consistent, some Isolation, Durable
• Availability:
• MySQL: async replication. Master-SlaveN or Master-Master
• Cassandra: Distributed, master-less, highly-replicated (multi-DC)
• Queryability:
• MySQL: Extremely flexible at adding indexes and changing data model
• Cassandra: More difficult to change the querying patterns
#5#5
#RightscaleCompute
Taxonomy of RightScale’s Data
Representative systems
with different data semantics:
Global Objects
 Marketplace Assets
Dashboard Objects
 Audits
 Tags
 Recent Events
Cloud Polling Data
Routing Data
Monitoring/Syslog
#6#6
#RightscaleCompute
Taxonomy of RightScale’s Data
Representative systems
with different data semantics:
Global Objects
 Marketplace Assets
Dashboard Objects
 Audits
 Tags
 Recent Events
Cloud Polling Data
Routing Data
Monitoring/Syslog
Common across accounts:
 Users
 Plans
 Settings
 MultiCloud Marketplace:
 Published Assets
 Sharing Groups
 …
#7#7
#RightscaleCompute
Taxonomy of RightScale’s Data
Representative systems
with different data semantics:
Global Objects
 Marketplace Assets
Dashboard Objects
 Audits
 Tags
 Recent Events
Cloud Polling Data
Routing Data
Monitoring/Syslog
Private to each account:
 Deployments
 Imported assets
 Alert Specifications
 Server Inputs
 Audit
 Tags
 User Events
 …
#8#8
#RightscaleCompute
Taxonomy of RightScale’s Data
Representative systems
with different data semantics:
Global Objects
 Marketplace Assets
Dashboard Objects
 Audits
 Tags
 Recent Events
Cloud Polling Data
Routing Data
Monitoring/Syslog
Private to each account:
 Cloud resource states (cache)
 Cloud credentials
#9#9
#RightscaleCompute
Taxonomy of RightScale’s Data
Representative systems
with different data semantics:
Global Objects
 Marketplace Assets
Dashboard Objects
 Audits
 Tags
 Recent Events
Cloud Polling Data
Routing Data
Monitoring/Syslog
Private to each account:
 Instance agents location
 Core agents location
 Agent action registry
 …
#10#10
#RightscaleCompute
Taxonomy of RightScale’s Data
Representative systems
with different data semantics:
Global Objects
 Marketplace Assets
Dashboard Objects
 Audits
 Tags
 Recent Events
Cloud Polling Data
Routing Data
Monitoring/Syslog
Private to each account:
 Collected metric data
 Collected syslog data
 …
#11#11
#RightscaleCompute
Taxonomy of RightScale’s Data
UsersInstances
Global Objects
 Marketplace Assets
Dashboard Objects
 Audits
 Tags
 Recent Events
Cloud Polling Data
Routing Data
Monitoring/Syslog
Who uses the data?
• Users through the Dash/API
• Instances from the Cloud
Data close to the Users
Data close to the Cloud
Data Placement
#12#12
#RightscaleCompute
Taxonomy of RightScale’s DataX-acctAccount
Global Objects
 Marketplace Assets
Dashboard Objects
 Audits
 Tags
 Recent Events
Cloud Polling Data
Routing Data
Monitoring/Syslog
Which data do we need?
• Data for all accounts
• Data for a single account
Data shared between accounts
Data required within scope
of a single account
Data scope and containment
#13#13
Talk with the Experts.
Users
Taxonomy of RightScale’s Data
Instances
X-acctAccount
Global Objects
 Marketplace Assets
Dashboard Objects
 Audits
 Tags
 Recent Events
Cloud Polling Data
Routing Data
Monitoring/Syslog
Who uses the data? Proximity to User vs. Cloud
Which data do we need? Scope of data available
Close to cloud resources
Account-shardable* data
Close to user
Account-shardable data
Close to user
Globally accessible data
#14#14
#RightscaleCompute
UsersInstances
AccountX-Account
#15#15
#RightscaleCompute
UsersInstances
global
X-Account
Custom replication
Why custom? More control
• Multiple sources
• Individual columns
• Apply transformations
• Smart re-sync features
Global: MySQL
• ACID semantics
• Master-Slave replication
#16#16
#RightscaleCompute
UsersInstances
Account
global dash
S3
events
tags
audit
X-Account
Dashboard: MySQL
• ACID semantics
• Master-SlaveN replication
• Slave reads
• Rows tagged by account
Other systems: Cassandra
• Simpler Key-Value access
• Great scalability
• Great replica control
• High write availability
• Time-to-live expiration as cache
• Rows tagged by account
Data archive: S3
• Low read rate
• Globally accessible
#17#17
#RightscaleCompute
UsersInstances
Account
global dash
S3
events
tags
audit
X-Account
dash
events
tags
audit
So we can horizontally scale our
dashboard by partitioning objects
based on account groups:
Clusters
#18#18
#RightscaleCompute
Users
AccountCluster1
dash
S3
events
tags
audit
ClusterN
dash
S3
events
tags
audit
Account Set 1 Account Set 2
RightScale Accounts
Cluster3
dash
S3
events
tags
audit
…
Features:
• 1 cluster: N accounts
• 1 account: 1 home
• Migratable accounts
Benefits:
• Great horizontal growth
• Better failure isolation
• Independent scale
• Load rebalancing
• Versionable code
• Differentiated service
#19#19
#RightscaleCompute
dash
events
tags
audit
UsersInstances
Account
global dash
S3
events
tags
audit
routing
polling
monitor
X-Account
#20#20
#RightscaleCompute
routing
polling
monitor
routing
polling
monitor
UsersInstances
Account
global dash
S3
events
tags
audit
X-Account
And partition our cloud objects based on the cloud
the instances of an account run on:
Islands
#21#21
#RightscaleCompute
Account
Instances
Services co-located
with resources
Services co-located
with resources
Services co-located
with resources
routing
polling
monitor
Island1
Island2
IslandN
routing
polling
monitor
routing
polling
monitor
Cloud 1 Cloud 2 Cloud N
#22#22
#RightscaleCompute
Account
Instances
Features:
• 1 instance: 1 home island
• 1 Island can serve N clouds
• Core Agents: global data
Benefits:
• Close to cloud resources
• Good failure isolation
• As good as cloud 
• Good scale: global replicas
across cassandra DCs
routing
polling
monitor
Island1
Island2
IslandN
routing
polling
monitor
routing
polling
monitor
routing
polling
monitor
routing
polling
monitor
routing
polling
monitor
Island1
Island2
IslandN
Polling Clouds: MySQL
• Master-Slave replication
• Can port to NoSQL easily
• Mostly a resource cache
• But cloud partitionable
Monitoring: Custom
• Replicated files
• Backup to S3
• Archive to S3
Routing: Cassandra
• Simpler Key-Value access
• Very high availability
• Great scalability
• Great replica control
• Plus cross DC replication*
#23#23
#RightscaleCompute
Users
AccountCluster1
dash
S3
events
tags
audit
ClusterN
dash
S3
events
tags
audit
Cluster3
dash
S3
events
tags
audit
…
routing
polling
monitor
routing
polling
monitor
routing
polling
monitor
Island1
Island2
IslandN
Instances
Different Geographies
Different Clouds
What if the cloud
where the cluster
is deployed on…
Fails?
What if the cloud
where the island
is deployed on…
Fails?
#24#24
#RightscaleCompute
Users
AccountCluster1
dash
S3
events
tags
audit
ClusterN
dash
S3
events
tags
audit
Cluster3
dash
S3
events
tags
audit
…
routing
polling
monitor
routing
polling
monitor
routing
polling
monitor
Island1
Island2
IslandN
Instances
Sister Clusters
Full replica
Features:
• Each master has an extra remote slave
• Each cluster in a pair is a DC replica of the other’s
localring
At Disaster Recovery time:
• Apps are told to start serving an extra shard
• No need to provision more infrastructure to recover
(try to avoid since everybody is on the same boat)
• New resources can be allocated over time to help
offload existing ones
#25#25
#RightscaleCompute
Conclusions
• Shown that RightScale uses multiple database technologies:
• RDBMS – MySQL for the ACID semantics and ‘queryability’
• Using a Master to N-Slaves for RO scale, and quick failure recovery
• And ReadOnly Provisioning – To increase RO availability and scale remote systems
• NoSQL: Cassandra for Availability and Scalability
• for higher Read/Write availability within a cluster
• For fully replicated regions across the globe (for Read/Write!)
• Shown how RightScale uses them in different techniques
• It partitions resource data into Islands based on cloud proximity
• Can achieve in-cloud polling,and keep monitoring/syslog data storage next to instances
• Can provide routing availability, colocated with instances for any world region
• It partitions core data into Clusters based on account groups
• To scale the core horizontally, and independently and achieve account isolation/differentiation
• Enhances fault isolation: Assigning accounts to Clusters deployed away their cloud resources
• It maintains cluster pairs (sister sites)
• To recover from full cloud region failures
• It doesn’t require massive amounts of new resources to recover
april25-26
sanfrancisco
cloud success starts here
Questions?

More Related Content

PPTX
Using Visualization to Succeed with Big Data
PPTX
Streaming ETL for All
PDF
Analytical DBMS to Apache Spark Auto Migration Framework with Edward Zhang an...
PDF
The State of Stream Processing
PDF
The delta architecture
PDF
Streamsets and spark at SF Hadoop User Group
PDF
Streamsets and spark
PDF
Real-Time Attribution with Structured Streaming and Databricks Delta with Car...
Using Visualization to Succeed with Big Data
Streaming ETL for All
Analytical DBMS to Apache Spark Auto Migration Framework with Edward Zhang an...
The State of Stream Processing
The delta architecture
Streamsets and spark at SF Hadoop User Group
Streamsets and spark
Real-Time Attribution with Structured Streaming and Databricks Delta with Car...

What's hot (20)

PPTX
Future of data visualization
PPTX
Real Time Data Processing Using Spark Streaming
PDF
Time Series Analysis Using an Event Streaming Platform
PDF
Enterprise Metadata Integration
PPTX
Kappa Architecture on Apache Kafka and Querona: datamass.io
PDF
Modern ETL Pipelines with Change Data Capture
PDF
Shared time-series-analysis-using-an-event-streaming-platform -_v2
PDF
Near Real-Time Data Warehousing with Apache Spark and Delta Lake
PDF
Spark with Delta Lake
ODP
Kick-Start with SMACK Stack
PPTX
Lambda architecture with Spark
PDF
Using Apache Spark to Predict Installer Retention from Messy Clickstream Data...
PDF
Open Source Reliability for Data Lake with Apache Spark by Michael Armbrust
PDF
Leveraging services in stream processor apps at Ticketmaster (Derek Cline, Ti...
PDF
Rocana Deep Dive OC Big Data Meetup #19 Sept 21st 2016
PDF
The Rise of Streaming SQL
PDF
ksqlDB: Building Consciousness on Real Time Events
PDF
SMACK Stack - Fast Data Done Right by Stefan Siprell at Codemotion Dubai
PPTX
Lambda architecture: from zero to One
PDF
Real-Time Anomaly Detection with Spark MLlib, Akka and Cassandra
Future of data visualization
Real Time Data Processing Using Spark Streaming
Time Series Analysis Using an Event Streaming Platform
Enterprise Metadata Integration
Kappa Architecture on Apache Kafka and Querona: datamass.io
Modern ETL Pipelines with Change Data Capture
Shared time-series-analysis-using-an-event-streaming-platform -_v2
Near Real-Time Data Warehousing with Apache Spark and Delta Lake
Spark with Delta Lake
Kick-Start with SMACK Stack
Lambda architecture with Spark
Using Apache Spark to Predict Installer Retention from Messy Clickstream Data...
Open Source Reliability for Data Lake with Apache Spark by Michael Armbrust
Leveraging services in stream processor apps at Ticketmaster (Derek Cline, Ti...
Rocana Deep Dive OC Big Data Meetup #19 Sept 21st 2016
The Rise of Streaming SQL
ksqlDB: Building Consciousness on Real Time Events
SMACK Stack - Fast Data Done Right by Stefan Siprell at Codemotion Dubai
Lambda architecture: from zero to One
Real-Time Anomaly Detection with Spark MLlib, Akka and Cassandra
Ad

Similar to Building RightScale's Globally Distributed Datastore - RightScale Compute 2013 (20)

PDF
[RightScale Webinar] Architecting Databases in the cloud: How RightScale Doe...
PPTX
RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...
PPTX
How RightScale Architects Its Own Databases for Worldwide Scale, HA, and DR S...
PPT
PHP LAMP AWS RightSscale
PPTX
RightScale User Conference / Fall / 2010 - Morning Sessions
PPT
RightScale Minneapolis Lightning Talk
PDF
How IT at Getty Images Brokers Cloud Services
PPTX
Delivering SaaS Using IaaS - RightScale Compute 2013
PPTX
Managing RightScale on RightScale
PPTX
Key Design Considerations Private and Hybrid Clouds - RightScale Compute 2013
PDF
[AWS에서의 미디어 및 엔터테인먼트] AWS 개요, 클라우드 스토리지 및 Amazon CloudFront, Elastic Transcod...
PDF
Intro to SW Eng Principles for Cloud Computing - DNelson Apr2015
PDF
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
PDF
Cloud On-Ramp Project Briefing
PPTX
Managing RightScale on RightScale
PPTX
How to RightScale-Enable Your Images
PPTX
Utah Codecamp Cloud Computing
PDF
Scientific Cloud Computing: Present & Future
PPT
The Enterprise Cloud
PPT
When small problems become big problems
[RightScale Webinar] Architecting Databases in the cloud: How RightScale Doe...
RightScale Webinar: How RightScale Architects Its Databases (for Worldwide Sc...
How RightScale Architects Its Own Databases for Worldwide Scale, HA, and DR S...
PHP LAMP AWS RightSscale
RightScale User Conference / Fall / 2010 - Morning Sessions
RightScale Minneapolis Lightning Talk
How IT at Getty Images Brokers Cloud Services
Delivering SaaS Using IaaS - RightScale Compute 2013
Managing RightScale on RightScale
Key Design Considerations Private and Hybrid Clouds - RightScale Compute 2013
[AWS에서의 미디어 및 엔터테인먼트] AWS 개요, 클라우드 스토리지 및 Amazon CloudFront, Elastic Transcod...
Intro to SW Eng Principles for Cloud Computing - DNelson Apr2015
Kubernetes and Terraform in the Cloud: How RightScale Does DevOps
Cloud On-Ramp Project Briefing
Managing RightScale on RightScale
How to RightScale-Enable Your Images
Utah Codecamp Cloud Computing
Scientific Cloud Computing: Present & Future
The Enterprise Cloud
When small problems become big problems
Ad

More from RightScale (20)

PDF
10 Must-Have Automated Cloud Policies for IT Governance
PDF
Optimize Software, SaaS, and Cloud with Flexera and RightScale
PDF
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
PDF
How to Set Up a Cloud Cost Optimization Process for your Enterprise
PDF
Multi-Cloud Management with RightScale CMP (Demo)
PDF
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
PDF
How to Allocate and Report Cloud Costs with RightScale Optima
PDF
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
PDF
Using RightScale CMP with Cloud Provider Tools
PDF
Best Practices for Multi-Cloud Security and Compliance
PDF
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
PDF
The 5 Stages of Cloud Management for Enterprises
PDF
9 Ways to Reduce Cloud Storage Costs
PDF
Serverless Comparison: AWS vs Azure vs Google vs IBM
PDF
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
PDF
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
PDF
2018 Cloud Trends: RightScale State of the Cloud Report
PDF
Got a Multi-Cloud Strategy? How RightScale CMP Helps
PDF
How to Manage Cloud Costs with RightScale Optima
PDF
Top 10 Cloud Trends for 2018 and Actions You Can Take Now
10 Must-Have Automated Cloud Policies for IT Governance
Optimize Software, SaaS, and Cloud with Flexera and RightScale
Prepare Your Enterprise Cloud Strategy for 2019: 7 Things to Think About Now
How to Set Up a Cloud Cost Optimization Process for your Enterprise
Multi-Cloud Management with RightScale CMP (Demo)
Comparing Cloud VM Types and Prices: AWS vs Azure vs Google vs IBM
How to Allocate and Report Cloud Costs with RightScale Optima
Should You Move Between AWS, Azure, or Google Clouds? Considerations, Pros an...
Using RightScale CMP with Cloud Provider Tools
Best Practices for Multi-Cloud Security and Compliance
Automating Multi-Cloud Policies for AWS, Azure, Google, and More
The 5 Stages of Cloud Management for Enterprises
9 Ways to Reduce Cloud Storage Costs
Serverless Comparison: AWS vs Azure vs Google vs IBM
Best Practices for Cloud Managed Services Providers: The Path to CMP Success
Cloud Storage Comparison: AWS vs Azure vs Google vs IBM
2018 Cloud Trends: RightScale State of the Cloud Report
Got a Multi-Cloud Strategy? How RightScale CMP Helps
How to Manage Cloud Costs with RightScale Optima
Top 10 Cloud Trends for 2018 and Actions You Can Take Now

Recently uploaded (20)

PDF
KodekX | Application Modernization Development
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
A Presentation on Artificial Intelligence
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Approach and Philosophy of On baking technology
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Machine learning based COVID-19 study performance prediction
KodekX | Application Modernization Development
Review of recent advances in non-invasive hemoglobin estimation
The Rise and Fall of 3GPP – Time for a Sabbatical?
Agricultural_Statistics_at_a_Glance_2022_0.pdf
MYSQL Presentation for SQL database connectivity
Spectral efficient network and resource selection model in 5G networks
Dropbox Q2 2025 Financial Results & Investor Presentation
20250228 LYD VKU AI Blended-Learning.pptx
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Empathic Computing: Creating Shared Understanding
Understanding_Digital_Forensics_Presentation.pptx
A Presentation on Artificial Intelligence
“AI and Expert System Decision Support & Business Intelligence Systems”
NewMind AI Weekly Chronicles - August'25 Week I
Approach and Philosophy of On baking technology
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Machine learning based COVID-19 study performance prediction

Building RightScale's Globally Distributed Datastore - RightScale Compute 2013