SlideShare a Scribd company logo
Getting Started with

   Riak in the
     Cloud
                       Big Nerd Ranch
About Me



 Ines
Sombra
@RandomMood
Our Agenda
    About Us
    Our Customers
    The Problem
    The Solution
Engine Yard
About US            Clouds
                             Regions
100% Cloud
Based                             Stacks
                       Support

Critical services                      DR

MUST be up                   HA
24/7/365
Our Customers




http://www.flickr.com/photos/meetrajesh/6972883401
An APPLICATION

App


App      DB
        Master
         us-east-1a
                        DB
                      Replica
App                    us-east-1b
Then AWS Crashed




 http://www.flickr.com/photos/brandongrasley/8227882239/
Our App AFTER

App


App
       XDB
       Master
        us-east-1a
                       DB
                     Replica
App                   us-east-1b
This sucked, A LOT
!   Time to strengthen
    our HA/DR Story
$   Customers and us
    need to be up all
    the time!
Our Research
Data Models   Consistency   Partitions
Relational    Are ACID Consistency
              properties
Document                 Availability
              really
Key/Value     needed?
Column
Impedance Mismatch
  Relational Model
Impedance Mismatch
Memory data-structure
We needed a fault
tolerant and highly
available (clustered)
     datastore
OSS, Dynamo-based,
distributed K/V store
                        Hello RIAK!
                        Linearly scalable

                        Fault-tolerant

                        Finally provide
                        HA/DR solution
Riak’s Use Cases
Application Type           Key          Value

Session            User/Session id Data
Advertising        Campaign id      Data

Logs               Date             Log file
                                    text, json,
Content            title, integer
                                    xml, media..
When is Riak a good fit
When availability is more
important than consistency
When data can be
modeled as keys/values
When the problem fits
Riak is OPS FRIENDLY


                     X
            node
             1.2.1
                               Growth
 node                node
    1.2.1             1.2.1
                               Reduction
 node
  1.2.1
                     node
                       1.2.1
                               Upgrades
            node
             1.3.0             Failures
World’s most famous slide
Riak is SO MUCH MORE
 Consistent Hashing
 Hinted Handoff
 Active Anti Entropy
 Tuneable consistency
 N|W|R
OUR APP AFTER
  App
                         node
                          1.3.0



 App    node                               node
         1.3.0                              1.3.0




            node                  node
  App            1.3.0             1.3.0
We need a Riak
Cluster but we have
   no hardware
RIAK In the Wild




          18th and Harrison, San Francisco, CA
Let’s build A PRODUCT




        http://www.flickr.com/photos/worldbank/8261699153/
Your CHOICES
     You are trying
    to create a Riak
     cluster. Do you
     need any help?
                       1   Number of Nodes
                       2   Instance Type
                       3   Data Location
                       4   Backend Type
What we GIVE YOU
Our SUPPORT



       X2
HOWTO


 What you
OPS to know
Prepare
                 Yourself


    More instances to
   compensate for the
performance variability of
  virtualized resources
Choosing INSTANCES
 ^   m1.large & m1.xlarge

     EBS-optimized or PIOPs

     High I/O Quadruple XL
     (hi1.4xlarge) - SSDs!
Cluster Sizes
        5 nodes or more
        Best performance
        & growth
        Scales linearly
        Don’t be stingy
Ring SIZE
      Change Default
      CANNOT BE RESET
      Ring Size at 256
       (8-64 vnodes per node)

      DONT GO OVER 512
Our PROVISIONING
                            Riak Cluster
                         Riak Cluster

    App                  MyRiakCluster: node-0

   haproxy
               HTTP
                or       MyRiakCluster: node-1
             Protobuff
    App

   haproxy
                         MyRiakCluster: node-2


    App
                         MyRiakCluster: node-3
   haproxy



                         MyRiakCluster: node-4
Disable SWAP
 ^   Riak process pages
     should not hit swap
     Swap usage can result
     in unresponsive server
     Let the kernel kill it
Mount & SCHEDULER
 ^   Mount with noatime flag
     Use deadline scheduler
     on EBS
     cat /sys/block/xvdj1/queue/scheduler
     noop [deadline] cfq

     echo deadline > /sys/block/xvdj1/queue/
     scheduler
Linux FILESYSTEMS
 ^   xfs, ext3
     ext4
       barrier = 0
       data = writeback
     ZFS not recommended
Backup STRATEGY
 ^   Backup both the ring
     and data directories
     scp or rsync data dir
     EBS snapshots
Monitoring STRATEGY
        Stats Interface
        riak-admin status
        Collectd
        Nagios
Scaling RIAK
^   Vertical
    riak-admin cluster replace <old>
    <new>


^   Horizontal
    riak-admin cluster add <new>
How we SCALE RIAK
Hybrid Solutions FTW

RIAK
       + PostgreSQL
RIAK
     + FTS Engine*
RIAK
     + Redis
Try OUR RIAK
How difficult IS IT?
Finally!


 A few
thoughts
Polyglot PERSISTENCE
  Pick the right tool
  for the job
  Apps & Services
  can have multiple
  databases
On SOA
         My super nice e-commerce site


Shopping Cart &    Completed    Inventory and       Recommendations
 Session Data       Orders       Item Pricing           Engine




Session             Order       Inventory &             Nodes and
Storage           Persistence   Price service            relations
service             service                               service
                                   RDBMS
K/V Store          Document                              Graph Store
                     Store

                                            NoSQL Distilled: Fowler & Sadalage
We RECOMMEND
 Don’t optimize too quickly
 Take time to understand
 your problem
 Complexity increases with
 more databases
Do KEEP IN MIND
 Different paradigms
 are different
                    Document
  Relational   !=     Store
Whatever


Give me
MOAR!
Great RESOURCES
 Little Riak Book
 (https://github.com/coderoshi/little_riak_book/)

 docs.basho.com
 blog.engineyard.com
 nosql.mypopescu.com
Questions?

More Related Content

PDF
Computational Patterns of the Cloud - QCon NYC 2014
PDF
Riak CS Build Your Own Cloud Storage
PPTX
When the Cloud is a Rockin: High Availability in Apache CloudStack
PDF
Logging infrastructure for Microservices using StreamSets Data Collector
PDF
Building Complete Private Clouds with Apache CloudStack and Riak CS
PDF
An AWS DMS Replication Journey from Oracle to Aurora MySQL
PDF
The Last Pickle: Distributed Tracing from Application to Database
PDF
Data Antipatterns
Computational Patterns of the Cloud - QCon NYC 2014
Riak CS Build Your Own Cloud Storage
When the Cloud is a Rockin: High Availability in Apache CloudStack
Logging infrastructure for Microservices using StreamSets Data Collector
Building Complete Private Clouds with Apache CloudStack and Riak CS
An AWS DMS Replication Journey from Oracle to Aurora MySQL
The Last Pickle: Distributed Tracing from Application to Database
Data Antipatterns

What's hot (17)

PDF
Battery Ventures: Simulating and Visualizing Large Scale Cassandra Deployments
PDF
Data Pipelines with Spark & DataStax Enterprise
PPTX
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARN
PPTX
Cloud native data platform
PPTX
How Tencent Applies Apache Pulsar to Apache InLong - Pulsar Summit Asia 2021
PDF
Data Lake and the rise of the microservices
PPTX
Triple C - Centralize, Cloudify and Consolidate Dozens of Oracle Databases (O...
PDF
Streaming Sensor Data Slides_Virender
PDF
Lean Enterprise, Microservices and Big Data
PDF
Riak at shareaholic
PDF
Novinky v Oracle Database 18c
PDF
Azure + DataStax Enterprise Powers Office 365 Per User Store
PDF
Trend Micro Big Data Platform and Apache Bigtop
PPTX
MySQL Cluster - Latest Developments (up to and including MySQL Cluster 7.4)
PPTX
Real time analytics
PPTX
Svccg nosql 2011_v4
PDF
Genomics on aws-webinar-april2018
Battery Ventures: Simulating and Visualizing Large Scale Cassandra Deployments
Data Pipelines with Spark & DataStax Enterprise
Kafka On YARN (KOYA): An Open Source Initiative to integrate Kafka & YARN
Cloud native data platform
How Tencent Applies Apache Pulsar to Apache InLong - Pulsar Summit Asia 2021
Data Lake and the rise of the microservices
Triple C - Centralize, Cloudify and Consolidate Dozens of Oracle Databases (O...
Streaming Sensor Data Slides_Virender
Lean Enterprise, Microservices and Big Data
Riak at shareaholic
Novinky v Oracle Database 18c
Azure + DataStax Enterprise Powers Office 365 Per User Store
Trend Micro Big Data Platform and Apache Bigtop
MySQL Cluster - Latest Developments (up to and including MySQL Cluster 7.4)
Real time analytics
Svccg nosql 2011_v4
Genomics on aws-webinar-april2018
Ad

Similar to Riak at Engine Yard Cloud (20)

PDF
Scaling Spark Workloads on YARN - Boulder/Denver July 2015
KEY
Cloud Computing & Scaling Web Apps
PPTX
Deliver Big Data, Database and AI/ML as-a-Service anywhere
PDF
Migrating to Riak at Shareaholic
PDF
The practice of big data - making big data approachable
PPTX
SQL and NoSQL in SQL Server
PDF
Transformation Track AWS Cloud Experience Argentina - Bases de Datos en AWS
PPTX
Minnebar 2013 - Scaling with Cassandra
PPTX
Ai tour 2019 Mejores Practicas en Entornos de Produccion Big Data Open Source...
PPT
Rolling With Riak
PDF
End of RAID as we know it with Ceph Replication
PPTX
Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020
PDF
Amazon ElastiCache (Dan Zamansky) - AWS DB Day
PDF
Getting started with MariaDB with Docker
PPTX
Containerized Hadoop beyond Kubernetes
PDF
5 Factors When Selecting a High Performance, Low Latency Database
KEY
Introduction to Riak - Red Dirt Ruby Conf Training
PDF
[Pgday.Seoul 2018] AWS Cloud 환경에서 PostgreSQL 구축하기
PPTX
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
PPTX
Global Big Data Conference Sept 2014 AWS Kinesis Spark Streaming Approximatio...
Scaling Spark Workloads on YARN - Boulder/Denver July 2015
Cloud Computing & Scaling Web Apps
Deliver Big Data, Database and AI/ML as-a-Service anywhere
Migrating to Riak at Shareaholic
The practice of big data - making big data approachable
SQL and NoSQL in SQL Server
Transformation Track AWS Cloud Experience Argentina - Bases de Datos en AWS
Minnebar 2013 - Scaling with Cassandra
Ai tour 2019 Mejores Practicas en Entornos de Produccion Big Data Open Source...
Rolling With Riak
End of RAID as we know it with Ceph Replication
Moving Beyond Cache by Yiftach Shoolman Redis Labs - Redis Day Seattle 2020
Amazon ElastiCache (Dan Zamansky) - AWS DB Day
Getting started with MariaDB with Docker
Containerized Hadoop beyond Kubernetes
5 Factors When Selecting a High Performance, Low Latency Database
Introduction to Riak - Red Dirt Ruby Conf Training
[Pgday.Seoul 2018] AWS Cloud 환경에서 PostgreSQL 구축하기
Big Data Goes Airborne. Propelling Your Big Data Initiative with Ironcluster ...
Global Big Data Conference Sept 2014 AWS Kinesis Spark Streaming Approximatio...
Ad

More from Ines Sombra (15)

PDF
Architectural Patterns of Resilient Distributed Systems
PDF
We hear you like papers
PDF
Testing & Integration (The Remix)
PDF
From 0 to Capacity Planning
PDF
Agile, Rugged, and Lean - The Paper Edition
PDF
Data antipatterns NYC Devops - 2014
PDF
How the Cloud is changing the world
PDF
NoSQL Databases in the Cloud - Great Wide Open 2014
PDF
Relational Databases in the Cloud - Great Wide Open 2014
PDF
Hello data
PDF
Ricon east
PDF
PgPyDay
PDF
Getting started with Riak in the Cloud
PDF
Postgres Open
KEY
North Bay Ruby Meetup 101911
Architectural Patterns of Resilient Distributed Systems
We hear you like papers
Testing & Integration (The Remix)
From 0 to Capacity Planning
Agile, Rugged, and Lean - The Paper Edition
Data antipatterns NYC Devops - 2014
How the Cloud is changing the world
NoSQL Databases in the Cloud - Great Wide Open 2014
Relational Databases in the Cloud - Great Wide Open 2014
Hello data
Ricon east
PgPyDay
Getting started with Riak in the Cloud
Postgres Open
North Bay Ruby Meetup 101911

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
Big Data Technologies - Introduction.pptx
PDF
Encapsulation theory and applications.pdf
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Empathic Computing: Creating Shared Understanding
PDF
KodekX | Application Modernization Development
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PPTX
A Presentation on Artificial Intelligence
PDF
Modernizing your data center with Dell and AMD
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Cloud computing and distributed systems.
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Electronic commerce courselecture one. Pdf
Chapter 3 Spatial Domain Image Processing.pdf
Big Data Technologies - Introduction.pptx
Encapsulation theory and applications.pdf
Per capita expenditure prediction using model stacking based on satellite ima...
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Empathic Computing: Creating Shared Understanding
KodekX | Application Modernization Development
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
The Rise and Fall of 3GPP – Time for a Sabbatical?
A Presentation on Artificial Intelligence
Modernizing your data center with Dell and AMD
The AUB Centre for AI in Media Proposal.docx
Cloud computing and distributed systems.
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Unlocking AI with Model Context Protocol (MCP)
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
“AI and Expert System Decision Support & Business Intelligence Systems”
Electronic commerce courselecture one. Pdf

Riak at Engine Yard Cloud