SlideShare a Scribd company logo
Runa on AWS
Big Data & Machine Intelligence for a SaaS
Startup
Runa
a
SaaS
converts
Shoppers to Buyers
for
Online
Commerce Sites
by presenting
Dynamic
Personalized
Promotions
on the
Merchant’s
Website
in
Real-Time
in the
Shopping Flow
Tech
Challenges
Big Data
JavaScript client
collects activity on
every Merchant page
for every Shopper
One or more
Ajax call & Event
Store to Runa per
Merchant page
view
Step function
increase of calls
and stores as
each new
Merchant added
We capture
everything we
can and store it
forever
Expecting to
grow to
thousands of
merchants
That’s a lot
of Data
Processing Data with
Machine
Intelligence
Batch Processing for
Statistical Analysis
and Reports
Real-Time
Rule based
inserts of
Promotions
Why AWS for
Runa?
At First
(a couple years ago)
Not Much
Money in
the Bank
Didn’t Know
exactly
what were
making
Or exactly
how we were
going to do it
Prototyped with
Ruby / Rails / MySQL
Then
Prototype
became
Production
EC2 & AWS let
us scale the
prototype to
Beta
Production
Flexibility to
incrementally
refine service &
infrastructure
Confidence
we could scale
as we added
Merchants
More Recently
Incrementally
added next-gen
Tech & Full
Production
Goal:
Everything
Horizontally
Scalable
Batch Processing &
Infinite Storage
Map / Reduce
& BigTable
via
Hadoop & HBase
Flexible Real-Time
parallel processing
via
Clojure / Swarmiji
Opscode Chef
Management
& Monitoring
Consumers on
Merchant Websites
Internet
Admin &
Merchant
Dashboard
(Rails)
Runtime
Rules
Merchant
Info
Merchants
Internet
AnalyticsReporting
Monitor &
Recovery
Data
Collectors
Hadoop / HBase
Map / Reduce
Petabyte Store
Load
Balancer
HTTP
Shared
Session
Memory
HTTP
Dispatchers
Redis
Mem
Cache
Redis
Mem
Cache
Redis
Mem
Cache
Redis
Mem
Cache
Redis
Mem
Cache
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
HBase
Amazon S3
Data Backup
9+ Amazon EC2 Instances
Amazon Elastic
Load Balancer
3+ Amazon EC2
Instances
Cheshire / Swarmiji
Dynamic
Runtime
Queue
AWS Elastic Load Balancer
Rails App Servers
Nginx / Unicorn EC2 m1.xlarge
MySQL
Master / Slave
EC2 m1.xlarge EBS
Legacy Runtime
Rails App Nginx/Unicorn
MySQL Master/Slave EC2 m1.xlarge / EBS
Merchant Dashboard
EC2 m1.xlarge
HBase / Hadoop
EC2 m1.xlargeEBS
RabbitMQ
Cheshire / Swarmiji
Redis
EC2 m1.xlarge
Clojure Based Runtime
AWS Elastic
Load Balancer
EC2 m1.large
Opscode Chef
Monitoring
EC2 m1.large
All Deployed on
Deployment &
Configuration
Management
via
Opscode Chef
Good Things
Able to Start Small
Then
GROW
BIGGER
Having the
flexibility to throw
“Hardware” at
our Prototype got
us to market
faster
Ability to
launch test
and staging
environments
almost at will
“Hardware”
as
“Software”
Scaling Runa Inc Big Data e-commerce service with AWS
Living in
“interesting”
times
Managing
Complexity
lots of moving parts
Easy to launch a
few instances
Impossible to
manage
horizontal stacks
“by hand”
Must have
tool like
Opscode Chef
Chef automates
deployment &
puts it under
Revision Control
There’s going to
be some blood
when using
cutting edge tech
Lots of
Learning
Curves to
climb
Useful
Monitoring is
hard but
Critical
HBase on
AWS may be
dangerous
because of Hadoop namenode
SPOF
EC2 bill can surprise
you if you cavalierly
deploy multiple
versions of
horizontally scalable
environments
Could not do our
startup without
AWS or lots more
VC Funding

More Related Content

PDF
Why TeamWave is the Best Quickbooks CRM
PPTX
Distribution Management System
PDF
Data driven video advertising campaigns - JustWatch & Snowplow
PDF
From monolithic to serverless with Amazon Step Functions
PPTX
Netsuite amazon integration
PPT
Amazon NetSuite Connector
PPTX
Simply Business and Snowplow - Multichannel Attribution Analysis
PPTX
Hybrid & Global Kafka Architecture
Why TeamWave is the Best Quickbooks CRM
Distribution Management System
Data driven video advertising campaigns - JustWatch & Snowplow
From monolithic to serverless with Amazon Step Functions
Netsuite amazon integration
Amazon NetSuite Connector
Simply Business and Snowplow - Multichannel Attribution Analysis
Hybrid & Global Kafka Architecture

What's hot (20)

PPT
Kash flow Presentation from kbbs
PDF
Simply Business - Near Real Time Event Processing
PDF
Snowplow - Evolve your analytics stack with your business
PPTX
APPSeCONNECT Presentation 2015
PPTX
5 reasons to try quickbooks
PPTX
Snowplow, Metail and Cascalog
PDF
High Volume Streaming Data: How Amazon Web Services is Changing Our Approach
PDF
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
PDF
Operations management software | Exhibit power
PPTX
The Aviation API Economy: More than SWIM and Waterfall
PPTX
Flows in the Service Console, Gotta Go with the Flow! by Duncan Stewart
PPTX
Calbar Data Scraping Service
PDF
Kafka Summit SF 2017 - Real time Streaming Platform
PDF
Introducing Sauna - Decisioning and response platform from Snowplow
PPTX
E-Commerce Innovations to Drive Revenue in a Hurry
PDF
Fastcollab - social media recruiting
PPTX
Snowplow Analytics: from NoSQL to SQL and back again
PPTX
QuickBooks on Cloud - Surviving and Thriving in the World of Cloud
PPTX
Implementing improved and consistent arbitrary event tracking company-wide us...
PPTX
A taste of Snowplow Analytics data
Kash flow Presentation from kbbs
Simply Business - Near Real Time Event Processing
Snowplow - Evolve your analytics stack with your business
APPSeCONNECT Presentation 2015
5 reasons to try quickbooks
Snowplow, Metail and Cascalog
High Volume Streaming Data: How Amazon Web Services is Changing Our Approach
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Operations management software | Exhibit power
The Aviation API Economy: More than SWIM and Waterfall
Flows in the Service Console, Gotta Go with the Flow! by Duncan Stewart
Calbar Data Scraping Service
Kafka Summit SF 2017 - Real time Streaming Platform
Introducing Sauna - Decisioning and response platform from Snowplow
E-Commerce Innovations to Drive Revenue in a Hurry
Fastcollab - social media recruiting
Snowplow Analytics: from NoSQL to SQL and back again
QuickBooks on Cloud - Surviving and Thriving in the World of Cloud
Implementing improved and consistent arbitrary event tracking company-wide us...
A taste of Snowplow Analytics data
Ad

Viewers also liked (7)

PDF
The secret relation between Big Data, E-commerce and Mel Gibson ||| presented...
PPTX
BIG Data & Hadoop Applications in E-Commerce
PDF
Big Data and E-Commerce
PDF
Big Data Analytics and its Application in E-Commerce
PPTX
All you wanted to know about analytics in e commerce- amazon, ebay, flipkart
PDF
Big Data in e-Commerce
PDF
Content meets e commerce 27-01-2016-3.0
The secret relation between Big Data, E-commerce and Mel Gibson ||| presented...
BIG Data & Hadoop Applications in E-Commerce
Big Data and E-Commerce
Big Data Analytics and its Application in E-Commerce
All you wanted to know about analytics in e commerce- amazon, ebay, flipkart
Big Data in e-Commerce
Content meets e commerce 27-01-2016-3.0
Ad

Similar to Scaling Runa Inc Big Data e-commerce service with AWS (20)

PDF
Event Streaming in Retail with Apache Kafka
PPTX
Build with Serverless Applications with azure functions By usama wahab Khan
PDF
AWS Cloud Experience CA: Data Lakes & Analytics en AWS
PDF
Path to the future #4 - Ingestão, processamento e análise de dados em tempo real
PDF
AWS Serverless Event-driven Architecture - in lastminute.com meetup
PPTX
The Cloud - What's different
PDF
AWS Summit Berlin 2013 - Next-Generation Websites on Demand – with Infopark C...
PDF
Data Democracy: Journey to User-Facing Analytics - Pulsar Summit SF 2022
PDF
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
PPTX
Salesforce com-architecture
PPTX
EVOLVE'14 | Keynote | Sal Visca | How APIs are Revolutionizing Commerce in th...
PPTX
WebAction In-Memory Computing Summit 2015
PDF
Beyond serverless.pptx
PDF
Bringing the Power of Big Data Computation to Salesforce
PDF
Building Modern Streaming Analytics with Confluent on AWS
PPTX
Accenture Technology Vision for SAP Solutions
PDF
Event Mesh Presentation at Gartner AADI Mumbai
PPTX
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
PDF
Big dataandhp cforawsbrasilsummit
PPT
Build Your Own Search Engine
Event Streaming in Retail with Apache Kafka
Build with Serverless Applications with azure functions By usama wahab Khan
AWS Cloud Experience CA: Data Lakes & Analytics en AWS
Path to the future #4 - Ingestão, processamento e análise de dados em tempo real
AWS Serverless Event-driven Architecture - in lastminute.com meetup
The Cloud - What's different
AWS Summit Berlin 2013 - Next-Generation Websites on Demand – with Infopark C...
Data Democracy: Journey to User-Facing Analytics - Pulsar Summit SF 2022
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Salesforce com-architecture
EVOLVE'14 | Keynote | Sal Visca | How APIs are Revolutionizing Commerce in th...
WebAction In-Memory Computing Summit 2015
Beyond serverless.pptx
Bringing the Power of Big Data Computation to Salesforce
Building Modern Streaming Analytics with Confluent on AWS
Accenture Technology Vision for SAP Solutions
Event Mesh Presentation at Gartner AADI Mumbai
Smart Enterprise Big Data Bus for the Modern Responsive Enterprise
Big dataandhp cforawsbrasilsummit
Build Your Own Search Engine

More from Robert Berger (9)

PDF
Personal Object Technology
PDF
Facilitating Distributed Last Mile Broadband Access Providers
PDF
The Linux Probability Wave
PPT
802.11: Ethernet Marches On
PPT
Open spectrum
PDF
Wireless commonsense fontsfixed
KEY
Orchestrating HBase Cluster Deployment with Ironfan and Chef
PDF
Deploying infrastructure with Opscode Chef
KEY
Chef 0.8, Knife and Amazon EC2
Personal Object Technology
Facilitating Distributed Last Mile Broadband Access Providers
The Linux Probability Wave
802.11: Ethernet Marches On
Open spectrum
Wireless commonsense fontsfixed
Orchestrating HBase Cluster Deployment with Ironfan and Chef
Deploying infrastructure with Opscode Chef
Chef 0.8, Knife and Amazon EC2

Recently uploaded (20)

PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Tartificialntelligence_presentation.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
SOPHOS-XG Firewall Administrator PPT.pptx
PPTX
1. Introduction to Computer Programming.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
Group 1 Presentation -Planning and Decision Making .pptx
PDF
MIND Revenue Release Quarter 2 2025 Press Release
“AI and Expert System Decision Support & Business Intelligence Systems”
Per capita expenditure prediction using model stacking based on satellite ima...
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Advanced methodologies resolving dimensionality complications for autism neur...
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Tartificialntelligence_presentation.pptx
Machine learning based COVID-19 study performance prediction
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Network Security Unit 5.pdf for BCA BBA.
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Programs and apps: productivity, graphics, security and other tools
Dropbox Q2 2025 Financial Results & Investor Presentation
SOPHOS-XG Firewall Administrator PPT.pptx
1. Introduction to Computer Programming.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
Group 1 Presentation -Planning and Decision Making .pptx
MIND Revenue Release Quarter 2 2025 Press Release

Scaling Runa Inc Big Data e-commerce service with AWS