SlideShare a Scribd company logo
Tapjoy & OpenStack 
Delivering Billions of 
Requests Daily 
Wes Jossey 
Head of Operations @Tapjoy
Tapjoy 
● Global App-Tech Startup 
● We Power For Mobile Developers: 
○ Monetization 
○ Analytics 
○ User Acquisition 
○ User Retention 
● 450M+ Monthly Users Across 270k+ Apps 
● Worldwide Presence
Technical Details 
● Early AWS Adopter. 
● Grew Predominantly on AWS. 
● Over 1,100 AWS VMs Daily (10/2014) 
● Active Regions in Asia, Europe, N.A. 
● Over One Trillion Requests Handled 
Annually
Tech Philosophy 
● Compute (EC2 & Nova) Driven Company 
○ Operate Your Own Infrastructure 
■ But Not Necessarily Built-From-Scratch 
○ Zero Heart-Attack Nodes 
■ All Nodes Are Ephemeral 
■ Data is Always Distributed 
■ Failure is Always Tolerated 
■ Misbehaving Instances Are Terminated Quickly
Services We Use 
● SQS 
○ Simple, Inexpensive, Durable. 
○ Currently Building New Internal System Influenced 
by SQS, but with Different Guarantees 
○ No Lock-In (See https://github.com/Tapjoy/chore) 
● RDS 
○ No Lock in. Simple. Easy. 
● Cloudwatch (but also statsd)
Services We Use Cont. 
● ELB 
○ SSL Termination Only. Routing Handled Elsewhere. 
● Auto-Scaling 
○ Traffic can fluctuate 30% peak to valley 
● S3 
○ Where we store ALL the things 
○ Still price competitive for what it provides. No plans 
to leave as of today.
Use Compute Everywhere 
● Every Dev Has Access to Either AWS or 
Tapjoy-1 (Tapjoy’s OpenStack Deployment) 
● Simulate Changes Against Useful Data 
● Test Algorithms on Large Hadoop Clusters 
● Practice for Failure With Access to Real 
Services (not mock endpoints)
Going Hybrid 
● We Spend in the Millions on AWS 
● Picked Data-Science Infrastructure because 
of Portability, and Ability to Leverage More 
Nodes 
● Lower Risk than Tier-1 Production Services 
● Wanted a Partner to Maintain OpenStack 
like Amazon ‘Maintains’ AWS 
● We Want to Operate Apps
OpenStack Timeline
Vendors (It Matters) 
● Metacloud 
○ Verified our Design 
○ Deployed Openstack 
○ Provisioned Network 
○ Allowed Us to Focus on Business Applications 
● Equinix 
○ Cooling & Power Design 
○ Remote Hands 
○ Went Above and Beyond on Numerous Occasions
Vendors: Full List 
● Metacloud 
● Equinix 
● Quanta 
● Cumulus 
● Level3 
● Newegg
Challenges 
● Hardware Delays Killed Our Timelines 
○ Blew through our contingency windows. 
○ Hurt our budgets. 
○ Delayed subsequent purchases 
● Setting Up IP Transit Can Be Slow 
● No Physical Presence in DC 
○ Also a Pro 
● No Internal Previous Success Story… So 
Lots of Skepticism
The Not So Glamorous Job 
● Negotiations Can Be Exhausting 
● If You’re An Engineer, the Turn Around Time 
Can Be Frustrating 
● You Probably Need a Gantt Chart 
● There’s Nothing Agile About Writing a Big 
Check
Tapjoy-1: Data Nodes 
348 ‘Data’ All Purpose Nodes 
● Quanta S910-X31E: 12 Node Configuration 
● Per Node 
○ Intel 1265Lv3 @ 2.5GHz 
○ 4x1TB 7200RPM 
○ 32GB RAM 
○ Dual 1Gig NIC 
● ‘Recyclable’ for Other Tasks if we Evolve
Tapjoy-1: Management Nodes 
12 ‘Management’ Nodes 
● Quanta S180: 4 Node Configuration 
● Per Node 
○ Intel 2650v2 x2 @2.60GHz 
○ 128GB RAM 
○ 6x480GB SSD 
○ Dual 10Gig NIC
Glamor Shot
Same Price, Different Outcome
Diagrams!
High-Level 
Request Flow 
Architecture
Detailed Flow
Data Pipeline 
Tapjoy-1
Plan For Failure 
● Hardware 
○ I’m Not Saying You Shouldn’t Use CEPH… 
■ But You’ll Notice it’s Absent Here 
● Service Boundaries 
○ Have Hardware & Software Contingencies 
■ Backup Links 
■ Temporary Cache(s) 
○ Actually Test Failure in Production
Info 
● Twitter! @dustywes 
● Email: wes@tapjoy.com

More Related Content

PDF
Zero Latency: Building a Telemetry Platform on the Elastic Stack
PDF
Order from chaos: automating monitoring configuration
PDF
BDX 2016 - Kevin lyons & yakir buskilla @ eXelate
PDF
OpenStack MagnetoDB. Atlanta Summit 2014
PPTX
Pull, Don't Push! Sensu Summit 2018 Talk
PPTX
The IDI Digital Transformation - OpenStack Day Israel 2016
PDF
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
PDF
Modern Monitoring - SysAdminDay 2017
Zero Latency: Building a Telemetry Platform on the Elastic Stack
Order from chaos: automating monitoring configuration
BDX 2016 - Kevin lyons & yakir buskilla @ eXelate
OpenStack MagnetoDB. Atlanta Summit 2014
Pull, Don't Push! Sensu Summit 2018 Talk
The IDI Digital Transformation - OpenStack Day Israel 2016
Streaming Sensor Data with Grafana and InfluxDB | Ryan Mckinley | Grafana
Modern Monitoring - SysAdminDay 2017

What's hot (20)

PDF
Kapacitor Stream Processing
PDF
OSOM - Operations in the Cloud
PDF
OSOM Operations in the Cloud
PDF
Big Data and OpenStack, a Love Story: Michael Still, Rackspace
PDF
Thinking DevOps in the Era of the Cloud - Demi Ben-Ari
PDF
Finding Cars and Hunting Down Logs - ElasticSearch @AutoScout24
PDF
Santa Cloud: How Netflix Does Holiday Capacity Planning - South Bay SRE Meetu...
PDF
OpenStack, a view from sysadmin. Ver. 0.1
PDF
[Meetup] a successful migration from elastic search to clickhouse
PPTX
NodeTime Tool Review
PPTX
AnsibleFest 2019 - Greenfielding Network and Systems Automation in a Large an...
PPTX
Hyperloglog Lightning Talk
PPTX
Monitoring in a scalable world
PPTX
WHODIS_kearns_presentation.v0a
PDF
Active record, standalone migrations, and working with Arel
PDF
Lightning talk: building a cloud of fares
PDF
Deployment Strategies
PPTX
Experiences sharing about Lambda, Kinesis, and Postgresql
PDF
Moving from Icinga 1 to Icinga 2 + Director - Icinga Camp Zurich 2019
PDF
Keynote: Sensu as a multi-cloud monitoring control plane
Kapacitor Stream Processing
OSOM - Operations in the Cloud
OSOM Operations in the Cloud
Big Data and OpenStack, a Love Story: Michael Still, Rackspace
Thinking DevOps in the Era of the Cloud - Demi Ben-Ari
Finding Cars and Hunting Down Logs - ElasticSearch @AutoScout24
Santa Cloud: How Netflix Does Holiday Capacity Planning - South Bay SRE Meetu...
OpenStack, a view from sysadmin. Ver. 0.1
[Meetup] a successful migration from elastic search to clickhouse
NodeTime Tool Review
AnsibleFest 2019 - Greenfielding Network and Systems Automation in a Large an...
Hyperloglog Lightning Talk
Monitoring in a scalable world
WHODIS_kearns_presentation.v0a
Active record, standalone migrations, and working with Arel
Lightning talk: building a cloud of fares
Deployment Strategies
Experiences sharing about Lambda, Kinesis, and Postgresql
Moving from Icinga 1 to Icinga 2 + Director - Icinga Camp Zurich 2019
Keynote: Sensu as a multi-cloud monitoring control plane
Ad

Similar to Tapjoy OpenStack Summit Paris Breakout Session (20)

PDF
Microservices and serverless for MegaStartups - DLD TLV 2017
PDF
The Netflix Open Source Platform
PDF
SV Forum Platform Architecture SIG - Netflix Open Source Platform
PDF
AWS re:Invent - Accelerating Research
PPTX
Migrating enterprise workloads to AWS
PPTX
Migrating enterprise workloads to AWS
PDF
Netflix Global Cloud Architecture
PDF
Serverless Architectures on AWS Lambda
PPTX
Highway to heaven - XConf Manchester 2015
PPTX
Serverlessusecase workshop feb3_v2
PDF
AWS case study: real estate portal
PPTX
ACDKOCHI19 - Technical Presentation - Connecting 10000 cars to the AWS Cloud
PDF
Re cap2018
PDF
How DreamHost builds a Public Cloud with OpenStack
PDF
How DreamHost builds a public cloud with OpenStack.pdf
PPTX
How to Sell Serverless to Your Colleagues
PPTX
AWS Re:Invent - Optimizing Costs with AWS
PPTX
Keith Inight, CTO at Atos - Software Defined Everything
PDF
AWS to Bare Metal: Motivation, Pitfalls, and Results
PDF
APAC Webinar Apr 10 - Architecting your Mobile App for the Cloud
Microservices and serverless for MegaStartups - DLD TLV 2017
The Netflix Open Source Platform
SV Forum Platform Architecture SIG - Netflix Open Source Platform
AWS re:Invent - Accelerating Research
Migrating enterprise workloads to AWS
Migrating enterprise workloads to AWS
Netflix Global Cloud Architecture
Serverless Architectures on AWS Lambda
Highway to heaven - XConf Manchester 2015
Serverlessusecase workshop feb3_v2
AWS case study: real estate portal
ACDKOCHI19 - Technical Presentation - Connecting 10000 cars to the AWS Cloud
Re cap2018
How DreamHost builds a Public Cloud with OpenStack
How DreamHost builds a public cloud with OpenStack.pdf
How to Sell Serverless to Your Colleagues
AWS Re:Invent - Optimizing Costs with AWS
Keith Inight, CTO at Atos - Software Defined Everything
AWS to Bare Metal: Motivation, Pitfalls, and Results
APAC Webinar Apr 10 - Architecting your Mobile App for the Cloud
Ad

Recently uploaded (20)

PDF
Navsoft: AI-Powered Business Solutions & Custom Software Development
PDF
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
PDF
17 Powerful Integrations Your Next-Gen MLM Software Needs
PDF
How to Make Money in the Metaverse_ Top Strategies for Beginners.pdf
PDF
iTop VPN Free 5.6.0.5262 Crack latest version 2025
PDF
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
PDF
Salesforce Agentforce AI Implementation.pdf
PPTX
history of c programming in notes for students .pptx
PPTX
Monitoring Stack: Grafana, Loki & Promtail
PDF
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
PPTX
Why Generative AI is the Future of Content, Code & Creativity?
PDF
AutoCAD Professional Crack 2025 With License Key
PPTX
Oracle Fusion HCM Cloud Demo for Beginners
PDF
Wondershare Filmora 15 Crack With Activation Key [2025
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PDF
Autodesk AutoCAD Crack Free Download 2025
PDF
Nekopoi APK 2025 free lastest update
PDF
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
Navsoft: AI-Powered Business Solutions & Custom Software Development
Adobe Premiere Pro 2025 (v24.5.0.057) Crack free
17 Powerful Integrations Your Next-Gen MLM Software Needs
How to Make Money in the Metaverse_ Top Strategies for Beginners.pdf
iTop VPN Free 5.6.0.5262 Crack latest version 2025
CapCut Video Editor 6.8.1 Crack for PC Latest Download (Fully Activated) 2025
Salesforce Agentforce AI Implementation.pdf
history of c programming in notes for students .pptx
Monitoring Stack: Grafana, Loki & Promtail
EN-Survey-Report-SAP-LeanIX-EA-Insights-2025.pdf
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
Why Generative AI is the Future of Content, Code & Creativity?
AutoCAD Professional Crack 2025 With License Key
Oracle Fusion HCM Cloud Demo for Beginners
Wondershare Filmora 15 Crack With Activation Key [2025
Odoo Companies in India – Driving Business Transformation.pdf
Autodesk AutoCAD Crack Free Download 2025
Nekopoi APK 2025 free lastest update
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency

Tapjoy OpenStack Summit Paris Breakout Session

  • 1. Tapjoy & OpenStack Delivering Billions of Requests Daily Wes Jossey Head of Operations @Tapjoy
  • 2. Tapjoy ● Global App-Tech Startup ● We Power For Mobile Developers: ○ Monetization ○ Analytics ○ User Acquisition ○ User Retention ● 450M+ Monthly Users Across 270k+ Apps ● Worldwide Presence
  • 3. Technical Details ● Early AWS Adopter. ● Grew Predominantly on AWS. ● Over 1,100 AWS VMs Daily (10/2014) ● Active Regions in Asia, Europe, N.A. ● Over One Trillion Requests Handled Annually
  • 4. Tech Philosophy ● Compute (EC2 & Nova) Driven Company ○ Operate Your Own Infrastructure ■ But Not Necessarily Built-From-Scratch ○ Zero Heart-Attack Nodes ■ All Nodes Are Ephemeral ■ Data is Always Distributed ■ Failure is Always Tolerated ■ Misbehaving Instances Are Terminated Quickly
  • 5. Services We Use ● SQS ○ Simple, Inexpensive, Durable. ○ Currently Building New Internal System Influenced by SQS, but with Different Guarantees ○ No Lock-In (See https://github.com/Tapjoy/chore) ● RDS ○ No Lock in. Simple. Easy. ● Cloudwatch (but also statsd)
  • 6. Services We Use Cont. ● ELB ○ SSL Termination Only. Routing Handled Elsewhere. ● Auto-Scaling ○ Traffic can fluctuate 30% peak to valley ● S3 ○ Where we store ALL the things ○ Still price competitive for what it provides. No plans to leave as of today.
  • 7. Use Compute Everywhere ● Every Dev Has Access to Either AWS or Tapjoy-1 (Tapjoy’s OpenStack Deployment) ● Simulate Changes Against Useful Data ● Test Algorithms on Large Hadoop Clusters ● Practice for Failure With Access to Real Services (not mock endpoints)
  • 8. Going Hybrid ● We Spend in the Millions on AWS ● Picked Data-Science Infrastructure because of Portability, and Ability to Leverage More Nodes ● Lower Risk than Tier-1 Production Services ● Wanted a Partner to Maintain OpenStack like Amazon ‘Maintains’ AWS ● We Want to Operate Apps
  • 10. Vendors (It Matters) ● Metacloud ○ Verified our Design ○ Deployed Openstack ○ Provisioned Network ○ Allowed Us to Focus on Business Applications ● Equinix ○ Cooling & Power Design ○ Remote Hands ○ Went Above and Beyond on Numerous Occasions
  • 11. Vendors: Full List ● Metacloud ● Equinix ● Quanta ● Cumulus ● Level3 ● Newegg
  • 12. Challenges ● Hardware Delays Killed Our Timelines ○ Blew through our contingency windows. ○ Hurt our budgets. ○ Delayed subsequent purchases ● Setting Up IP Transit Can Be Slow ● No Physical Presence in DC ○ Also a Pro ● No Internal Previous Success Story… So Lots of Skepticism
  • 13. The Not So Glamorous Job ● Negotiations Can Be Exhausting ● If You’re An Engineer, the Turn Around Time Can Be Frustrating ● You Probably Need a Gantt Chart ● There’s Nothing Agile About Writing a Big Check
  • 14. Tapjoy-1: Data Nodes 348 ‘Data’ All Purpose Nodes ● Quanta S910-X31E: 12 Node Configuration ● Per Node ○ Intel 1265Lv3 @ 2.5GHz ○ 4x1TB 7200RPM ○ 32GB RAM ○ Dual 1Gig NIC ● ‘Recyclable’ for Other Tasks if we Evolve
  • 15. Tapjoy-1: Management Nodes 12 ‘Management’ Nodes ● Quanta S180: 4 Node Configuration ● Per Node ○ Intel 2650v2 x2 @2.60GHz ○ 128GB RAM ○ 6x480GB SSD ○ Dual 10Gig NIC
  • 19. High-Level Request Flow Architecture
  • 22. Plan For Failure ● Hardware ○ I’m Not Saying You Shouldn’t Use CEPH… ■ But You’ll Notice it’s Absent Here ● Service Boundaries ○ Have Hardware & Software Contingencies ■ Backup Links ■ Temporary Cache(s) ○ Actually Test Failure in Production
  • 23. Info ● Twitter! @dustywes ● Email: wes@tapjoy.com