SlideShare a Scribd company logo
BERLIN • NEW YORK • SAN FRANCISCO • SÃO PAULO • PARIS • LONDON • MOSCOW • ISTANBUL
SEOUL • SHANGHAI • BEIJING • TOKYO • MUMBAI • SINGAPORE
Scaling a platform for
real-time fraud detection
2
Why is no one prepared for
success?
3
‣ Focus on idea
‣ Focus on execution is limited in time to market
‣ Underestimating importance of scalability and profitability
Startup Zeitgeist
4
‣ Director of Engineering
‣ Consulted with early design decisions
‣ Joined Adjust 2012 as Head of IT Operations
Who am I?
Who is Adjust?
1B+
Daily active users tracked
400+ Billion
Data points tracked monthly
22K+
Apps tracked
6
‣ Automatically reject fraudulent data before it gets paid
‣ Sends rejection reason callbacks to all parties
‣ Customers and their networks have full transparency
‣ Real-time statistical analysis of all ad engagements and app activity
Fraud Prevention Suite
7
How did we do that?
8
Bootstrapping a product
Current approach
‣ Using cloud services
‣ Base product on 3rd party IP
‣ Faster development
9
Bootstrapping a product
Current approach Our approach
‣ Using cloud services
‣ Base product on 3rd party IP
‣ Faster development
‣ Own infrastructure
‣ Develop own IP
‣ Slower ramp up
10
Bootstrapping a product
Current approach Our approach
‣ Using cloud services
‣ Base product on 3rd party IP
‣ Faster development
‣ Potential lock in
‣ More cost efficient in the beginning
‣ Shared environment
‣ Own infrastructure
‣ Develop own IP
‣ Slower ramp up
11
Bootstrapping a product
Current approach Our approach
‣ Using cloud services
‣ Base product on 3rd party IP
‣ Faster development
‣ Potential lock in
‣ More cost efficient in the beginning
‣ Shared environment
‣ Own infrastructure
‣ Develop own IP
‣ Slower ramp up
‣ Keep flexibility
‣ More cost efficient in the long run
‣ Dedicated environment
12
The machine room
13
General Overview
Realtime Callbacks
Raw Data Upload
Realtime Fraud
Prevention
Realtime
Aggregation
SDK Traffic Tracking Server Device Data
Store
14
So it begins…
15
Redis Small
haproxy
Redis Leader
Redis Replica
16
A little more please?
17
Redis Big
haproxy
Redis Leader
Redis Replica
18
We need to go further
19
It’s all connected :)
Leader 1 Replica 1 Leader n Replica n
Sentinel 1 Sentinel n
twemproxy twemproxy
20
And we felt like…
21
22
‣ Automation is tricky
Smoke in the machine room
23
‣ Automation is tricky
‣ Weird latency spikes
Smoke in the machine room
24
‣ Automation is tricky
‣ Weird latency spikes
‣ Increased average response time
Smoke in the machine room
25
‣ Automation is tricky
‣ Weird latency spikes
‣ Increased average response time
‣ Long failover times
‣ Disruptive failovers
Smoke in the machine room
26
And we felt like…
27
28
Sometimes you have to let go…
29
‣ Only index is in RAM
‣ Data is on SSD
‣ Cost reduced by 85%
‣ Server count reduced from 40 to 6
Migrating to Aerospike
30
‣ 1,1 million reads per second
‣ 500.000 writes per second
‣ 180TB of data
‣ 3 locations
Using Aerospike
31
What if?
32
‣ Easy to scale
‣ Redis interface
‣ Online resizing
‣ All dirty work is done by Amazon
Elasticache
33
Q: Can I use Amazon ElastiCache for use cases other
than caching?
A: Yes. ElastiCache for Redis can be used as a primary
in-memory key-value data store, providing fast, sub
millisecond data performance, high availability and
scalability up to 15 nodes plus up to 5 read replicas,
each of up to 9.5 TiB of in-memory data.
34
“Once we are successful, we will
take care of scalability and
profitability”
35
What will happen to my
infrastructure when I will be
successful?
36
Questions and answers
New York
Paris
São Paulo
San Francisco
London Berlin
Istanbul
Moscow
Mumbai
Beijing
Seoul
Tokyo
Shanghai
Singapore
Robert Abraham
DIRECTOR OF ENGINEERING

robert@adjust.com
ADJUST HQ

Saarbrücker Str. 37a

10405 Berlin

Germany

More Related Content

PDF
New Relic Infrastructure: Servers Transition August 2017
PPTX
Customer case - Dynatrace Monitoring Redefined
PDF
Operational Analytics at Credit Suisse from ThousandEyes Connect
PDF
How to Handle the Realities of DevOps Monitoring Today
ODP
Monitoring via Datadog
PDF
2016 09 measurecamp - event data modeling
PDF
Top-Down Approach to Monitoring
PPTX
Production Operations An Architect And Developers Perspective (Without Notes)
New Relic Infrastructure: Servers Transition August 2017
Customer case - Dynatrace Monitoring Redefined
Operational Analytics at Credit Suisse from ThousandEyes Connect
How to Handle the Realities of DevOps Monitoring Today
Monitoring via Datadog
2016 09 measurecamp - event data modeling
Top-Down Approach to Monitoring
Production Operations An Architect And Developers Perspective (Without Notes)

What's hot (12)

PDF
Cloud-Native Workshop New York- Dynatrace
PDF
Accelerate to Cloud
PPTX
SplunkLive! Utrecht 2017 - ASML Customer Presentation
PDF
Turning Cloud Metrics into Results
PPTX
SplunkLive! Utrecht 2016 - Exact
PDF
Provide Company Overview
PDF
Microservices: A foundational approach for fully managed cloud data analytics
PDF
Monitoring with Elastic Machine Learning at Sky
PDF
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
PDF
LogicMonitor: An Overview
PPT
Implementing Powerful IT Search on the Cloud
Cloud-Native Workshop New York- Dynatrace
Accelerate to Cloud
SplunkLive! Utrecht 2017 - ASML Customer Presentation
Turning Cloud Metrics into Results
SplunkLive! Utrecht 2016 - Exact
Provide Company Overview
Microservices: A foundational approach for fully managed cloud data analytics
Monitoring with Elastic Machine Learning at Sky
Future-Proof Your Streaming Analytics Architecture- StreamAnalytix Webinar
LogicMonitor: An Overview
Implementing Powerful IT Search on the Cloud
Ad

Similar to Big Data LDN 2018: SCALING A PLATFORM FOR REAL-TIME FRAUD DETECTION WITHOUT BREAKING THE BANK (20)

PDF
Lambda Architectures in Practice
PDF
SACON - Incident Response Automation & Orchestration (Amit Modi)
PPTX
State of Streams | Gwen Shapira, Fall 2018
PDF
Fifteen Years of DevOps -- LISA 2012 keynote
PDF
AppSphere 15 - Deep Dive into AppDynamics Application Analytics
PDF
Acting on Real-time Behavior: How Peak Games Won Transactions
PPTX
Pivoting to Cloud: How an MSP Brokers Cloud Services
PPTX
Office 365 Monitoring Best Practices
PPTX
(R)evolutionize APM
PDF
VMworld 2013: How to make most out of your Hybrid Cloud
PDF
Taming Big Data With Modern Software Architecture
PPTX
Buisness drivers for real-time streaming analytics integrated to action frame...
PPTX
Disaster Recovery: Don't risk it--automate it
KEY
Selecting SaaS providers
PDF
RightScale Roadtrip - Accelerate To Cloud
PDF
A Trifecta of Real-Time Applications: Apache Kafka, Flink, and Druid
PPTX
The Business Justification for APM
PPTX
The new dominant companies are running on data
PPTX
The Future of Infrastructure: Key Trends to consider
PDF
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Lambda Architectures in Practice
SACON - Incident Response Automation & Orchestration (Amit Modi)
State of Streams | Gwen Shapira, Fall 2018
Fifteen Years of DevOps -- LISA 2012 keynote
AppSphere 15 - Deep Dive into AppDynamics Application Analytics
Acting on Real-time Behavior: How Peak Games Won Transactions
Pivoting to Cloud: How an MSP Brokers Cloud Services
Office 365 Monitoring Best Practices
(R)evolutionize APM
VMworld 2013: How to make most out of your Hybrid Cloud
Taming Big Data With Modern Software Architecture
Buisness drivers for real-time streaming analytics integrated to action frame...
Disaster Recovery: Don't risk it--automate it
Selecting SaaS providers
RightScale Roadtrip - Accelerate To Cloud
A Trifecta of Real-Time Applications: Apache Kafka, Flink, and Druid
The Business Justification for APM
The new dominant companies are running on data
The Future of Infrastructure: Key Trends to consider
Big Data LDN 2017: The New Dominant Companies Are Running on Data
Ad

More from Matt Stubbs (20)

PDF
Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
PDF
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
PDF
Blueprint Series: Expedia Partner Solutions, Data Platform
PDF
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
PDF
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
PDF
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
PDF
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
PDF
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
PDF
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
PDF
Big Data LDN 2018: AI VS. GDPR
PDF
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
PDF
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
PDF
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
PDF
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
PDF
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
PDF
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
PDF
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
PDF
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
PDF
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
PDF
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE
Blueprint Series: Banking In The Cloud – Ultra-high Reliability Architectures
Speed Up Your Apache Cassandra™ Applications: A Practical Guide to Reactive P...
Blueprint Series: Expedia Partner Solutions, Data Platform
Blueprint Series: Architecture Patterns for Implementing Serverless Microserv...
Big Data LDN 2018: DATA, WHAT PEOPLE THINK AND WHAT YOU CAN DO TO BUILD TRUST.
Big Data LDN 2018: DATABASE FOR THE INSTANT EXPERIENCE
Big Data LDN 2018: BIG DATA TOO SLOW? SPRINKLE IN SOME NOSQL
Big Data LDN 2018: ENABLING DATA-DRIVEN DECISIONS WITH AUTOMATED INSIGHTS
Big Data LDN 2018: DATA MANAGEMENT AUTOMATION AND THE INFORMATION SUPPLY CHAI...
Big Data LDN 2018: AI VS. GDPR
Big Data LDN 2018: REALISING THE PROMISE OF SELF-SERVICE ANALYTICS WITH DATA ...
Big Data LDN 2018: TURNING MULTIPLE DATA LAKES INTO A UNIFIED ANALYTIC DATA L...
Big Data LDN 2018: MICROSOFT AZURE AND CLOUDERA – FLEXIBLE CLOUD, WHATEVER TH...
Big Data LDN 2018: CONSISTENT SECURITY, GOVERNANCE AND FLEXIBILITY FOR ALL WO...
Big Data LDN 2018: MICROLISE: USING BIG DATA AND AI IN TRANSPORT AND LOGISTICS
Big Data LDN 2018: EXPERIAN: MAXIMISE EVERY OPPORTUNITY IN THE BIG DATA UNIVERSE
Big Data LDN 2018: A LOOK INSIDE APPLIED MACHINE LEARNING
Big Data LDN 2018: DEUTSCHE BANK: THE PATH TO AUTOMATION IN A HIGHLY REGULATE...
Big Data LDN 2018: FROM PROLIFERATION TO PRODUCTIVITY: MACHINE LEARNING DATA ...
Big Data LDN 2018: DATA APIS DON’T DISCRIMINATE

Recently uploaded (20)

PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Introduction to Knowledge Engineering Part 1
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PPTX
IB Computer Science - Internal Assessment.pptx
PDF
Lecture1 pattern recognition............
PPTX
1_Introduction to advance data techniques.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
Introduction to machine learning and Linear Models
PPT
Quality review (1)_presentation of this 21
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPT
Reliability_Chapter_ presentation 1221.5784
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
STUDY DESIGN details- Lt Col Maksud (21).pptx
Mega Projects Data Mega Projects Data
Introduction to Knowledge Engineering Part 1
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
IB Computer Science - Internal Assessment.pptx
Lecture1 pattern recognition............
1_Introduction to advance data techniques.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Business Acumen Training GuidePresentation.pptx
Introduction to machine learning and Linear Models
Quality review (1)_presentation of this 21
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Reliability_Chapter_ presentation 1221.5784

Big Data LDN 2018: SCALING A PLATFORM FOR REAL-TIME FRAUD DETECTION WITHOUT BREAKING THE BANK

  • 1. BERLIN • NEW YORK • SAN FRANCISCO • SÃO PAULO • PARIS • LONDON • MOSCOW • ISTANBUL SEOUL • SHANGHAI • BEIJING • TOKYO • MUMBAI • SINGAPORE Scaling a platform for real-time fraud detection
  • 2. 2 Why is no one prepared for success?
  • 3. 3 ‣ Focus on idea ‣ Focus on execution is limited in time to market ‣ Underestimating importance of scalability and profitability Startup Zeitgeist
  • 4. 4 ‣ Director of Engineering ‣ Consulted with early design decisions ‣ Joined Adjust 2012 as Head of IT Operations Who am I?
  • 5. Who is Adjust? 1B+ Daily active users tracked 400+ Billion Data points tracked monthly 22K+ Apps tracked
  • 6. 6 ‣ Automatically reject fraudulent data before it gets paid ‣ Sends rejection reason callbacks to all parties ‣ Customers and their networks have full transparency ‣ Real-time statistical analysis of all ad engagements and app activity Fraud Prevention Suite
  • 7. 7 How did we do that?
  • 8. 8 Bootstrapping a product Current approach ‣ Using cloud services ‣ Base product on 3rd party IP ‣ Faster development
  • 9. 9 Bootstrapping a product Current approach Our approach ‣ Using cloud services ‣ Base product on 3rd party IP ‣ Faster development ‣ Own infrastructure ‣ Develop own IP ‣ Slower ramp up
  • 10. 10 Bootstrapping a product Current approach Our approach ‣ Using cloud services ‣ Base product on 3rd party IP ‣ Faster development ‣ Potential lock in ‣ More cost efficient in the beginning ‣ Shared environment ‣ Own infrastructure ‣ Develop own IP ‣ Slower ramp up
  • 11. 11 Bootstrapping a product Current approach Our approach ‣ Using cloud services ‣ Base product on 3rd party IP ‣ Faster development ‣ Potential lock in ‣ More cost efficient in the beginning ‣ Shared environment ‣ Own infrastructure ‣ Develop own IP ‣ Slower ramp up ‣ Keep flexibility ‣ More cost efficient in the long run ‣ Dedicated environment
  • 13. 13 General Overview Realtime Callbacks Raw Data Upload Realtime Fraud Prevention Realtime Aggregation SDK Traffic Tracking Server Device Data Store
  • 16. 16 A little more please?
  • 18. 18 We need to go further
  • 19. 19 It’s all connected :) Leader 1 Replica 1 Leader n Replica n Sentinel 1 Sentinel n twemproxy twemproxy
  • 20. 20 And we felt like…
  • 21. 21
  • 22. 22 ‣ Automation is tricky Smoke in the machine room
  • 23. 23 ‣ Automation is tricky ‣ Weird latency spikes Smoke in the machine room
  • 24. 24 ‣ Automation is tricky ‣ Weird latency spikes ‣ Increased average response time Smoke in the machine room
  • 25. 25 ‣ Automation is tricky ‣ Weird latency spikes ‣ Increased average response time ‣ Long failover times ‣ Disruptive failovers Smoke in the machine room
  • 26. 26 And we felt like…
  • 27. 27
  • 28. 28 Sometimes you have to let go…
  • 29. 29 ‣ Only index is in RAM ‣ Data is on SSD ‣ Cost reduced by 85% ‣ Server count reduced from 40 to 6 Migrating to Aerospike
  • 30. 30 ‣ 1,1 million reads per second ‣ 500.000 writes per second ‣ 180TB of data ‣ 3 locations Using Aerospike
  • 32. 32 ‣ Easy to scale ‣ Redis interface ‣ Online resizing ‣ All dirty work is done by Amazon Elasticache
  • 33. 33 Q: Can I use Amazon ElastiCache for use cases other than caching? A: Yes. ElastiCache for Redis can be used as a primary in-memory key-value data store, providing fast, sub millisecond data performance, high availability and scalability up to 15 nodes plus up to 5 read replicas, each of up to 9.5 TiB of in-memory data.
  • 34. 34 “Once we are successful, we will take care of scalability and profitability”
  • 35. 35 What will happen to my infrastructure when I will be successful?
  • 37. New York Paris São Paulo San Francisco London Berlin Istanbul Moscow Mumbai Beijing Seoul Tokyo Shanghai Singapore Robert Abraham DIRECTOR OF ENGINEERING
 robert@adjust.com ADJUST HQ
 Saarbrücker Str. 37a
 10405 Berlin
 Germany