SomeSQL 
Scaling in a changing world 
of databases and hardware 
Alistair Hann 
CTO, Skyscanner
Buzzwords
Web 2.0
Year of Mobile
Big Data
NoSQL
Scaling the live pricing 
cache
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
Website Native Apps APIs and White 
Labels 
Traditional Airlines Budget Airlines Online Travel Agencies
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
Prices +Timetables 
Data Collection 
Services 
1) Which websites 
should we show? 
2) What prices do 
we already have 
cached? 
3) Live update 
what we still 
need. 
5) Return the 
prices to the user. 
4) Clean up and 
save the new data 
Live Pricing 
Service
Live Pricing 
Service
Live Pricing 
Service 
Cached Prices (key/value)
2000 quotes per second 
2 bn itineraries and quotes 
270 GB table 
250 GB indices
What we really needed 
Consistency 
Horizontal Scaling 
Elasticity 
Persistence 
Speed 
Resilience 
Simplicity
Live Pricing 
Service 
Cached Prices (key/value)
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
Beyond key value
SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware
Couchbase – Map Reduce Views 
{ 
"website": { 
"published": true, 
"id": "affd", ... 
}, 
"office_id": "1", 
"city_id": "AUHA", 
"raw_data": [...] 
"address": 
"closing_time": "00:00", 
"routenodeid": "9618", 
"type": "office“ 
}
What about the 
hardware?
Disk for VMs 
c.f. 250,000 
iops Fusion I/O 
Standard $0.03 / GB 
Glacier $0.01 / GB
Quote 
Bus 
staging 
UK1 
Thrift 
long-term archive 
GZIP 
queryable 
hierarchical 
LZO 
queryable 
flat 
filter 
Loader 
GZIP 
Quote 
Bus 
UK2 
Thrift 
Loader
hierarchical 
flat 
Hadoop cluster 
or 
Elastic MapReduce 
analysts 
query 
load 
analytical tools 
feed 
export
The death of the data 
warehouse
A distributed database… 
Fluentd 
Kafka reporting 
Graphite 
Stitched 
events 
Fluentd 
Stitched 
events 
Operational 
metrics 
Raw JSON events Errors 
Elastic 
MapReduce 
Raw 
Events 
Trigger and view materialization Indexes on the data
Some things don’t 
change
CC Images courtesy of chaya760 on Flickr
We still face the same challenges 
RAM and Disk i/o concerns 
Administration 
Security 
Data insert and retrieval 
Monitoring and alerting 
Performance optimization
The report of my death 
was an exaggeration
Elastic Search
NoSQL
Microsoft SQL Server
Relational Vs NoSQL
Edinburgh 
Quartermile One 
15 Lauriston Place 
Edinburgh 
EH3 9EN 
Glasgow 
5th floor, 
151-155 St Vincent 
St, Glasgow 
G2 5NW 
Singapore 
No. 08-01&04 & 09- 
04 
8th floor, 
Robinson Point, 
39 Robinson Rd, 
Singapore 
Beijing 
Level 19, Tower E2, 
Oriental Plaza, 
No. 1 East Chang An 
Avenue, 
Dong Cheng District, 
Beijing 100738 
Miami 
1395 Brickell Ave, 
Suite 900, 
Miami, 
Florida 33131 
Barcelona 
Torre NN, 
Calle Tarragona, 157, 
4a Planta, 
Barcelona, 08014 
thank you

More Related Content

PDF
Data science challenges in flight search
PPTX
Operating samza at skyscanner
PPTX
Alistair Hann of Skyscanner
PPTX
Hack UPC
PPTX
Conversion Camp: The complexity of love and conversions, Skyscanner
PDF
Resume_General
PPTX
Spark Summit East Keynote by Anjul Bhambhri
PPTX
Postgres Sql at TripAdvisor
Data science challenges in flight search
Operating samza at skyscanner
Alistair Hann of Skyscanner
Hack UPC
Conversion Camp: The complexity of love and conversions, Skyscanner
Resume_General
Spark Summit East Keynote by Anjul Bhambhri
Postgres Sql at TripAdvisor

Viewers also liked (16)

PPTX
THack @ WIT - Skyscanner presentation
PPT
What is inventor2
PPT
Walkercare vs Obamacare: Which Do You Choose?
PPTX
Skyscanner: to be local or not to be.
PPTX
Skyscanner: Abandoning conventional wisdom for hypergrowth
PPTX
Alexa, ask Skyscanner...
PPT
Tweeting beyond Facts – The Need for a Linguistic Perspective
PPTX
Agile @Skyscanner : From Theory to Practice
PDF
High flying technology (Greg Urquhart) SCOT100915
PPT
Got Healthcare? The Affordable Care Act
PDF
trivago - Just a website?
PPTX
Salesforce Revenue model
PPTX
Demystifying hotel metasearch
PPT
AIS, Airline Information System, Pilot Project
PPTX
Webinar 1 what is meta
PPTX
Direct booking summit
THack @ WIT - Skyscanner presentation
What is inventor2
Walkercare vs Obamacare: Which Do You Choose?
Skyscanner: to be local or not to be.
Skyscanner: Abandoning conventional wisdom for hypergrowth
Alexa, ask Skyscanner...
Tweeting beyond Facts – The Need for a Linguistic Perspective
Agile @Skyscanner : From Theory to Practice
High flying technology (Greg Urquhart) SCOT100915
Got Healthcare? The Affordable Care Act
trivago - Just a website?
Salesforce Revenue model
Demystifying hotel metasearch
AIS, Airline Information System, Pilot Project
Webinar 1 what is meta
Direct booking summit
Ad

Similar to SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware (20)

PPTX
StrongLoop Overview
PPTX
Coud-based Data Lake for Analytics and AI
PPT
Technology Overview
PDF
Unifying Analytics
PDF
EDA Meets Data Engineering – What's the Big Deal?
PPTX
Turbocharged Data - Leveraging Azure Data Explorer for Real-Time Insights fro...
PPTX
Flink Forward San Francisco 2018: - Jinkui Shi and Radu Tudoran "Flink real-t...
PDF
Elasticsearch in Netflix
PDF
Transforming Mobile Push Notifications with Big Data
PPTX
Unify Analytics: Combine Strengths of Data Lake and Data Warehouse
PDF
Real time analytics on deep learning @ strata data 2019
PPTX
Understanding The Azure Platform March 2010
PPT
Tspbug 2 24_2014_final
PDF
MongoDB World 2019: Streaming ETL on the Shoulders of Giants
PDF
Keynote sp summit 2014 final
PPTX
Azure Stream Analytics : Analyse Data in Motion
PDF
AWS Analytics Immersion Day - Build BI System from Scratch (Day1, Day2 Full V...
PDF
Customer migration to azure sql database from on-premises SQL, for a SaaS app...
PDF
AWS를 활용한 Big Data 실전 배치 사례 :: 이한주 :: AWS Summit Seoul 2016
PPTX
Enabling Machine Learning with Apache Flink - Sherin Thomas, Lyft
StrongLoop Overview
Coud-based Data Lake for Analytics and AI
Technology Overview
Unifying Analytics
EDA Meets Data Engineering – What's the Big Deal?
Turbocharged Data - Leveraging Azure Data Explorer for Real-Time Insights fro...
Flink Forward San Francisco 2018: - Jinkui Shi and Radu Tudoran "Flink real-t...
Elasticsearch in Netflix
Transforming Mobile Push Notifications with Big Data
Unify Analytics: Combine Strengths of Data Lake and Data Warehouse
Real time analytics on deep learning @ strata data 2019
Understanding The Azure Platform March 2010
Tspbug 2 24_2014_final
MongoDB World 2019: Streaming ETL on the Shoulders of Giants
Keynote sp summit 2014 final
Azure Stream Analytics : Analyse Data in Motion
AWS Analytics Immersion Day - Build BI System from Scratch (Day1, Day2 Full V...
Customer migration to azure sql database from on-premises SQL, for a SaaS app...
AWS를 활용한 Big Data 실전 배치 사례 :: 이한주 :: AWS Summit Seoul 2016
Enabling Machine Learning with Apache Flink - Sherin Thomas, Lyft
Ad

Recently uploaded (20)

PDF
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
PDF
Ableton Live Suite for MacOS Crack Full Download (Latest 2025)
PDF
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
PPTX
Monitoring Stack: Grafana, Loki & Promtail
PPTX
assetexplorer- product-overview - presentation
PDF
DNT Brochure 2025 – ISV Solutions @ D365
PDF
Topaz Photo AI Crack New Download (Latest 2025)
PDF
AI/ML Infra Meetup | LLM Agents and Implementation Challenges
PDF
Time Tracking Features That Teams and Organizations Actually Need
PDF
Cost to Outsource Software Development in 2025
DOCX
Modern SharePoint Intranet Templates That Boost Employee Engagement in 2025.docx
PDF
DuckDuckGo Private Browser Premium APK for Android Crack Latest 2025
DOCX
How to Use SharePoint as an ISO-Compliant Document Management System
PPTX
"Secure File Sharing Solutions on AWS".pptx
PDF
Types of Token_ From Utility to Security.pdf
PPTX
Patient Appointment Booking in Odoo with online payment
PDF
EaseUS PDF Editor Pro 6.2.0.2 Crack with License Key 2025
PDF
CCleaner 6.39.11548 Crack 2025 License Key
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
PDF
AI Guide for Business Growth - Arna Softech
How AI/LLM recommend to you ? GDG meetup 16 Aug by Fariman Guliev
Ableton Live Suite for MacOS Crack Full Download (Latest 2025)
AI-Powered Threat Modeling: The Future of Cybersecurity by Arun Kumar Elengov...
Monitoring Stack: Grafana, Loki & Promtail
assetexplorer- product-overview - presentation
DNT Brochure 2025 – ISV Solutions @ D365
Topaz Photo AI Crack New Download (Latest 2025)
AI/ML Infra Meetup | LLM Agents and Implementation Challenges
Time Tracking Features That Teams and Organizations Actually Need
Cost to Outsource Software Development in 2025
Modern SharePoint Intranet Templates That Boost Employee Engagement in 2025.docx
DuckDuckGo Private Browser Premium APK for Android Crack Latest 2025
How to Use SharePoint as an ISO-Compliant Document Management System
"Secure File Sharing Solutions on AWS".pptx
Types of Token_ From Utility to Security.pdf
Patient Appointment Booking in Odoo with online payment
EaseUS PDF Editor Pro 6.2.0.2 Crack with License Key 2025
CCleaner 6.39.11548 Crack 2025 License Key
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
AI Guide for Business Growth - Arna Softech

SomeSQL at Skyscanner - Scaling in a changing world of databases and hardware

Editor's Notes

  • #30: Approach heavily inspired by http://engineering.linkedin.com/distributed-systems/log-what-every-software-engineer-should-know-about-real-time-datas-unifying