SlideShare a Scribd company logo
Fighting with scale
things that “Getting started” doesn’t cover
Adam Dubiel
We’re building a New World
We used to have these..
..and now we have these
Microservices are needy
discovery
messaging
deployment
metrics
monitoring
storage
Need for automation
env business
env business
Choosing the right tools
can be hard
Fighting with scale
and then came docker..
Metrics - graphite
building blocks
frontend
dashboard
aggregator
storage
building blocks
graphite-web
tessera
carbon
whisper
building blocks
graphite-web
tessera
carbon
ceres
building blocks
graphite-web
tessera
carbon
cyanite
frontend
dashboard
influxdb
building blocks
“getting started” setup
graphite-web
tessera
carbon
whisper
1st iteration
1st iteration - overall
1st iteration - memory
1st iteration - cause
2nd iteration
OTHER
TECH
OTHER
TECH
2nd iteration - memory
2nd iteration - overall
not only soft, drivers too
Metrics - StatsD
architecture
Host1
Host2
Graphite
metric.host1.rate
metric.host2.rate
OK
architecture
Host1
Host2
Graphite
metric.rate
metric.rate
CLASH!
architecture
Host1
Host2
Graphite
metric.rate
metric.rate
OKStatsD
problem?
how to scale?
StatsDStatsD
StatsD proxy
how to scale proxy…?
Messaging - Hermes
architecture
Hermes
Frontend
Hermes
Consumer
Kafka is great!
performant
fault-tolerant
robust
but what with SLA?
strict SLA, p99 < 100ms
stable p999 would be great
cluster manipulations can be painful
and beware of abusive clients
1st iteration - Kafka leaders
1st iteration - Kafka leaders
1st iteration - Kafka leaders
Kafka does not distribute leaders uniformly
simple take-first algorithm
no tools to make it happen
go with #!/usr/bin/python
2nd iteration - buffering
simple solution to complex problem
use Kafka producer buffer
we can operate without Kafka for 1h!
Fighting with scale
and when we thought we have it all covered..
bam!
corrupt message logs
corrupt message logs
Kafka High Level Consumer can’t skip corrupt parts
no easy solution
time pressure - 24h retency
#!/usr/bin/python to the rescue!
Lessons learned
Production traffic is only on.. Production
prepare for the Unknown
..but mostly for problems
scripts are your friends when time is the enemy
centralisation sounds appealing
(but seriously, don’t)
focus on
knowhow
agility
ability to create clusters on demand
isolate
allegrotech.io
@allegrotechblog

More Related Content

PDF
JDD 2014: Adam Dubiel - Import allegro.tech.internal.*
PDF
Scaling infrastructure beyond containers
PDF
RxJava & Hystrix - Perfect match for distributed applications
PPTX
allegro.tech Data Science Meetup #2: Elasticsearch w praktyce
PDF
Microservices architecture pitfalls
PDF
RxJava - introduction & design
PDF
Confitura 2015 - Mikrousługi nie lubią być samotne
PDF
[WHUG] Wielki brat patrzy - czyli jak zbieramy dane o użytkownikach allegro
JDD 2014: Adam Dubiel - Import allegro.tech.internal.*
Scaling infrastructure beyond containers
RxJava & Hystrix - Perfect match for distributed applications
allegro.tech Data Science Meetup #2: Elasticsearch w praktyce
Microservices architecture pitfalls
RxJava - introduction & design
Confitura 2015 - Mikrousługi nie lubią być samotne
[WHUG] Wielki brat patrzy - czyli jak zbieramy dane o użytkownikach allegro

Similar to Fighting with scale (20)

PPTX
Kubernetes at Spreadshirt - First steps to production
PDF
Introduction to MANTL Data Platform
PDF
Best Practices for Building and Deploying Data Pipelines in Apache Spark
PDF
QConSF 2022 - Backends in Dart
PDF
Bimodal IT and EDW Modernization
PDF
5 Ways to Optimize Your LiDAR Data
PPT
100424 teradata cloud computing 3rd party influencers2c
PDF
Building The Modern IT
PDF
Js foo - Sept 8 upload
PPTX
Crm Saturday Madrid - Test Automation for Dynamics 365
PPTX
Crm saturday madrid 2017 jordi montaña - test automation
PPTX
Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing
PPT
16 greg hope_com_wics
PDF
What’s new in Microsoft ALM 2013, hosted in Windows Azure, VISUAL STUDIO ONLI...
PDF
Extending 3D Model Visualization with FME 2017
PPTX
Kisters 3DViewStation - your data: how, when, where you want
PDF
Building Reliable Data Lakes at Scale with Delta Lake
PDF
Brand Commerce - We all know the shiny stuff at the front. But what magic is ...
PPTX
Highway to heaven - Voxxed Days Belgrade
PDF
InfoTrack: Creating a single source of truth with the Elastic Stack
Kubernetes at Spreadshirt - First steps to production
Introduction to MANTL Data Platform
Best Practices for Building and Deploying Data Pipelines in Apache Spark
QConSF 2022 - Backends in Dart
Bimodal IT and EDW Modernization
5 Ways to Optimize Your LiDAR Data
100424 teradata cloud computing 3rd party influencers2c
Building The Modern IT
Js foo - Sept 8 upload
Crm Saturday Madrid - Test Automation for Dynamics 365
Crm saturday madrid 2017 jordi montaña - test automation
Applying AI to Performance Engineering: Shift-Left, Shift-Right, Self-Healing
16 greg hope_com_wics
What’s new in Microsoft ALM 2013, hosted in Windows Azure, VISUAL STUDIO ONLI...
Extending 3D Model Visualization with FME 2017
Kisters 3DViewStation - your data: how, when, where you want
Building Reliable Data Lakes at Scale with Delta Lake
Brand Commerce - We all know the shiny stuff at the front. But what magic is ...
Highway to heaven - Voxxed Days Belgrade
InfoTrack: Creating a single source of truth with the Elastic Stack
Ad

Recently uploaded (20)

PDF
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
PDF
Odoo Companies in India – Driving Business Transformation.pdf
PPTX
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
PDF
Nekopoi APK 2025 free lastest update
PPTX
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
PDF
wealthsignaloriginal-com-DS-text-... (1).pdf
PDF
medical staffing services at VALiNTRY
PPTX
Transform Your Business with a Software ERP System
PDF
Digital Systems & Binary Numbers (comprehensive )
PDF
Digital Strategies for Manufacturing Companies
PDF
Design an Analysis of Algorithms II-SECS-1021-03
PDF
Softaken Excel to vCard Converter Software.pdf
PPTX
history of c programming in notes for students .pptx
PDF
Design an Analysis of Algorithms I-SECS-1021-03
PPTX
Reimagine Home Health with the Power of Agentic AI​
PDF
Cost to Outsource Software Development in 2025
PDF
top salesforce developer skills in 2025.pdf
PPTX
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
PDF
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
PDF
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Internet Downloader Manager (IDM) Crack 6.42 Build 42 Updates Latest 2025
Odoo Companies in India – Driving Business Transformation.pdf
Log360_SIEM_Solutions Overview PPT_Feb 2020.pptx
Nekopoi APK 2025 free lastest update
Agentic AI Use Case- Contract Lifecycle Management (CLM).pptx
wealthsignaloriginal-com-DS-text-... (1).pdf
medical staffing services at VALiNTRY
Transform Your Business with a Software ERP System
Digital Systems & Binary Numbers (comprehensive )
Digital Strategies for Manufacturing Companies
Design an Analysis of Algorithms II-SECS-1021-03
Softaken Excel to vCard Converter Software.pdf
history of c programming in notes for students .pptx
Design an Analysis of Algorithms I-SECS-1021-03
Reimagine Home Health with the Power of Agentic AI​
Cost to Outsource Software Development in 2025
top salesforce developer skills in 2025.pdf
Agentic AI : A Practical Guide. Undersating, Implementing and Scaling Autono...
Product Update: Alluxio AI 3.7 Now with Sub-Millisecond Latency
T3DD25 TYPO3 Content Blocks - Deep Dive by André Kraus
Ad

Fighting with scale