SlideShare a Scribd company logo
Jimmy Lim SGNOG 5
jhalim@cloudflare.com Singapore, 5 September 2017
Managing Global Distributed Network
Cloudflare in a glance
This is the new style.
● 4+ million zones/domains
● 43+ billion DNS queries/day
● How?
○ Orange cloud
○ Global distributed network
in 120+ locations
Still growing fast!
○ Anycast routing
Protect and accelerate any website online
Benefit of orange cloud
● Direct visitors to the nearest entry point
○ Fast!
■ Lesser hops
■ Reduced latency
● Save bandwidth!
○ Lesser requests to origin
○ Mitigate DDoS
● Resiliency
○ 120+ locations!
Orange cloud vs grey cloud
Building like crazy
1 new PoP per week!
Challenges
● 120+ locations
● 500+ transit/exchange ports
● 500+ network equipments
● Countless alerts!
Managing 120+ PoPs
Building Resilience Network
● Stable hardware and software
● Automatic configuration template/peer review
● Solid monitoring system
● Global network engineering
● Network automation
Hardware and Software
● Proper evaluation and testing
○ Fits requirement
○ Bugs free
○ Scalable
● Global standardization
○ Same models of hardware
○ Same software codes
● No mass software upgrade!
○ Small PoP first
○ Deploy in batches
Solid Monitoring System
● Reduced unwanted alerts
○ Only gets relevant alerts
○ Silence PoP/ports during maintenances
● Monitor the performance of transit providers
○ Detects packet loss on their backbone
○ Provides automatic related traceroutes
○ Actions based on severity
■ Disabling the PoP automatically
■ Disabling traffic on related transit provider automatically
■ Suggests on actions to do
Alerts Channel and Dashboard
Alerts Channel and Dashboard
Global Network Engineering
● Follow the sun approach
○ San Francisco -> Singapore -> London -> San Francisco
● Doing all stuffs
○ Technical operations
○ Network engineering
○ Network expansion projects
○ New PoPs deployment
○ Peering stuffs
● Very fast response to network issues and escalation
Network Automation
● Open source recipe: napalm-salt
Apricot'2017 Network Automation by Salt+Napalm by Mircea Ulinic
Some daily usage examples
● salt [target_device] net.cli "junos_show_command"
● salt-run net.find [target_device]
● salt-run bgp.neighbors [bgp_asn]
● salt [target_device] [anycast.disable | anycast.enable]
● salt [target_device] [transit.disable | transit.enable]
[transit_name]
Net finder salt runner example
BGP salt runner example
Get advertised routes to particular BGP peer
PeeringDB lookup on IXPs info
BGP peering configuration push
Global configuration push
● salt [target_device] net.load_template [config_template]
○ Configuration template with Jinja2 format
○ Dynamic configuration loader
■ Flexible
○ Allow dry-run
○ Example: salt -N junos-cdn-edges net.load_template test-lar
Configuration template file
Global configuration push using template
Peering Update in Asia
In August 2015
● AKL-IX (Auckland)
● APE (Auckland)
● BBIX (Tokyo, Osaka, Singapore)
● Equinix (Hong Kong, Osaka, Singapore, Sydney, Tokyo)
● HKIX (Hong Kong)
● JPIX (Tokyo, Osaka)
● JPNAP (Tokyo, Osaka)
● Megaport (Auckland, Singapore, Sydney)
● PIPE (Sydney)
Addition of Peering Exchange as for now
● AMS-IX (Hong Kong)
● KINX (Incheon)
● MYIX (Kuala Lumpur)
● NSW-IX (Sydney)
● SGIX (Singapore)
● VIC-IX (Melbourne)
● WAIX (Perth)
Globally
● Registered in 150 public peering exchange points
● About 120 of them are up and running!
● More details: https://www.peeringdb.com/asn/13335
● On-going 100G deployment
● Peering Contact: peering@cloudflare.com
Q&A
Managing Global Distributed Network

More Related Content

PDF
MY Orange Cloud - MyIX Peering Forum 2016
PDF
Infrastructure as Code with Terraform: Koombea TechTalks
PDF
OSMC 2013 | Zabbix: A Practical Demo by Rihards Olups
PDF
Go frugal with web services
PDF
21 - IDNOG03 - Jimmy Halim (Cloudflare) - Brief Introduction of CloudFlare, t...
PDF
Nyc storm meetup_robdoherty
PDF
PyConIE 2017 Writing and deploying serverless python applications
PDF
Architecting for the Cloud: Hoping for the best, prepared for the worst
MY Orange Cloud - MyIX Peering Forum 2016
Infrastructure as Code with Terraform: Koombea TechTalks
OSMC 2013 | Zabbix: A Practical Demo by Rihards Olups
Go frugal with web services
21 - IDNOG03 - Jimmy Halim (Cloudflare) - Brief Introduction of CloudFlare, t...
Nyc storm meetup_robdoherty
PyConIE 2017 Writing and deploying serverless python applications
Architecting for the Cloud: Hoping for the best, prepared for the worst

What's hot (19)

PDF
Kamailio presence + json
PPTX
Nagios Conference 2014 - Luis Contreras - Monitoring SAP System with Nagios Core
PDF
re:Invent 2018 recap
PDF
Scalr: Setting Up Automated Scaling
PDF
A Cheapskates Guide to AWS v2.0
PPTX
Architecting for the Cloud: Hoping for the Best, Prepared for the Worst
PPTX
Reduce IT Spend with Software Load Balancing
PDF
Life of a startup - Sjoerd Mulder - Codemotion Amsterdam 2017
ODP
A Cheapskates Guide to AWS
PDF
P99CONF — What We Need to Unlearn About Persistent Storage
PDF
Stream Processing Live Traffic Data with Kafka Streams
PDF
The Dark Side Of Go -- Go runtime related problems in TiDB in production
PPTX
Going Serverless
PDF
Unikraft: Fast, Specialized Unikernels the Easy Way
PDF
Austin bdug 2011_01_27_small_and_big_data
PDF
OpenNebulaConf2018 - Our Journey to OpenNebula - Germán Gutierrez - Booking.com
PDF
Couchbase live 2016
PPTX
AnsibleFest 2019 - Greenfielding Network and Systems Automation in a Large an...
PDF
SOLR Power FTW: short version
Kamailio presence + json
Nagios Conference 2014 - Luis Contreras - Monitoring SAP System with Nagios Core
re:Invent 2018 recap
Scalr: Setting Up Automated Scaling
A Cheapskates Guide to AWS v2.0
Architecting for the Cloud: Hoping for the Best, Prepared for the Worst
Reduce IT Spend with Software Load Balancing
Life of a startup - Sjoerd Mulder - Codemotion Amsterdam 2017
A Cheapskates Guide to AWS
P99CONF — What We Need to Unlearn About Persistent Storage
Stream Processing Live Traffic Data with Kafka Streams
The Dark Side Of Go -- Go runtime related problems in TiDB in production
Going Serverless
Unikraft: Fast, Specialized Unikernels the Easy Way
Austin bdug 2011_01_27_small_and_big_data
OpenNebulaConf2018 - Our Journey to OpenNebula - Germán Gutierrez - Booking.com
Couchbase live 2016
AnsibleFest 2019 - Greenfielding Network and Systems Automation in a Large an...
SOLR Power FTW: short version
Ad

Similar to Managing Global Distributed Network (20)

PDF
The bond between automation and network engineering
PDF
IDNOG3-Jimmy-CloudFlare
PDF
Technical Debt: An Anycast Story
PDF
giip service brochure (en) 150705
PDF
Dev.bg DevOps March 2024 Monitoring & Logging
PDF
Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019
PDF
OpenFlow @ Google
PDF
Netty training
PDF
FastNetMon and Metrics
PDF
PDF
TRHUG 2015 - Veloxity Big Data Migration Use Case
PDF
DevOpsDays Tel Aviv DEC 2022 | Building A Cloud-Native Platform Brick by Bric...
PDF
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
PDF
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
PDF
An EyeWitness View into your Network
PDF
Building a Small DC
PDF
Multiplier Effect: Case Studies in Distributions for Publishers
PDF
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...
PDF
SPDY and What to Consider for HTTP/2.0
PDF
SPDY @Zynga
The bond between automation and network engineering
IDNOG3-Jimmy-CloudFlare
Technical Debt: An Anycast Story
giip service brochure (en) 150705
Dev.bg DevOps March 2024 Monitoring & Logging
Pivotal Greenplum Cloud Marketplaces - Greenplum Summit 2019
OpenFlow @ Google
Netty training
FastNetMon and Metrics
TRHUG 2015 - Veloxity Big Data Migration Use Case
DevOpsDays Tel Aviv DEC 2022 | Building A Cloud-Native Platform Brick by Bric...
Kafka Summit NYC 2017 - Scalable Real-Time Complex Event Processing @ Uber
USENIX LISA15: How TubeMogul Handles over One Trillion HTTP Requests a Month
An EyeWitness View into your Network
Building a Small DC
Multiplier Effect: Case Studies in Distributions for Publishers
How Sysbee Manages Infrastructures and Provides Advanced Monitoring by Using ...
SPDY and What to Consider for HTTP/2.0
SPDY @Zynga
Ad

Recently uploaded (20)

PPTX
Funds Management Learning Material for Beg
PPTX
Introduction to Information and Communication Technology
PPT
Design_with_Watersergyerge45hrbgre4top (1).ppt
PPTX
artificial intelligence overview of it and more
PPTX
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
PDF
Cloud-Scale Log Monitoring _ Datadog.pdf
PPTX
INTERNET------BASICS-------UPDATED PPT PRESENTATION
PPTX
newyork.pptxirantrafgshenepalchinachinane
PDF
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
PPTX
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
PDF
SASE Traffic Flow - ZTNA Connector-1.pdf
PPTX
Internet___Basics___Styled_ presentation
PDF
Introduction to the IoT system, how the IoT system works
PPTX
SAP Ariba Sourcing PPT for learning material
PPTX
PptxGenJS_Demo_Chart_20250317130215833.pptx
PPTX
introduction about ICD -10 & ICD-11 ppt.pptx
PDF
Exploring VPS Hosting Trends for SMBs in 2025
PDF
Tenda Login Guide: Access Your Router in 5 Easy Steps
PPTX
Slides PPTX World Game (s) Eco Economic Epochs.pptx
PDF
Unit-1 introduction to cyber security discuss about how to secure a system
Funds Management Learning Material for Beg
Introduction to Information and Communication Technology
Design_with_Watersergyerge45hrbgre4top (1).ppt
artificial intelligence overview of it and more
Introduction about ICD -10 and ICD11 on 5.8.25.pptx
Cloud-Scale Log Monitoring _ Datadog.pdf
INTERNET------BASICS-------UPDATED PPT PRESENTATION
newyork.pptxirantrafgshenepalchinachinane
Best Practices for Testing and Debugging Shopify Third-Party API Integrations...
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
SASE Traffic Flow - ZTNA Connector-1.pdf
Internet___Basics___Styled_ presentation
Introduction to the IoT system, how the IoT system works
SAP Ariba Sourcing PPT for learning material
PptxGenJS_Demo_Chart_20250317130215833.pptx
introduction about ICD -10 & ICD-11 ppt.pptx
Exploring VPS Hosting Trends for SMBs in 2025
Tenda Login Guide: Access Your Router in 5 Easy Steps
Slides PPTX World Game (s) Eco Economic Epochs.pptx
Unit-1 introduction to cyber security discuss about how to secure a system

Managing Global Distributed Network

  • 1. Jimmy Lim SGNOG 5 jhalim@cloudflare.com Singapore, 5 September 2017 Managing Global Distributed Network
  • 3. This is the new style. ● 4+ million zones/domains ● 43+ billion DNS queries/day ● How? ○ Orange cloud ○ Global distributed network in 120+ locations Still growing fast! ○ Anycast routing Protect and accelerate any website online
  • 4. Benefit of orange cloud ● Direct visitors to the nearest entry point ○ Fast! ■ Lesser hops ■ Reduced latency ● Save bandwidth! ○ Lesser requests to origin ○ Mitigate DDoS ● Resiliency ○ 120+ locations!
  • 5. Orange cloud vs grey cloud
  • 7. 1 new PoP per week!
  • 8. Challenges ● 120+ locations ● 500+ transit/exchange ports ● 500+ network equipments ● Countless alerts!
  • 10. Building Resilience Network ● Stable hardware and software ● Automatic configuration template/peer review ● Solid monitoring system ● Global network engineering ● Network automation
  • 11. Hardware and Software ● Proper evaluation and testing ○ Fits requirement ○ Bugs free ○ Scalable ● Global standardization ○ Same models of hardware ○ Same software codes ● No mass software upgrade! ○ Small PoP first ○ Deploy in batches
  • 12. Solid Monitoring System ● Reduced unwanted alerts ○ Only gets relevant alerts ○ Silence PoP/ports during maintenances ● Monitor the performance of transit providers ○ Detects packet loss on their backbone ○ Provides automatic related traceroutes ○ Actions based on severity ■ Disabling the PoP automatically ■ Disabling traffic on related transit provider automatically ■ Suggests on actions to do
  • 13. Alerts Channel and Dashboard
  • 14. Alerts Channel and Dashboard
  • 15. Global Network Engineering ● Follow the sun approach ○ San Francisco -> Singapore -> London -> San Francisco ● Doing all stuffs ○ Technical operations ○ Network engineering ○ Network expansion projects ○ New PoPs deployment ○ Peering stuffs ● Very fast response to network issues and escalation
  • 16. Network Automation ● Open source recipe: napalm-salt Apricot'2017 Network Automation by Salt+Napalm by Mircea Ulinic
  • 17. Some daily usage examples ● salt [target_device] net.cli "junos_show_command" ● salt-run net.find [target_device] ● salt-run bgp.neighbors [bgp_asn] ● salt [target_device] [anycast.disable | anycast.enable] ● salt [target_device] [transit.disable | transit.enable] [transit_name]
  • 18. Net finder salt runner example
  • 19. BGP salt runner example
  • 20. Get advertised routes to particular BGP peer
  • 21. PeeringDB lookup on IXPs info
  • 23. Global configuration push ● salt [target_device] net.load_template [config_template] ○ Configuration template with Jinja2 format ○ Dynamic configuration loader ■ Flexible ○ Allow dry-run ○ Example: salt -N junos-cdn-edges net.load_template test-lar
  • 25. Global configuration push using template
  • 27. In August 2015 ● AKL-IX (Auckland) ● APE (Auckland) ● BBIX (Tokyo, Osaka, Singapore) ● Equinix (Hong Kong, Osaka, Singapore, Sydney, Tokyo) ● HKIX (Hong Kong) ● JPIX (Tokyo, Osaka) ● JPNAP (Tokyo, Osaka) ● Megaport (Auckland, Singapore, Sydney) ● PIPE (Sydney)
  • 28. Addition of Peering Exchange as for now ● AMS-IX (Hong Kong) ● KINX (Incheon) ● MYIX (Kuala Lumpur) ● NSW-IX (Sydney) ● SGIX (Singapore) ● VIC-IX (Melbourne) ● WAIX (Perth)
  • 29. Globally ● Registered in 150 public peering exchange points ● About 120 of them are up and running! ● More details: https://www.peeringdb.com/asn/13335 ● On-going 100G deployment ● Peering Contact: peering@cloudflare.com
  • 30. Q&A