SlideShare a Scribd company logo
Cloud-Scale BGP and NetFlow Analysis
Jim Frey, VP Product, Kentik Technologies
December 15, 2015
2
• Common NetOps Stress points
• Helpful Data Sets – NetFlow, BGP
• Handling NetFlow and BGP at Cloud Scale
• Kentik’s Approach
• Wrap-Up / Q&A
Agenda
R
R
S
S
S
S
S
R
R
S
S
S
S
S
NetOps Stress Points: Needing Instant Answers
How should I allocate my
resources in the future?
Does performance
meet expectations?
Is this an attack or
legitimate traffic?
Where in my network
is the problem?
Things You Need Answers to About/From Your Network
$$$
$$$
$$$
X
4
• Accurate Visibility, Without Delay
• Relevant Alerts: No False Positives or Negatives
• Complete Data: Breadth + Depth
• Fast/Flexible Data Exploration
• Tools that don’t suck (time or $$)
What We Hear….
To Address These Questions, NetOps Needs:
5
What Data Sets Can Help?
And which ones can do the job cost effectively?
6
Primary Network Monitoring Data Choices
Examples
- SNMP, WMI
Advantages
- Ubiquitous
- Good for monitoringdevice
health/status/activity
Disadvantages
- Notraffic detail
- Typically nofrequentthan
every 5 minutes truly anti-
real-time
Polled Stats
Examples
- NetFlow, sFlow, IPFIX
Advantages
- Details on traffic
src/dest/content, etc.
- Very costeffective
Disadvantages
- NRT(near real-time)atbest
- Incomplete app-layer detail
- Limitedperformance metrics
- Data volumes can be massive
Flow Records
Examples
- Packets -> xFlow
- Long term stream-to-disk
Advantages
- Mostcomplete app layer detail
- True real-time (millisecondlvl)
- Complete vendor independent
Disadvantages
- Expensive todeploy at scale
- Requires network tapor SPAN
- Packetcaptures can be massive
Packet Inspection
7
Secondary Network Monitoring Data Choices
Examples
- Syslog
Advantages
- Continuous/streaming
- Unique, device-specific info
- True real-time
Disadvantages
- Nostandards – musthave very
flexible search/mappingtools
- Data volumes can be massive
Log Records
Examples
- OSPF, IGRP, BGP
Advantages
- Details on traffic paths and
provider volumes
- Insights intoInternetfactors
Disadvantages
- Address data only – no
awareness of traffic
- Mustpeer with routers to get
updates
Routing/Path Data
Examples
- IP SLA, Independenttestsw
Advantages
- Assess functions/services 24x7
- Provides both availability and
performance measures
Disadvantages
- Deploying/maintainingenough
agents to achieve full coverage
- Only an approximation of real
user experience (atbest)
Synthetic Agents
8
• You never know which data set will present the specific
insights you need
• The challenge (real magic) comes from correlating
multiple datasets, i.e.:
• Behavioral observations with configuration changes
• Trends with underlying traffic details
• Routing data with traffic data
Key Assertion:
Use Multiple Data Types for Best Results
9
For Providers
• Recognizing newservice opportunities basedon subscriber(and peer) behavior
• Optimizing peering relationships forcostcontrol
For Web Services/ Commerce
• Recognizing where yourcustomers are andhowtheyreach you
• Managing peering relationships forbestcustomerexperience
For Enterprise
• Assessing howyourconnectivityproviders perform/compare
• Building InternetIQ – howyou connect/relate to the outside world
Why Correlate Routing Data with Traffic Data?
10
Cloud Scale for NetFlow
and BGP:
The Big Data Challenge
Why can’t we just use our existing tools?
Cloud, SaaS,
Big Data
Network traffic has grown exponentially;
Legacy tools/tech haven’t kept pace.
Result? Fragmented tools, visibility gaps,
unanswered questions.
Existing Tools: Falling Behind
10M
100M
1G
10G
100G
12
- Network Monitoring Data IS Big Data
- Meets Volume/Variety/Velocity Test
- Billions of records/day (millions/second)
- Big Data architectures are considered best practices today for open/flexible
correlation, analytics
Why Big Data?
13
Existing solutions shortfalls:
- Flexibility for moving between viewpoints and into full details
- Data Completeness due to reliance on summarized/aggregated flow data
- Speed: Generating new analysis in a timely manner
Specific Challenges For NetFlow + BGP
- Network Monitoring Data IS Big Data
- Meets Volume/Variety/Velocity Test
- Billions of records/day (millions/second)
- Big Data architectures are considered best practices today for open/flexible
correlation, analytics
Why Big Data?
14
How to Get/Use Big Data Approach?
15
1. BYO – Build Your Own
• Pick back end & reporting/analysis tools (open source = free?)
• Procure operating platforms (hard, virtual, or cloud servers = $$)
• Integrate, add data sources, and get it up and running (dev = $$)
• Keep it up and running (ops/admin = $$)
How to Get/Use Big Data Approach?
16
1. BYO – Build Your Own
• Pick back end & reporting/analysis tools (open source = free?)
• Procure operating platforms (hard, virtual, or cloud servers = $$)
• Integrate, add data sources, and get it up and running (dev = $$)
• Keep it up and running (ops/admin = $$)
2. Let SOMEONE ELSE build/optimize/operate
• Subscribe to SaaS (ops $$)
• Just Send Your Data and enjoy the ride!
How to Get/Use Big Data Approach?
17
Kentik’s Answer
How we address the Big Data challenge to meet the needs of
Network Operators now
Kentik Detect: the first and only SaaS Solution
For Network Ops Management & Visibility at Terabit Scale
CL OU D -B A S E D RE A L -TIM E M U LTI-TE N A N T OP E N G L OB A L
Analyze & Take Action
Big Data Network
Telemetry Platform
S
S
S
R
R
The Network is
the Sensor
Web Portal
Real-time & historical
queries
NetFlow/
sFlow/IPFIX
SNMP
BGP
Alerts
E-mail / Syslog / JSON
Open API
SQL / RESTful
Kentik
Data Engine
Multi-tiered/Clustered for Scale / Load Balancing / HA, Hosted by Kentik
What’s Behind the Kentik Data Engine
POSTGRES
SERVERS
SQL
DATA STORAGE CLUSTER
NetFlow
SNMP
BGP
INGEST CLUSTER
CLIENTS
N M
Optimized forMassive DataIngest & Rapid Query Response
20
Kentik Portal Dashboard
21
Top Traffic Flows
22
Traffic by Source Geography
23
AS Path Changes
24
AS Top Talkers and Drill Down Options
25
Peering Analytics: ASN by Dest Country Paths
26
Peering Analytics: Traffic by BGP Paths
27
Peering Analytics: Traffic by Origin AS (“Last Hop”)
28
Peering Analytics: Traffic by Transit AS
Key Takeaways: Cloud Scale NetFlow + BGP
Why You Need It
- Clear Insight into external/Internet network traffic behaviors
- Improved customer/subscriber engagement
- Reduced network operating costs
Technical Path to Success
- This is a big data problem, requiring high capacity/speed for data
management, correlation, exploration, and analytics
- SaaS solutions are a fully viable option
Network Intelligence at Terabit Scale
Thank You!
Jim Frey
VP Product
KentikTechnologies
jfrey@kentik.com
@jfrey80

More Related Content

PDF
Kentik Network@Scale (Dan Ellis)
PDF
Cloud Aware Network Management
PDF
Kentik Detect Engine - Network Field Day 2017
PPTX
The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik
PDF
Next-Gen DDoS Detection
PPTX
SIEM Modernization: Build a Situationally Aware Organization with Apache Kafka®
PPTX
Monitoring and Troubleshooting a Real Time Pipeline
PPTX
Flink Case Study: Bouygues Telecom
Kentik Network@Scale (Dan Ellis)
Cloud Aware Network Management
Kentik Detect Engine - Network Field Day 2017
The Network Knows—Avi Freedman, CEO & Co-Founder of Kentik
Next-Gen DDoS Detection
SIEM Modernization: Build a Situationally Aware Organization with Apache Kafka®
Monitoring and Troubleshooting a Real Time Pipeline
Flink Case Study: Bouygues Telecom

What's hot (20)

PDF
Kafka Migration for Satellite Event Streaming Data | Eric Velte, ASRC Federal
PDF
PaNDA - a platform for Network Data Analytics: an overview
PDF
PNDA - Platform for Network Data Analytics
PPTX
Streaming real time data with Vibe Data Stream
PDF
Digital transformation: Highly resilient streaming architecture and strategie...
PDF
Kafka in the Enterprise—A Two-Year Journey to Build a Data Streaming Platform...
PPTX
Shaping a Digital Vision
PPTX
Stream processing IoT time series data with Kafka & InfluxDB | Al Sargent, In...
PDF
How a Data Mesh is Driving our Platform | Trey Hicks, Gloo
PPTX
Kurt Schneider [Discover Financial] | How Discover Modernizes Observability w...
PDF
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
PDF
Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...
PDF
Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...
PDF
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
PDF
Apache Spark vs Apache Spark: An On-Prem Comparison of Databricks and Open-So...
PPTX
Insight into Hyperconverged Infrastructure
PDF
Javantura v3 - Microservice – no fluff the REAL stuff – Nakul Mishra
PDF
Low-latency real-time data processing at giga-scale with Kafka | John DesJard...
PDF
Safer Commutes & Streaming Data | George Padavick, Ohio Department of Transpo...
PDF
Joe witt may2015_kafka_nyc_apachenifi-overview
Kafka Migration for Satellite Event Streaming Data | Eric Velte, ASRC Federal
PaNDA - a platform for Network Data Analytics: an overview
PNDA - Platform for Network Data Analytics
Streaming real time data with Vibe Data Stream
Digital transformation: Highly resilient streaming architecture and strategie...
Kafka in the Enterprise—A Two-Year Journey to Build a Data Streaming Platform...
Shaping a Digital Vision
Stream processing IoT time series data with Kafka & InfluxDB | Al Sargent, In...
How a Data Mesh is Driving our Platform | Trey Hicks, Gloo
Kurt Schneider [Discover Financial] | How Discover Modernizes Observability w...
Apache Flink for IoT: How Event-Time Processing Enables Easy and Accurate Ana...
Streaming Data Lakes using Kafka Connect + Apache Hudi | Vinoth Chandar, Apac...
Digital Transformation in Healthcare with Kafka—Building a Low Latency Data P...
user Behavior Analysis with Session Windows and Apache Kafka's Streams API
Apache Spark vs Apache Spark: An On-Prem Comparison of Databricks and Open-So...
Insight into Hyperconverged Infrastructure
Javantura v3 - Microservice – no fluff the REAL stuff – Nakul Mishra
Low-latency real-time data processing at giga-scale with Kafka | John DesJard...
Safer Commutes & Streaming Data | George Padavick, Ohio Department of Transpo...
Joe witt may2015_kafka_nyc_apachenifi-overview
Ad

Viewers also liked (20)

PPTX
Nokia Big Data and Analytics
PPTX
Big Data Expo 2015 - Schiphol Big Data @ Schiphol
PDF
2016-05-30 Venia Legendi (CEITER): Luis Pablo Prieto
PPTX
Tecnologia
PDF
Alpha Bank – Property Xpress (PropertyXpress.com)
ODP
Twilightful Alphabetacy Chapter 1.2
PDF
Visión Artificial, Accesibilidad y Android
DOCX
El gran impacto de las redes sociales
PPTX
La lírica y la ópera
PDF
Kongsklide Supra vac 2000 parts catalog
PPTX
Curso fitoterapia
PDF
Bruno García.
PDF
Ebook Gatilhos Mentais - Armas de Vendas
PDF
La vida de una abeja
PPTX
Trabajo en clases informatica 17 05-2014
PPTX
Evento SugarCRM y Redes Sociales
PDF
PDF
Varsavsky
PDF
Escoex. Cómo disparar mi Productividad con las Nuevas Tecnologías
PPTX
Caligramas
Nokia Big Data and Analytics
Big Data Expo 2015 - Schiphol Big Data @ Schiphol
2016-05-30 Venia Legendi (CEITER): Luis Pablo Prieto
Tecnologia
Alpha Bank – Property Xpress (PropertyXpress.com)
Twilightful Alphabetacy Chapter 1.2
Visión Artificial, Accesibilidad y Android
El gran impacto de las redes sociales
La lírica y la ópera
Kongsklide Supra vac 2000 parts catalog
Curso fitoterapia
Bruno García.
Ebook Gatilhos Mentais - Armas de Vendas
La vida de una abeja
Trabajo en clases informatica 17 05-2014
Evento SugarCRM y Redes Sociales
Varsavsky
Escoex. Cómo disparar mi Productividad con las Nuevas Tecnologías
Caligramas
Ad

Similar to Cloud-Scale BGP and NetFlow Analysis (20)

PDF
Data Platform Architecture Principles and Evaluation Criteria
PPTX
Bitkom Cray presentation - on HPC affecting big data analytics in FS
PDF
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
PDF
Analytics&IoT
PPTX
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
PDF
Pivotal - Advanced Analytics for Telecommunications
PPS
Qo Introduction V2
PDF
Igniting Audience Measurement at Time Warner Cable
PPTX
What’s New: Splunk App for Stream and Splunk MINT
PPTX
SplunkLive! Munich 2018: Data Onboarding Overview
PDF
Cisco Analytics: Accelerate Network Optimization with Virtualization
PPTX
Big Data Analytics and Advanced Computer Networking Scenarios
PPTX
Sql 2017 net raf
PPTX
SplunkLive! Frankfurt 2018 - Data Onboarding Overview
PDF
Architecting Petabyte Scale AI Applications
PDF
Horses for Courses: Database Roundtable
PDF
Big data for Telco: opportunity or threat?
PPTX
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
PPTX
Agile Gurugram 2023 | Observability for Modern Applications. How does it help...
Data Platform Architecture Principles and Evaluation Criteria
Bitkom Cray presentation - on HPC affecting big data analytics in FS
Data-Driven Transformation: Leveraging Big Data at Showtime with Apache Spark
Analytics&IoT
Splunk MINT for Mobile Intelligence and Splunk App for Stream for Enhanced Op...
Pivotal - Advanced Analytics for Telecommunications
Qo Introduction V2
Igniting Audience Measurement at Time Warner Cable
What’s New: Splunk App for Stream and Splunk MINT
SplunkLive! Munich 2018: Data Onboarding Overview
Cisco Analytics: Accelerate Network Optimization with Virtualization
Big Data Analytics and Advanced Computer Networking Scenarios
Sql 2017 net raf
SplunkLive! Frankfurt 2018 - Data Onboarding Overview
Architecting Petabyte Scale AI Applications
Horses for Courses: Database Roundtable
Big data for Telco: opportunity or threat?
Using Familiar BI Tools and Hadoop to Analyze Enterprise Networks
Agile Gurugram 2023 | Observability for Modern Applications. How does it help...

Recently uploaded (20)

PDF
Modernizing your data center with Dell and AMD
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
A Presentation on Artificial Intelligence
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Approach and Philosophy of On baking technology
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
cuic standard and advanced reporting.pdf
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Empathic Computing: Creating Shared Understanding
PPTX
Cloud computing and distributed systems.
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
Modernizing your data center with Dell and AMD
Network Security Unit 5.pdf for BCA BBA.
Reach Out and Touch Someone: Haptics and Empathic Computing
Digital-Transformation-Roadmap-for-Companies.pptx
A Presentation on Artificial Intelligence
Building Integrated photovoltaic BIPV_UPV.pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Approach and Philosophy of On baking technology
“AI and Expert System Decision Support & Business Intelligence Systems”
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
cuic standard and advanced reporting.pdf
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Spectral efficient network and resource selection model in 5G networks
Empathic Computing: Creating Shared Understanding
Cloud computing and distributed systems.
The Rise and Fall of 3GPP – Time for a Sabbatical?

Cloud-Scale BGP and NetFlow Analysis

  • 1. Cloud-Scale BGP and NetFlow Analysis Jim Frey, VP Product, Kentik Technologies December 15, 2015
  • 2. 2 • Common NetOps Stress points • Helpful Data Sets – NetFlow, BGP • Handling NetFlow and BGP at Cloud Scale • Kentik’s Approach • Wrap-Up / Q&A Agenda
  • 3. R R S S S S S R R S S S S S NetOps Stress Points: Needing Instant Answers How should I allocate my resources in the future? Does performance meet expectations? Is this an attack or legitimate traffic? Where in my network is the problem? Things You Need Answers to About/From Your Network $$$ $$$ $$$ X
  • 4. 4 • Accurate Visibility, Without Delay • Relevant Alerts: No False Positives or Negatives • Complete Data: Breadth + Depth • Fast/Flexible Data Exploration • Tools that don’t suck (time or $$) What We Hear…. To Address These Questions, NetOps Needs:
  • 5. 5 What Data Sets Can Help? And which ones can do the job cost effectively?
  • 6. 6 Primary Network Monitoring Data Choices Examples - SNMP, WMI Advantages - Ubiquitous - Good for monitoringdevice health/status/activity Disadvantages - Notraffic detail - Typically nofrequentthan every 5 minutes truly anti- real-time Polled Stats Examples - NetFlow, sFlow, IPFIX Advantages - Details on traffic src/dest/content, etc. - Very costeffective Disadvantages - NRT(near real-time)atbest - Incomplete app-layer detail - Limitedperformance metrics - Data volumes can be massive Flow Records Examples - Packets -> xFlow - Long term stream-to-disk Advantages - Mostcomplete app layer detail - True real-time (millisecondlvl) - Complete vendor independent Disadvantages - Expensive todeploy at scale - Requires network tapor SPAN - Packetcaptures can be massive Packet Inspection
  • 7. 7 Secondary Network Monitoring Data Choices Examples - Syslog Advantages - Continuous/streaming - Unique, device-specific info - True real-time Disadvantages - Nostandards – musthave very flexible search/mappingtools - Data volumes can be massive Log Records Examples - OSPF, IGRP, BGP Advantages - Details on traffic paths and provider volumes - Insights intoInternetfactors Disadvantages - Address data only – no awareness of traffic - Mustpeer with routers to get updates Routing/Path Data Examples - IP SLA, Independenttestsw Advantages - Assess functions/services 24x7 - Provides both availability and performance measures Disadvantages - Deploying/maintainingenough agents to achieve full coverage - Only an approximation of real user experience (atbest) Synthetic Agents
  • 8. 8 • You never know which data set will present the specific insights you need • The challenge (real magic) comes from correlating multiple datasets, i.e.: • Behavioral observations with configuration changes • Trends with underlying traffic details • Routing data with traffic data Key Assertion: Use Multiple Data Types for Best Results
  • 9. 9 For Providers • Recognizing newservice opportunities basedon subscriber(and peer) behavior • Optimizing peering relationships forcostcontrol For Web Services/ Commerce • Recognizing where yourcustomers are andhowtheyreach you • Managing peering relationships forbestcustomerexperience For Enterprise • Assessing howyourconnectivityproviders perform/compare • Building InternetIQ – howyou connect/relate to the outside world Why Correlate Routing Data with Traffic Data?
  • 10. 10 Cloud Scale for NetFlow and BGP: The Big Data Challenge Why can’t we just use our existing tools?
  • 11. Cloud, SaaS, Big Data Network traffic has grown exponentially; Legacy tools/tech haven’t kept pace. Result? Fragmented tools, visibility gaps, unanswered questions. Existing Tools: Falling Behind 10M 100M 1G 10G 100G
  • 12. 12 - Network Monitoring Data IS Big Data - Meets Volume/Variety/Velocity Test - Billions of records/day (millions/second) - Big Data architectures are considered best practices today for open/flexible correlation, analytics Why Big Data?
  • 13. 13 Existing solutions shortfalls: - Flexibility for moving between viewpoints and into full details - Data Completeness due to reliance on summarized/aggregated flow data - Speed: Generating new analysis in a timely manner Specific Challenges For NetFlow + BGP - Network Monitoring Data IS Big Data - Meets Volume/Variety/Velocity Test - Billions of records/day (millions/second) - Big Data architectures are considered best practices today for open/flexible correlation, analytics Why Big Data?
  • 14. 14 How to Get/Use Big Data Approach?
  • 15. 15 1. BYO – Build Your Own • Pick back end & reporting/analysis tools (open source = free?) • Procure operating platforms (hard, virtual, or cloud servers = $$) • Integrate, add data sources, and get it up and running (dev = $$) • Keep it up and running (ops/admin = $$) How to Get/Use Big Data Approach?
  • 16. 16 1. BYO – Build Your Own • Pick back end & reporting/analysis tools (open source = free?) • Procure operating platforms (hard, virtual, or cloud servers = $$) • Integrate, add data sources, and get it up and running (dev = $$) • Keep it up and running (ops/admin = $$) 2. Let SOMEONE ELSE build/optimize/operate • Subscribe to SaaS (ops $$) • Just Send Your Data and enjoy the ride! How to Get/Use Big Data Approach?
  • 17. 17 Kentik’s Answer How we address the Big Data challenge to meet the needs of Network Operators now
  • 18. Kentik Detect: the first and only SaaS Solution For Network Ops Management & Visibility at Terabit Scale CL OU D -B A S E D RE A L -TIM E M U LTI-TE N A N T OP E N G L OB A L Analyze & Take Action Big Data Network Telemetry Platform S S S R R The Network is the Sensor Web Portal Real-time & historical queries NetFlow/ sFlow/IPFIX SNMP BGP Alerts E-mail / Syslog / JSON Open API SQL / RESTful Kentik Data Engine
  • 19. Multi-tiered/Clustered for Scale / Load Balancing / HA, Hosted by Kentik What’s Behind the Kentik Data Engine POSTGRES SERVERS SQL DATA STORAGE CLUSTER NetFlow SNMP BGP INGEST CLUSTER CLIENTS N M Optimized forMassive DataIngest & Rapid Query Response
  • 22. 22 Traffic by Source Geography
  • 24. 24 AS Top Talkers and Drill Down Options
  • 25. 25 Peering Analytics: ASN by Dest Country Paths
  • 27. 27 Peering Analytics: Traffic by Origin AS (“Last Hop”)
  • 29. Key Takeaways: Cloud Scale NetFlow + BGP Why You Need It - Clear Insight into external/Internet network traffic behaviors - Improved customer/subscriber engagement - Reduced network operating costs Technical Path to Success - This is a big data problem, requiring high capacity/speed for data management, correlation, exploration, and analytics - SaaS solutions are a fully viable option
  • 30. Network Intelligence at Terabit Scale Thank You! Jim Frey VP Product KentikTechnologies jfrey@kentik.com @jfrey80