SlideShare a Scribd company logo
Searching	
  pa,erns	
  in	
  SNMP	
  data	
  

           Mehmet	
  Balman	
  and	
  Doron	
  Rotem	
  
Searching	
  pa,erns	
  in	
  SNMP	
  data	
  
•    Looked	
  at	
  two	
  busy	
  links:	
  	
  
      –  star-­‐sdn1/interface/xe-­‐7_3_0	
  
      –  denv-­‐cr2/interface/xe-­‐1_1_0	
  
      	
  
      	
  
      	
  
      	
  
      -­‐Generated	
  graphs	
  for	
  the	
  last	
  6	
  
      months	
  (including	
  graphs	
  per	
  month,	
  
      per	
  week	
  )	
  
      	
  
      -­‐	
  Visually	
  inspected	
  whether	
  there	
  is	
  
      any	
  Ime	
  related	
  specific	
  pa,ern	
  in	
  
      bandwidth	
  usage	
  
Searching	
  pa,erns	
  in	
  SNMP	
  data	
  
–  star-­‐sdn1/interface/xe-­‐7_3_0	
  
	
  star-­‐>wash	
  
	
  h6ps://sdm.lbl.gov/~balman/temp/1-­‐a/	
  
	
  
–  denv-­‐cr2/interface/xe-­‐1_1_0	
  
	
  
sunn-­‐>denv	
  
	
  
h6ps://sdm.lbl.gov/~balman/temp/1/	
  
	
  
Searching	
  pa,erns	
  in	
  SNMP	
  data	
  
•    Collected	
  data	
  for	
  those	
  two	
  links	
  (one	
  year	
  long)	
  and	
  tried	
  to	
  analyze	
  the	
  data	
  
     with	
  a	
  machine	
  learning	
  soMware	
  
•    Converted	
  data	
  into	
  arff	
  format	
  
•    Used	
  Weka	
  
•    Evaluated	
  the	
  bandwidth	
  vs.	
  Ime	
  data	
  (Ime	
  series	
  analysis)	
  to	
  see	
  whether	
  day	
  of	
  
     the	
  week,	
  PM	
  or	
  AM,	
  day	
  of	
  the	
  year,	
  etc.	
  have	
  any	
  visible	
  effect	
  on	
  bandwidth	
  
     usage	
  
Sunn-­‐>denv	
  
Star-­‐>wash	
  
APM project meeting - June 13, 2012 - LBNL, Berkeley, CA
Searching	
  pa,erns	
  in	
  SNMP	
  data	
  
•  Our	
  iniIal	
  results	
  on	
  Ime	
  series	
  predicIon	
  gave	
  	
  40-­‐50%	
  error	
  rate.	
  	
  	
  
•  By	
  using	
  some	
  other	
  techniques,	
  we	
  were	
  able	
  to	
  achieve	
  30-­‐40	
  %	
  
   error	
  rate.	
  

•  At	
  this	
  moment,	
  	
  taking	
  average	
  link	
  usage	
  may	
  be	
  a	
  reasonable	
  
   way	
  to	
  start	
  with.	
  

•  Further	
  study	
  is	
  required	
  to	
  make	
  useful	
  predicIons	
  
    –  Gretl	
  is	
  also	
  another	
  alternaIve	
  
    –  Using	
  R	
  instead	
  of	
  Weka	
  

More Related Content

PPTX
Route Stability Prediction Using Machine Learning Modelling of Route Table Fe...
DOCX
Network Flow Pattern Extraction by Clustering Eugine Kang
PDF
Icacci presentation-ssh traffic
PDF
Performance analysis and optimal cooperative cluster size for randomly distri...
PDF
Network tomography to enhance the performance of software defined network mon...
PDF
Icacci presentation-isi-ssh traffic
PDF
rscript_paper-1
PDF
Solve Production Allocation and Reconciliation Problems using the same Network
Route Stability Prediction Using Machine Learning Modelling of Route Table Fe...
Network Flow Pattern Extraction by Clustering Eugine Kang
Icacci presentation-ssh traffic
Performance analysis and optimal cooperative cluster size for randomly distri...
Network tomography to enhance the performance of software defined network mon...
Icacci presentation-isi-ssh traffic
rscript_paper-1
Solve Production Allocation and Reconciliation Problems using the same Network

What's hot (9)

PDF
Pregel - Paper Review
PDF
A New Bi-level Program Based on Unblocked Reliability for a Continuous Road N...
DOCX
AN INFORMATION THEORY-BASED FEATURE SELECTIONFRAMEWORK FOR BIG DATA UNDER APA...
PDF
PPTX
Domain research presentation Final
PDF
A Study on Hardware and Software Link Quality Metrics for Wireless Multimedia...
PDF
Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...
DOCX
EXPLOITING EFFICIENT AND SCALABLE SHUFFLE TRANSFERS IN FUTURE DATA CENTER NET...
PDF
MetOp Satellites Data Processing for Air Pollution Monitoring in Morocco
Pregel - Paper Review
A New Bi-level Program Based on Unblocked Reliability for a Continuous Road N...
AN INFORMATION THEORY-BASED FEATURE SELECTIONFRAMEWORK FOR BIG DATA UNDER APA...
Domain research presentation Final
A Study on Hardware and Software Link Quality Metrics for Wireless Multimedia...
Data Retrieval Scheduling For Unsynchronized Channel in Wireless Broadcast Sy...
EXPLOITING EFFICIENT AND SCALABLE SHUFFLE TRANSFERS IN FUTURE DATA CENTER NET...
MetOp Satellites Data Processing for Air Pollution Monitoring in Morocco
Ad

Viewers also liked (9)

PPT
Navidad Y Belenes Hortaleza 2009
PPTX
Mumiy troll
PPTX
Uniformity PITCH: DOES IT REALLY MATTER?
PPTX
Jeremiah leaves and trees
PPT
2004 Metro Dortmund Bahnazubis Neu
PPS
生命教育(洪蘭)
PDF
Sintesis informativa 01 08 2012
PDF
Cartelloni e loghi
PDF
Avda+escoles
Navidad Y Belenes Hortaleza 2009
Mumiy troll
Uniformity PITCH: DOES IT REALLY MATTER?
Jeremiah leaves and trees
2004 Metro Dortmund Bahnazubis Neu
生命教育(洪蘭)
Sintesis informativa 01 08 2012
Cartelloni e loghi
Avda+escoles
Ad

Similar to APM project meeting - June 13, 2012 - LBNL, Berkeley, CA (20)

PPTX
Large scale social networks analysis joclad 2013
PPTX
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
PPTX
TAO Refresh - Automation of Data Spike Flagging Quality
PPTX
Building Electricity Demand Forecasting
PDF
Data mining projects topics for java and dot net
PPTX
Aug2013 bioinformatics working group
PDF
IRJET- Univariate Time Series Prediction of Reservoir Inflow using Artifi...
PPTX
EFFICIENT DATA EXTRACTION USING ARTIFICIAL INTELLIGENCE
PDF
Common Design Elements for Data Movement Eli Dart
PDF
YAPC2007 Remote System Monitoring (w. Notes)
PDF
Cross Domain Data Fusion
PPT
Moving Towards a Streaming Architecture
PDF
Internet ttraffic monitering anomalous behiviour detection
PPTX
FME-Enabled Address Management Ecosystem in Arizona - A Technical Introductio...
DOC
TransPAC2 Workplan - Measurement (v9)
PPT
A brief introduction to 'R' statistical package
PDF
How to do accurate RE forecasting & scheduling
PDF
CROP YIELD PREDICTION USING ARTIFICIAL NEURAL NETWORK
PDF
Use of Spark MLib for Predicting the Offlining of Digital Media-(Christopher ...
PDF
The Earth System Modeling Framework
Large scale social networks analysis joclad 2013
Hadoop for High-Performance Climate Analytics - Use Cases and Lessons Learned
TAO Refresh - Automation of Data Spike Flagging Quality
Building Electricity Demand Forecasting
Data mining projects topics for java and dot net
Aug2013 bioinformatics working group
IRJET- Univariate Time Series Prediction of Reservoir Inflow using Artifi...
EFFICIENT DATA EXTRACTION USING ARTIFICIAL INTELLIGENCE
Common Design Elements for Data Movement Eli Dart
YAPC2007 Remote System Monitoring (w. Notes)
Cross Domain Data Fusion
Moving Towards a Streaming Architecture
Internet ttraffic monitering anomalous behiviour detection
FME-Enabled Address Management Ecosystem in Arizona - A Technical Introductio...
TransPAC2 Workplan - Measurement (v9)
A brief introduction to 'R' statistical package
How to do accurate RE forecasting & scheduling
CROP YIELD PREDICTION USING ARTIFICIAL NEURAL NETWORK
Use of Spark MLib for Predicting the Offlining of Digital Media-(Christopher ...
The Earth System Modeling Framework

More from balmanme (20)

PDF
Network-aware Data Management for Large Scale Distributed Applications, IBM R...
PDF
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...
PDF
Hpcwire100gnetworktosupportbigscience 130725203822-phpapp01-1
PDF
Experiences with High-bandwidth Networks
PDF
A 100 gigabit highway for science: researchers take a 'test drive' on ani tes...
PDF
Balman stork cw09
PDF
Available technologies: algorithm for flexible bandwidth reservations for dat...
PDF
Berkeley lab team develops flexible reservation algorithm for advance network...
PDF
Dynamic adaptation balman
PDF
Nersc dtn-perf-100121.test_results-nercmeeting-jan21-2010
PDF
Cybertools stork-2009-cybertools allhandmeeting-poster
PDF
Presentation summerstudent 2009-aug09-lbl-summer
PDF
Lblc sseminar jun09-2009-jun09-lblcsseminar
PDF
Presentation southernstork 2009-nov-southernworkshop
PDF
Balman dissertation Copyright @ 2010 Mehmet Balman
PDF
Aug17presentation.v2 2009-aug09-lblc sseminar
PDF
Pdcs2010 balman-presentation
PDF
Analyzing Data Movements and Identifying Techniques for Next-generation Networks
PDF
MemzNet: Memory-Mapped Zero-copy Network Channel -- Streaming exascala data o...
PDF
Opening ndm2012 sc12
Network-aware Data Management for Large Scale Distributed Applications, IBM R...
Network-aware Data Management for High Throughput Flows Akamai, Cambridge, ...
Hpcwire100gnetworktosupportbigscience 130725203822-phpapp01-1
Experiences with High-bandwidth Networks
A 100 gigabit highway for science: researchers take a 'test drive' on ani tes...
Balman stork cw09
Available technologies: algorithm for flexible bandwidth reservations for dat...
Berkeley lab team develops flexible reservation algorithm for advance network...
Dynamic adaptation balman
Nersc dtn-perf-100121.test_results-nercmeeting-jan21-2010
Cybertools stork-2009-cybertools allhandmeeting-poster
Presentation summerstudent 2009-aug09-lbl-summer
Lblc sseminar jun09-2009-jun09-lblcsseminar
Presentation southernstork 2009-nov-southernworkshop
Balman dissertation Copyright @ 2010 Mehmet Balman
Aug17presentation.v2 2009-aug09-lblc sseminar
Pdcs2010 balman-presentation
Analyzing Data Movements and Identifying Techniques for Next-generation Networks
MemzNet: Memory-Mapped Zero-copy Network Channel -- Streaming exascala data o...
Opening ndm2012 sc12

Recently uploaded (20)

PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Encapsulation theory and applications.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Hindi spoken digit analysis for native and non-native speakers
PDF
Enhancing emotion recognition model for a student engagement use case through...
PDF
Getting Started with Data Integration: FME Form 101
PDF
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PDF
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
PPTX
A Presentation on Artificial Intelligence
PPTX
Tartificialntelligence_presentation.pptx
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PDF
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
A comparative analysis of optical character recognition models for extracting...
Univ-Connecticut-ChatGPT-Presentaion.pdf
Encapsulation theory and applications.pdf
Chapter 5: Probability Theory and Statistics
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Hindi spoken digit analysis for native and non-native speakers
Enhancing emotion recognition model for a student engagement use case through...
Getting Started with Data Integration: FME Form 101
DASA ADMISSION 2024_FirstRound_FirstRank_LastRank.pdf
gpt5_lecture_notes_comprehensive_20250812015547.pdf
Transform Your ITIL® 4 & ITSM Strategy with AI in 2025.pdf
A Presentation on Artificial Intelligence
Tartificialntelligence_presentation.pptx
1 - Historical Antecedents, Social Consideration.pdf
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Building Integrated photovoltaic BIPV_UPV.pdf
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Microsoft Solutions Partner Drive Digital Transformation with D365.pdf
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
A comparative analysis of optical character recognition models for extracting...

APM project meeting - June 13, 2012 - LBNL, Berkeley, CA

  • 1. Searching  pa,erns  in  SNMP  data   Mehmet  Balman  and  Doron  Rotem  
  • 2. Searching  pa,erns  in  SNMP  data   •  Looked  at  two  busy  links:     –  star-­‐sdn1/interface/xe-­‐7_3_0   –  denv-­‐cr2/interface/xe-­‐1_1_0           -­‐Generated  graphs  for  the  last  6   months  (including  graphs  per  month,   per  week  )     -­‐  Visually  inspected  whether  there  is   any  Ime  related  specific  pa,ern  in   bandwidth  usage  
  • 3. Searching  pa,erns  in  SNMP  data   –  star-­‐sdn1/interface/xe-­‐7_3_0    star-­‐>wash    h6ps://sdm.lbl.gov/~balman/temp/1-­‐a/     –  denv-­‐cr2/interface/xe-­‐1_1_0     sunn-­‐>denv     h6ps://sdm.lbl.gov/~balman/temp/1/    
  • 4. Searching  pa,erns  in  SNMP  data   •  Collected  data  for  those  two  links  (one  year  long)  and  tried  to  analyze  the  data   with  a  machine  learning  soMware   •  Converted  data  into  arff  format   •  Used  Weka   •  Evaluated  the  bandwidth  vs.  Ime  data  (Ime  series  analysis)  to  see  whether  day  of   the  week,  PM  or  AM,  day  of  the  year,  etc.  have  any  visible  effect  on  bandwidth   usage  
  • 8. Searching  pa,erns  in  SNMP  data   •  Our  iniIal  results  on  Ime  series  predicIon  gave    40-­‐50%  error  rate.       •  By  using  some  other  techniques,  we  were  able  to  achieve  30-­‐40  %   error  rate.   •  At  this  moment,    taking  average  link  usage  may  be  a  reasonable   way  to  start  with.   •  Further  study  is  required  to  make  useful  predicIons   –  Gretl  is  also  another  alternaIve   –  Using  R  instead  of  Weka