SlideShare a Scribd company logo
Jun Liu ; Feng Liu and Ansari, N. 
Beijing Univ. of Posts & Telecommun., Beijing, China 
IEEE Network • July/August 2014 
Advisor : Dr. Jenq-Shiou Leu 
Student : Chia-Yun Chan 
Date : 2014/12/09
Introduction 
System Architecture 
Traffic Analysis Algorithms 
Experimental Results 
Conclusions
Network traffic monitoring and analysis is significance 
for optimizing network resource and improving user 
experience 
Existing solutions usually rely on a high-performance 
server with large storage capacity, are not scalable for 
detailed analysis of big traffic data
The features of Hadoop 
Distributed parallel computing 
Low-cost scale-out capability 
High fault tolerance 
But some important issues in large-scale commercial 
telecommunication networks have not been solved
Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
Application-layer analysis 
Web service provider analysis 
User behavior analysis
Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
d 
30 30 
80 54 
a 
e 
90 
b 
c 
120 
60 
80 
100 
200 
64 
20 10
We develop a three-step algorithm: 
1. Measuring affinity 
2. Sparsifying a graph 
3. Identifying communities
Mobile operators want to know the user behaviors of 
cellular devices including models, prices, and features 
We design a novel Jaccard-based learning method to 
build a cellular device model database 
1. Extract all keywords of a device model 
2. Filter candidate keywords 
3. Calculate the Jaccard coefficient index using statistical 
information, and select the keyword with the highest 
Jaccard index to represent the device model
Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop
A novel system for monitoring and analyzing large-scale 
network traffic data 
Designed algorithms and implemented MapReduce 
programs for network traffic analysis from different 
perspectives 
Revealed a number of network traffic and user 
behavior phenomena not shown before
Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop

More Related Content

PPTX
TRAFFIC DATA ANALYSIS USING HADOOP
PPTX
Traffic data analysis using HADOOP
PDF
PDF
Traffic Profiles and Management for Support of Community Networks
PDF
ShibiaoNong_Resume_ColumbiaMS (1)
PPTX
Actibump resultat
PPT
Tweeting hadoop
PPT
Hadoop
TRAFFIC DATA ANALYSIS USING HADOOP
Traffic data analysis using HADOOP
Traffic Profiles and Management for Support of Community Networks
ShibiaoNong_Resume_ColumbiaMS (1)
Actibump resultat
Tweeting hadoop
Hadoop

Viewers also liked (13)

PDF
Hadoop Network Performance profile
PPTX
Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...
PPTX
Performing Network & Security Analytics with Hadoop
PDF
Deploying pNFS over Distributed File Storage w/ Jiffin Tony Thottan and Niels...
PPTX
ahepburn MDES PRES2 Production Tech Its only a Comic
ODP
Kkeithley ufonfs-gluster summit
PPT
The Big Traffic
PPTX
Network for the Large-scale Hadoop cluster at Yahoo! JAPAN
PPT
Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...
PPT
Solving Big Data Problems
PPT
Hadoop Security Architecture
PPT
Hadoop Monitoring best Practices
PPT
Free Download Powerpoint Slides
Hadoop Network Performance profile
Accelerating Apache Hadoop through High-Performance Networking and I/O Techno...
Performing Network & Security Analytics with Hadoop
Deploying pNFS over Distributed File Storage w/ Jiffin Tony Thottan and Niels...
ahepburn MDES PRES2 Production Tech Its only a Comic
Kkeithley ufonfs-gluster summit
The Big Traffic
Network for the Large-scale Hadoop cluster at Yahoo! JAPAN
Hadoop World 2011: Hadoop Network and Compute Architecture Considerations - J...
Solving Big Data Problems
Hadoop Security Architecture
Hadoop Monitoring best Practices
Free Download Powerpoint Slides
Ad

Similar to Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop (10)

PDF
A Research Framework for the Clean-Slate Design of Next-Generation Optical Ac...
PDF
T rec-e.503-198811-s!!pdf-e
PDF
ADAPTIVE An Object-Oriented Framework for Flexible and Adaptive Communicatio...
DOCX
Proposal for System Analysis and Desing
PDF
SzaboGeza_disszertacio
PDF
Approximation of regression-based fault minimization for network traffic
PDF
Teletraffic engineering handbook
PDF
THE DEVELOPMENT AND STUDY OF THE METHODS AND ALGORITHMS FOR THE CLASSIFICATIO...
PPT
big data analytics in mobile cellular network
PDF
Study and development of methods and tools for testing, validation and verif...
A Research Framework for the Clean-Slate Design of Next-Generation Optical Ac...
T rec-e.503-198811-s!!pdf-e
ADAPTIVE An Object-Oriented Framework for Flexible and Adaptive Communicatio...
Proposal for System Analysis and Desing
SzaboGeza_disszertacio
Approximation of regression-based fault minimization for network traffic
Teletraffic engineering handbook
THE DEVELOPMENT AND STUDY OF THE METHODS AND ALGORITHMS FOR THE CLASSIFICATIO...
big data analytics in mobile cellular network
Study and development of methods and tools for testing, validation and verif...
Ad

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Empathic Computing: Creating Shared Understanding
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Machine learning based COVID-19 study performance prediction
PPTX
Big Data Technologies - Introduction.pptx
PDF
Assigned Numbers - 2025 - Bluetooth® Document
PPTX
Cloud computing and distributed systems.
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPT
Teaching material agriculture food technology
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
A Presentation on Artificial Intelligence
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Advanced methodologies resolving dimensionality complications for autism neur...
Review of recent advances in non-invasive hemoglobin estimation
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Reach Out and Touch Someone: Haptics and Empathic Computing
Empathic Computing: Creating Shared Understanding
MYSQL Presentation for SQL database connectivity
Machine learning based COVID-19 study performance prediction
Big Data Technologies - Introduction.pptx
Assigned Numbers - 2025 - Bluetooth® Document
Cloud computing and distributed systems.
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Teaching material agriculture food technology
Encapsulation_ Review paper, used for researhc scholars
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
gpt5_lecture_notes_comprehensive_20250812015547.pdf
A Presentation on Artificial Intelligence
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
The AUB Centre for AI in Media Proposal.docx
Programs and apps: productivity, graphics, security and other tools
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Monitoring and Analyzing Big Traffic Data of a Large-Scale Cellular Network with Hadoop

  • 1. Jun Liu ; Feng Liu and Ansari, N. Beijing Univ. of Posts & Telecommun., Beijing, China IEEE Network • July/August 2014 Advisor : Dr. Jenq-Shiou Leu Student : Chia-Yun Chan Date : 2014/12/09
  • 2. Introduction System Architecture Traffic Analysis Algorithms Experimental Results Conclusions
  • 3. Network traffic monitoring and analysis is significance for optimizing network resource and improving user experience Existing solutions usually rely on a high-performance server with large storage capacity, are not scalable for detailed analysis of big traffic data
  • 4. The features of Hadoop Distributed parallel computing Low-cost scale-out capability High fault tolerance But some important issues in large-scale commercial telecommunication networks have not been solved
  • 6. Application-layer analysis Web service provider analysis User behavior analysis
  • 9. d 30 30 80 54 a e 90 b c 120 60 80 100 200 64 20 10
  • 10. We develop a three-step algorithm: 1. Measuring affinity 2. Sparsifying a graph 3. Identifying communities
  • 11. Mobile operators want to know the user behaviors of cellular devices including models, prices, and features We design a novel Jaccard-based learning method to build a cellular device model database 1. Extract all keywords of a device model 2. Filter candidate keywords 3. Calculate the Jaccard coefficient index using statistical information, and select the keyword with the highest Jaccard index to represent the device model
  • 17. A novel system for monitoring and analyzing large-scale network traffic data Designed algorithms and implemented MapReduce programs for network traffic analysis from different perspectives Revealed a number of network traffic and user behavior phenomena not shown before

Editor's Notes

  • #2: 監測和分析與Hadoop的大型蜂窩網絡的大流量數據
  • #4: 網絡流量監測和分析是優化網絡資源,提升用戶體驗的意義。 現有的解決方案通常依賴具有大存儲容量的高性能服務器上,都沒有可擴展為大量的業務數據的詳細分析。
  • #5: Hadoop的具有幾個重要特點:高效分散平行運算,低成本的向外擴展的能力,和高容錯性。 HADOOP用於分析網絡流量的數據,一些重要的在大規模商用的電信網絡問題還沒有得到解決。
  • #12: 移動運營商希望了解移動設備,包括型號,價格和功能的用戶行為 我們設計了一種新的杰卡德為基礎的學習方法來建立一個蜂窩設備模型數據庫 1.提取有關的器件模型描述的所有關鍵字。 2.篩選候選關鍵字,通過評估每個關鍵字和設備型號之間的條件概率值。 3.使用的統計信息,計算杰卡德係數索引,並選擇具有最高的Jaccard指數來表示該設備模型的關鍵字。
  • #18: 一種新的系統,用於監測和分析大規模網絡流量的數據。 從不同的角度的網絡流量分析算法設計並實現了MapReduce程序 使我們能夠揭示了一些之前未顯示網絡流量和用戶行為的現象。