SlideShare a Scribd company logo
Team No 24
Integrating Network
Discovery and
Community Detection
Nikhil Daliya - 201301142
Athresh G - 201505565
Overview
Integrating network discovery and community
detection routines for nodes in the
given network and identifying the
characteristics of the nodes (constant or rapidly
changing) in the network.
Dataset
Railway dataset
Railway network, proposed by [Ghosh et al.
2011] consists of nodes representing railway
stations in India, where two stations si and sj
are connected by an edge if there exists at
least one train-route such that both si and sj
are scheduled halts on that route. Here the
communities are states/provinces of India since
the number of trains within each state is much
higher than the trains in-between two states.
Dataset
Football
Football network, proposed by [Girvan and
Newman 2002a] contains the network of
American football games between Division IA
colleges during the regular season of Fall 2000.
The vertices in the graph represent teams
(identified by their college names) and edges
represent regular-season games between the
two teams they connect.
Dataset
Football
The teams are divided into conferences
(indicating communities) containing around 8-
12 teams each. Games are more frequent
between members of the same conference
than between members of different
conferences. Teams that are geographically
close to one another but belong to different
conferences are more likely to play one another
than teams separated by large geographic
distances.
Application
●Exploring the adversarial networks(such as
terrorist networks).
●Clustering in social networks.
●Politeness policies on crawling website
makes it difficult to mine the whole network
on social networking sites. There are space
and bandwidth limits which put constraints
on the size of network that can be mined.
Challenges
● Dynamic discovery of the network imposes
problems in clustering of nodes .
● Identifying the characteristic of
nodes(constant , changing or rapidly changing)
is difficult problem.
●The dataset grows rapidly with network
discovery and keeping
track of probability distribution of each node for
different communities is
challenging task.
Tools Used
● Third party package
(​https://sites.google.com/site/santofortunato/in
thepress2​) for generating synthetic graphs as
input.
● Language to be used: Python and Java.
Packages such as panda, numpy, scikitlearn,
networkX and igraph will be used accordingly.
● matplotlib package for plotting the results for
better visualization and understanding.
Implementation
●We have used 2 modules mainly
ChooseNode which chooses node in each
iteration to be merged to the network and
UpdateCommunity which will update the
community or clusters from the choosen
node.
●Spectral clustering is applied on the initial
set of target nodes.
Implementation
During ChooseNode we use 2 measures to
choose the node for updation.
Ncut measure : minimize the similarity across a cut,
while simultaneously maximizing the similarity within
the same community.
Modularity : additional fraction of the edges that fall
within the given communities over the expected fraction
Implementation
I/P :
●Initial set of clustering , Initial network, cost
and budget.
O/P :
●Final network and nodes with clusters formed
from nodes we have discovered.
●List of rapidly changing nodes in the network.
Results and Analysis
- We have used Average Clustering Purity
(ACP) and Average Clustering Entropy (ACE)
to measure effectiveness of our algorithm.
- Both these measures incorporates the
fraction of nodes of particular cluster
belonging to same class as their measure.
Results and Analysis
Railway Dataset :
Total no. of target nodes : 80
Average cluster purity : 0.79
Average Cluster entropy : 0.17
Rapidly changing nodes : 6,47,84,91
Results and Analysis
Railway Dataset :
Results and Analysis
Railway Dataset :
Results and Analysis
Football Dataset :
Total no. of target nodes : 48
Average cluster purity : 0.91
Average Cluster entropy : 0.11
Changing nodes : 51 , 63 , 49
Results and Analysis
Football Dataset :
References
Research paper : On integrating Network and
Community Discovery
http://hanj.cs.illinois.edu/pdf/wsdm15_jliu.pdf
Thank You !!!!

More Related Content

PPTX
Community detection
PDF
Social network analysis basics
PPTX
Community detection algorithms
PDF
Community Detection in Social Media
PDF
Community detection in social networks[1]
PPT
Clique-based Network Clustering
PDF
Large graph analysis using g mine system
PDF
Scalable Similarity-Based Neighborhood Methods with MapReduce
Community detection
Social network analysis basics
Community detection algorithms
Community Detection in Social Media
Community detection in social networks[1]
Clique-based Network Clustering
Large graph analysis using g mine system
Scalable Similarity-Based Neighborhood Methods with MapReduce

What's hot (17)

PDF
An innovative approach to solve the network design problem concerning intelli...
PPTX
Election in Wireless Environment
PDF
Cross domain sentiment classification via spectral feature alignment
 
PDF
Propose a Method to Improve Performance in Grid Environment, Using Multi-Crit...
PDF
Community detection of political blogs network based on structure-attribute g...
PPTX
Recomendation system: Community Detection Based Recomendation System using Hy...
PDF
A Dependent Set Based Approach for Large Graph Analysis
PPTX
Using content and interactions for discovering communities in
PPTX
A technical paper presentation on Evaluation of Deep Learning techniques in S...
PDF
Community detection in graphs
PPTX
Load balancing
PDF
A NOBEL HYBRID APPROACH FOR EDGE DETECTION
PPTX
Ppt 2 d ploting k10998
PDF
Distributed and Cloud Computing 1st Edition Hwang Solutions Manual
PDF
TASK-DECOMPOSITION BASED ANOMALY DETECTION OF MASSIVE AND HIGH-VOLATILITY SES...
PDF
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
PDF
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
An innovative approach to solve the network design problem concerning intelli...
Election in Wireless Environment
Cross domain sentiment classification via spectral feature alignment
 
Propose a Method to Improve Performance in Grid Environment, Using Multi-Crit...
Community detection of political blogs network based on structure-attribute g...
Recomendation system: Community Detection Based Recomendation System using Hy...
A Dependent Set Based Approach for Large Graph Analysis
Using content and interactions for discovering communities in
A technical paper presentation on Evaluation of Deep Learning techniques in S...
Community detection in graphs
Load balancing
A NOBEL HYBRID APPROACH FOR EDGE DETECTION
Ppt 2 d ploting k10998
Distributed and Cloud Computing 1st Edition Hwang Solutions Manual
TASK-DECOMPOSITION BASED ANOMALY DETECTION OF MASSIVE AND HIGH-VOLATILITY SES...
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
Ad

Viewers also liked (13)

PDF
Kaseya Monitoring Suite Overview
PPTX
ADC/DAC
PDF
Remote Control Architecture: How We Are Building The World’s Fastest Remote C...
PDF
Reporting and Dashboards: The Present and Future Direction of VSA Reporting
PPT
OpManager Major Features
PPTX
Incident Response: SIEM
PPTX
LIquidity and Validity - Jan Gill
PPTX
Discovering Computers: Chapter 09
PPTX
OpManager Technical Overview
PPTX
Incident response: Advanced Network Forensics
PPTX
Network Traffic Analysis at a financial institution with 788 branches for 350...
PPTX
What's new in NetFlow Analyzer 12.2
PDF
Network discovery - Inside out by Aakash Goel
Kaseya Monitoring Suite Overview
ADC/DAC
Remote Control Architecture: How We Are Building The World’s Fastest Remote C...
Reporting and Dashboards: The Present and Future Direction of VSA Reporting
OpManager Major Features
Incident Response: SIEM
LIquidity and Validity - Jan Gill
Discovering Computers: Chapter 09
OpManager Technical Overview
Incident response: Advanced Network Forensics
Network Traffic Analysis at a financial institution with 788 branches for 350...
What's new in NetFlow Analyzer 12.2
Network discovery - Inside out by Aakash Goel
Ad

Similar to Integrating Network Discovery and Community Detection (IRE IIITH) Team 24 (20)

PDF
Greedy Incremental approach for unfolding of communities in massive networks
PDF
Community profiling for social networks
PDF
Overlapping community detection survey
PPTX
03 Communities in Networks (2017)
PDF
Community detection
PPTX
Discovering Overlapping Community Structure in Networks through Co-clustering
PDF
Mesoscale Structures in Networks
PDF
Mesoscale Structures in Networks - Mason A. Porter
PDF
thesis
PPTX
06 Community Detection
PPTX
Optimized Greedy Algorithm (OPA) for Graph Theory
PPTX
Data mining.pptx
PDF
FAST AND ACCURATE MINING THE COMMUNITY STRUCTURE: INTEGRATING CENTER LOCATING...
PDF
COMMUNITY DETECTION IN NETWORKS USING PAGE RANK VECTORS
PDF
Community Detection in Networks Using Page Rank Vectors
PDF
Community Detection in Networks Using Page Rank Vectors
PDF
Application Areas of Community Detection: A Review : NOTES
PDF
Community detection in social networks an overview
PDF
Node similarity
PPTX
Algorithm in Social network of graph and social network analysis
Greedy Incremental approach for unfolding of communities in massive networks
Community profiling for social networks
Overlapping community detection survey
03 Communities in Networks (2017)
Community detection
Discovering Overlapping Community Structure in Networks through Co-clustering
Mesoscale Structures in Networks
Mesoscale Structures in Networks - Mason A. Porter
thesis
06 Community Detection
Optimized Greedy Algorithm (OPA) for Graph Theory
Data mining.pptx
FAST AND ACCURATE MINING THE COMMUNITY STRUCTURE: INTEGRATING CENTER LOCATING...
COMMUNITY DETECTION IN NETWORKS USING PAGE RANK VECTORS
Community Detection in Networks Using Page Rank Vectors
Community Detection in Networks Using Page Rank Vectors
Application Areas of Community Detection: A Review : NOTES
Community detection in social networks an overview
Node similarity
Algorithm in Social network of graph and social network analysis

Recently uploaded (20)

PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Sustainable Sites - Green Building Construction
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
Construction Project Organization Group 2.pptx
PPTX
Safety Seminar civil to be ensured for safe working.
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
Artificial Intelligence
PPTX
Lecture Notes Electrical Wiring System Components
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PDF
composite construction of structures.pdf
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Sustainable Sites - Green Building Construction
CH1 Production IntroductoryConcepts.pptx
Construction Project Organization Group 2.pptx
Safety Seminar civil to be ensured for safe working.
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Automation-in-Manufacturing-Chapter-Introduction.pdf
Internet of Things (IOT) - A guide to understanding
Artificial Intelligence
Lecture Notes Electrical Wiring System Components
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
composite construction of structures.pdf
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Operating System & Kernel Study Guide-1 - converted.pdf
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...

Integrating Network Discovery and Community Detection (IRE IIITH) Team 24

  • 1. Team No 24 Integrating Network Discovery and Community Detection Nikhil Daliya - 201301142 Athresh G - 201505565
  • 2. Overview Integrating network discovery and community detection routines for nodes in the given network and identifying the characteristics of the nodes (constant or rapidly changing) in the network.
  • 3. Dataset Railway dataset Railway network, proposed by [Ghosh et al. 2011] consists of nodes representing railway stations in India, where two stations si and sj are connected by an edge if there exists at least one train-route such that both si and sj are scheduled halts on that route. Here the communities are states/provinces of India since the number of trains within each state is much higher than the trains in-between two states.
  • 4. Dataset Football Football network, proposed by [Girvan and Newman 2002a] contains the network of American football games between Division IA colleges during the regular season of Fall 2000. The vertices in the graph represent teams (identified by their college names) and edges represent regular-season games between the two teams they connect.
  • 5. Dataset Football The teams are divided into conferences (indicating communities) containing around 8- 12 teams each. Games are more frequent between members of the same conference than between members of different conferences. Teams that are geographically close to one another but belong to different conferences are more likely to play one another than teams separated by large geographic distances.
  • 6. Application ●Exploring the adversarial networks(such as terrorist networks). ●Clustering in social networks. ●Politeness policies on crawling website makes it difficult to mine the whole network on social networking sites. There are space and bandwidth limits which put constraints on the size of network that can be mined.
  • 7. Challenges ● Dynamic discovery of the network imposes problems in clustering of nodes . ● Identifying the characteristic of nodes(constant , changing or rapidly changing) is difficult problem. ●The dataset grows rapidly with network discovery and keeping track of probability distribution of each node for different communities is challenging task.
  • 8. Tools Used ● Third party package (​https://sites.google.com/site/santofortunato/in thepress2​) for generating synthetic graphs as input. ● Language to be used: Python and Java. Packages such as panda, numpy, scikitlearn, networkX and igraph will be used accordingly. ● matplotlib package for plotting the results for better visualization and understanding.
  • 9. Implementation ●We have used 2 modules mainly ChooseNode which chooses node in each iteration to be merged to the network and UpdateCommunity which will update the community or clusters from the choosen node. ●Spectral clustering is applied on the initial set of target nodes.
  • 10. Implementation During ChooseNode we use 2 measures to choose the node for updation. Ncut measure : minimize the similarity across a cut, while simultaneously maximizing the similarity within the same community. Modularity : additional fraction of the edges that fall within the given communities over the expected fraction
  • 11. Implementation I/P : ●Initial set of clustering , Initial network, cost and budget. O/P : ●Final network and nodes with clusters formed from nodes we have discovered. ●List of rapidly changing nodes in the network.
  • 12. Results and Analysis - We have used Average Clustering Purity (ACP) and Average Clustering Entropy (ACE) to measure effectiveness of our algorithm. - Both these measures incorporates the fraction of nodes of particular cluster belonging to same class as their measure.
  • 13. Results and Analysis Railway Dataset : Total no. of target nodes : 80 Average cluster purity : 0.79 Average Cluster entropy : 0.17 Rapidly changing nodes : 6,47,84,91
  • 16. Results and Analysis Football Dataset : Total no. of target nodes : 48 Average cluster purity : 0.91 Average Cluster entropy : 0.11 Changing nodes : 51 , 63 , 49
  • 18. References Research paper : On integrating Network and Community Discovery http://hanj.cs.illinois.edu/pdf/wsdm15_jliu.pdf