SlideShare a Scribd company logo
ECWAY TECHNOLOGIES
IEEE PROJECTS & SOFTWARE DEVELOPMENTS
OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE
CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111
VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com

CLUSTERING LARGE PROBABILISTIC GRAPHS

ABSTRACT:

We study the problem of clustering probabilistic graphs. Similar to the problem of clustering
standard graphs, probabilistic graph clustering has numerous applications, such as finding
complexes in probabilistic protein-protein interaction (PPI) networks and discovering groups of
users in affiliation networks.

We extend the edit-distance-based definition of graph clustering to probabilistic graphs. We
establish a connection between our objective function and correlation clustering to propose
practical approximation algorithms for our problem. A benefit of our approach is that our
objective function is parameter-free. Therefore, the number of clusters is part of the output.

We develop methods for testing the statistical significance of the output clustering and study the
case of noisy clusterings. Using a real protein-protein interaction network and ground-truth data,
we show that our methods discover the correct number of clusters and identify established
protein relationships. Finally, we show the practicality of our techniques using a large social
network of Yahoo! users consisting of one billion edges.

More Related Content

PPTX
Application's of Numerical Math in CSE
PDF
Master's degree thesis testing algorithms for image & video understanding
PPTX
Finding Maximum Edge Biclique in Bipartite Networks by Integer Programming
PPTX
Learning to learn with meta learning
PDF
Bellman Equation in Dynamic Programming
DOCX
CONFLICT-AWARE WEIGHTED BIPARTITE B-MATCHING AND ITS APPLICATION TO E-COMMERCE
PPTX
Numerical Integral using NNI
Application's of Numerical Math in CSE
Master's degree thesis testing algorithms for image & video understanding
Finding Maximum Edge Biclique in Bipartite Networks by Integer Programming
Learning to learn with meta learning
Bellman Equation in Dynamic Programming
CONFLICT-AWARE WEIGHTED BIPARTITE B-MATCHING AND ITS APPLICATION TO E-COMMERCE
Numerical Integral using NNI

What's hot (8)

PDF
Fast activity detection indexing for temporal stochastic automaton based acti...
PDF
Ideas on Machine Learning Interpretability
PPTX
PPT
Dexa2007 Orsi V1.5
PDF
MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015
PDF
Introduction to Model-Based Machine Learning
PPT
1 00-introduction to computer graphics
PDF
Slides - Summary of: "Automating Data Preparation: Can We? Should We? Must We?"
Fast activity detection indexing for temporal stochastic automaton based acti...
Ideas on Machine Learning Interpretability
Dexa2007 Orsi V1.5
MediaEval 2015 - Geo_ML @ MediaEval Placing Task 2015
Introduction to Model-Based Machine Learning
1 00-introduction to computer graphics
Slides - Summary of: "Automating Data Preparation: Can We? Should We? Must We?"
Ad

Viewers also liked (19)

DOCX
Nomenclatura quimica.
PDF
ใบงานแบบสำรวจและประวัติ
DOCX
DR TOOSI REFERENCE LETTER
PDF
MONIKA BISSELL REFERENCE LETTER
PPTX
PDF
συγγραμμα για την φιλοσοφικη συμβουλευτικη
PPTX
GHEI Poster (1)
PDF
Queens College PDF
PPTX
ClimateLaunchpad_LT
PDF
Nacion las piedras formulario inscripcion autoridades_tucuman
DOCX
Madison Hoyle's Resume
PDF
How to make people listen
PDF
επισκόπηση - φιλοσοφικη συμβουλετικη
PDF
Getting started
PPTX
Hardware y-software
PPTX
Tablets
PDF
Thanks for visiting my page!
PPTX
DOCX
Nomenclatura quimica.
ใบงานแบบสำรวจและประวัติ
DR TOOSI REFERENCE LETTER
MONIKA BISSELL REFERENCE LETTER
συγγραμμα για την φιλοσοφικη συμβουλευτικη
GHEI Poster (1)
Queens College PDF
ClimateLaunchpad_LT
Nacion las piedras formulario inscripcion autoridades_tucuman
Madison Hoyle's Resume
How to make people listen
επισκόπηση - φιλοσοφικη συμβουλετικη
Getting started
Hardware y-software
Tablets
Thanks for visiting my page!
Ad

Similar to Clustering large probabilistic graphs (20)

PDF
Clustering large probabilistic graphs
PDF
Estimating project development effort using clustered regression approach
PDF
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
PDF
A Literature Survey on Image Linguistic Visual Question Answering
PDF
A Machine learning based framework for Verification and Validation of Massive...
PDF
50120130406017
PDF
MultiObjective(11) - Copy
PDF
IEEE Pattern analysis and machine intelligence 2016 Title and Abstract
PDF
IEEE Datamining 2016 Title and Abstract
PPTX
K anonymity for crowdsourcing database
DOCX
BULK IEEE PROJECTS IN MATLAB ,BULK IEEE PROJECTS, IEEE 2015-16 MATLAB PROJEC...
DOCX
final year ieee pojects in pondicherry,bulk ieee projects ,bulk 2015-16 i...
PDF
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
PDF
A simplified predictive framework for cost evaluation to fault assessment usi...
PDF
IRJET- Analysis of Vehicle Number Plate Recognition
PDF
Deepcoder to Self-Code with Machine Learning
PDF
algorithms
PDF
Demonstrated Deep Learning Techniques for the Resolution of CAPTCHA images
PDF
Partial Object Detection in Inclined Weather Conditions
PDF
Comparative Study of Pre-Trained Neural Network Models in Detection of Glaucoma
Clustering large probabilistic graphs
Estimating project development effort using clustered regression approach
ESTIMATING PROJECT DEVELOPMENT EFFORT USING CLUSTERED REGRESSION APPROACH
A Literature Survey on Image Linguistic Visual Question Answering
A Machine learning based framework for Verification and Validation of Massive...
50120130406017
MultiObjective(11) - Copy
IEEE Pattern analysis and machine intelligence 2016 Title and Abstract
IEEE Datamining 2016 Title and Abstract
K anonymity for crowdsourcing database
BULK IEEE PROJECTS IN MATLAB ,BULK IEEE PROJECTS, IEEE 2015-16 MATLAB PROJEC...
final year ieee pojects in pondicherry,bulk ieee projects ,bulk 2015-16 i...
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
A simplified predictive framework for cost evaluation to fault assessment usi...
IRJET- Analysis of Vehicle Number Plate Recognition
Deepcoder to Self-Code with Machine Learning
algorithms
Demonstrated Deep Learning Techniques for the Resolution of CAPTCHA images
Partial Object Detection in Inclined Weather Conditions
Comparative Study of Pre-Trained Neural Network Models in Detection of Glaucoma

More from Ecwaytechnoz (20)

PPTX
Wheelztracker.pptx
PDF
Coloring based inter-wban scheduling for mobile wireless body area networks
DOC
Code modulation based encryption & decryption technique for secure communicat...
PDF
Clustering sentence level text using a novel fuzzy relational clustering algo...
PDF
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
PDF
Cloudsim ranking on data manifold with sink points
PDF
Cloudsim quality-differentiated video multicast in multirate wireless networks
PDF
Cloudsim power allocation for statistical qo s provisioning in opportunistic...
PDF
Cloudsim distributed web systems performance forecasting using turning bands...
PDF
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
DOCX
Civil 2013 titles
DOC
Chopper based dc motor speed control
PDF
Channel assignment for throughput optimization in multichannel multiradio wir...
PDF
Channel allocation and routing in hybrid multichannel multiradio wireless mes...
PDF
Casual stereoscopic photo authoring
DOCX
Casual stereoscopic photo authoring
PDF
Capacity of hybrid wireless mesh networks with random a ps
DOC
Bomb detection robot with wireless camera
DOC
Bed side patients monitoring system with emergency alert
PDF
Autonomous sensing order selection strategies exploiting channel access infor...
Wheelztracker.pptx
Coloring based inter-wban scheduling for mobile wireless body area networks
Code modulation based encryption & decryption technique for secure communicat...
Clustering sentence level text using a novel fuzzy relational clustering algo...
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
Cloudsim ranking on data manifold with sink points
Cloudsim quality-differentiated video multicast in multirate wireless networks
Cloudsim power allocation for statistical qo s provisioning in opportunistic...
Cloudsim distributed web systems performance forecasting using turning bands...
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
Civil 2013 titles
Chopper based dc motor speed control
Channel assignment for throughput optimization in multichannel multiradio wir...
Channel allocation and routing in hybrid multichannel multiradio wireless mes...
Casual stereoscopic photo authoring
Casual stereoscopic photo authoring
Capacity of hybrid wireless mesh networks with random a ps
Bomb detection robot with wireless camera
Bed side patients monitoring system with emergency alert
Autonomous sensing order selection strategies exploiting channel access infor...

Clustering large probabilistic graphs

  • 1. ECWAY TECHNOLOGIES IEEE PROJECTS & SOFTWARE DEVELOPMENTS OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111 VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com CLUSTERING LARGE PROBABILISTIC GRAPHS ABSTRACT: We study the problem of clustering probabilistic graphs. Similar to the problem of clustering standard graphs, probabilistic graph clustering has numerous applications, such as finding complexes in probabilistic protein-protein interaction (PPI) networks and discovering groups of users in affiliation networks. We extend the edit-distance-based definition of graph clustering to probabilistic graphs. We establish a connection between our objective function and correlation clustering to propose practical approximation algorithms for our problem. A benefit of our approach is that our objective function is parameter-free. Therefore, the number of clusters is part of the output. We develop methods for testing the statistical significance of the output clustering and study the case of noisy clusterings. Using a real protein-protein interaction network and ground-truth data, we show that our methods discover the correct number of clusters and identify established protein relationships. Finally, we show the practicality of our techniques using a large social network of Yahoo! users consisting of one billion edges.