SlideShare a Scribd company logo
ECWAY TECHNOLOGIES
IEEE PROJECTS & SOFTWARE DEVELOPMENTS
OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE
CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111
VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com

CLUSTERING SENTENCE-LEVEL TEXT USING A NOVEL FUZZY
RELATIONAL CLUSTERING ALGORITHM

ABSTRACT:

In comparison with hard clustering methods, in which a pattern belongs to a single cluster, fuzzy
clustering algorithms allow patterns to belong to all clusters with differing degrees of
membership. This is important in domains such as sentence clustering, since a sentence is likely
to be related to more than one theme or topic present within a document or set of documents.
However, because most sentence similarity measures do not represent sentences in a common
metric space, conventional fuzzy clustering approaches based on prototypes or mixtures of
Gaussians are generally not applicable to sentence clustering.

This paper presents a novel fuzzy clustering algorithm that operates on relational input data; i.e.,
data in the form of a square matrix of pair wise similarities between data objects. The algorithm
uses a graph representation of the data, and operates in an Expectation-Maximization framework
in which the graph centrality of an object in the graph is interpreted as likelihood. Results of
applying the algorithm to sentence clustering tasks demonstrate that the algorithm is capable of
identifying overlapping clusters of semantically related sentences, and that it is therefore of
potential use in a variety of text mining tasks. We also include results of applying the algorithm
to benchmark data sets in several other domains.

More Related Content

PDF
Clustering sentence level text using a novel fuzzy relational clustering algo...
PPT
Deciding Behaviour Compatibility of Complex Correspondences between Process ...
PPTX
DOCX
Clustering sentence level text using a novel fuzzy relational clustering algo...
PDF
Bellman Equation in Dynamic Programming
PDF
Semantic Ordering Relation- Applied to Agatha Christie Crime Thrillers
PPTX
Bayesian probabilistic interference
PDF
Clustering large probabilistic graphs
Clustering sentence level text using a novel fuzzy relational clustering algo...
Deciding Behaviour Compatibility of Complex Correspondences between Process ...
Clustering sentence level text using a novel fuzzy relational clustering algo...
Bellman Equation in Dynamic Programming
Semantic Ordering Relation- Applied to Agatha Christie Crime Thrillers
Bayesian probabilistic interference
Clustering large probabilistic graphs

Viewers also liked (13)

PDF
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
PPTX
What Is News?
PDF
KUMU Orientation EAT1 poster SP15
PPTX
Automóviles deportivos
DOC
Chopper based dc motor speed control
PPTX
Πώς παράγουμε το κρασί
PDF
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
DOC
Code modulation based encryption & decryption technique for secure communicat...
PDF
Channel assignment for throughput optimization in multichannel multiradio wir...
PDF
To the extract add a mixture of zinc dust and conc. Hydrochloric acid. It giv...
DOC
Design, analysis, and implementation of solar power optimizer for dc distribu...
PPTX
Порядок восстановления денежных средств на счетах образовательных организаций...
PDF
Straw pellet machine
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
What Is News?
KUMU Orientation EAT1 poster SP15
Automóviles deportivos
Chopper based dc motor speed control
Πώς παράγουμε το κρασί
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
Code modulation based encryption & decryption technique for secure communicat...
Channel assignment for throughput optimization in multichannel multiradio wir...
To the extract add a mixture of zinc dust and conc. Hydrochloric acid. It giv...
Design, analysis, and implementation of solar power optimizer for dc distribu...
Порядок восстановления денежных средств на счетах образовательных организаций...
Straw pellet machine
Ad

More from Ecwayt (20)

PDF
Covering points of interest with mobile sensors
PDF
Coloring based inter-wban scheduling for mobile wireless body area networks
DOC
Code modulation based encryption & decryption technique for secure communicat...
PDF
Clustering sentence level text using a novel fuzzy relational clustering algo...
PDF
Clustering large probabilistic graphs
PDF
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
PDF
Cloudsim ranking on data manifold with sink points
PDF
Cloudsim quality-differentiated video multicast in multirate wireless networks
PDF
Cloudsim power allocation for statistical qo s provisioning in opportunistic...
PDF
Cloudsim distributed web systems performance forecasting using turning bands...
PDF
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
DOCX
Civil 2013 titles
DOC
Chopper based dc motor speed control
PDF
Channel assignment for throughput optimization in multichannel multiradio wir...
PDF
Channel allocation and routing in hybrid multichannel multiradio wireless mes...
PDF
Casual stereoscopic photo authoring
DOCX
Casual stereoscopic photo authoring
PDF
Capacity of hybrid wireless mesh networks with random a ps
DOC
Bomb detection robot with wireless camera
DOC
Bed side patients monitoring system with emergency alert
Covering points of interest with mobile sensors
Coloring based inter-wban scheduling for mobile wireless body area networks
Code modulation based encryption & decryption technique for secure communicat...
Clustering sentence level text using a novel fuzzy relational clustering algo...
Clustering large probabilistic graphs
Cloudsim t-drive enhancing driving directions with taxi drivers’ intelligence
Cloudsim ranking on data manifold with sink points
Cloudsim quality-differentiated video multicast in multirate wireless networks
Cloudsim power allocation for statistical qo s provisioning in opportunistic...
Cloudsim distributed web systems performance forecasting using turning bands...
Cloudsim distributed processing of probabilistic top-k queries in wireless s...
Civil 2013 titles
Chopper based dc motor speed control
Channel assignment for throughput optimization in multichannel multiradio wir...
Channel allocation and routing in hybrid multichannel multiradio wireless mes...
Casual stereoscopic photo authoring
Casual stereoscopic photo authoring
Capacity of hybrid wireless mesh networks with random a ps
Bomb detection robot with wireless camera
Bed side patients monitoring system with emergency alert
Ad

Clustering sentence level text using a novel fuzzy relational clustering algorithm

  • 1. ECWAY TECHNOLOGIES IEEE PROJECTS & SOFTWARE DEVELOPMENTS OUR OFFICES @ CHENNAI / TRICHY / KARUR / ERODE / MADURAI / SALEM / COIMBATORE CELL: +91 98949 17187, +91 875487 2111 / 3111 / 4111 / 5111 / 6111 VISIT: www.ecwayprojects.com MAIL TO: ecwaytechnologies@gmail.com CLUSTERING SENTENCE-LEVEL TEXT USING A NOVEL FUZZY RELATIONAL CLUSTERING ALGORITHM ABSTRACT: In comparison with hard clustering methods, in which a pattern belongs to a single cluster, fuzzy clustering algorithms allow patterns to belong to all clusters with differing degrees of membership. This is important in domains such as sentence clustering, since a sentence is likely to be related to more than one theme or topic present within a document or set of documents. However, because most sentence similarity measures do not represent sentences in a common metric space, conventional fuzzy clustering approaches based on prototypes or mixtures of Gaussians are generally not applicable to sentence clustering. This paper presents a novel fuzzy clustering algorithm that operates on relational input data; i.e., data in the form of a square matrix of pair wise similarities between data objects. The algorithm uses a graph representation of the data, and operates in an Expectation-Maximization framework in which the graph centrality of an object in the graph is interpreted as likelihood. Results of applying the algorithm to sentence clustering tasks demonstrate that the algorithm is capable of identifying overlapping clusters of semantically related sentences, and that it is therefore of potential use in a variety of text mining tasks. We also include results of applying the algorithm to benchmark data sets in several other domains.