SlideShare a Scribd company logo
Journal of Computer Science andand Engineering Research and
 Journal of Computer Science Engineering Research and Development (JCSERD), ISSN XXXX –
                                                                  JCSERD
DevelopmentISSN XXXX – ISSN XXXX – XXXX(Print), 1, May -October (2011)
 XXXX(Print), (JCSERD), XXXX(Online), Volume 1, Number
ISSN XXXX – XXXX(Online)
Volume 1, Number 1, May -October (2011)
                                                                © PRJ PUBLICATION
pp. 30-35 © PRJ Publication
http://www.prjpublication.com/JCSERD.asp



        A COMPREHENSIVE SURVEY OF CONTEMPORARY
       RESEARCHES ON IMAGE SEGMENTATION THROUGH
                       CLUSTERING
                         Ritu Agrawal1 , Prof. Manisha Sharma2
         1 Electronics and Telecommunication , C.I.T, Rajnandgaon, C.G, India
             2 Electronics and Telecommunication , B.I.T, Durg, C.G, India
               ritube_03@yahoo.co.in , manishasharma1@rediffmail.com


ABSTRACT

This paper presents an analysis on different clustering techniques for image
segmentation. Clustering is the unsupervised classification of patterns (observations, data
items, or feature vectors) into groups (clusters). Clustering is most widely spread
approach in Image segmentation because of its robust characteristics for data
classification. Clustering is done on different attributes of an image such as size, color,
texture etc. The purpose of clustering is to get meaningful result, effective storage and
fast retrieval in various areas.

Keywords: Clustering, Image segmentation, Exclusive Clustering ,Overlapping
Clustering , Hierarchical clustering , Probabilistic D-Clustering

1. INTRODUCTION

Clustering in Image segmentation is defined as the process of identifying groups of
similar objects. A cluster is a collection of data points that are similar to one another
within the same cluster and dissimilar to data points in other clusters [1]. Clustering
techniques can broadly be classified as Unsupervised Clustering (calculated by software)
and Supervised (human-guided) clustering. Unsupervised clustering is defined as
groupings of pixels with common characteristics which are based on the software
analysis of an image. Supervised clustering is based on the idea that a user can select
sample pixels in an image that are representative of specific classes. Segmentation is a
process of subdividing an image into its constituent regions or objects. The level of
details, to which the subdivision is carried on, depends on the problem being solved So,
the segmentation should stop when the objects or regions of interest in an application
have been detected.




                                            30
Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX –
XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011)

      Image segmentation is a technique that partitions an image into uniform and non-
overlapping regions. This technique has a variety of applications including computer
vision, image analysis, medical image processing, remote sensing and geographical
information system.

2. Clustering

Clustering is a common technique for data analysis, which is used in many fields,
including machine learning, data mining, pattern recognition, image analysis and
bioinformatics. Clustering is the method of classification of similar
objects into different groups or more precisely the partitioning of a data set into subsets
(clusters), so that the data in each subset (ideally) share some common trait . The goal of
clustering algorithm is to maximize the intra-cluster similarity and minimize the inter-
cluster similarity.

A variety of clustering technique has been introduced to make the segmentation more
effective. The clustering technique can be broadly classified as: 1. Exclusive Clustering
2. Overlapping Clustering 3. Hierarchical Clustering 4. Probabilistic D- Clustering.

3. Segmentation

Segmentation is a process of partitioning a digital image into multiple segments or a sets
of pixels. The goal of segmentation is to simplify an image into some more meaningful
and easier to analyze. Image segmentation is typically used to locate objects and
boundaries (lines, curves, etc.) in images. More precisely, image segmentation is the
process of assigning a label to every pixel in an image such that pixels with the same
label share certain visual characteristics.

        Segmentation has been used in a wide range of applications. Different
applications require different types of images. The most commonly used images are light
intensity (LI), range (depth) image(RI), computerized tomography(CT), magnetic
resonance images(MRI). Image segmentation is highly dependent on the image type,
hence there is no single generalized technique that is suitable for all images.
There are numerous image segmentation techniques in the literature , which can be
broadly classified into two categories, namely i) classical ii) fuzzy mathematical . Fuzzy
mathematical techniques are widely used in computer vision applications as they are far
better able to handle and segment images, particularly noisy images.

4. Clustering Techniques

       An image may contain more than one objects and to segment an image in a
meaningful feature is a very difficult job.
This paper is a review and summarizes different clustering technique.




                                            31
Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX –
XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011)

4.1 Exclusive Clustering

        In this case data are grouped in an exclusive way, so that if a certain datum
belongs to a definite cluster then it could not be included in another cluster. K-means
clustering is one of the type of exclusive clustering and is one of the simplest
unsupervised learning algorithms.
In case of K means clustering , k centroid must be defined for each cluster. The algorithm
is composed of the following steps:
Step 1: Place K points into the space represented by the objects that are being clustered .
These points represent initial group centroids.
Step2: Assign each object to the group that has the closest centroid.
Step3: When all objects have been assigned, recalculate the positions of the K centroids.
Step 4: Repeat Step2 and 3 until the centriods no longer moves. This produces a
separation of the objects into groups which the metric to be minimized can be calculated.

4.2 Overlapping Clustering

The overlapping clustering, uses fuzzy sets to cluster data, so that each point may belong
to two or more clusters with different degrees of membership. In this case, data will be
associated to an appropriate membership value. Fuzzy C means is one of the type of
overlapping clustering algorithm. Fuzzy c-means (FCM) is a method of clustering which
allows one piece of data to belong to two or more clusters. This method is frequently
used in pattern recognition.
The algorithm is composed of the following steps:

Step 1: Initialize prototype
V= {v1,v2,…vc}
Repeat Vprevious ← V
Compute membership function using equation
                   k
µci(x) = 1 /     ∑       [(║ x-vi ║2) /
                  i =1
(║ x-vj ║2 )] 1/m-1 1≤i≤ k, xЄ X
update the prototype , vi in V using equation
                                n
vi= ∑ (µci(x))m X x / ∑ (µci(x))m 1≤i≤ k
   x∈X                         x∈ X
         c
until   ∑      ║ vi previous - vi ║ ≤ ε
        i =1
where, X : an unlabeled data set
c: the number of clusters to form
m: the parameter in the objective function.
ε: a threshold for the convergence criteria.




                                                32
Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX –
XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011)

4.3 Hierarchical clustering

     Hierarchical clustering creates a hierarchy of clusters which may be represented in a
tree structure. The root of the tree consists of a single cluster containing all observations,
and the leaves correspond to individual observations.
For example, suppose this data is to be clustered
                          a



                 b            d

                 c            e       f


In this example, we have six elements {a} {b} {c} {d} {e} and {f}. The first step is to
determine which elements is to merge in a cluster. Usually, we want to take the two
closest elements, according to the chosen distance by using Euclidian distance.
In this example, cutting the row yield clusters {a} {b c} {d e} {f}. Then again Cutting
another row yields clusters {a} {b c} {d e f}.The combination of clusters depend on the
distace between the clusters. As clustering progresses, rows and columns are merged and
as the
clusters are merged, the distances updated. This is a common way to implement this type
of clustering. To

 a           b        c           d        e   f



                 bc                       de



                                          de


                 bcde


     abcde

Hierarchical representation

stop clustering either when the clusters are too far apart to be merged or when there is a
sufficiently small number of clusters.
     The algorithm used are Given a set of N items to be clustered, and an N*N distance
matrix then.
     The basic algorithm of hierarchical clustering is the N*N matrix is D = [d(i,j)]. The
clustering’s are assigned sequence numbers 0,1,......, (n-1) and L(k) is the level of the kth
clustering. A cluster with sequence number “m” is denoted (m) and the proximity
between clusters (r) and (s) is denoted d [(r),(s)].
The algorithm is composed of the following steps:


                                                   33
Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX –
XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011)

Step1: Begin with the disjoint clustering having level L(0)= 0 and the sequence number
m=0 .
Step2: Find the least dissimilar pair of clusters in the current clustering , say pair (r ), (s)
according to d[(r ),(s)]= min d[(i),(j)] where the minimum is over all pairs of clusters in
the current clustering.
Step 3: Increment the sequence number m= m+1 . Merge clusters (r )and (s) into single
clusters to form the next clustering m. Set the level of this clustering to L(m)= d[(r ),(s)].
Step 4: Update the proximity matrix D , by deleting the rows and columns corresponding
to clusters (r ) and (s) and adding a row and columns corresponding to the newly formed
cluster. The proximity between the new cluster, denoted(r,s) and the old cluster (k) is
defined in this way
d[(k),(r,s)] = min d[(k),(r )],d[(k),(s)]
if all objects are in one cluster , stop else go to Step 2 .

4.4 Probabilistic – D Clustering:
 Here the word D means distance (Euclidean/ Exponential). The probability of cluster
membership at any point is assumed to be inversely proportional to the distance from the
centre of cluster.
   If, Pk(x) = probability that the point x belongs to cluster Ck.
dk(x) = distance of point x from cluster            Ck.
Then: Pk(x) . dk(x) = constant , depending on (x).
The clustering criterion used here is Euclidean distance

Mathematically
                            k
Pk(x) =   ∏
          j ≠k
                 dj(x) /   ∑∏ di( x)
                           i =1 j ≠ i

If we consider the distance as Exponential then, Probability equation will be changed as
dj(x) will be replaced by e dj(x).

5       CONCLUSION
The paper presents an analysis on different clustering techniques used for image
segmentation. Through clustering algorithms, image segmentation can be done in an
effective way. Fuzzy C-means algorithm proved to be superior over other clustering
approaches in terms of segmentation efficiency. The major drawback of FCM is the huge
computational time required for convergence. To avoid the computational time of FCM ,
probabilistic – D clustering is proposed




                                              34
Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX –
XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011)

REFERENCES
[1] www.wikipedia.com
[2] Andrew Moore: “K-means and Hierarchical Clustering - Tutorial Slides”
[3]          Brian          T.         Luke:           “K-Means             Clustering”
[4] J. C. Dunn (1973): "A Fuzzy Relative of the ISODATA Process and Its Use in
Detecting Compact Well-Separated Clusters", Journal of Cybernetics 3: 32-5
[5] J. C. Bezdek (1981): "Pattern Recognition with Fuzzy Objective Function
Algorithms", Plenum Press, New York.
[6]Data Clustering A Review ACM Computing Surveys, Vol. 31, No. 3, September 1999
[7] Paper 193-2011
Comparison of Probabilistic-D and k-Means Clustering in Segment Profiles for B2B
Markets SAS Global Forum 2011
[8] A Survey on Image Segmentation Through Clustering International Journal
of Research and Reviews in Information Sciences Vol. 1, No. 1, March 2011
[9] Image Segmentation using Fuzzy Clustering: A Survey. 6th International Conference
on Emerging Technologies (ICET) 2010




                                           35

More Related Content

PDF
A comprehensive survey of contemporary
PDF
DOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGES
PDF
I1803026164
PDF
SEGMENTATION USING ‘NEW’ TEXTURE FEATURE
PDF
Textural Feature Extraction of Natural Objects for Image Classification
PDF
Fractal Image Compression By Range Block Classification
PDF
A new block cipher for image encryption based on multi chaotic systems
PDF
A010210106
A comprehensive survey of contemporary
DOMAIN SPECIFIC CBIR FOR HIGHLY TEXTURED IMAGES
I1803026164
SEGMENTATION USING ‘NEW’ TEXTURE FEATURE
Textural Feature Extraction of Natural Objects for Image Classification
Fractal Image Compression By Range Block Classification
A new block cipher for image encryption based on multi chaotic systems
A010210106

What's hot (20)

PDF
Web image annotation by diffusion maps manifold learning algorithm
PDF
Image Segmentation Using Two Weighted Variable Fuzzy K Means
PDF
AUTOMATIC THRESHOLDING TECHNIQUES FOR OPTICAL IMAGES
PDF
Introduction to Multi-Objective Clustering Ensemble
PDF
SCAF – AN EFFECTIVE APPROACH TO CLASSIFY SUBSPACE CLUSTERING ALGORITHMS
PDF
Survey on Unsupervised Learning in Datamining
PDF
FUZZY SET THEORETIC APPROACH TO IMAGE THRESHOLDING
PDF
AUTOMATIC THRESHOLDING TECHNIQUES FOR SAR IMAGES
PDF
AUTOMATIC THRESHOLDING TECHNIQUES FOR SAR IMAGES
PDF
B colouring
PDF
Improved probabilistic distance based locality preserving projections method ...
PPT
3.1 clustering
PPT
Capter10 cluster basic
PDF
Kandemir Inferring Object Relevance From Gaze In Dynamic Scenes
PDF
Fuzzy In Remote Classification
PDF
Fuzzy c-means
PDF
PERFORMANCE ANALYSIS OF CLUSTERING BASED IMAGE SEGMENTATION AND OPTIMIZATION ...
PDF
Colour Image Segmentation Using Soft Rough Fuzzy-C-Means and Multi Class SVM
PDF
Texture Segmentation Based on Multifractal Dimension
PPTX
Clusters techniques
Web image annotation by diffusion maps manifold learning algorithm
Image Segmentation Using Two Weighted Variable Fuzzy K Means
AUTOMATIC THRESHOLDING TECHNIQUES FOR OPTICAL IMAGES
Introduction to Multi-Objective Clustering Ensemble
SCAF – AN EFFECTIVE APPROACH TO CLASSIFY SUBSPACE CLUSTERING ALGORITHMS
Survey on Unsupervised Learning in Datamining
FUZZY SET THEORETIC APPROACH TO IMAGE THRESHOLDING
AUTOMATIC THRESHOLDING TECHNIQUES FOR SAR IMAGES
AUTOMATIC THRESHOLDING TECHNIQUES FOR SAR IMAGES
B colouring
Improved probabilistic distance based locality preserving projections method ...
3.1 clustering
Capter10 cluster basic
Kandemir Inferring Object Relevance From Gaze In Dynamic Scenes
Fuzzy In Remote Classification
Fuzzy c-means
PERFORMANCE ANALYSIS OF CLUSTERING BASED IMAGE SEGMENTATION AND OPTIMIZATION ...
Colour Image Segmentation Using Soft Rough Fuzzy-C-Means and Multi Class SVM
Texture Segmentation Based on Multifractal Dimension
Clusters techniques
Ad

Similar to 4 image segmentation through clustering (20)

PDF
Fuzzy c means_realestate_application
PDF
H0114857
PDF
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
DOCX
Clustering techniques final
PDF
A Novel Algorithm for Design Tree Classification with PCA
PDF
1376846406 14447221
PDF
iiit delhi unsupervised pdf.pdf
PPT
K_MeansK_MeansK_MeansK_MeansK_MeansK_MeansK_Means.ppt
PPT
UniT_A_Clustering machine learning .ppt
PDF
Comparison Between Clustering Algorithms for Microarray Data Analysis
PDF
Extended fuzzy c means clustering algorithm in segmentation of noisy images
PDF
RFNM-Aranda-Final.PDF
PDF
CLUSTERING HYPERSPECTRAL DATA
PPT
Clustering
PDF
MAGNETIC RESONANCE BRAIN IMAGE SEGMENTATION
PDF
B0343011014
PDF
Ensemble based Distributed K-Modes Clustering
PDF
GREY LEVEL CO-OCCURRENCE MATRICES: GENERALISATION AND SOME NEW FEATURES
PDF
Volume 2-issue-6-2143-2147
PDF
Volume 2-issue-6-2143-2147
Fuzzy c means_realestate_application
H0114857
Machine Learning Algorithms for Image Classification of Hand Digits and Face ...
Clustering techniques final
A Novel Algorithm for Design Tree Classification with PCA
1376846406 14447221
iiit delhi unsupervised pdf.pdf
K_MeansK_MeansK_MeansK_MeansK_MeansK_MeansK_Means.ppt
UniT_A_Clustering machine learning .ppt
Comparison Between Clustering Algorithms for Microarray Data Analysis
Extended fuzzy c means clustering algorithm in segmentation of noisy images
RFNM-Aranda-Final.PDF
CLUSTERING HYPERSPECTRAL DATA
Clustering
MAGNETIC RESONANCE BRAIN IMAGE SEGMENTATION
B0343011014
Ensemble based Distributed K-Modes Clustering
GREY LEVEL CO-OCCURRENCE MATRICES: GENERALISATION AND SOME NEW FEATURES
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
Ad

More from prjpublications (20)

PDF
Mems based optical sensor for salinity measurement
PDF
Implementation and analysis of multiple criteria decision routing algorithm f...
PDF
An approach to design a rectangular microstrip patch antenna in s band by tlm...
PDF
A design and simulation of optical pressure sensor based on photonic crystal ...
PDF
Pattern recognition using video surveillance for wildlife applications
PDF
Precision face image retrieval by extracting the face features and comparing ...
PDF
Keyless approach of separable hiding data into encrypted image
PDF
Encryption based multi user manner secured data sharing and storing in cloud
PDF
A secure payment scheme in multihop wireless network by trusted node identifi...
PDF
Preparation gade and idol model for preventing multiple spoofing attackers in...
PDF
Study on gis simulated water quality model
PDF
Smes role in reduction of the unemployment problem in the area located in sa...
PDF
Review of three categories of fingerprint recognition
PDF
Reduction of executive stress by development of emotional intelligence a stu...
PDF
Mathematical modeling approach for flood management
PDF
Influences of child endorsers on the consumers
PDF
Impact of stress management by development of emotional intelligence in cmts,...
PDF
Faulty node recovery and replacement algorithm for wireless sensor network
PDF
Extended information technology enabled service quality model for life insura...
PDF
Employee spirituality and job engagement a correlational study across organi...
Mems based optical sensor for salinity measurement
Implementation and analysis of multiple criteria decision routing algorithm f...
An approach to design a rectangular microstrip patch antenna in s band by tlm...
A design and simulation of optical pressure sensor based on photonic crystal ...
Pattern recognition using video surveillance for wildlife applications
Precision face image retrieval by extracting the face features and comparing ...
Keyless approach of separable hiding data into encrypted image
Encryption based multi user manner secured data sharing and storing in cloud
A secure payment scheme in multihop wireless network by trusted node identifi...
Preparation gade and idol model for preventing multiple spoofing attackers in...
Study on gis simulated water quality model
Smes role in reduction of the unemployment problem in the area located in sa...
Review of three categories of fingerprint recognition
Reduction of executive stress by development of emotional intelligence a stu...
Mathematical modeling approach for flood management
Influences of child endorsers on the consumers
Impact of stress management by development of emotional intelligence in cmts,...
Faulty node recovery and replacement algorithm for wireless sensor network
Extended information technology enabled service quality model for life insura...
Employee spirituality and job engagement a correlational study across organi...

4 image segmentation through clustering

  • 1. Journal of Computer Science andand Engineering Research and Journal of Computer Science Engineering Research and Development (JCSERD), ISSN XXXX – JCSERD DevelopmentISSN XXXX – ISSN XXXX – XXXX(Print), 1, May -October (2011) XXXX(Print), (JCSERD), XXXX(Online), Volume 1, Number ISSN XXXX – XXXX(Online) Volume 1, Number 1, May -October (2011) © PRJ PUBLICATION pp. 30-35 © PRJ Publication http://www.prjpublication.com/JCSERD.asp A COMPREHENSIVE SURVEY OF CONTEMPORARY RESEARCHES ON IMAGE SEGMENTATION THROUGH CLUSTERING Ritu Agrawal1 , Prof. Manisha Sharma2 1 Electronics and Telecommunication , C.I.T, Rajnandgaon, C.G, India 2 Electronics and Telecommunication , B.I.T, Durg, C.G, India ritube_03@yahoo.co.in , manishasharma1@rediffmail.com ABSTRACT This paper presents an analysis on different clustering techniques for image segmentation. Clustering is the unsupervised classification of patterns (observations, data items, or feature vectors) into groups (clusters). Clustering is most widely spread approach in Image segmentation because of its robust characteristics for data classification. Clustering is done on different attributes of an image such as size, color, texture etc. The purpose of clustering is to get meaningful result, effective storage and fast retrieval in various areas. Keywords: Clustering, Image segmentation, Exclusive Clustering ,Overlapping Clustering , Hierarchical clustering , Probabilistic D-Clustering 1. INTRODUCTION Clustering in Image segmentation is defined as the process of identifying groups of similar objects. A cluster is a collection of data points that are similar to one another within the same cluster and dissimilar to data points in other clusters [1]. Clustering techniques can broadly be classified as Unsupervised Clustering (calculated by software) and Supervised (human-guided) clustering. Unsupervised clustering is defined as groupings of pixels with common characteristics which are based on the software analysis of an image. Supervised clustering is based on the idea that a user can select sample pixels in an image that are representative of specific classes. Segmentation is a process of subdividing an image into its constituent regions or objects. The level of details, to which the subdivision is carried on, depends on the problem being solved So, the segmentation should stop when the objects or regions of interest in an application have been detected. 30
  • 2. Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX – XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011) Image segmentation is a technique that partitions an image into uniform and non- overlapping regions. This technique has a variety of applications including computer vision, image analysis, medical image processing, remote sensing and geographical information system. 2. Clustering Clustering is a common technique for data analysis, which is used in many fields, including machine learning, data mining, pattern recognition, image analysis and bioinformatics. Clustering is the method of classification of similar objects into different groups or more precisely the partitioning of a data set into subsets (clusters), so that the data in each subset (ideally) share some common trait . The goal of clustering algorithm is to maximize the intra-cluster similarity and minimize the inter- cluster similarity. A variety of clustering technique has been introduced to make the segmentation more effective. The clustering technique can be broadly classified as: 1. Exclusive Clustering 2. Overlapping Clustering 3. Hierarchical Clustering 4. Probabilistic D- Clustering. 3. Segmentation Segmentation is a process of partitioning a digital image into multiple segments or a sets of pixels. The goal of segmentation is to simplify an image into some more meaningful and easier to analyze. Image segmentation is typically used to locate objects and boundaries (lines, curves, etc.) in images. More precisely, image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain visual characteristics. Segmentation has been used in a wide range of applications. Different applications require different types of images. The most commonly used images are light intensity (LI), range (depth) image(RI), computerized tomography(CT), magnetic resonance images(MRI). Image segmentation is highly dependent on the image type, hence there is no single generalized technique that is suitable for all images. There are numerous image segmentation techniques in the literature , which can be broadly classified into two categories, namely i) classical ii) fuzzy mathematical . Fuzzy mathematical techniques are widely used in computer vision applications as they are far better able to handle and segment images, particularly noisy images. 4. Clustering Techniques An image may contain more than one objects and to segment an image in a meaningful feature is a very difficult job. This paper is a review and summarizes different clustering technique. 31
  • 3. Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX – XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011) 4.1 Exclusive Clustering In this case data are grouped in an exclusive way, so that if a certain datum belongs to a definite cluster then it could not be included in another cluster. K-means clustering is one of the type of exclusive clustering and is one of the simplest unsupervised learning algorithms. In case of K means clustering , k centroid must be defined for each cluster. The algorithm is composed of the following steps: Step 1: Place K points into the space represented by the objects that are being clustered . These points represent initial group centroids. Step2: Assign each object to the group that has the closest centroid. Step3: When all objects have been assigned, recalculate the positions of the K centroids. Step 4: Repeat Step2 and 3 until the centriods no longer moves. This produces a separation of the objects into groups which the metric to be minimized can be calculated. 4.2 Overlapping Clustering The overlapping clustering, uses fuzzy sets to cluster data, so that each point may belong to two or more clusters with different degrees of membership. In this case, data will be associated to an appropriate membership value. Fuzzy C means is one of the type of overlapping clustering algorithm. Fuzzy c-means (FCM) is a method of clustering which allows one piece of data to belong to two or more clusters. This method is frequently used in pattern recognition. The algorithm is composed of the following steps: Step 1: Initialize prototype V= {v1,v2,…vc} Repeat Vprevious ← V Compute membership function using equation k µci(x) = 1 / ∑ [(║ x-vi ║2) / i =1 (║ x-vj ║2 )] 1/m-1 1≤i≤ k, xЄ X update the prototype , vi in V using equation n vi= ∑ (µci(x))m X x / ∑ (µci(x))m 1≤i≤ k x∈X x∈ X c until ∑ ║ vi previous - vi ║ ≤ ε i =1 where, X : an unlabeled data set c: the number of clusters to form m: the parameter in the objective function. ε: a threshold for the convergence criteria. 32
  • 4. Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX – XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011) 4.3 Hierarchical clustering Hierarchical clustering creates a hierarchy of clusters which may be represented in a tree structure. The root of the tree consists of a single cluster containing all observations, and the leaves correspond to individual observations. For example, suppose this data is to be clustered a b d c e f In this example, we have six elements {a} {b} {c} {d} {e} and {f}. The first step is to determine which elements is to merge in a cluster. Usually, we want to take the two closest elements, according to the chosen distance by using Euclidian distance. In this example, cutting the row yield clusters {a} {b c} {d e} {f}. Then again Cutting another row yields clusters {a} {b c} {d e f}.The combination of clusters depend on the distace between the clusters. As clustering progresses, rows and columns are merged and as the clusters are merged, the distances updated. This is a common way to implement this type of clustering. To a b c d e f bc de de bcde abcde Hierarchical representation stop clustering either when the clusters are too far apart to be merged or when there is a sufficiently small number of clusters. The algorithm used are Given a set of N items to be clustered, and an N*N distance matrix then. The basic algorithm of hierarchical clustering is the N*N matrix is D = [d(i,j)]. The clustering’s are assigned sequence numbers 0,1,......, (n-1) and L(k) is the level of the kth clustering. A cluster with sequence number “m” is denoted (m) and the proximity between clusters (r) and (s) is denoted d [(r),(s)]. The algorithm is composed of the following steps: 33
  • 5. Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX – XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011) Step1: Begin with the disjoint clustering having level L(0)= 0 and the sequence number m=0 . Step2: Find the least dissimilar pair of clusters in the current clustering , say pair (r ), (s) according to d[(r ),(s)]= min d[(i),(j)] where the minimum is over all pairs of clusters in the current clustering. Step 3: Increment the sequence number m= m+1 . Merge clusters (r )and (s) into single clusters to form the next clustering m. Set the level of this clustering to L(m)= d[(r ),(s)]. Step 4: Update the proximity matrix D , by deleting the rows and columns corresponding to clusters (r ) and (s) and adding a row and columns corresponding to the newly formed cluster. The proximity between the new cluster, denoted(r,s) and the old cluster (k) is defined in this way d[(k),(r,s)] = min d[(k),(r )],d[(k),(s)] if all objects are in one cluster , stop else go to Step 2 . 4.4 Probabilistic – D Clustering: Here the word D means distance (Euclidean/ Exponential). The probability of cluster membership at any point is assumed to be inversely proportional to the distance from the centre of cluster. If, Pk(x) = probability that the point x belongs to cluster Ck. dk(x) = distance of point x from cluster Ck. Then: Pk(x) . dk(x) = constant , depending on (x). The clustering criterion used here is Euclidean distance Mathematically k Pk(x) = ∏ j ≠k dj(x) / ∑∏ di( x) i =1 j ≠ i If we consider the distance as Exponential then, Probability equation will be changed as dj(x) will be replaced by e dj(x). 5 CONCLUSION The paper presents an analysis on different clustering techniques used for image segmentation. Through clustering algorithms, image segmentation can be done in an effective way. Fuzzy C-means algorithm proved to be superior over other clustering approaches in terms of segmentation efficiency. The major drawback of FCM is the huge computational time required for convergence. To avoid the computational time of FCM , probabilistic – D clustering is proposed 34
  • 6. Journal of Computer Science and Engineering Research and Development (JCSERD), ISSN XXXX – XXXX(Print), ISSN XXXX – XXXX(Online), Volume 1, Number 1, May -October (2011) REFERENCES [1] www.wikipedia.com [2] Andrew Moore: “K-means and Hierarchical Clustering - Tutorial Slides” [3] Brian T. Luke: “K-Means Clustering” [4] J. C. Dunn (1973): "A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters", Journal of Cybernetics 3: 32-5 [5] J. C. Bezdek (1981): "Pattern Recognition with Fuzzy Objective Function Algorithms", Plenum Press, New York. [6]Data Clustering A Review ACM Computing Surveys, Vol. 31, No. 3, September 1999 [7] Paper 193-2011 Comparison of Probabilistic-D and k-Means Clustering in Segment Profiles for B2B Markets SAS Global Forum 2011 [8] A Survey on Image Segmentation Through Clustering International Journal of Research and Reviews in Information Sciences Vol. 1, No. 1, March 2011 [9] Image Segmentation using Fuzzy Clustering: A Survey. 6th International Conference on Emerging Technologies (ICET) 2010 35