SlideShare a Scribd company logo
Clustering algorithms
Presented by :
Issra’a Haider Hashim
Depatment :Electronic &
Communication Engineering
Outline:
1
What is clustering?
2
Clustering Algorithms
3
Applications
What is Clustering?
 Is grouping similar objects together.
 Is the task of grouping a set of objects in such a
way that objects in the same group (called
a cluster) are more similar (in some sense or
another) to each other than to those in other
groups (clusters) .
 Clustering can therefore be formulated as
a multi-objective optimization problem.
A BBB
Females Males
A BBB
BBB
A
SIMPSONS Family SIMPSONS neighbors
Children Adults
BBB
BBB
A
A
Tall People
With hair
Short People
Hairless
“You need to know the question you
are trying to answer “
- Jason Bell (2014)
Bell,2014,Machine Learning: Hands-on for developers and technical professionals
Clustering
Algorithms
Connectivity-based
(hierarchical clustering)
Centroid -
based
Distribution-
based
Density-
based
Other
Algorithms
•Connectivity-based clustering (hierarchical
clustering):
These algorithms connect "objects" to form "clusters"
based on their distance. At different distances,
different clusters will form, which can be represented
using a dendrogram (from Greek dendro "tree" and
gramma "drawing" is a tree diagram).
Names Formulas
Euclidean distance
Manhattan distance
Squared Euclidean distance
Maximum distance
Mahalanobis distance
Clustering Algorithms.pptx
• Centroid-based clustering
In centroid-based clustering, clusters are represented by a central vector, which
may not necessarily be a member of the data set. When the number of clusters
is fixed to k, k-means clustering gives a formal definition as an optimization
problem: find the k cluster centers and assign the objects to the nearest cluster
center, such that the squared distances from the cluster are minimized.
The optimization problem itself is known to be NP-hard, and thus the common
approach is to search only for approximate solutions. A particularly well known
approximate method is Lloyd's algorithm,[8] often just referred to as "k-means
algorithm" (although another algorithm introduced this name). It does however
only find a local optimum, and is commonly run multiple times with different
random initializations. Variations of k-means often include such optimizations as
choosing the best of multiple runs, but also restricting the centroids to members
of the data set (k-medoids), choosing medians (k-medians clustering), choosing
the initial centers less randomly (k-means++) or allowing a fuzzy cluster
assignment (fuzzy c-means).
Clustering Algorithms.pptx
• Distribution-based clustering
The clustering model most closely related to statistics
is based on distribution models. Clusters can then
easily be defined as objects belonging most likely to
the same distribution. A convenient property of this
approach is that this closely resembles the way
artificial data sets are generated: by sampling random
objects from a distribution.
• Density based clustering:
In density-based clustering , clusters are defined as
areas of higher density than the remainder of the data
set. Objects in these sparse areas - that are required to
separate clusters - are usually considered to be noise
and border points.
Applications
Biology
Market
research
Medicine Social Science
World Wide
Web
Others
Computer
science
Confused , Comment , or any questions
Clustering Algorithms.pptx

More Related Content

PPTX
clustering ppt.pptx
PPTX
Data clustring
PPT
Capter10 cluster basic
PPT
Capter10 cluster basic : Han & Kamber
PDF
Paper id 26201478
PPT
UniT_A_Clustering machine learning .ppt
PPT
10 clusbasic
PPTX
pratik meshram-Unit 5 (contemporary mkt r sch)
clustering ppt.pptx
Data clustring
Capter10 cluster basic
Capter10 cluster basic : Han & Kamber
Paper id 26201478
UniT_A_Clustering machine learning .ppt
10 clusbasic
pratik meshram-Unit 5 (contemporary mkt r sch)

Similar to Clustering Algorithms.pptx (20)

PPT
CLUSTERING
PPT
data mining cocepts and techniques chapter
PPT
data mining cocepts and techniques chapter
PDF
10 clusbasic
PPT
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
PPT
Clustering
PPT
Data mining concepts and techniques Chapter 10
PPT
Chapter 10. Cluster Analysis Basic Concepts and Methods.ppt
PPTX
cluster.pptx
PPT
Chapter 10 ClusBasic ppt file for clear understaning
PPT
Chapter -10-Clus_Basic.ppt -DataMinning
PDF
4.Unit 4 ML Q&A.pdf machine learning qb
PDF
Chapter 5.pdf
PDF
ACO-medoids
PDF
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
PDF
iiit delhi unsupervised pdf.pdf
PDF
A Novel Clustering Method for Similarity Measuring in Text Documents
PDF
CLUSTERING IN DATA MINING.pdf
PPTX
Data Clustering Using Swarm Intelligence Algorithms An Overview
PDF
Multilevel techniques for the clustering problem
CLUSTERING
data mining cocepts and techniques chapter
data mining cocepts and techniques chapter
10 clusbasic
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
Clustering
Data mining concepts and techniques Chapter 10
Chapter 10. Cluster Analysis Basic Concepts and Methods.ppt
cluster.pptx
Chapter 10 ClusBasic ppt file for clear understaning
Chapter -10-Clus_Basic.ppt -DataMinning
4.Unit 4 ML Q&A.pdf machine learning qb
Chapter 5.pdf
ACO-medoids
Improved K-mean Clustering Algorithm for Prediction Analysis using Classifica...
iiit delhi unsupervised pdf.pdf
A Novel Clustering Method for Similarity Measuring in Text Documents
CLUSTERING IN DATA MINING.pdf
Data Clustering Using Swarm Intelligence Algorithms An Overview
Multilevel techniques for the clustering problem
Ad

Recently uploaded (20)

PDF
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
PPTX
BIOMOLECULES PPT........................
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PDF
The scientific heritage No 166 (166) (2025)
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PDF
. Radiology Case Scenariosssssssssssssss
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PDF
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PDF
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
Pharmacology of Autonomic nervous system
PDF
lecture 2026 of Sjogren's syndrome l .pdf
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Lymphatic System MCQs & Practice Quiz – Functions, Organs, Nodes, Ducts
BIOMOLECULES PPT........................
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
The scientific heritage No 166 (166) (2025)
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Introduction to Fisheries Biotechnology_Lesson 1.pptx
POSITIONING IN OPERATION THEATRE ROOM.ppt
. Radiology Case Scenariosssssssssssssss
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
CHAPTER 3 Cell Structures and Their Functions Lecture Outline.pdf
Biophysics 2.pdffffffffffffffffffffffffff
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Warm, water-depleted rocky exoplanets with surfaceionic liquids: A proposed c...
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
Pharmacology of Autonomic nervous system
lecture 2026 of Sjogren's syndrome l .pdf
Classification Systems_TAXONOMY_SCIENCE8.pptx
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
Ad

Clustering Algorithms.pptx

  • 1. Clustering algorithms Presented by : Issra’a Haider Hashim Depatment :Electronic & Communication Engineering
  • 3. What is Clustering?  Is grouping similar objects together.  Is the task of grouping a set of objects in such a way that objects in the same group (called a cluster) are more similar (in some sense or another) to each other than to those in other groups (clusters) .  Clustering can therefore be formulated as a multi-objective optimization problem.
  • 5. A BBB BBB A SIMPSONS Family SIMPSONS neighbors Children Adults
  • 7. “You need to know the question you are trying to answer “ - Jason Bell (2014) Bell,2014,Machine Learning: Hands-on for developers and technical professionals
  • 9. •Connectivity-based clustering (hierarchical clustering): These algorithms connect "objects" to form "clusters" based on their distance. At different distances, different clusters will form, which can be represented using a dendrogram (from Greek dendro "tree" and gramma "drawing" is a tree diagram). Names Formulas Euclidean distance Manhattan distance Squared Euclidean distance Maximum distance Mahalanobis distance
  • 11. • Centroid-based clustering In centroid-based clustering, clusters are represented by a central vector, which may not necessarily be a member of the data set. When the number of clusters is fixed to k, k-means clustering gives a formal definition as an optimization problem: find the k cluster centers and assign the objects to the nearest cluster center, such that the squared distances from the cluster are minimized. The optimization problem itself is known to be NP-hard, and thus the common approach is to search only for approximate solutions. A particularly well known approximate method is Lloyd's algorithm,[8] often just referred to as "k-means algorithm" (although another algorithm introduced this name). It does however only find a local optimum, and is commonly run multiple times with different random initializations. Variations of k-means often include such optimizations as choosing the best of multiple runs, but also restricting the centroids to members of the data set (k-medoids), choosing medians (k-medians clustering), choosing the initial centers less randomly (k-means++) or allowing a fuzzy cluster assignment (fuzzy c-means).
  • 13. • Distribution-based clustering The clustering model most closely related to statistics is based on distribution models. Clusters can then easily be defined as objects belonging most likely to the same distribution. A convenient property of this approach is that this closely resembles the way artificial data sets are generated: by sampling random objects from a distribution.
  • 14. • Density based clustering: In density-based clustering , clusters are defined as areas of higher density than the remainder of the data set. Objects in these sparse areas - that are required to separate clusters - are usually considered to be noise and border points.
  • 16. Confused , Comment , or any questions