SlideShare a Scribd company logo
Clustering for New Discovery in Data
Houston Machine Learning Meetup
2
SCRŠ
Roadmap: Method
• Tour of machine learning algorithms (1 session)
• Feature engineering (1 session)
– Feature selection - Yan
• Supervised learning (4 sessions)
– Regression models -Yan
– SVM and kernel SVM - Yan
– Tree-based models - Dario
– Bayesian method - Xiaoyang
– Ensemble models - Yan
• Unsupervised learning (3 sessions)
– K-means clustering
– DBSCAN - Cheng
– Mean shift
– Agglomerative clustering - Kunal
– Dimension reduction for data visualization - Yan
• Deep learning (4 sessions)
_ Neural network
– From neural network to deep learning
– Convolutional neural network
– Train deep nets with open-source tools
3
SCRŠ
Roadmap: Application
• Business analytics
• Recommendation system
• Natural language processing
• Computer vision
• Energy industry
4
SCRŠ
Agenda
• Introduction
• Application of clustering
• K-means
• DBSCAN
• Cluster validation
5
SCRŠ
What is clustering
Clustering: to discover the natural groupings of a set of objects/patterns in the
unlabeled data
6
SCRŠ
Application: Recommendation
7
SCRŠ
Application: Document Clustering
https://www.noggle.online/knowledgebase/document-clustering/
8
SCRŠ
Application: Pizza Hut Center
Delivery locations
9
SCRŠ
Application: Discovering Gene functions
Important to discover diseases
and treatment
10
SCRŠ
Clustering Algorithm
• K-Means (King of clustering, many variants)
• DBSCAN (group neighboring points)
• Mean shift (locating the maxima of density)
• Spectral clustering (cares about connectivity instead of proximity)
• Hierarchical clustering (a hierarchical structure, multiple levels)
• Expectation Maximization (k-means is a variant of EM)
• Latent Dirichlet Allocation (natural language processing)
……
11
SCRŠ
• K-Means
• DBSCAN
12
SCRŠ
Cluster Validation
13
SCRŠ
Cluster Validity
• For cluster analysis, the question is how to evaluate the
“goodness” of the resulting clusters?
• Then why do we want to evaluate them?
– To avoid finding patterns in noise
– To compare clustering algorithms
– To determine the optimal number of clusters
14
SCRŠ
Cluster Validity
• Numerical measures:
– External: Used to measure the extent to which cluster labels match
externally supplied class labels.
• Entropy
– Internal: Used to measure the goodness of a clustering structure without
respect to external information.
• Sum of Squared Error (SSE)
– Relative: Used to compare two different clusterings.
• Often an external or internal measurement is used for this function, e.g., SSE or entropy
• Visualization
15
SCRŠ
Internal Measures: WSE and BSE
• Cluster Cohesion: Measures how closely related are objects in a
cluster
– Example: SSE
• Cluster Separation: Measure how distinct or well-separated a
cluster is from other clusters
• Example: Squared Error
– Cohesion is measured by the within cluster sum of squares (SSE)
– Separation is measured by the between cluster sum of squares
– Where |Ci| is the size of cluster i
 


i Cx
i
i
mxWSS 2
)(
 
i
ii mmCBSS 2
)(
16
SCRŠ
Internal Measures: WSE and BSE
• Example: SSE
– BSS + WSS = constant
1091
9)35.4(2)5.13(2
1)5.45()5.44()5.12()5.11(
22
2222



Total
BSS
WSS
1 2 3 4 5
 
m1 m2
m
K=2 clusters:
10010
0)33(4
10)35()34()32()31(
2
2222



Total
BSS
WSSK=1 cluster:
17
SCRŠ
Internal Measures: WSE and BSE
• Can be used to estimate the number of clusters
2 5 10 15 20 25 30
0
1
2
3
4
5
6
7
8
9
10
KSSE5 10 15
-6
-4
-2
0
2
4
6
WSS
18
SCRŠ
Internal Measures: Proximity graph measures
• Cluster cohesion is the sum of the weight of all links within a
cluster.
• Cluster separation is the sum of the weights between nodes in the
cluster and nodes outside the cluster.
cohesion separation
19
SCRŠ
Correlation between affinity matrix and
incidence matrix
• Given affinity distance matrix D = {d11,d12, …, dnn }
Incidence matrix C= { c11, c12,…, cnn } from clustering
• Correlation r between D and C is given by








n
ji
ij
n
ji
ij
n
ji
ijij
ccdd
ccdd
r
1,1
2
_
1,1
2
_
1,1
__
)()(
))((
20
SCRŠ
Correlation with Incidence matrix








n
ji
ij
n
ji
ij
n
ji
ijij
ccdd
ccdd
r
1,1
2
_
1,1
2
_
1,1
__
)()(
))((
0 0.2 0.4 0.6 0.8 1
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
x
y
0 0.2 0.4 0.6 0.8 1
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
x
y
r = -0.9235 r = -0.5810
21
SCRŠ
Visualization of similarity matrix
• Order the similarity matrix with respect to cluster labels and
inspect visually.
0 0.2 0.4 0.6 0.8 1
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
x
y
Points
Points
20 40 60 80 100
10
20
30
40
50
60
70
80
90
100
Similarity
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
22
SCRŠ
• Clusters in random data are not so crisp
Points
Points
20 40 60 80 100
10
20
30
40
50
60
70
80
90
100
Similarity
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 0.2 0.4 0.6 0.8 1
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
x
y
Visualization of similarity matrix
23
SCRŠ
Final Comment on Cluster Validity
“The validation of clustering structures is the most difficult and frustrating part
of cluster analysis.
Without a strong effort in this direction, cluster analysis will remain a black art
accessible only to those true believers who have experience and great
courage.”
Algorithms for Clustering Data, Jain and Dubes
24
SCRŠ
Roadmap: Method
• Tour of machine learning algorithms (1 session)
• Feature engineering (1 session)
– Feature selection - Yan
• Supervised learning (4 sessions)
– Regression models -Yan
– SVM and kernel SVM - Yan
– Tree-based models - Dario
– Bayesian method - Xiaoyang
– Ensemble models - Yan
• Unsupervised learning (3 sessions)
– K-means clustering
– DBSCAN - Cheng
– Mean shift
– Hierarchical clustering - Kunal
– Dimension reduction for data visualization - Yan
• Deep learning (4 sessions)
_ Neural network
– From neural network to deep learning - Yan
– Convolutional neural network
– Train deep nets with open-source tools
25
SCRŠ
Thank you
Slides will be posted on slide share:
http://www.slideshare.net/xuyangela

More Related Content

PDF
Nonlinear dimension reduction
 
PPTX
Spectral clustering - Houston ML Meetup
 
PDF
Case Study of Convolutional Neural Network
PDF
Multidimension Scaling and Isomap
PDF
Deep Feed Forward Neural Networks and Regularization
 
PDF
Introduction to Convolutional Neural Networks
PDF
Score based Generative Modeling through Stochastic Differential Equations
PPTX
Convolutional neural network
Nonlinear dimension reduction
 
Spectral clustering - Houston ML Meetup
 
Case Study of Convolutional Neural Network
Multidimension Scaling and Isomap
Deep Feed Forward Neural Networks and Regularization
 
Introduction to Convolutional Neural Networks
Score based Generative Modeling through Stochastic Differential Equations
Convolutional neural network

What's hot (20)

PDF
CNNs: from the Basics to Recent Advances
PDF
Understanding Convolutional Neural Networks
PDF
Convolutional Neural Networks : Popular Architectures
 
PDF
Introduction to Diffusion Models
PPTX
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
PDF
Deep learning and image analytics using Python by Dr Sanparit
PPTX
Clustering on database systems rkm
PPTX
Image classification with Deep Neural Networks
PDF
Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC
PPTX
K-means Clustering
PPTX
Invertible Denoising Network: A Light Solution for Real Noise Removal
PPTX
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
PPTX
Image classification using cnn
PPTX
Deep Learning Tutorial
PPTX
Fast Single-pass K-means Clusterting at Oxford
PDF
Super resolution in deep learning era - Jaejun Yoo
PDF
Birch
PPTX
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
 
PPTX
Machine Learning - Introduction to Convolutional Neural Networks
PDF
Deep Learning behind Prisma
CNNs: from the Basics to Recent Advances
Understanding Convolutional Neural Networks
Convolutional Neural Networks : Popular Architectures
 
Introduction to Diffusion Models
Traffic Demand Prediction Based Dynamic Transition Convolutional Neural Network
Deep learning and image analytics using Python by Dr Sanparit
Clustering on database systems rkm
Image classification with Deep Neural Networks
Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC
K-means Clustering
Invertible Denoising Network: A Light Solution for Real Noise Removal
Lecture 29 Convolutional Neural Networks - Computer Vision Spring2015
Image classification using cnn
Deep Learning Tutorial
Fast Single-pass K-means Clusterting at Oxford
Super resolution in deep learning era - Jaejun Yoo
Birch
AlexNet(ImageNet Classification with Deep Convolutional Neural Networks)
 
Machine Learning - Introduction to Convolutional Neural Networks
Deep Learning behind Prisma
Ad

Viewers also liked (20)

PDF
K means and dbscan
 
PDF
Kernel Bayes Rule
 
PPTX
Cloud-based Storage, Processing and Rendering for Gegabytes 3D Biomedical Images
 
PDF
Mean shift and Hierarchical clustering
 
PPTX
Visualization using tSNE
 
PPTX
machine learning - Clustering in R
DOC
Clustering overview
PDF
Unidad 9.
PPTX
my fabourite house
PDF
Unidad 5.
PPTX
Transporte em nanoestruturas_3_algumas_consideracoes_fisicas
PPSX
Water conservation
PPTX
Contato Metal-semicondutor
PPTX
Ekologi
PPTX
Evaluation question 6..
DOCX
Asbal
PPTX
O modelo bĂĄsico dos MOSFETs - 3
PPTX
my fabourite house
K means and dbscan
 
Kernel Bayes Rule
 
Cloud-based Storage, Processing and Rendering for Gegabytes 3D Biomedical Images
 
Mean shift and Hierarchical clustering
 
Visualization using tSNE
 
machine learning - Clustering in R
Clustering overview
Unidad 9.
my fabourite house
Unidad 5.
Transporte em nanoestruturas_3_algumas_consideracoes_fisicas
Water conservation
Contato Metal-semicondutor
Ekologi
Evaluation question 6..
Asbal
O modelo bĂĄsico dos MOSFETs - 3
my fabourite house
Ad

Similar to Clustering introduction (20)

PDF
dbscan clusteringdbscan clusteringdbscan clusteringdbscan clustering.pdf
PPTX
Data Mining Lecture_8(a).pptx
PPTX
datamining-lect8a-amachinelearningapproach.pptx
PPT
ClustIII.ppt
 
PPT
multiarmed bandit.ppt
PPT
Clustering_Unsupervised learning Unsupervised learning.ppt
PDF
An Analysis On Clustering Algorithms In Data Mining
PPT
4 DM Clustering ifor computerscience.ppt
PPTX
Part 2
PPTX
Unsupervised%20Learninffffg (2).pptx. application
PPT
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
PPTX
Evaluation of clustering in data warehouse and data mining.pptx
PPTX
QUALITY AND VALIDITY of cluster analysis in data minig
PDF
QUALITY AND VALIDITY OF CLUSTER ANALYSIS
PPT
DM_clustering.ppt
PPTX
Machine learning algorithms for data mining
PDF
clustering using different methods in .pdf
PPT
Chap8 basic cluster_analysis
PPTX
Cluster
PDF
iiit delhi unsupervised pdf.pdf
dbscan clusteringdbscan clusteringdbscan clusteringdbscan clustering.pdf
Data Mining Lecture_8(a).pptx
datamining-lect8a-amachinelearningapproach.pptx
ClustIII.ppt
 
multiarmed bandit.ppt
Clustering_Unsupervised learning Unsupervised learning.ppt
An Analysis On Clustering Algorithms In Data Mining
4 DM Clustering ifor computerscience.ppt
Part 2
Unsupervised%20Learninffffg (2).pptx. application
Data Mining Concepts and Techniques, Chapter 10. Cluster Analysis: Basic Conc...
Evaluation of clustering in data warehouse and data mining.pptx
QUALITY AND VALIDITY of cluster analysis in data minig
QUALITY AND VALIDITY OF CLUSTER ANALYSIS
DM_clustering.ppt
Machine learning algorithms for data mining
clustering using different methods in .pdf
Chap8 basic cluster_analysis
Cluster
iiit delhi unsupervised pdf.pdf

More from Yan Xu (20)

PPTX
Kaggle winning solutions: Retail Sales Forecasting
 
PDF
Basics of Dynamic programming
 
PPTX
Walking through Tensorflow 2.0
 
PPTX
Practical contextual bandits for business
 
PDF
Introduction to Multi-armed Bandits
 
PDF
A Data-Driven Question Generation Model for Educational Content - by Jack Wang
 
PDF
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
 
PDF
Deep Hierarchical Profiling & Pattern Discovery: Application to Whole Brain R...
 
PDF
Detecting anomalies on rotating equipment using Deep Stacked Autoencoders - b...
 
PDF
Introduction to Autoencoders
 
PPTX
State of enterprise data science
 
PDF
Long Short Term Memory
 
PPTX
Linear algebra and probability (Deep Learning chapter 2&3)
 
PPTX
HML: Historical View and Trends of Deep Learning
 
PDF
Secrets behind AlphaGo
 
PPTX
Optimization in Deep Learning
 
PDF
Introduction to Recurrent Neural Network
 
PDF
Convolutional neural network
 
PDF
Introduction to Neural Network
 
PPTX
Introduction to data integration in bioinformatics
 
Kaggle winning solutions: Retail Sales Forecasting
 
Basics of Dynamic programming
 
Walking through Tensorflow 2.0
 
Practical contextual bandits for business
 
Introduction to Multi-armed Bandits
 
A Data-Driven Question Generation Model for Educational Content - by Jack Wang
 
Deep Learning Approach in Characterizing Salt Body on Seismic Images - by Zhe...
 
Deep Hierarchical Profiling & Pattern Discovery: Application to Whole Brain R...
 
Detecting anomalies on rotating equipment using Deep Stacked Autoencoders - b...
 
Introduction to Autoencoders
 
State of enterprise data science
 
Long Short Term Memory
 
Linear algebra and probability (Deep Learning chapter 2&3)
 
HML: Historical View and Trends of Deep Learning
 
Secrets behind AlphaGo
 
Optimization in Deep Learning
 
Introduction to Recurrent Neural Network
 
Convolutional neural network
 
Introduction to Neural Network
 
Introduction to data integration in bioinformatics
 

Recently uploaded (20)

PDF
Lecture1 pattern recognition............
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PDF
Mega Projects Data Mega Projects Data
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Database Infoormation System (DBIS).pptx
PDF
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
PPTX
Computer network topology notes for revision
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PDF
Business Analytics and business intelligence.pdf
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
1_Introduction to advance data techniques.pptx
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Introduction to Knowledge Engineering Part 1
Lecture1 pattern recognition............
oil_refinery_comprehensive_20250804084928 (1).pptx
Mega Projects Data Mega Projects Data
Data_Analytics_and_PowerBI_Presentation.pptx
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
Qualitative Qantitative and Mixed Methods.pptx
Database Infoormation System (DBIS).pptx
Recruitment and Placement PPT.pdfbjfibjdfbjfobj
Computer network topology notes for revision
Introduction-to-Cloud-ComputingFinal.pptx
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Business Analytics and business intelligence.pdf
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
Supervised vs unsupervised machine learning algorithms
1_Introduction to advance data techniques.pptx
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Introduction to Knowledge Engineering Part 1

Clustering introduction

  • 1. Clustering for New Discovery in Data Houston Machine Learning Meetup
  • 2. 2 SCRŠ Roadmap: Method • Tour of machine learning algorithms (1 session) • Feature engineering (1 session) – Feature selection - Yan • Supervised learning (4 sessions) – Regression models -Yan – SVM and kernel SVM - Yan – Tree-based models - Dario – Bayesian method - Xiaoyang – Ensemble models - Yan • Unsupervised learning (3 sessions) – K-means clustering – DBSCAN - Cheng – Mean shift – Agglomerative clustering - Kunal – Dimension reduction for data visualization - Yan • Deep learning (4 sessions) _ Neural network – From neural network to deep learning – Convolutional neural network – Train deep nets with open-source tools
  • 3. 3 SCRŠ Roadmap: Application • Business analytics • Recommendation system • Natural language processing • Computer vision • Energy industry
  • 4. 4 SCRŠ Agenda • Introduction • Application of clustering • K-means • DBSCAN • Cluster validation
  • 5. 5 SCRŠ What is clustering Clustering: to discover the natural groupings of a set of objects/patterns in the unlabeled data
  • 8. 8 SCRŠ Application: Pizza Hut Center Delivery locations
  • 9. 9 SCRŠ Application: Discovering Gene functions Important to discover diseases and treatment
  • 10. 10 SCRŠ Clustering Algorithm • K-Means (King of clustering, many variants) • DBSCAN (group neighboring points) • Mean shift (locating the maxima of density) • Spectral clustering (cares about connectivity instead of proximity) • Hierarchical clustering (a hierarchical structure, multiple levels) • Expectation Maximization (k-means is a variant of EM) • Latent Dirichlet Allocation (natural language processing) ……
  • 13. 13 SCRŠ Cluster Validity • For cluster analysis, the question is how to evaluate the “goodness” of the resulting clusters? • Then why do we want to evaluate them? – To avoid finding patterns in noise – To compare clustering algorithms – To determine the optimal number of clusters
  • 14. 14 SCRŠ Cluster Validity • Numerical measures: – External: Used to measure the extent to which cluster labels match externally supplied class labels. • Entropy – Internal: Used to measure the goodness of a clustering structure without respect to external information. • Sum of Squared Error (SSE) – Relative: Used to compare two different clusterings. • Often an external or internal measurement is used for this function, e.g., SSE or entropy • Visualization
  • 15. 15 SCRŠ Internal Measures: WSE and BSE • Cluster Cohesion: Measures how closely related are objects in a cluster – Example: SSE • Cluster Separation: Measure how distinct or well-separated a cluster is from other clusters • Example: Squared Error – Cohesion is measured by the within cluster sum of squares (SSE) – Separation is measured by the between cluster sum of squares – Where |Ci| is the size of cluster i     i Cx i i mxWSS 2 )(   i ii mmCBSS 2 )(
  • 16. 16 SCRŠ Internal Measures: WSE and BSE • Example: SSE – BSS + WSS = constant 1091 9)35.4(2)5.13(2 1)5.45()5.44()5.12()5.11( 22 2222    Total BSS WSS 1 2 3 4 5   m1 m2 m K=2 clusters: 10010 0)33(4 10)35()34()32()31( 2 2222    Total BSS WSSK=1 cluster:
  • 17. 17 SCRŠ Internal Measures: WSE and BSE • Can be used to estimate the number of clusters 2 5 10 15 20 25 30 0 1 2 3 4 5 6 7 8 9 10 KSSE5 10 15 -6 -4 -2 0 2 4 6 WSS
  • 18. 18 SCRŠ Internal Measures: Proximity graph measures • Cluster cohesion is the sum of the weight of all links within a cluster. • Cluster separation is the sum of the weights between nodes in the cluster and nodes outside the cluster. cohesion separation
  • 19. 19 SCRŠ Correlation between affinity matrix and incidence matrix • Given affinity distance matrix D = {d11,d12, …, dnn } Incidence matrix C= { c11, c12,…, cnn } from clustering • Correlation r between D and C is given by         n ji ij n ji ij n ji ijij ccdd ccdd r 1,1 2 _ 1,1 2 _ 1,1 __ )()( ))((
  • 20. 20 SCRŠ Correlation with Incidence matrix         n ji ij n ji ij n ji ijij ccdd ccdd r 1,1 2 _ 1,1 2 _ 1,1 __ )()( ))(( 0 0.2 0.4 0.6 0.8 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 x y 0 0.2 0.4 0.6 0.8 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 x y r = -0.9235 r = -0.5810
  • 21. 21 SCRŠ Visualization of similarity matrix • Order the similarity matrix with respect to cluster labels and inspect visually. 0 0.2 0.4 0.6 0.8 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 x y Points Points 20 40 60 80 100 10 20 30 40 50 60 70 80 90 100 Similarity 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
  • 22. 22 SCRŠ • Clusters in random data are not so crisp Points Points 20 40 60 80 100 10 20 30 40 50 60 70 80 90 100 Similarity 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0 0.2 0.4 0.6 0.8 1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 x y Visualization of similarity matrix
  • 23. 23 SCRŠ Final Comment on Cluster Validity “The validation of clustering structures is the most difficult and frustrating part of cluster analysis. Without a strong effort in this direction, cluster analysis will remain a black art accessible only to those true believers who have experience and great courage.” Algorithms for Clustering Data, Jain and Dubes
  • 24. 24 SCRŠ Roadmap: Method • Tour of machine learning algorithms (1 session) • Feature engineering (1 session) – Feature selection - Yan • Supervised learning (4 sessions) – Regression models -Yan – SVM and kernel SVM - Yan – Tree-based models - Dario – Bayesian method - Xiaoyang – Ensemble models - Yan • Unsupervised learning (3 sessions) – K-means clustering – DBSCAN - Cheng – Mean shift – Hierarchical clustering - Kunal – Dimension reduction for data visualization - Yan • Deep learning (4 sessions) _ Neural network – From neural network to deep learning - Yan – Convolutional neural network – Train deep nets with open-source tools
  • 25. 25 SCRŠ Thank you Slides will be posted on slide share: http://www.slideshare.net/xuyangela