SlideShare a Scribd company logo
2
Most read
3
Most read
4
Most read
Grid based method & model based clustering method
 INTRODUCTION
 STING
 WAVECLUSTER
 CLIQUE-Clustering in QUEST
 FAST PROCESSING TIME
 The grid based clustering approach uses a multi
resolution grid data structure.
 The object space is quantized into finite number
of cells that form a grid structure.
 The major advantage of this method is fast
processing time.
 It is dependent only on the number of cells in
each dimension in the quantized space.
 Statistical information GRID.
 Spatial area is divided into rectangular cells
 Several levels of cells-at different levels of
resolution
 High level cell is partitioned into several
lower level cells.
 Statistical attributes are stored in cell.
(mean , maximum , minimum)
 Computation is query independent
 Parallel processing-supported.
 Data is processed in a single pass
 Quality depends on granuerily
Grid based method & model based clustering method
 A multi-resolution clustering approach which
applies wavelet transform to the feature space
 A wavelet transform is a signal processing
technique that decomposes a signal into different
frequency sub-band
 Both grid-based and density-based
 Input parameters:
 # of cells for each dimension
 The wavelet , and the # of application wavelet
transform.
Grid based method & model based clustering method
 Complexity O(N)
 Detect arbitrary shaped clusters at different
scales.
 Not sensitive to noise , not sensitive to input
order.
 Only applicable to low dimensional data.
CLIQUE can be considered as both density-
based and grid-based
1.It partitions each dimension into the same number
of equal length interval.
2.It partitions an m-dimensional data space into
non-overlapping rectangular units.
3.A unit is dense if the fraction of total data points
contained in the unit exceeds the input model
parameter.
4.A cluster is a maximal set of connected dense units
within a subspace.
 Attempt to optimize the fit between the data
and some mathematical model.
 ASSUMPTION:-data are generated by a
mixture of underlying portability distributes.
 TECHNIQUES:
 expectation-maximization
 Conceptual clustering
 Neural networks approach
 ITERATIVE REFINEMENT ALGORITHM-
used to find parameter estimates
EXTENSION OF K-MEANS
 Assigns an object to a cluster according to a
weight representing portability of
membership.
 Initial estimate of parameters
 Iteratively reassigns scores.
 A form of clustering in machine learning
 Produces a classification scheme for a set of
unlabeled objects.
 Finds characteristics description for each concept
 COBWEB
 A popular and simple method of incremental
conceptual learning.
 Creates a hierarchical clustering in the form of a
classification tree.
Animal
P(Co)=1.0
P(scales | Co)=0.25
Fish
P(C1)=0.25
P(scales|C1)=
1.0
Amphibian
P(C2)=0.25
P(moist|C2)=1.
0
Mammal/bird
P(C3)=0.5
P(hair|C3)=0.
5
Mammal
P(C4)=0.5
P(hair|C4)=1
.0
Bird
P(C5)=0.5
P(feathers|c5
)=1.0
 Represent each cluster as an exemplar , acting as
a “prototype” of the cluster.
 New objects are distributed to the cluster whose
exemplar is the most similar according to some
distance measure.
SELF ORGANIZING MAP
 Competitive learning
 Involves a hierarchical architecture of several
units
 Organization of units-forms a feature map
 Web document clustering.
FEATURE TRANSFORMATION METHODS
 PCA , SVD-Summarize data by creating linear
combinations of attributes.
 But do not remove any attributes ;
transformed attributes-complex to interpret
FEATURE SELECTION METHODS
 Most relevant of attributes with represent to
class labels
 Entropy analysis .

More Related Content

PPTX
Java swing
PPTX
Geographic Routing in WSN
PPT
Ll(1) Parser in Compilers
PPTX
MACHINE LEARNING - GENETIC ALGORITHM
PPTX
Predictive Analytics - An Overview
PPT
3.5 model based clustering
PPTX
Multilayer perceptron
PPT
3.2 partitioning methods
Java swing
Geographic Routing in WSN
Ll(1) Parser in Compilers
MACHINE LEARNING - GENETIC ALGORITHM
Predictive Analytics - An Overview
3.5 model based clustering
Multilayer perceptron
3.2 partitioning methods

What's hot (20)

PPT
K mean-clustering algorithm
PPTX
Raspberry Pi
PPT
1.8 discretization
PPTX
Image restoration and degradation model
PPTX
Physical design of io t
PDF
Region Splitting and Merging Technique For Image segmentation.
PPTX
IOT - Design Principles of Connected Devices
PPTX
Graph coloring using backtracking
PDF
Design principle of pattern recognition system and STATISTICAL PATTERN RECOGN...
PPTX
Data mining Measuring similarity and desimilarity
PPTX
Hadoop And Their Ecosystem ppt
PPTX
Mining single dimensional boolean association rules from transactional
PDF
Internet of Things - module 1
PPTX
Dynamic Itemset Counting
PDF
Hybrid wireless protocols
PPTX
Image compression models
PPTX
Structure of agents
PPSX
Fuzzy expert system
PPTX
Clustering in Data Mining
PPTX
Artificial neural network
K mean-clustering algorithm
Raspberry Pi
1.8 discretization
Image restoration and degradation model
Physical design of io t
Region Splitting and Merging Technique For Image segmentation.
IOT - Design Principles of Connected Devices
Graph coloring using backtracking
Design principle of pattern recognition system and STATISTICAL PATTERN RECOGN...
Data mining Measuring similarity and desimilarity
Hadoop And Their Ecosystem ppt
Mining single dimensional boolean association rules from transactional
Internet of Things - module 1
Dynamic Itemset Counting
Hybrid wireless protocols
Image compression models
Structure of agents
Fuzzy expert system
Clustering in Data Mining
Artificial neural network
Ad

Similar to Grid based method & model based clustering method (20)

PPTX
Data Mining: clustering and analysis
PPTX
Data Mining: clustering and analysis
PDF
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
PDF
CLUSTERING IN DATA MINING.pdf
PDF
Chapter 5.pdf
PDF
Volume 2-issue-6-2143-2147
PDF
Volume 2-issue-6-2143-2147
PDF
An Efficient Clustering Method for Aggregation on Data Fragments
PDF
A survey on Efficient Enhanced K-Means Clustering Algorithm
PPT
ClustIII.ppt
PPT
dm_clustering2.ppt
PDF
Clustering Using Shared Reference Points Algorithm Based On a Sound Data Model
PDF
Data Mining: Cluster Analysis
PDF
Paper id 26201478
PDF
A Density Based Clustering Technique For Large Spatial Data Using Polygon App...
PDF
Ir3116271633
PDF
The improved k means with particle swarm optimization
PPTX
K- means clustering method based Data Mining of Network Shared Resources .pptx
PPTX
K- means clustering method based Data Mining of Network Shared Resources .pptx
PDF
A Study of Efficiency Improvements Technique for K-Means Algorithm
Data Mining: clustering and analysis
Data Mining: clustering and analysis
Feature Subset Selection for High Dimensional Data Using Clustering Techniques
CLUSTERING IN DATA MINING.pdf
Chapter 5.pdf
Volume 2-issue-6-2143-2147
Volume 2-issue-6-2143-2147
An Efficient Clustering Method for Aggregation on Data Fragments
A survey on Efficient Enhanced K-Means Clustering Algorithm
ClustIII.ppt
dm_clustering2.ppt
Clustering Using Shared Reference Points Algorithm Based On a Sound Data Model
Data Mining: Cluster Analysis
Paper id 26201478
A Density Based Clustering Technique For Large Spatial Data Using Polygon App...
Ir3116271633
The improved k means with particle swarm optimization
K- means clustering method based Data Mining of Network Shared Resources .pptx
K- means clustering method based Data Mining of Network Shared Resources .pptx
A Study of Efficiency Improvements Technique for K-Means Algorithm
Ad

More from rajshreemuthiah (20)

PPTX
PPTX
PPTX
PPTX
polymorphism
PPTX
solutions and understanding text analytics
PPTX
interface
PPTX
Testing &ampdebugging
PPTX
concurrency control
PPTX
Education
PPTX
Formal verification
PPTX
Transaction management
PPTX
Multi thread
PPTX
System testing
PPTX
software maintenance
PPTX
exception handling
PPTX
e governance
PPTX
recovery management
PPTX
Implementing polymorphism
PPSX
Buffer managements
PPTX
os linux
polymorphism
solutions and understanding text analytics
interface
Testing &ampdebugging
concurrency control
Education
Formal verification
Transaction management
Multi thread
System testing
software maintenance
exception handling
e governance
recovery management
Implementing polymorphism
Buffer managements
os linux

Recently uploaded (20)

PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
A Presentation on Artificial Intelligence
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Electronic commerce courselecture one. Pdf
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Cloud computing and distributed systems.
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Building Integrated photovoltaic BIPV_UPV.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
A Presentation on Artificial Intelligence
“AI and Expert System Decision Support & Business Intelligence Systems”
Bridging biosciences and deep learning for revolutionary discoveries: a compr...
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Diabetes mellitus diagnosis method based random forest with bat algorithm
Electronic commerce courselecture one. Pdf
Reach Out and Touch Someone: Haptics and Empathic Computing
Per capita expenditure prediction using model stacking based on satellite ima...
Encapsulation_ Review paper, used for researhc scholars
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Cloud computing and distributed systems.
Unlocking AI with Model Context Protocol (MCP)
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Understanding_Digital_Forensics_Presentation.pptx
Network Security Unit 5.pdf for BCA BBA.
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
The AUB Centre for AI in Media Proposal.docx

Grid based method & model based clustering method

  • 2.  INTRODUCTION  STING  WAVECLUSTER  CLIQUE-Clustering in QUEST  FAST PROCESSING TIME
  • 3.  The grid based clustering approach uses a multi resolution grid data structure.  The object space is quantized into finite number of cells that form a grid structure.  The major advantage of this method is fast processing time.  It is dependent only on the number of cells in each dimension in the quantized space.
  • 4.  Statistical information GRID.  Spatial area is divided into rectangular cells  Several levels of cells-at different levels of resolution  High level cell is partitioned into several lower level cells.  Statistical attributes are stored in cell. (mean , maximum , minimum)
  • 5.  Computation is query independent  Parallel processing-supported.  Data is processed in a single pass  Quality depends on granuerily
  • 7.  A multi-resolution clustering approach which applies wavelet transform to the feature space  A wavelet transform is a signal processing technique that decomposes a signal into different frequency sub-band  Both grid-based and density-based  Input parameters:  # of cells for each dimension  The wavelet , and the # of application wavelet transform.
  • 9.  Complexity O(N)  Detect arbitrary shaped clusters at different scales.  Not sensitive to noise , not sensitive to input order.  Only applicable to low dimensional data.
  • 10. CLIQUE can be considered as both density- based and grid-based 1.It partitions each dimension into the same number of equal length interval. 2.It partitions an m-dimensional data space into non-overlapping rectangular units. 3.A unit is dense if the fraction of total data points contained in the unit exceeds the input model parameter. 4.A cluster is a maximal set of connected dense units within a subspace.
  • 11.  Attempt to optimize the fit between the data and some mathematical model.  ASSUMPTION:-data are generated by a mixture of underlying portability distributes.  TECHNIQUES:  expectation-maximization  Conceptual clustering  Neural networks approach
  • 12.  ITERATIVE REFINEMENT ALGORITHM- used to find parameter estimates EXTENSION OF K-MEANS  Assigns an object to a cluster according to a weight representing portability of membership.  Initial estimate of parameters  Iteratively reassigns scores.
  • 13.  A form of clustering in machine learning  Produces a classification scheme for a set of unlabeled objects.  Finds characteristics description for each concept  COBWEB  A popular and simple method of incremental conceptual learning.  Creates a hierarchical clustering in the form of a classification tree.
  • 15.  Represent each cluster as an exemplar , acting as a “prototype” of the cluster.  New objects are distributed to the cluster whose exemplar is the most similar according to some distance measure. SELF ORGANIZING MAP  Competitive learning  Involves a hierarchical architecture of several units  Organization of units-forms a feature map  Web document clustering.
  • 16. FEATURE TRANSFORMATION METHODS  PCA , SVD-Summarize data by creating linear combinations of attributes.  But do not remove any attributes ; transformed attributes-complex to interpret FEATURE SELECTION METHODS  Most relevant of attributes with represent to class labels  Entropy analysis .