SlideShare a Scribd company logo
4
Most read
6
Most read
10
Most read
OPTICS: Ordering Points To
Identify the Clustering Structure
Presented By:
Rohit Paul
 Disadvantages of DBSCAN
 Requires two user inputs(Eps and MinPts)
 Unable to determine variable density cluster
 OPTICS:
 Able to get variable density cluster
 Mainly requires one inputs (i.e, MinPts)
 Eps can be considered as ‘infinite’
Idea
 Creates an augmented ordering of the database
representing its density-based clustering structure
 Help us gain a high level understanding of the way data
is structured
Observation
 For a constant MinPts value, density-based
clusters with higher density are completely
contained in density-connected sets with respect
to a lower density.
 Extend the DBSCAN algorithm such that several
distance parameters are processed at the same
time
OPTICS
 An infinite number of distance parameters eps’
which are smaller than a “generating distance”
eps (i.e. 0 <= eps’ <= eps).
 Order is stored in which the objects are
processed and the information which would be
used by an extended DBSCAN algorithm to
assign cluster memberships
 This information consists of only two values for
each object: the core-distance and a reachability-
distance
Terminology
 Core distance of an object p:
 The core-distance of an object p is simply the smallest
distance eps’ between p and an object in its e-
neighborhood such that p would be a core object with
respect to eps’ if this neighbor is contained in Ne(p).
 Reachability-distance object p w.r.t. object o:
 Reachability-distance of an object p with respect to
another object o is the smallest
distance such that p is directly
density-reachable from o if o is
a core object
Algorithm
 FOR i FROM 1 TO SetOfObjects.size DO
 IF NOT Object.Processed THEN
1. neighbors := SetOfObjects.neighbors(Object, e);
2. Object.Processed := TRUE;
3. Object.reachability_distance := UNDEFINED;
4. Object.setCoreDistance(neighbors, e, MinPts);
5. OrderedFile.write(Object);
6. IF Object.core_distance <> UNDEFINED THEN
 OrderSeeds.update(neighbors, Object);
 WHILE NOT orderSeeds.empty() DO
 Repeat Step 1, 2, 4, 5 and 6
 If reachability-distance of the current object
Object is larger than the clustering-distance eps’
 Object is not density-reachable from any of the objects
which are located before the current object in the
cluster-ordering.
 We look at the core-distance of Object and start a new
cluster if Object is a core object with respect to eps’
and MinPts; otherwise, Object is assigned to NOISE
 If reachability-distance of the current object is
smaller than eps’
 Can simply assign this object to the current cluster
because then it is density-reachable from a preceding
core object in the cluster-ordering.
Reachability plot insensitive to input
parameter
• The smaller the Eps
value, the more
objects have an
UNDEFINED
reachability-distance
• Lower values MinPts
reachability-plot looks
more jagged and
higher value
smoothen the curve.
Thank You

More Related Content

PDF
Optics ordering points to identify the clustering structure
PPT
Alpaydin - Chapter 2
PDF
Clustering: A Survey
PPT
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5
PDF
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
PPTX
PPT
hierarchical_planning.ppt
PPTX
Dendral
Optics ordering points to identify the clustering structure
Alpaydin - Chapter 2
Clustering: A Survey
Data Mining: Concepts and Techniques (3rd ed.) — Chapter 5
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
hierarchical_planning.ppt
Dendral

What's hot (20)

PDF
Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...
PPTX
Artificial Intelligence Notes Unit 3
PPTX
Multilayer perceptron
PDF
16 2 predicate resolution
PPT
Iterative deepening search
PPTX
Hierarchical clustering.pptx
PPTX
Competitive Learning [Deep Learning And Nueral Networks].pptx
PPTX
introduction to machine learning
PDF
I. FSSP(Progression Planner) II. BSSP(Regression Planner
PPTX
Recent Progress on Object Detection_20170331
PDF
Machine Learning - Object Detection and Classification
PPT
Data cleaning-outlier-detection
PPTX
Moving object detection in video surveillance
PPT
Pattern Recognition
PPTX
Probabilistic Reasoning
PDF
Uncertainty Estimation in Deep Learning
PPTX
Reasoning in AI
PDF
Bayesian inference
PDF
Uncertainty in Deep Learning
Artificial Neural Network Lecture 6- Associative Memories & Discrete Hopfield...
Artificial Intelligence Notes Unit 3
Multilayer perceptron
16 2 predicate resolution
Iterative deepening search
Hierarchical clustering.pptx
Competitive Learning [Deep Learning And Nueral Networks].pptx
introduction to machine learning
I. FSSP(Progression Planner) II. BSSP(Regression Planner
Recent Progress on Object Detection_20170331
Machine Learning - Object Detection and Classification
Data cleaning-outlier-detection
Moving object detection in video surveillance
Pattern Recognition
Probabilistic Reasoning
Uncertainty Estimation in Deep Learning
Reasoning in AI
Bayesian inference
Uncertainty in Deep Learning
Ad

Similar to Optics (20)

PPTX
density based method and expectation maximization
PPTX
Could a Data Science Program use Data Science Insights?
PPTX
Density based clustering
PPT
instance bases k nearest neighbor algorithm.ppt
PPT
3.4 density and grid methods
PPT
[PPT]
PDF
A framework for outlier detection in
PDF
A study on rough set theory based
PDF
Data Mining: Cluster Analysis
PDF
Computer Vision: Visual Extent of an Object
PDF
Ijartes v1-i2-006
PPTX
Dbscan
PDF
Clustering Algorithms for Data Stream
PPTX
Clique and sting
DOCX
PDF
UNIT V.pdf
PDF
Convolution neural networks
PDF
Reduct generation for the incremental data using rough set theory
PDF
ML_Unit_IV_Clustering in Machine Learning.pdf
density based method and expectation maximization
Could a Data Science Program use Data Science Insights?
Density based clustering
instance bases k nearest neighbor algorithm.ppt
3.4 density and grid methods
[PPT]
A framework for outlier detection in
A study on rough set theory based
Data Mining: Cluster Analysis
Computer Vision: Visual Extent of an Object
Ijartes v1-i2-006
Dbscan
Clustering Algorithms for Data Stream
Clique and sting
UNIT V.pdf
Convolution neural networks
Reduct generation for the incremental data using rough set theory
ML_Unit_IV_Clustering in Machine Learning.pdf
Ad

Recently uploaded (20)

PPT
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
PPTX
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
PPTX
Business Acumen Training GuidePresentation.pptx
PPTX
climate analysis of Dhaka ,Banglades.pptx
PDF
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PDF
Clinical guidelines as a resource for EBP(1).pdf
PPT
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
PPTX
Database Infoormation System (DBIS).pptx
PPTX
STUDY DESIGN details- Lt Col Maksud (21).pptx
PPTX
Global journeys: estimating international migration
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
PPTX
Moving the Public Sector (Government) to a Digital Adoption
PPTX
Data_Analytics_and_PowerBI_Presentation.pptx
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
PDF
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf
Chapter 2 METAL FORMINGhhhhhhhjjjjmmmmmmmmm
advance b rammar.pptxfdgdfgdfsgdfgsdgfdfgdfgsdfgdfgdfg
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Business Acumen Training GuidePresentation.pptx
climate analysis of Dhaka ,Banglades.pptx
“Getting Started with Data Analytics Using R – Concepts, Tools & Case Studies”
Galatica Smart Energy Infrastructure Startup Pitch Deck
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Clinical guidelines as a resource for EBP(1).pdf
Chapter 3 METAL JOINING.pptnnnnnnnnnnnnn
Database Infoormation System (DBIS).pptx
STUDY DESIGN details- Lt Col Maksud (21).pptx
Global journeys: estimating international migration
IB Computer Science - Internal Assessment.pptx
05. PRACTICAL GUIDE TO MICROSOFT EXCEL.pptx
Moving the Public Sector (Government) to a Digital Adoption
Data_Analytics_and_PowerBI_Presentation.pptx
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
TRAFFIC-MANAGEMENT-AND-ACCIDENT-INVESTIGATION-WITH-DRIVING-PDF-FILE.pdf

Optics

  • 1. OPTICS: Ordering Points To Identify the Clustering Structure Presented By: Rohit Paul
  • 2.  Disadvantages of DBSCAN  Requires two user inputs(Eps and MinPts)  Unable to determine variable density cluster  OPTICS:  Able to get variable density cluster  Mainly requires one inputs (i.e, MinPts)  Eps can be considered as ‘infinite’
  • 3. Idea  Creates an augmented ordering of the database representing its density-based clustering structure  Help us gain a high level understanding of the way data is structured
  • 4. Observation  For a constant MinPts value, density-based clusters with higher density are completely contained in density-connected sets with respect to a lower density.  Extend the DBSCAN algorithm such that several distance parameters are processed at the same time
  • 5. OPTICS  An infinite number of distance parameters eps’ which are smaller than a “generating distance” eps (i.e. 0 <= eps’ <= eps).  Order is stored in which the objects are processed and the information which would be used by an extended DBSCAN algorithm to assign cluster memberships  This information consists of only two values for each object: the core-distance and a reachability- distance
  • 6. Terminology  Core distance of an object p:  The core-distance of an object p is simply the smallest distance eps’ between p and an object in its e- neighborhood such that p would be a core object with respect to eps’ if this neighbor is contained in Ne(p).  Reachability-distance object p w.r.t. object o:  Reachability-distance of an object p with respect to another object o is the smallest distance such that p is directly density-reachable from o if o is a core object
  • 7. Algorithm  FOR i FROM 1 TO SetOfObjects.size DO  IF NOT Object.Processed THEN 1. neighbors := SetOfObjects.neighbors(Object, e); 2. Object.Processed := TRUE; 3. Object.reachability_distance := UNDEFINED; 4. Object.setCoreDistance(neighbors, e, MinPts); 5. OrderedFile.write(Object); 6. IF Object.core_distance <> UNDEFINED THEN  OrderSeeds.update(neighbors, Object);  WHILE NOT orderSeeds.empty() DO  Repeat Step 1, 2, 4, 5 and 6
  • 8.  If reachability-distance of the current object Object is larger than the clustering-distance eps’  Object is not density-reachable from any of the objects which are located before the current object in the cluster-ordering.  We look at the core-distance of Object and start a new cluster if Object is a core object with respect to eps’ and MinPts; otherwise, Object is assigned to NOISE  If reachability-distance of the current object is smaller than eps’  Can simply assign this object to the current cluster because then it is density-reachable from a preceding core object in the cluster-ordering.
  • 9. Reachability plot insensitive to input parameter • The smaller the Eps value, the more objects have an UNDEFINED reachability-distance • Lower values MinPts reachability-plot looks more jagged and higher value smoothen the curve.