SlideShare a Scribd company logo
3
Most read
4
Most read
7
Most read
CLUSTER ANALYSIS

 PREPARED BY SABA KHAN
PRESENTED TO IMTIAZ ARIF
        ID 4640
What is Cluster Analysis?
     It is a descriptive analysis technique which groups
     objects (respondents, products, firms, variables,
     etc.) so that each object is similar to the other
     objects in the cluster and different from objects in
     all the other clusters.




2
What is Cluster Analysis?
 Cluster: a collection of data objects
   Similar to one another within the same cluster
   Dissimilar to the objects in other clusters


 Cluster analysis
   Finding similarities between data according to the
   characteristics found in the data and grouping
   similar data objects into clusters
When to use cluster analysis?
     The essence of all clustering approaches is the classification of
        data as suggested by “natural” groupings of the data themselves.
     Simply put when you desire the following then use
        Cluster analysis.
          Taxonomy development(segmentation)
          Data simplification
          Relationship identification
         Applications.
     It is used to segment the market in Marketing, used in
        social networking sites in making new groups based on
        users data, Flickr’s map of photos and other map sites
        use clustering to reduce the number of markers on a
        map.
4
    
Examples of Clustering Applications

 • Marketing: Help marketers discover distinct groups in their
customer bases, and then use this knowledge to develop
targeted marketing programs.
 • Land use: Identification of areas of similar land use in an
earth observation database.
 • Insurance: Identifying groups of motor insurance policy
holders with a high average claim cost.
 • City-planning: Identifying groups of houses according to
their house type, value, and geographical location.
 • Earth-quake studies: Observed earth quake epicenters
  should be clustered along continent faults
Assumptions for Cluster Analysis.
     Sufficient size is needed to ensure representativeness of
        the population and its underlying structure, particularly
        small groups within the population.
       Outliers can severely distort the representativeness of the
        results if they appear as structure (clusters) that are
        inconsistent with the research objectives
       Representativeness of the sample. The sample must
        represent the research question.
       Impact of multicollinearity. Input variables should be
        examined for substantial multicollinearity and if present:
       Reduce the variables to equal numbers in each set of
        correlated measures.


6
HOW TO DEFINE
CLUSTERS
   CLUSTER       CLUSTER
   A             B




             1

             2

             3
We will now go to SPSS for
     analysis.

      Retrieve judges.sav
      Analyze  classify  Hierarchical cluster
      All variables.




10

More Related Content

PPTX
Cluster analysis
PDF
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
PPTX
Strategies to achieve Sustainable Development
PPT
Business research method ch 1 zikmund_Research
PPTX
Introduction to data science
PPTX
Cluster analysis
PPTX
Deep learning
PPTX
Parametric tests
Cluster analysis
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Strategies to achieve Sustainable Development
Business research method ch 1 zikmund_Research
Introduction to data science
Cluster analysis
Deep learning
Parametric tests

What's hot (20)

PPTX
Factor analysis
PPTX
Discriminant analysis
PPTX
Multivariate data analysis
PPTX
Correlation and regression analysis
PPTX
Factor Analysis in Research
PPTX
discriminant analysis
PPT
Discriminant analysis
PDF
Logistic Ordinal Regression
PPTX
Introduction to principal component analysis (pca)
PPTX
Cluster Analysis
PDF
Multivariate Analysis
PPT
Multivariate Analysis Techniques
PPTX
Regression analysis
PPTX
Univariate & bivariate analysis
PPTX
PPT
Introduction to spss
PPTX
Descriptive statistics
PPT
Regression analysis ppt
DOCX
Estimation in statistics
Factor analysis
Discriminant analysis
Multivariate data analysis
Correlation and regression analysis
Factor Analysis in Research
discriminant analysis
Discriminant analysis
Logistic Ordinal Regression
Introduction to principal component analysis (pca)
Cluster Analysis
Multivariate Analysis
Multivariate Analysis Techniques
Regression analysis
Univariate & bivariate analysis
Introduction to spss
Descriptive statistics
Regression analysis ppt
Estimation in statistics
Ad

Viewers also liked (8)

PDF
Three case studies deploying cluster analysis
PPT
Human aspect of project
PPT
Clustering
PDF
Project delay and_cost_overrun-libre
PPTX
Time overruns
PPTX
Types of clustering and different types of clustering algorithms
PPTX
Clustering in Data Mining
PPT
Test of hypothesis
Three case studies deploying cluster analysis
Human aspect of project
Clustering
Project delay and_cost_overrun-libre
Time overruns
Types of clustering and different types of clustering algorithms
Clustering in Data Mining
Test of hypothesis
Ad

Similar to Cluster analysis (20)

PPTX
pratik meshram-Unit 5 (contemporary mkt r sch)
PPTX
QUALITY AND VALIDITY of cluster analysis in data minig
PDF
QUALITY AND VALIDITY OF CLUSTER ANALYSIS
PPTX
CLuster analysis presentation.pptx
DOCX
Cluster analysis (2).docx
PPTX
Program_Cluster_Analysis
PDF
It is a presentation on machine learning
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
PDF
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
PDF
4.Unit 4 ML Q&A.pdf machine learning qb
PPTX
Ahhbsdnfmfmfmdbshehwheheheheh3hehehehebq
PPTX
Hierarchical Clustering in Data Mining
PPTX
Artificial Intelligence Clustering lecture
PPTX
Clusteranalysis 121206234137-phpapp01
PPTX
Clusteranalysis
PPTX
Read first few slides cluster analysis
PPTX
PPTX
Presentation on K-Means Clustering
PPTX
For iiii year students of cse ML-UNIT-V.pptx
PDF
Unsupervised Learning in Machine Learning
pratik meshram-Unit 5 (contemporary mkt r sch)
QUALITY AND VALIDITY of cluster analysis in data minig
QUALITY AND VALIDITY OF CLUSTER ANALYSIS
CLuster analysis presentation.pptx
Cluster analysis (2).docx
Program_Cluster_Analysis
It is a presentation on machine learning
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
EXPLORING DATA MINING TECHNIQUES AND ITS APPLICATIONS
4.Unit 4 ML Q&A.pdf machine learning qb
Ahhbsdnfmfmfmdbshehwheheheheh3hehehehebq
Hierarchical Clustering in Data Mining
Artificial Intelligence Clustering lecture
Clusteranalysis 121206234137-phpapp01
Clusteranalysis
Read first few slides cluster analysis
Presentation on K-Means Clustering
For iiii year students of cse ML-UNIT-V.pptx
Unsupervised Learning in Machine Learning

More from saba khan (6)

PPTX
Training_Self Assessment Report
PPTX
PPTX
Regression analysis
PPTX
Logistic regression
PPTX
Correspondence analysis final
PPTX
Conjoint ppt final one
Training_Self Assessment Report
Regression analysis
Logistic regression
Correspondence analysis final
Conjoint ppt final one

Recently uploaded (20)

PPTX
MYSQL Presentation for SQL database connectivity
PPT
Teaching material agriculture food technology
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Encapsulation theory and applications.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Electronic commerce courselecture one. Pdf
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PPTX
Cloud computing and distributed systems.
PDF
Machine learning based COVID-19 study performance prediction
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Approach and Philosophy of On baking technology
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Modernizing your data center with Dell and AMD
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
MYSQL Presentation for SQL database connectivity
Teaching material agriculture food technology
Spectral efficient network and resource selection model in 5G networks
NewMind AI Monthly Chronicles - July 2025
Encapsulation theory and applications.pdf
NewMind AI Weekly Chronicles - August'25 Week I
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Electronic commerce courselecture one. Pdf
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Cloud computing and distributed systems.
Machine learning based COVID-19 study performance prediction
Reach Out and Touch Someone: Haptics and Empathic Computing
Approach and Philosophy of On baking technology
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Modernizing your data center with Dell and AMD
Encapsulation_ Review paper, used for researhc scholars
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx

Cluster analysis

  • 1. CLUSTER ANALYSIS PREPARED BY SABA KHAN PRESENTED TO IMTIAZ ARIF ID 4640
  • 2. What is Cluster Analysis?  It is a descriptive analysis technique which groups objects (respondents, products, firms, variables, etc.) so that each object is similar to the other objects in the cluster and different from objects in all the other clusters. 2
  • 3. What is Cluster Analysis?  Cluster: a collection of data objects  Similar to one another within the same cluster  Dissimilar to the objects in other clusters  Cluster analysis  Finding similarities between data according to the characteristics found in the data and grouping similar data objects into clusters
  • 4. When to use cluster analysis?  The essence of all clustering approaches is the classification of data as suggested by “natural” groupings of the data themselves.  Simply put when you desire the following then use Cluster analysis.  Taxonomy development(segmentation)  Data simplification  Relationship identification Applications.  It is used to segment the market in Marketing, used in social networking sites in making new groups based on users data, Flickr’s map of photos and other map sites use clustering to reduce the number of markers on a map. 4 
  • 5. Examples of Clustering Applications  • Marketing: Help marketers discover distinct groups in their customer bases, and then use this knowledge to develop targeted marketing programs.  • Land use: Identification of areas of similar land use in an earth observation database.  • Insurance: Identifying groups of motor insurance policy holders with a high average claim cost.  • City-planning: Identifying groups of houses according to their house type, value, and geographical location.  • Earth-quake studies: Observed earth quake epicenters should be clustered along continent faults
  • 6. Assumptions for Cluster Analysis.  Sufficient size is needed to ensure representativeness of the population and its underlying structure, particularly small groups within the population.  Outliers can severely distort the representativeness of the results if they appear as structure (clusters) that are inconsistent with the research objectives  Representativeness of the sample. The sample must represent the research question.  Impact of multicollinearity. Input variables should be examined for substantial multicollinearity and if present:  Reduce the variables to equal numbers in each set of correlated measures. 6
  • 7. HOW TO DEFINE CLUSTERS CLUSTER CLUSTER A B 1 2 3
  • 8. We will now go to SPSS for analysis. Retrieve judges.sav Analyze  classify  Hierarchical cluster All variables. 10