SlideShare a Scribd company logo
Cluster Analysis


By Rama Krishna Kompella
Introduction
• In order to go from data to information, to
  knowledge and to wisdom, we need to
  reduce the complexity of the data.
• Complexity can be reduced on
  – case level : cluster analysis
  – on variable level: factor analysis
Cluster Anlaysis
• Cluster analysis is another interdependence multivariate
  method
• As the name implies, the basic purpose of cluster
  analysis is to classify or segment objects (e.g., customers,
  products, market areas) into groups so that objects
  within each group are similar to one another on a variety
  of variables.
• Cluster analysis seeks to classify segments or objects
  such that there will be as much likeness within segments
  and as much difference between segments as possible.
T20 cluster analysis
What cluster analysis does
• Cluster analysis
  – generate groups which are similar
  – homogeneous within the group and as much as
    possible heterogeneous to other groups
  – data consists usually of objects or persons
  – segmentation based on more than two variables
• Examples for datasets used for cluster
  analysis:
  – socio-economic criteria: income, education,
    profession, age, number of children, size of city of
    residence ....
  – psychographic criteria: interest, life style,
    motivation, values, involvement
  – criteria linked to the buying behaviour: price
    range, type of media used, intensity of use, choice
    of retail outlet, fidelity, buyer/non-buyer, buying
    intensity
• Proximity Measures
  – Proximity measures are used to represent the
    nearness of two objects
  – relate objects with a high similarity to the same
    cluster and objects with low similarity to different
    clusters
  – differentiation of nominal-scaled and metric-
    scaled variables
  – the calculation of the distances measures is the
    basis of the cluster analysis
• Two phases:
  – Forming of clusters by the chosen data set –
    resulting in a new variable that identifies cluster
    members among the cases
  – Description of clusters by re-crossing with the
    data
Cluster Algorithm in agglomerative hierarchical clustering
   methods – seven steps to get clusters
2. each object is a independent cluster, n
3. two clusters with the lowest distance are merged to one
   cluster. reduce the number of clusters by 1 (n-1)
4. calculate the distance matrix between the new cluster
   and all remaining clusters
5. repeat step 2 and 3, (n-1) times until all objects form one
   reminding cluster
• Finally…
2.decide upon the number of clusters you want
  to keep (decision often based on the size of
  the clusters)
3.description of the clusters by means of the
  cluster forming variables
4.appellation of the clusters with catchy titles
• Consumers and Fair Trade Coffee
• 214 interviews of consumers of fair trade coffee
  (personal and telephone interviews)
• Cluster analysis in order to identify consumer
typologies
• Identification of 6 clusters
• Description of these clusters by further analysis:
  comparison of means, crosstabs etc.
• Description of Cluster 1 (11,6%): “self-oriented fair
  trade buyer” :
• Searches satisfaction by doing the good thing
• Is not altruistic
• Buys occasionally
• Sticks to his conventional coffee brand
• High level of formal education
• Frequently religious (catholic or protestant)
• Description of Cluster 2 (13,6%): “less ready to take
personal constraints”
• States that “fair trade coffee is hard to find”
• Feels responsible for fare development issues
• Believes that fair trade is efficient for developing
  countries
• Is less ready to go to special fair trade outlets
• Buys conventional coffee
• Likes the taste of fair trade coffee
• Description of clusters Cluster 3 (18,2%): “less
  engaged about fair trade” :
• Feels no personal responsibility with regard to
  development questions
• Doesn’t see the efficiency of the consumption of fair
  trade goods
• The only thing that can make him change is the
  influence of friends
• Is older then the average fair trade buyer and has
  less formal education
• Description of clusters: Cluster 4 (32,2%): “intensive
  buyer”
• Has abandoned conventional coffee brands
• Has started to buy fair trade quite a while ago (> 3 years)
• Shops frequently in fair trade stores (and not in organic
  retail)
• Is ready to act for fair development and talks to friends
  about it
• Relatively young, with low incomes and high educational
  values
• Description of clusters: Cluster 5 (18,7%):
  “valueoriented”
• Together with cluster 4 highly aware of development
  issues
• Ready to act and to constraint consumption habits
• Buys for altruistic reasons
• Highly involved in social / political action
• Most frequently women, highest household income
  among all clusters
• Own security is the basis for solidary action
• Description of clusters: Cluster 6 (5,6%): “does not like
  the taste of fair trade coffee”
• Lowest purchase intensity of all clusters
• Not willing to accept constraints in consumption habits
  or higher prices
• Most members of these group are attached to a
  conventional coffee brand
• Relatively high incomes, age within the average of all
  groups, lower level of formal education
• Less religious than other groups.
Cluster Analysis Applications in
        Marketing Research
• New-product research:
   – Clustering brands can help a firm examine its product offerings
     relative to competition. Brands in the same cluster often
     compete more fiercely with each other than with brands in
     other clusters.
• Test marketing:
   – Cluster analysis groups test cities into homogeneous clusters for
     test marketing purposes.
Cluster Analysis Applications in
        Marketing Research
• Buyer behavior:
   – Cluster analysis can be employed to identify similar groups of
     buyers who have similar choice criteria.
• Market segmentation:
   – Cluster analysis can develop distinct market segments on the
     basis of geographic, demographic, psychographic, and
     behavioristic variables.
Advantages
• no special scales of measurement necessary
• high persuasiveness and good assignment to realisable
  recommendations in practice
Disadvantages
• choice of cluster-forming variables often not based on
  theory but at random
• determination of the right number of clusters often time
  consuming – often decided upon arbitrarily
• high influence on the interpretation of the scientist,
  difficult to control (good documentation is needed)
Questions?

More Related Content

PPTX
Consumer research process
PPTX
Cb(models)
PPTX
Consumer Research - Consumer Behavior
PPTX
consumer behavior model
PPTX
Consumer behavior and retail research
PPT
Cluster And Dioxin Exposure / Prof. Jean Francois Viel
PDF
La comunicazione
PPTX
TCI 2014 Design for Clusters and Clusters for Design
Consumer research process
Cb(models)
Consumer Research - Consumer Behavior
consumer behavior model
Consumer behavior and retail research
Cluster And Dioxin Exposure / Prof. Jean Francois Viel
La comunicazione
TCI 2014 Design for Clusters and Clusters for Design

Similar to T20 cluster analysis (20)

PDF
Business analytics certification training in pune
PDF
Data science certification in pune
PDF
Online data science course
PDF
Data science course
PDF
Best data science courses
PDF
Business analytics course in bangalore
PDF
Data science course
PDF
business analytics course in delhi
PDF
Data science course
PDF
Best data science courses
PDF
Data science online course
PDF
business analytics course in bangalore
PDF
Data science course in delhi
PDF
Data science course
PDF
Best data science courses
PDF
Data science course
PDF
data analytics certification courses
PDF
Data science certification
PDF
Data science online course
PDF
Data science training in chennai
Business analytics certification training in pune
Data science certification in pune
Online data science course
Data science course
Best data science courses
Business analytics course in bangalore
Data science course
business analytics course in delhi
Data science course
Best data science courses
Data science online course
business analytics course in bangalore
Data science course in delhi
Data science course
Best data science courses
Data science course
data analytics certification courses
Data science certification
Data science online course
Data science training in chennai
Ad

More from kompellark (20)

PPT
T22 research report writing
PPT
Rubric assignment 2
PPT
Answers mid-term
PDF
Exam paper
PPT
T21 conjoint analysis
PPT
T19 factor analysis
PPT
T18 discriminant analysis
PPT
T17 correlation
PPT
T16 multiple regression
PPT
T15 ancova
PPT
T14 anova
PPT
T13 parametric tests
PPT
T11 types of tests
PPT
T15 ancova
PPT
T14 anova
PPT
T13 parametric tests
PPT
T12 non-parametric tests
PPT
T11 types of tests
PPT
T16 multiple regression
PPT
T10 statisitical analysis
T22 research report writing
Rubric assignment 2
Answers mid-term
Exam paper
T21 conjoint analysis
T19 factor analysis
T18 discriminant analysis
T17 correlation
T16 multiple regression
T15 ancova
T14 anova
T13 parametric tests
T11 types of tests
T15 ancova
T14 anova
T13 parametric tests
T12 non-parametric tests
T11 types of tests
T16 multiple regression
T10 statisitical analysis
Ad

Recently uploaded (20)

PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Encapsulation theory and applications.pdf
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PPTX
Machine Learning_overview_presentation.pptx
PDF
Approach and Philosophy of On baking technology
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPT
Teaching material agriculture food technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
Mobile App Security Testing_ A Comprehensive Guide.pdf
Encapsulation theory and applications.pdf
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Machine Learning_overview_presentation.pptx
Approach and Philosophy of On baking technology
NewMind AI Weekly Chronicles - August'25-Week II
MYSQL Presentation for SQL database connectivity
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Programs and apps: productivity, graphics, security and other tools
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Building Integrated photovoltaic BIPV_UPV.pdf
The AUB Centre for AI in Media Proposal.docx
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Chapter 3 Spatial Domain Image Processing.pdf
Teaching material agriculture food technology
Review of recent advances in non-invasive hemoglobin estimation
Reach Out and Touch Someone: Haptics and Empathic Computing

T20 cluster analysis

  • 1. Cluster Analysis By Rama Krishna Kompella
  • 2. Introduction • In order to go from data to information, to knowledge and to wisdom, we need to reduce the complexity of the data. • Complexity can be reduced on – case level : cluster analysis – on variable level: factor analysis
  • 3. Cluster Anlaysis • Cluster analysis is another interdependence multivariate method • As the name implies, the basic purpose of cluster analysis is to classify or segment objects (e.g., customers, products, market areas) into groups so that objects within each group are similar to one another on a variety of variables. • Cluster analysis seeks to classify segments or objects such that there will be as much likeness within segments and as much difference between segments as possible.
  • 5. What cluster analysis does • Cluster analysis – generate groups which are similar – homogeneous within the group and as much as possible heterogeneous to other groups – data consists usually of objects or persons – segmentation based on more than two variables
  • 6. • Examples for datasets used for cluster analysis: – socio-economic criteria: income, education, profession, age, number of children, size of city of residence .... – psychographic criteria: interest, life style, motivation, values, involvement – criteria linked to the buying behaviour: price range, type of media used, intensity of use, choice of retail outlet, fidelity, buyer/non-buyer, buying intensity
  • 7. • Proximity Measures – Proximity measures are used to represent the nearness of two objects – relate objects with a high similarity to the same cluster and objects with low similarity to different clusters – differentiation of nominal-scaled and metric- scaled variables – the calculation of the distances measures is the basis of the cluster analysis
  • 8. • Two phases: – Forming of clusters by the chosen data set – resulting in a new variable that identifies cluster members among the cases – Description of clusters by re-crossing with the data
  • 9. Cluster Algorithm in agglomerative hierarchical clustering methods – seven steps to get clusters 2. each object is a independent cluster, n 3. two clusters with the lowest distance are merged to one cluster. reduce the number of clusters by 1 (n-1) 4. calculate the distance matrix between the new cluster and all remaining clusters 5. repeat step 2 and 3, (n-1) times until all objects form one reminding cluster
  • 10. • Finally… 2.decide upon the number of clusters you want to keep (decision often based on the size of the clusters) 3.description of the clusters by means of the cluster forming variables 4.appellation of the clusters with catchy titles
  • 11. • Consumers and Fair Trade Coffee • 214 interviews of consumers of fair trade coffee (personal and telephone interviews) • Cluster analysis in order to identify consumer typologies • Identification of 6 clusters • Description of these clusters by further analysis: comparison of means, crosstabs etc.
  • 12. • Description of Cluster 1 (11,6%): “self-oriented fair trade buyer” : • Searches satisfaction by doing the good thing • Is not altruistic • Buys occasionally • Sticks to his conventional coffee brand • High level of formal education • Frequently religious (catholic or protestant)
  • 13. • Description of Cluster 2 (13,6%): “less ready to take personal constraints” • States that “fair trade coffee is hard to find” • Feels responsible for fare development issues • Believes that fair trade is efficient for developing countries • Is less ready to go to special fair trade outlets • Buys conventional coffee • Likes the taste of fair trade coffee
  • 14. • Description of clusters Cluster 3 (18,2%): “less engaged about fair trade” : • Feels no personal responsibility with regard to development questions • Doesn’t see the efficiency of the consumption of fair trade goods • The only thing that can make him change is the influence of friends • Is older then the average fair trade buyer and has less formal education
  • 15. • Description of clusters: Cluster 4 (32,2%): “intensive buyer” • Has abandoned conventional coffee brands • Has started to buy fair trade quite a while ago (> 3 years) • Shops frequently in fair trade stores (and not in organic retail) • Is ready to act for fair development and talks to friends about it • Relatively young, with low incomes and high educational values
  • 16. • Description of clusters: Cluster 5 (18,7%): “valueoriented” • Together with cluster 4 highly aware of development issues • Ready to act and to constraint consumption habits • Buys for altruistic reasons • Highly involved in social / political action • Most frequently women, highest household income among all clusters • Own security is the basis for solidary action
  • 17. • Description of clusters: Cluster 6 (5,6%): “does not like the taste of fair trade coffee” • Lowest purchase intensity of all clusters • Not willing to accept constraints in consumption habits or higher prices • Most members of these group are attached to a conventional coffee brand • Relatively high incomes, age within the average of all groups, lower level of formal education • Less religious than other groups.
  • 18. Cluster Analysis Applications in Marketing Research • New-product research: – Clustering brands can help a firm examine its product offerings relative to competition. Brands in the same cluster often compete more fiercely with each other than with brands in other clusters. • Test marketing: – Cluster analysis groups test cities into homogeneous clusters for test marketing purposes.
  • 19. Cluster Analysis Applications in Marketing Research • Buyer behavior: – Cluster analysis can be employed to identify similar groups of buyers who have similar choice criteria. • Market segmentation: – Cluster analysis can develop distinct market segments on the basis of geographic, demographic, psychographic, and behavioristic variables.
  • 20. Advantages • no special scales of measurement necessary • high persuasiveness and good assignment to realisable recommendations in practice Disadvantages • choice of cluster-forming variables often not based on theory but at random • determination of the right number of clusters often time consuming – often decided upon arbitrarily • high influence on the interpretation of the scientist, difficult to control (good documentation is needed)