11/5/2018 Mikael Kågebäck, Chalmers CSE 1
Credit: Aldebaran
Correlation Clustering: A Tale of Two Cultures
Erik Thiel, Morteza Chehreghani, Devdatt Dubhashi
Chalmers University of Technology, Sweden
Are Colours Universal?
WCS Stimulus Palette
Colour Partitions
• Ask humans subjects in a language group to
label tiles with colour terms, then aggregate
all results into a partition for that language.
• Take the CIELAB colour coordinates to define a
similarity between colour tiles and forma
partition based on these similarities.
World Colour Survey
Regier T, Kemp C, Kay P. 2015.
Word meanings across languages
support efficient communication.
In
The Handbook of Language
Emergence, ed. B MacWhinney, W
O’Grady
Correlation Clustering
• Input: Graph 𝐺 = 𝑉, 𝐸 and positive or
negative weights 𝑤 𝑒 , 𝑒 ∈ 𝐸
• Output: A clustering of the vertices to
maximize the sum of the weights of edges
within each cluster.
Corr clust-kiel
Difference from usual Clustering
• Weights can be positive or negative!
• Contentious what’s ”good” quality clustering
• But in correlation clustering there is
unambiguous objective
• The number of clusters need not be specified,
will emerge from the optimizing the objective.
Web scale clustering
Approximation Algorithms
• Bansal, Blum Chawla (2004): PTAS on
complete graphs
• Charikar Guruswami, Wirth (2005): APX hard
on general graphs
• Charikar et al (2005), Swamy (2004): 0.76
approximation
• Guruswami-Giotis (2006): PTAS with fixed no
of clusters
Exact Formulation
SDP Relaxation + Rounding
However …
• No implementation, no code …
• Doesn’t work in practice …
A Tale of Two Cultures
• Deep elegant theory
• “Polynomial time”
• No implementation
• No experiments on data
sets
• Does not work in
practice or scale
• Beamer/LaTeX
• Sometimes theory
• Linear or sub-linear
• Well engineered
implementation
• Extensive testing on
data sets
• Must work in practice,
scale to “Big Data”
• Powerpoint
Algorithms Theory Machine Learning
Non-Convex Relaxation
Tightness of Relaxation
• The non-convex relaxation is tight: no gap
between continuous and discrete problem,
simple proof by randomized rounding.
• In contrast SDP relaxation is not tight.
Corr clust-kiel
Corr clust-kiel
Non-convex Convergence Theory
• For a differentiable (but not necessarily
convex) function, the FW algorithm converges
in 𝑂(1/ 𝑇) steps.
• If the function is multilinear, then it converges
in 𝑂(1/𝑇) steps.
• Note that our correlation clustering objective
is indeed multilinear!
Combinatorial Search
Synthetic Data: Generative Model
• Planted model with k clusters and noise p
• With probability (1-p), high positive weight on
edge within a cluster and high negative weight
on edge across clusters, with probability p,
arbitrary weight
Corr clust-kiel
Corr clust-kiel
Corr clust-kiel
Sampling Block FW
Corr clust-kiel
Variance Reduction Technique
Variance reduction
Newsgroup data set
Colour Partitions
Summary
• Non-convex relaxation solved with Frank
Wolfe yields an algorithm with guarantees
that beats all other methods handily in both
runtime and quality.
• Combine theory and rigour of algorithms
research with engineering good
implementations and extensive testing on
data.
Happy Birthday Anand!

More Related Content

PDF
Meta Learning Shared Hierarchies
PPT
Ensemble Learning Featuring the Netflix Prize Competition and ...
PDF
Fluency-Guided Cross-Lingual Image Captioning
PPTX
A Non--convex optimization approach to Correlation Clustering
PDF
iiit delhi unsupervised pdf.pdf
PPT
multiarmed bandit.ppt
PDF
Clustering.pdf
PDF
Local vs. Global Models for Effort Estimation and Defect Prediction
Meta Learning Shared Hierarchies
Ensemble Learning Featuring the Netflix Prize Competition and ...
Fluency-Guided Cross-Lingual Image Captioning
A Non--convex optimization approach to Correlation Clustering
iiit delhi unsupervised pdf.pdf
multiarmed bandit.ppt
Clustering.pdf
Local vs. Global Models for Effort Estimation and Defect Prediction

Similar to Corr clust-kiel (20)

PPTX
Data minig.pptx
PPT
15857 cse422 unsupervised-learning
PDF
3MLChapter3ClusteringSlides23EN UC Coimbra PT
PDF
Chromatic Sparse Learning
PDF
[ML]-Unsupervised-learning_Unit2.ppt.pdf
PDF
clustering using different methods in .pdf
PDF
Unsupervised learning and clustering.pdf
PPTX
17- Kernels and Clustering.pptx
PPTX
Evaluation of programs codes using machine learning
PPTX
Unsupervised Learning.pptx
PPTX
Unsupervised%20Learninffffg (2).pptx. application
PDF
L13. Cluster Analysis
PPT
PPT
Clustering
PPTX
Generalization abstraction
PPTX
Cluster Analysis.pptx
PDF
C4.3.1
Data minig.pptx
15857 cse422 unsupervised-learning
3MLChapter3ClusteringSlides23EN UC Coimbra PT
Chromatic Sparse Learning
[ML]-Unsupervised-learning_Unit2.ppt.pdf
clustering using different methods in .pdf
Unsupervised learning and clustering.pdf
17- Kernels and Clustering.pptx
Evaluation of programs codes using machine learning
Unsupervised Learning.pptx
Unsupervised%20Learninffffg (2).pptx. application
L13. Cluster Analysis
Clustering
Generalization abstraction
Cluster Analysis.pptx
C4.3.1
Ad

More from Devdatt Dubhashi (7)

PPTX
PPTX
PPTX
Ai energy
PPTX
Ai finance
PPTX
AI: The New Electricity to Harness Our Digital Future
PPTX
PPTX
Ai energy
Ai finance
AI: The New Electricity to Harness Our Digital Future
Ad

Recently uploaded (20)

PDF
Navigating the Thai Supplements Landscape.pdf
PPTX
retention in jsjsksksksnbsndjddjdnFPD.pptx
PPTX
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
PPTX
The Data Security Envisioning Workshop provides a summary of an organization...
PPTX
eGramSWARAJ-PPT Training Module for beginners
PPTX
IMPACT OF LANDSLIDE.....................
PPTX
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
PDF
Global Data and Analytics Market Outlook Report
PPT
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PPT
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
PPTX
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
PPTX
recommendation Project PPT with details attached
PPTX
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
PPTX
MBA JAPAN: 2025 the University of Waseda
PPT
Image processing and pattern recognition 2.ppt
PDF
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
PPTX
Machine Learning and working of machine Learning
PPTX
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
PPT
expt-design-lecture-12 hghhgfggjhjd (1).ppt
PPTX
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded
Navigating the Thai Supplements Landscape.pdf
retention in jsjsksksksnbsndjddjdnFPD.pptx
Statisticsccdxghbbnhhbvvvvvvvvvv. Dxcvvvhhbdzvbsdvvbbvv ccc
The Data Security Envisioning Workshop provides a summary of an organization...
eGramSWARAJ-PPT Training Module for beginners
IMPACT OF LANDSLIDE.....................
sac 451hinhgsgshssjsjsjheegdggeegegdggddgeg.pptx
Global Data and Analytics Market Outlook Report
lectureusjsjdhdsjjshdshshddhdhddhhd1.ppt
PROJECT CYCLE MANAGEMENT FRAMEWORK (PCM).ppt
FMIS 108 and AISlaudon_mis17_ppt_ch11.pptx
recommendation Project PPT with details attached
DS-40-Pre-Engagement and Kickoff deck - v8.0.pptx
MBA JAPAN: 2025 the University of Waseda
Image processing and pattern recognition 2.ppt
Jean-Georges Perrin - Spark in Action, Second Edition (2020, Manning Publicat...
Machine Learning and working of machine Learning
CHAPTER-2-THE-ACCOUNTING-PROCESS-2-4.pptx
expt-design-lecture-12 hghhgfggjhjd (1).ppt
Lesson-01intheselfoflifeofthekennyrogersoftheunderstandoftheunderstanded

Corr clust-kiel

  • 1. 11/5/2018 Mikael Kågebäck, Chalmers CSE 1 Credit: Aldebaran Correlation Clustering: A Tale of Two Cultures Erik Thiel, Morteza Chehreghani, Devdatt Dubhashi Chalmers University of Technology, Sweden
  • 4. Colour Partitions • Ask humans subjects in a language group to label tiles with colour terms, then aggregate all results into a partition for that language. • Take the CIELAB colour coordinates to define a similarity between colour tiles and forma partition based on these similarities.
  • 6. Regier T, Kemp C, Kay P. 2015. Word meanings across languages support efficient communication. In The Handbook of Language Emergence, ed. B MacWhinney, W O’Grady
  • 7. Correlation Clustering • Input: Graph 𝐺 = 𝑉, 𝐸 and positive or negative weights 𝑤 𝑒 , 𝑒 ∈ 𝐸 • Output: A clustering of the vertices to maximize the sum of the weights of edges within each cluster.
  • 9. Difference from usual Clustering • Weights can be positive or negative! • Contentious what’s ”good” quality clustering • But in correlation clustering there is unambiguous objective • The number of clusters need not be specified, will emerge from the optimizing the objective.
  • 11. Approximation Algorithms • Bansal, Blum Chawla (2004): PTAS on complete graphs • Charikar Guruswami, Wirth (2005): APX hard on general graphs • Charikar et al (2005), Swamy (2004): 0.76 approximation • Guruswami-Giotis (2006): PTAS with fixed no of clusters
  • 13. SDP Relaxation + Rounding
  • 14. However … • No implementation, no code … • Doesn’t work in practice …
  • 15. A Tale of Two Cultures • Deep elegant theory • “Polynomial time” • No implementation • No experiments on data sets • Does not work in practice or scale • Beamer/LaTeX • Sometimes theory • Linear or sub-linear • Well engineered implementation • Extensive testing on data sets • Must work in practice, scale to “Big Data” • Powerpoint Algorithms Theory Machine Learning
  • 17. Tightness of Relaxation • The non-convex relaxation is tight: no gap between continuous and discrete problem, simple proof by randomized rounding. • In contrast SDP relaxation is not tight.
  • 20. Non-convex Convergence Theory • For a differentiable (but not necessarily convex) function, the FW algorithm converges in 𝑂(1/ 𝑇) steps. • If the function is multilinear, then it converges in 𝑂(1/𝑇) steps. • Note that our correlation clustering objective is indeed multilinear!
  • 22. Synthetic Data: Generative Model • Planted model with k clusters and noise p • With probability (1-p), high positive weight on edge within a cluster and high negative weight on edge across clusters, with probability p, arbitrary weight
  • 32. Summary • Non-convex relaxation solved with Frank Wolfe yields an algorithm with guarantees that beats all other methods handily in both runtime and quality. • Combine theory and rigour of algorithms research with engineering good implementations and extensive testing on data.