SlideShare a Scribd company logo
Simplifying Gaussian Mixture Models
Via Entropic Quantization
Frank Nielsen1 2 , Vincent Garcia1 , and Richard Nock3
1 Ecole Polytechnique (Paris, France)
Sony Computer Science Laboratories (Tokyo, Japan)
Universit´ des Antilles et de la Guyane (Guadeloupe, France)
e
2

3

28th august 2009

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

1 / 23
Introduction

Plan
1

Introduction
Mixture models
Problem
Mixture model simplification

2

Mixture model simplification
KLD and Bregman divergence
Sided BKMC
Symmetric BKMC
jMEF

3

Experiments
Quality measure and initialization
Sided BKMC
BKMC vs UTAC

4

Conclusion
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

2 / 23
Introduction

Mixture models

Mixture models

Mixture model is a powerful framework to estimate PDF
Mixture model f

n

f (x) =

αi fi (x)
i=1

where αi ≥ 0 denotes a weight with

n
i=1 αi

=1

If f is a Gaussian mixture model (GMM),
(x − µi )T Σ−1 (x − µi )
1
i
fi (x) =
exp −
2
(2π)d/2 |Σi |1/2
with µi mean and Σi covariance matrix

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

3 / 23
Introduction

Problem

Problem
2.5

2

1.5

1

0.5

0
−0.5

0

0.5

1

1.5

Density estimation using kernel-based Parzen estimator
Mixture models usually contain a lot of components
Estimation of statistical measures is computationally expensive
Need to reduce the number of components
Re-lear a simpler mixture model from dataset
Simplify the mixture model f
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

4 / 23
Introduction

Mixture model simplification

Mixture model simplification

Given a mixture model f of n components
n

f (x) =

αi fi (x)
i=1

Compute a mixture model g of m components
m

αj gj (x)

g (x) =
j=1

such as g is the best approximation of f

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

5 / 23
Mixture model simplification

Plan
1

Introduction
Mixture models
Problem
Mixture model simplification

2

Mixture model simplification
KLD and Bregman divergence
Sided BKMC
Symmetric BKMC
jMEF

3

Experiments
Quality measure and initialization
Sided BKMC
BKMC vs UTAC

4

Conclusion
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

6 / 23
Mixture model simplification

KLD and Bregman divergence

Relative entropy and Bregman divergence
The fundamental measure between statistical distributions is the
relative entropy, also called the Kullback-Leibler divergence
Given fi and fj two distributions, the KLD is given by
KLD(fi ||fj ) =

fi (x) log

fi (x)
dx
fj (x)

In the case of normal distriubtions
det Σj
1
1
KLD(fi ||fj ) = log
+ tr Σ−1 Σi
j
2
det Σi
2
1
d
+ (µj − µi )T Σ−1 (µj − µi ) −
j
2
2

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

7 / 23
Mixture model simplification

KLD and Bregman divergence

Relative entropy and Bregman divergence
Nomral distributions belong to the class of exponential families
Canonical form of exponential families
f (x) = exp

˜
˜
Θ, t(x) − F (Θ) + C (x)

Estimation of the KLD by computing the Bregman divergence defined
for the log normalizer F
˜ ˜
KLD(fi ||fj ) = DF (Θj ||Θi )
where
˜ ˜
˜
˜
˜
˜
˜
DF (Θj ||Θi ) = F (Θj ) − F (Θi ) − Θj − Θi , F (Θi )

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

8 / 23
Mixture model simplification

KLD and Bregman divergence

Relative entropy and Bregman divergence
For multivariate normal distributions
Sufficient statistics

1
t(x) = (x, − xx T )
2

Natural parameters
1
˜
Θ = (θ, Θ) = (Σ−1 µ, Σ−1 )
2
Log normalizer
1
d
1
˜
F (Θ) = tr(Θ−1 θθT ) − log det Θ + log π
4
2
2
˜
F (Θ) =

V. Garcia (X, Paris, France)

1 −1
1
1
Θ θ , − Θ−1 − (Θ−1 θ)(Θ−1 θ)T
2
2
4
Simplifying GMMs

28th august 2009

9 / 23
Mixture model simplification

Sided BKMC

Bregman k-means clustering
K-means clustering
Set of points
Initialize k centroids = k classes
Repetition until convergence
Repartition step (distance)
Computation of centroids (centers of mass)

Bregman K-means clustering
Set of distributions
Initialize k centroids (αi , gi ) = GMM with k components
Repetition until convergence
Repartition step (sided Bregman divergence)
Computation of centroids (sided centroids)
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

10 / 23
Mixture model simplification

Sided BKMC

Sided centroids

5 multivariate Gaussians
Right-centroid
Left-centroid
http://www.sonycsl.co.jp/person/nielsen/BNCj/
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

11 / 23
Mixture model simplification

Sided BKMC

Right-sided BKMC algorithm
1: Initialize the GMM g
2: repeat
3:
Compute the cluster C : the Gaussian fi belongs to cluster Cj if and only if

˜ ˜
˜ ˜
DF (Θi Θj ) < DF (Θi Θl ), ∀l ∈ [1, m]  {j}
4:

Compute the centroids: the weight and the natural parameters of the j-th
centroid (i.e. Gaussian gj ) are given by:
αj =

αi ,
i

The sum

i

θj =

i

αi θi
,
i αi

Θj =

i

αi Θi
i αi

is performed on i ∈ [1, m] such as fi ∈ Cj

5: until the cluster does not change between two iterations

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

12 / 23
Mixture model simplification

Sided BKMC

Left-sided BKMC algorithm
1: Initialize the GMM g
2: repeat
3:
Compute the cluster C : the Gaussian fi belongs to cluster Cj if and only if

˜ ˜
˜ ˜
DF (Θj Θi ) < DF (Θl Θi ), ∀l ∈ [1, m]  {j}
4:

Compute the centroids: the weight and the natural parameters of the j-th
centroid (i.e. Gaussian gj ) are given by:
αj =

αi ,

˜
Θj =

F −1

i

i

where
˜
F −1 (Θ) =

− Θ + θθT

−1

θ, −

αi
αj

˜
F Θi

1
Θ + θθT
2

−1

5: until the cluster does not change between two iterations
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

13 / 23
Mixture model simplification

Symmetric BKMC

Symmetric BKMC algorithm
Symmetric similarity measure can be required (e.g. CBIR)
Repartition step: Symmetric Bregman divergence
˜ ˜
SDF (Θp , Θq ) =

˜ ˜
˜ ˜
DF (Θq ||Θp ) + DF (Θp ||Θq )
2

Computation of symmetric centroid:
Compute right and left centroids (cr and cl )
The symmetric centroid cs belongs to the geodesic link joining cr and cl
cλ =

F −1 (λ F (cr ) + (1 − λ) F (cl ))

The symmetric centroid cs = cλ verifies
SDF (cλ , cr ) = SDF (cλ , cl ).

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

14 / 23
Mixture model simplification

jMEF

jMEF

jMEF : Java library for Mixture of Exponential Families
Create and manage MEF
Simplify MEF using BKMC
Available on line at www.lix.polytechnique.fr/∼nielsen/MEF

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

15 / 23
Experiments

Plan
1

Introduction
Mixture models
Problem
Mixture model simplification

2

Mixture model simplification
KLD and Bregman divergence
Sided BKMC
Symmetric BKMC
jMEF

3

Experiments
Quality measure and initialization
Sided BKMC
BKMC vs UTAC

4

Conclusion
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

16 / 23
Experiments

Quality measure and initialization

Quality measure and initialization
Simplification quality measure
KLD(f g ) (right-sided)
No closed-form expression
Draw 10,000 points to estimate this KLD (Monte-Carlo)
Initial GMM f
Learnt from an image
K-means on RGB pixels ⇒ 32 classes
EM algorithm ⇒ fi
Weights αi : proportion of pixels in each class

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

17 / 23
Experiments

Sided BKMC

Sided BKMC

Evolution of KLD(f g ) as a function of m
The simplification quality increases with m
Left-sided BKMC provides the best results
Right-sided BKMC provides the worst results
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

18 / 23
Experiments

BKMC vs UTAC

BKMC vs UTAC

UTAC algorithm based on sigma points + EM algorithm
BKMC provides better results than UTAC
BKMC is faster than UTAC: 20ms vs 100ms
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

19 / 23
Experiments

BKMC vs UTAC

Clustering-based image segmentation
Image

UTAC

BKMC

KLD=0.23

KLD=0.11

KLD=0.16
V. Garcia (X, Paris, France)

f

KLD=0.13

Simplifying GMMs

28th august 2009

20 / 23
Experiments

BKMC vs UTAC

Clustering-based image segmentation
Image

UTAC

BKMC

KLD=0.69

KLD=0.53

KLD=0.36
V. Garcia (X, Paris, France)

f

KLD=0.18

Simplifying GMMs

28th august 2009

21 / 23
Conclusion

Plan
1

Introduction
Mixture models
Problem
Mixture model simplification

2

Mixture model simplification
KLD and Bregman divergence
Sided BKMC
Symmetric BKMC
jMEF

3

Experiments
Quality measure and initialization
Sided BKMC
BKMC vs UTAC

4

Conclusion
V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

22 / 23
Conclusion

Conclusion

GMM simplification algorithm based on k-means and Bregman
divergence
BKMC is faster and provides better results than UTAC algorithm
BKMC extends to mixtures of exponential families
jMEF available on line at www.lix.polytechnique.fr/∼nielsen/MEF
Included features:
Create/manage mixtures of exponential families
BKMC algorithm
Hierarchical GMM (ACCV 2009)

V. Garcia (X, Paris, France)

Simplifying GMMs

28th august 2009

23 / 23

More Related Content

PDF
Context-Aware Recommender System Based on Boolean Matrix Factorisation
PDF
20320140501020
PDF
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
PDF
2-rankings of Graphs
PDF
The power and Arnoldi methods in an algebra of circulants
PDF
A first order hyperbolic framework for large strain computational computation...
PDF
Presentation_Tan
PDF
A study of the worst case ratio of a simple algorithm for simple assembly lin...
Context-Aware Recommender System Based on Boolean Matrix Factorisation
20320140501020
Quantum Machine Learning and QEM for Gaussian mixture models (Alessandro Luongo)
2-rankings of Graphs
The power and Arnoldi methods in an algebra of circulants
A first order hyperbolic framework for large strain computational computation...
Presentation_Tan
A study of the worst case ratio of a simple algorithm for simple assembly lin...

What's hot (19)

PDF
Path Contraction Faster than 2^n
PDF
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
PDF
Large strain computational solid dynamics: An upwind cell centred Finite Volu...
PDF
Pilot Optimization and Channel Estimation for Multiuser Massive MIMO Systems
PDF
An upwind cell centred Finite Volume Method for nearly incompressible explici...
PDF
Bagging-Clustering Methods to Forecast Time Series
PDF
Traffic flow modeling on road networks using Hamilton-Jacobi equations
PDF
Second order traffic flow models on networks
PDF
PPT
Pixel rf
PDF
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...
PDF
Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...
DOC
Transportation and assignment_problem
PPT
Craig-Bampton Method
PDF
New tools from the bandit literature to improve A/B Testing
PDF
CVPR2010: Advanced ITinCVPR in a Nutshell: part 6: Mixtures
PDF
New approaches for boosting to uniformity
PDF
A One-Pass Triclustering Approach: Is There any Room for Big Data?
PDF
Second order traffic flow models on networks
Path Contraction Faster than 2^n
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
Large strain computational solid dynamics: An upwind cell centred Finite Volu...
Pilot Optimization and Channel Estimation for Multiuser Massive MIMO Systems
An upwind cell centred Finite Volume Method for nearly incompressible explici...
Bagging-Clustering Methods to Forecast Time Series
Traffic flow modeling on road networks using Hamilton-Jacobi equations
Second order traffic flow models on networks
Pixel rf
Joint CSI Estimation, Beamforming and Scheduling Design for Wideband Massive ...
Webinar: Integrating timetabling and vehicle scheduling to analyze the trade-...
Transportation and assignment_problem
Craig-Bampton Method
New tools from the bandit literature to improve A/B Testing
CVPR2010: Advanced ITinCVPR in a Nutshell: part 6: Mixtures
New approaches for boosting to uniformity
A One-Pass Triclustering Approach: Is There any Room for Big Data?
Second order traffic flow models on networks
Ad

Viewers also liked (19)

PPTX
Presentation_NEW.PPTX
PDF
Apple
DOCX
Deadpool sound analysis
DOC
CBS INTRO 2015
PDF
Apple
PDF
KM4city, Il Valore degli #OpenData: Esperienze a confronto
PPTX
Cuentapedaggica1 d mayo
PPT
Balanced scorecard
PDF
Tutorial DropBox
PPTX
Franquicias exitosas guada
PPTX
Interacciones jajajajajajajjaajjajajajajjajajajajajaja
PDF
Ourense map
DOCX
Patrick Roman Resume
PPT
Sincope. Dott. Mauro Zanocchi
DOCX
COMPANY PROFILE
PPTX
Um dia na escola de meu filho
PPTX
Future of Work, Sharing Economy and Real Estate: Thoughts and Trends To Watch
PPT
Ujian hidup
PPTX
Power point presentations 12
Presentation_NEW.PPTX
Apple
Deadpool sound analysis
CBS INTRO 2015
Apple
KM4city, Il Valore degli #OpenData: Esperienze a confronto
Cuentapedaggica1 d mayo
Balanced scorecard
Tutorial DropBox
Franquicias exitosas guada
Interacciones jajajajajajajjaajjajajajajjajajajajajaja
Ourense map
Patrick Roman Resume
Sincope. Dott. Mauro Zanocchi
COMPANY PROFILE
Um dia na escola de meu filho
Future of Work, Sharing Economy and Real Estate: Thoughts and Trends To Watch
Ujian hidup
Power point presentations 12
Ad

Similar to Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009) (20)

PDF
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
PDF
Generalized Nonlinear Models in R
PDF
Optimal L-shaped matrix reordering, aka graph's core-periphery
PDF
On learning statistical mixtures maximizing the complete likelihood
PDF
Approximation in Value Space using Aggregation, with Applications to POMDPs a...
PDF
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
PDF
Beck Workshop on Modelling and Simulation of Coal-fired Power Generation and ...
PDF
Introduction to logistic regression
PDF
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
PDF
Clustering lect
PDF
Inria Tech Talk - La classification de données complexes avec MASSICCC
PDF
Subquad multi ff
PDF
SIAM - Minisymposium on Guaranteed numerical algorithms
PDF
clusteringEng pour savoir le technique de classtering
PDF
QMC: Operator Splitting Workshop, Perturbed (accelerated) Proximal-Gradient A...
PDF
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
PDF
Number theoretic-rsa-chailos-new
PDF
Accelerating Pseudo-Marginal MCMC using Gaussian Processes
PDF
Bayesian Inference and Uncertainty Quantification for Inverse Problems
QMC Program: Trends and Advances in Monte Carlo Sampling Algorithms Workshop,...
Generalized Nonlinear Models in R
Optimal L-shaped matrix reordering, aka graph's core-periphery
On learning statistical mixtures maximizing the complete likelihood
Approximation in Value Space using Aggregation, with Applications to POMDPs a...
MUMS Opening Workshop - An Overview of Reduced-Order Models and Emulators (ED...
Beck Workshop on Modelling and Simulation of Coal-fired Power Generation and ...
Introduction to logistic regression
Subproblem-Tree Calibration: A Unified Approach to Max-Product Message Passin...
Clustering lect
Inria Tech Talk - La classification de données complexes avec MASSICCC
Subquad multi ff
SIAM - Minisymposium on Guaranteed numerical algorithms
clusteringEng pour savoir le technique de classtering
QMC: Operator Splitting Workshop, Perturbed (accelerated) Proximal-Gradient A...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Generalized Probabilis...
Number theoretic-rsa-chailos-new
Accelerating Pseudo-Marginal MCMC using Gaussian Processes
Bayesian Inference and Uncertainty Quantification for Inverse Problems

Recently uploaded (20)

PDF
Ôn tập tiếng anh trong kinh doanh nâng cao
PDF
Training And Development of Employee .pdf
PDF
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
PDF
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
PPTX
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
PPTX
Lecture (1)-Introduction.pptx business communication
PDF
Outsourced Audit & Assurance in USA Why Globus Finanza is Your Trusted Choice
PDF
Chapter 5_Foreign Exchange Market in .pdf
PPTX
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
PDF
Reconciliation AND MEMORANDUM RECONCILATION
PDF
Unit 1 Cost Accounting - Cost sheet
PDF
DOC-20250806-WA0002._20250806_112011_0000.pdf
PDF
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
PDF
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
PDF
A Brief Introduction About Julia Allison
PDF
How to Get Business Funding for Small Business Fast
PPTX
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
PDF
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
PDF
MSPs in 10 Words - Created by US MSP Network
PPTX
Amazon (Business Studies) management studies
Ôn tập tiếng anh trong kinh doanh nâng cao
Training And Development of Employee .pdf
pdfcoffee.com-opt-b1plus-sb-answers.pdfvi
Solara Labs: Empowering Health through Innovative Nutraceutical Solutions
CkgxkgxydkydyldylydlydyldlyddolydyoyyU2.pptx
Lecture (1)-Introduction.pptx business communication
Outsourced Audit & Assurance in USA Why Globus Finanza is Your Trusted Choice
Chapter 5_Foreign Exchange Market in .pdf
The Marketing Journey - Tracey Phillips - Marketing Matters 7-2025.pptx
Reconciliation AND MEMORANDUM RECONCILATION
Unit 1 Cost Accounting - Cost sheet
DOC-20250806-WA0002._20250806_112011_0000.pdf
Katrina Stoneking: Shaking Up the Alcohol Beverage Industry
BsN 7th Sem Course GridNNNNNNNN CCN.pdf
A Brief Introduction About Julia Allison
How to Get Business Funding for Small Business Fast
Dragon_Fruit_Cultivation_in Nepal ppt.pptx
Elevate Cleaning Efficiency Using Tallfly Hair Remover Roller Factory Expertise
MSPs in 10 Words - Created by US MSP Network
Amazon (Business Studies) management studies

Simplifying Gaussian Mixture Models Via Entropic Quantization (EUSIPCO 2009)

  • 1. Simplifying Gaussian Mixture Models Via Entropic Quantization Frank Nielsen1 2 , Vincent Garcia1 , and Richard Nock3 1 Ecole Polytechnique (Paris, France) Sony Computer Science Laboratories (Tokyo, Japan) Universit´ des Antilles et de la Guyane (Guadeloupe, France) e 2 3 28th august 2009 V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 1 / 23
  • 2. Introduction Plan 1 Introduction Mixture models Problem Mixture model simplification 2 Mixture model simplification KLD and Bregman divergence Sided BKMC Symmetric BKMC jMEF 3 Experiments Quality measure and initialization Sided BKMC BKMC vs UTAC 4 Conclusion V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 2 / 23
  • 3. Introduction Mixture models Mixture models Mixture model is a powerful framework to estimate PDF Mixture model f n f (x) = αi fi (x) i=1 where αi ≥ 0 denotes a weight with n i=1 αi =1 If f is a Gaussian mixture model (GMM), (x − µi )T Σ−1 (x − µi ) 1 i fi (x) = exp − 2 (2π)d/2 |Σi |1/2 with µi mean and Σi covariance matrix V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 3 / 23
  • 4. Introduction Problem Problem 2.5 2 1.5 1 0.5 0 −0.5 0 0.5 1 1.5 Density estimation using kernel-based Parzen estimator Mixture models usually contain a lot of components Estimation of statistical measures is computationally expensive Need to reduce the number of components Re-lear a simpler mixture model from dataset Simplify the mixture model f V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 4 / 23
  • 5. Introduction Mixture model simplification Mixture model simplification Given a mixture model f of n components n f (x) = αi fi (x) i=1 Compute a mixture model g of m components m αj gj (x) g (x) = j=1 such as g is the best approximation of f V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 5 / 23
  • 6. Mixture model simplification Plan 1 Introduction Mixture models Problem Mixture model simplification 2 Mixture model simplification KLD and Bregman divergence Sided BKMC Symmetric BKMC jMEF 3 Experiments Quality measure and initialization Sided BKMC BKMC vs UTAC 4 Conclusion V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 6 / 23
  • 7. Mixture model simplification KLD and Bregman divergence Relative entropy and Bregman divergence The fundamental measure between statistical distributions is the relative entropy, also called the Kullback-Leibler divergence Given fi and fj two distributions, the KLD is given by KLD(fi ||fj ) = fi (x) log fi (x) dx fj (x) In the case of normal distriubtions det Σj 1 1 KLD(fi ||fj ) = log + tr Σ−1 Σi j 2 det Σi 2 1 d + (µj − µi )T Σ−1 (µj − µi ) − j 2 2 V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 7 / 23
  • 8. Mixture model simplification KLD and Bregman divergence Relative entropy and Bregman divergence Nomral distributions belong to the class of exponential families Canonical form of exponential families f (x) = exp ˜ ˜ Θ, t(x) − F (Θ) + C (x) Estimation of the KLD by computing the Bregman divergence defined for the log normalizer F ˜ ˜ KLD(fi ||fj ) = DF (Θj ||Θi ) where ˜ ˜ ˜ ˜ ˜ ˜ ˜ DF (Θj ||Θi ) = F (Θj ) − F (Θi ) − Θj − Θi , F (Θi ) V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 8 / 23
  • 9. Mixture model simplification KLD and Bregman divergence Relative entropy and Bregman divergence For multivariate normal distributions Sufficient statistics 1 t(x) = (x, − xx T ) 2 Natural parameters 1 ˜ Θ = (θ, Θ) = (Σ−1 µ, Σ−1 ) 2 Log normalizer 1 d 1 ˜ F (Θ) = tr(Θ−1 θθT ) − log det Θ + log π 4 2 2 ˜ F (Θ) = V. Garcia (X, Paris, France) 1 −1 1 1 Θ θ , − Θ−1 − (Θ−1 θ)(Θ−1 θ)T 2 2 4 Simplifying GMMs 28th august 2009 9 / 23
  • 10. Mixture model simplification Sided BKMC Bregman k-means clustering K-means clustering Set of points Initialize k centroids = k classes Repetition until convergence Repartition step (distance) Computation of centroids (centers of mass) Bregman K-means clustering Set of distributions Initialize k centroids (αi , gi ) = GMM with k components Repetition until convergence Repartition step (sided Bregman divergence) Computation of centroids (sided centroids) V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 10 / 23
  • 11. Mixture model simplification Sided BKMC Sided centroids 5 multivariate Gaussians Right-centroid Left-centroid http://www.sonycsl.co.jp/person/nielsen/BNCj/ V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 11 / 23
  • 12. Mixture model simplification Sided BKMC Right-sided BKMC algorithm 1: Initialize the GMM g 2: repeat 3: Compute the cluster C : the Gaussian fi belongs to cluster Cj if and only if ˜ ˜ ˜ ˜ DF (Θi Θj ) < DF (Θi Θl ), ∀l ∈ [1, m] {j} 4: Compute the centroids: the weight and the natural parameters of the j-th centroid (i.e. Gaussian gj ) are given by: αj = αi , i The sum i θj = i αi θi , i αi Θj = i αi Θi i αi is performed on i ∈ [1, m] such as fi ∈ Cj 5: until the cluster does not change between two iterations V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 12 / 23
  • 13. Mixture model simplification Sided BKMC Left-sided BKMC algorithm 1: Initialize the GMM g 2: repeat 3: Compute the cluster C : the Gaussian fi belongs to cluster Cj if and only if ˜ ˜ ˜ ˜ DF (Θj Θi ) < DF (Θl Θi ), ∀l ∈ [1, m] {j} 4: Compute the centroids: the weight and the natural parameters of the j-th centroid (i.e. Gaussian gj ) are given by: αj = αi , ˜ Θj = F −1 i i where ˜ F −1 (Θ) = − Θ + θθT −1 θ, − αi αj ˜ F Θi 1 Θ + θθT 2 −1 5: until the cluster does not change between two iterations V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 13 / 23
  • 14. Mixture model simplification Symmetric BKMC Symmetric BKMC algorithm Symmetric similarity measure can be required (e.g. CBIR) Repartition step: Symmetric Bregman divergence ˜ ˜ SDF (Θp , Θq ) = ˜ ˜ ˜ ˜ DF (Θq ||Θp ) + DF (Θp ||Θq ) 2 Computation of symmetric centroid: Compute right and left centroids (cr and cl ) The symmetric centroid cs belongs to the geodesic link joining cr and cl cλ = F −1 (λ F (cr ) + (1 − λ) F (cl )) The symmetric centroid cs = cλ verifies SDF (cλ , cr ) = SDF (cλ , cl ). V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 14 / 23
  • 15. Mixture model simplification jMEF jMEF jMEF : Java library for Mixture of Exponential Families Create and manage MEF Simplify MEF using BKMC Available on line at www.lix.polytechnique.fr/∼nielsen/MEF V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 15 / 23
  • 16. Experiments Plan 1 Introduction Mixture models Problem Mixture model simplification 2 Mixture model simplification KLD and Bregman divergence Sided BKMC Symmetric BKMC jMEF 3 Experiments Quality measure and initialization Sided BKMC BKMC vs UTAC 4 Conclusion V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 16 / 23
  • 17. Experiments Quality measure and initialization Quality measure and initialization Simplification quality measure KLD(f g ) (right-sided) No closed-form expression Draw 10,000 points to estimate this KLD (Monte-Carlo) Initial GMM f Learnt from an image K-means on RGB pixels ⇒ 32 classes EM algorithm ⇒ fi Weights αi : proportion of pixels in each class V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 17 / 23
  • 18. Experiments Sided BKMC Sided BKMC Evolution of KLD(f g ) as a function of m The simplification quality increases with m Left-sided BKMC provides the best results Right-sided BKMC provides the worst results V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 18 / 23
  • 19. Experiments BKMC vs UTAC BKMC vs UTAC UTAC algorithm based on sigma points + EM algorithm BKMC provides better results than UTAC BKMC is faster than UTAC: 20ms vs 100ms V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 19 / 23
  • 20. Experiments BKMC vs UTAC Clustering-based image segmentation Image UTAC BKMC KLD=0.23 KLD=0.11 KLD=0.16 V. Garcia (X, Paris, France) f KLD=0.13 Simplifying GMMs 28th august 2009 20 / 23
  • 21. Experiments BKMC vs UTAC Clustering-based image segmentation Image UTAC BKMC KLD=0.69 KLD=0.53 KLD=0.36 V. Garcia (X, Paris, France) f KLD=0.18 Simplifying GMMs 28th august 2009 21 / 23
  • 22. Conclusion Plan 1 Introduction Mixture models Problem Mixture model simplification 2 Mixture model simplification KLD and Bregman divergence Sided BKMC Symmetric BKMC jMEF 3 Experiments Quality measure and initialization Sided BKMC BKMC vs UTAC 4 Conclusion V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 22 / 23
  • 23. Conclusion Conclusion GMM simplification algorithm based on k-means and Bregman divergence BKMC is faster and provides better results than UTAC algorithm BKMC extends to mixtures of exponential families jMEF available on line at www.lix.polytechnique.fr/∼nielsen/MEF Included features: Create/manage mixtures of exponential families BKMC algorithm Hierarchical GMM (ACCV 2009) V. Garcia (X, Paris, France) Simplifying GMMs 28th august 2009 23 / 23