Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification

Spatially coherent latent topic model for concurrent object segmentation and classificationAuthors: Liangliang Cao, Li Fei-FeiPresenter: Shao-Chuan Wang

OutlineMotivationA Review on Graphical ModelsToday’s topic: the paperTheir Results

Motivation: Real world problem often full of “noises”Bags of words (local features)Spatial relationships of objects are ignored (has its limit)When classify a test image, what is its “subject” ?Flag?Banner?People?Sports field?From Prof. Fei-Fei’s ICCV09 tutorial slide

Generative vs Discriminative Generative model: model p(x, y) or p(x|y)p(y)Discriminative model: model p(y|x)0.10.05001020304050607010.50010203040506070x = dataFrom Prof. Antonio Torralba course slide

Naïve Bayesian model (c: class, w: visual words)Once we have learnt the distribution, for a query imageGenerative model: An exampleBayesianNetworkscw1wn…

Generative model: Another exampleMixture Gaussian ModelHow to infer from unlabeled data even if weknow the underlining probability distribution structure? ?

A graphical modelObject classcP(c)Inverse VarianceMeanγμP(γ|c)P(μ|c)Observed dataxP(x|μ,γ)Directed graph

Nodes represent variablesHiddenLinks show dependencies

Conditional distributions at each nodeInference of latent variablesExpectation maximization (EM)“Soft guess” latent variable first (E-step)Based on latent variable (assume it is correct), solve optimization problem (M-step)Markov-chain Monte Carlo (MCMC)

Use Gibbs sampling from the Posterior

Variational method/Variational Message Passing (VMP)

Algorithms that convert inference problems into optimization problems (Opper and Saad 2001; Wainwright and Jordan 2003)Image from Wikipedia

Back to the topic: the paperbag of wordsKey Ideas:Latent topics are spatially coherentGenerate topic distribution at the region levelOver-segmentation, then merge by same topicsAvoid obtaining regions larger than the objectsOne topic per regionCan recognize objects with occlusionoversegmentationDescribe a region:

Homogeneous Appearance ar: average of color or texture features

Concurrent segmentation and classificationSpatial Latent Topic ModelNotation:Image IdRegion r = {1,2,…,Rd}Latent topic zr= {1,2,…,K}appearance ar = {1,2,…,A}visual words wr = (wr1,wr2,…, wrMr); wr1 = {1,2,…,W}P(zr |θd): topic probability (Multinomial distribution) parameterized by θdP(θd|λ): Dirichlet prior of θd, parameterized by λα, β: parameters describing the probability of generating appearance and visual words given topic

Spatial Latent Topic Model (Unsupervised)MultinomialDirichletpriorMaximize Log-likelihoodan optimization problem: close-formed solution is intractable

Variaitional Message Passing (Winn 2005)Coupling hidden variables θ, α, β makes the maximization intractableInstead, maximize the lower bound of L Goal: Find a tractable Q(H) that closely approximates the true posterior distribution P(H|V) (equality holds for any distribution Q)←Or equivalently, minimize KL(Q||P)

Variaitional Message Passing (Winn 2005)Further factorization assumptions (Jordan et al., 1999; Jaakkola, 2001; Parisi, 1988) (restrict the family of distributions Q)Entropy term=Where,

Variaitional Message Passing (Winn 2005)Eqn. (6) in the paperBayesian networks representationMarkov blanket:

Spatial Latent Topic Model (Supervised)Now it becomes C x K matrix, i.e. θ depends on observed cFor a query image,Id , find its most probable category c:

Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification

More Related Content

What's hot (20)

Similar to Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification (20)

More from Shao-Chuan Wang (10)

Recently uploaded (20)

Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification