SlideShare a Scribd company logo
Spatially coherent latent topic model for concurrent object segmentation and classificationAuthors: Liangliang Cao, Li Fei-FeiPresenter: Shao-Chuan Wang
OutlineMotivationA Review on Graphical ModelsToday’s topic: the paperTheir Results
Motivation: Real world problem often full of “noises”Bags of words (local features)Spatial relationships of objects are ignored (has its limit)When classify a test image, what is its “subject” ?Flag?Banner?People?Sports field?From Prof. Fei-Fei’s ICCV09 tutorial slide
OutlineMotivationA Review on Graphical ModelsToday’s topic: the paperTheir Results
Generative vs Discriminative Generative model: model p(x, y) or p(x|y)p(y)Discriminative model: model p(y|x)0.10.05001020304050607010.50010203040506070x = dataFrom Prof. Antonio Torralba course slide
Naïve Bayesian model (c: class, w: visual words)Once we have learnt the distribution, for a query imageGenerative model: An exampleBayesianNetworkscw1wn…
Generative model: Another exampleMixture Gaussian ModelHow to infer from unlabeled data even if weknow the underlining  probability distribution structure? ?
A graphical modelObject classcP(c)Inverse VarianceMeanγμP(γ|c)P(μ|c)Observed dataxP(x|μ,γ)Directed graph
Nodes represent variablesHiddenLinks show dependencies
Conditional distributions   at each nodeInference of latent variablesExpectation maximization (EM)“Soft guess” latent variable first (E-step)Based on latent variable (assume it is correct), solve optimization problem (M-step)Markov-chain Monte Carlo (MCMC)
Use Gibbs sampling from the Posterior
Slow to converge
Variational method/Variational Message Passing (VMP)
Algorithms that convert inference problems into optimization problems (Opper and Saad 2001; Wainwright and Jordan 2003)Image from Wikipedia
OutlineMotivationA Review on Graphical ModelsToday’s topic: the paperTheir Results
Back to the topic: the paperbag of wordsKey Ideas:Latent topics are spatially coherentGenerate topic distribution at the region levelOver-segmentation, then merge by same topicsAvoid obtaining regions larger than the objectsOne topic per regionCan recognize objects with occlusionoversegmentationDescribe a region:
Homogeneous Appearance ar: average of color or texture features
SIFT-based visual words: wr
Concurrent segmentation and classificationSpatial Latent Topic ModelNotation:Image IdRegion r = {1,2,…,Rd}Latent topic zr= {1,2,…,K}appearance ar = {1,2,…,A}visual words wr = (wr1,wr2,…, wrMr); wr1 = {1,2,…,W}P(zr |θd): topic probability (Multinomial distribution) parameterized by θdP(θd|λ): Dirichlet prior of θd, parameterized by λα, β: parameters describing the probability of generating appearance and visual words given topic
Spatial Latent Topic Model (Unsupervised)MultinomialDirichletpriorMaximize Log-likelihoodan optimization problem: close-formed solution is intractable
Variaitional Message Passing (Winn 2005)Coupling hidden variables θ, α, β makes the maximization intractableInstead, maximize the lower bound of L Goal: Find a tractable Q(H) that closely approximates the true posterior distribution P(H|V)          (equality holds for any distribution Q)←Or equivalently, minimize KL(Q||P)
Variaitional Message Passing (Winn 2005)Further factorization assumptions (Jordan et al., 1999; Jaakkola, 2001; Parisi, 1988) (restrict the family of distributions Q)Entropy term=Where,
Variaitional Message Passing (Winn 2005)Eqn. (6) in the paperBayesian networks representationMarkov blanket:
Spatial Latent Topic Model (Supervised)Now it becomes C x K matrix, i.e. θ depends on observed cFor a query image,Id , find its most probable category c:

More Related Content

PDF
Random Forest for Big Data
PDF
Deep Learning Opening Workshop - Horseshoe Regularization for Machine Learnin...
PDF
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
PPT
16 17 bag_words
PDF
Pattern learning and recognition on statistical manifolds: An information-geo...
PDF
bag-of-words models
PDF
About functional SIR
PPTX
Generalization abstraction
Random Forest for Big Data
Deep Learning Opening Workshop - Horseshoe Regularization for Machine Learnin...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
16 17 bag_words
Pattern learning and recognition on statistical manifolds: An information-geo...
bag-of-words models
About functional SIR
Generalization abstraction

What's hot (20)

PDF
Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...
PPSX
Prototype-based models in machine learning
PPT
Cvpr2007 object category recognition p1 - bag of words models
PDF
MLIP - Chapter 5 - Detection, Segmentation, Captioning
PDF
Kernel methods and variable selection for exploratory analysis and multi-omic...
PDF
Chapter 1 - Introduction
PDF
Lec15 graph laplacian embedding
PDF
MLIP - Chapter 2 - Preliminaries to deep learning
PDF
Multimodal Residual Networks for Visual QA
PPT
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
PDF
About functional SIR
PDF
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
PDF
MLIP - Chapter 3 - Introduction to deep learning
PPT
Constellation Models and Unsupervised Learning for Object Class Recognition
PDF
A pixel to-pixel segmentation method of DILD without masks using CNN and perl...
PPT
CS364 Artificial Intelligence Machine Learning
PPT
PPTX
06 cv mil_learning_and_inference
PDF
(DL輪読)Matching Networks for One Shot Learning
PDF
4 avrachenkov
Chris Dyer - 2017 - Neural MT Workshop Invited Talk: The Neural Noisy Channel...
Prototype-based models in machine learning
Cvpr2007 object category recognition p1 - bag of words models
MLIP - Chapter 5 - Detection, Segmentation, Captioning
Kernel methods and variable selection for exploratory analysis and multi-omic...
Chapter 1 - Introduction
Lec15 graph laplacian embedding
MLIP - Chapter 2 - Preliminaries to deep learning
Multimodal Residual Networks for Visual QA
Machine learning by Dr. Vivek Vijay and Dr. Sandeep Yadav
About functional SIR
Subspace Indexing on Grassmannian Manifold for Large Scale Visual Identification
MLIP - Chapter 3 - Introduction to deep learning
Constellation Models and Unsupervised Learning for Object Class Recognition
A pixel to-pixel segmentation method of DILD without masks using CNN and perl...
CS364 Artificial Intelligence Machine Learning
06 cv mil_learning_and_inference
(DL輪読)Matching Networks for One Shot Learning
4 avrachenkov
Ad

Similar to Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification (20)

ODP
Topic Modeling
PDF
Lecture14 xing fei-fei
PPTX
Tdm probabilistic models (part 2)
PPTX
Multi-modal sources for predictive modeling using deep learning
PDF
Probabilistic Topic models
PDF
The state of the art in integrating machine learning into visual analytics
PDF
Lec 1-2 ssdsdffffsssssfsdfsdfstGenAI.pdf
PDF
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
PDF
Lecture20 xing
PPTX
Recommenders, Topics, and Text
PPTX
Text mining meets neural nets
PPTX
Neural motifs scene graph parsing with global context
PDF
群衆の知を引き出すための機械学習(第4回ステアラボ人工知能セミナー)
PDF
Survey of Generative Clustering Models 2008
PDF
Mlj 2013 itm
PDF
(Hierarchical) Topic Modeling_Yueshen Xu
PPTX
UP-STAT 2015 Abstract Presentation - Statistical and Machine Learning Methods...
PDF
PDF
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
PDF
Leveraging high level and low-level features for multimedia event detection.2...
Topic Modeling
Lecture14 xing fei-fei
Tdm probabilistic models (part 2)
Multi-modal sources for predictive modeling using deep learning
Probabilistic Topic models
The state of the art in integrating machine learning into visual analytics
Lec 1-2 ssdsdffffsssssfsdfsdfstGenAI.pdf
Slides: Concurrent Inference of Topic Models and Distributed Vector Represent...
Lecture20 xing
Recommenders, Topics, and Text
Text mining meets neural nets
Neural motifs scene graph parsing with global context
群衆の知を引き出すための機械学習(第4回ステアラボ人工知能セミナー)
Survey of Generative Clustering Models 2008
Mlj 2013 itm
(Hierarchical) Topic Modeling_Yueshen Xu
UP-STAT 2015 Abstract Presentation - Statistical and Machine Learning Methods...
The Search for a New Visual Search Beyond Language - StampedeCon AI Summit 2017
Leveraging high level and low-level features for multimedia event detection.2...
Ad

More from Shao-Chuan Wang (10)

PPTX
Book Cover Recognition
PPTX
Introduction to Machine Learning
PPTX
Beyond The Euclidean Distance: Creating effective visual codebooks using the ...
PPTX
Self Taught Learning
PDF
A Friendly Guide To Sparse Coding
PPTX
An Exemplar Model For Learning Object Classes
PPTX
Evaluation Of Color Descriptors For Object And Scene
PPTX
Support Vector Machine
PPTX
About Python
PPTX
Image Classification And Support Vector Machine
Book Cover Recognition
Introduction to Machine Learning
Beyond The Euclidean Distance: Creating effective visual codebooks using the ...
Self Taught Learning
A Friendly Guide To Sparse Coding
An Exemplar Model For Learning Object Classes
Evaluation Of Color Descriptors For Object And Scene
Support Vector Machine
About Python
Image Classification And Support Vector Machine

Recently uploaded (20)

PDF
RMMM.pdf make it easy to upload and study
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Lesson notes of climatology university.
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Yogi Goddess Pres Conference Studio Updates
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
RMMM.pdf make it easy to upload and study
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Lesson notes of climatology university.
2.FourierTransform-ShortQuestionswithAnswers.pdf
Yogi Goddess Pres Conference Studio Updates
Abdominal Access Techniques with Prof. Dr. R K Mishra
O5-L3 Freight Transport Ops (International) V1.pdf
Cell Structure & Organelles in detailed.
Final Presentation General Medicine 03-08-2024.pptx
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Final Presentation General Medicine 03-08-2024.pptx
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Microbial disease of the cardiovascular and lymphatic systems
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Supply Chain Operations Speaking Notes -ICLT Program
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf

Spatially Coherent Latent Topic Model For Concurrent Object Segmentation and Classification

  • 1. Spatially coherent latent topic model for concurrent object segmentation and classificationAuthors: Liangliang Cao, Li Fei-FeiPresenter: Shao-Chuan Wang
  • 2. OutlineMotivationA Review on Graphical ModelsToday’s topic: the paperTheir Results
  • 3. Motivation: Real world problem often full of “noises”Bags of words (local features)Spatial relationships of objects are ignored (has its limit)When classify a test image, what is its “subject” ?Flag?Banner?People?Sports field?From Prof. Fei-Fei’s ICCV09 tutorial slide
  • 4. OutlineMotivationA Review on Graphical ModelsToday’s topic: the paperTheir Results
  • 5. Generative vs Discriminative Generative model: model p(x, y) or p(x|y)p(y)Discriminative model: model p(y|x)0.10.05001020304050607010.50010203040506070x = dataFrom Prof. Antonio Torralba course slide
  • 6. Naïve Bayesian model (c: class, w: visual words)Once we have learnt the distribution, for a query imageGenerative model: An exampleBayesianNetworkscw1wn…
  • 7. Generative model: Another exampleMixture Gaussian ModelHow to infer from unlabeled data even if weknow the underlining probability distribution structure? ?
  • 8. A graphical modelObject classcP(c)Inverse VarianceMeanγμP(γ|c)P(μ|c)Observed dataxP(x|μ,γ)Directed graph
  • 10. Conditional distributions at each nodeInference of latent variablesExpectation maximization (EM)“Soft guess” latent variable first (E-step)Based on latent variable (assume it is correct), solve optimization problem (M-step)Markov-chain Monte Carlo (MCMC)
  • 11. Use Gibbs sampling from the Posterior
  • 14. Algorithms that convert inference problems into optimization problems (Opper and Saad 2001; Wainwright and Jordan 2003)Image from Wikipedia
  • 15. OutlineMotivationA Review on Graphical ModelsToday’s topic: the paperTheir Results
  • 16. Back to the topic: the paperbag of wordsKey Ideas:Latent topics are spatially coherentGenerate topic distribution at the region levelOver-segmentation, then merge by same topicsAvoid obtaining regions larger than the objectsOne topic per regionCan recognize objects with occlusionoversegmentationDescribe a region:
  • 17. Homogeneous Appearance ar: average of color or texture features
  • 19. Concurrent segmentation and classificationSpatial Latent Topic ModelNotation:Image IdRegion r = {1,2,…,Rd}Latent topic zr= {1,2,…,K}appearance ar = {1,2,…,A}visual words wr = (wr1,wr2,…, wrMr); wr1 = {1,2,…,W}P(zr |θd): topic probability (Multinomial distribution) parameterized by θdP(θd|λ): Dirichlet prior of θd, parameterized by λα, β: parameters describing the probability of generating appearance and visual words given topic
  • 20. Spatial Latent Topic Model (Unsupervised)MultinomialDirichletpriorMaximize Log-likelihoodan optimization problem: close-formed solution is intractable
  • 21. Variaitional Message Passing (Winn 2005)Coupling hidden variables θ, α, β makes the maximization intractableInstead, maximize the lower bound of L Goal: Find a tractable Q(H) that closely approximates the true posterior distribution P(H|V) (equality holds for any distribution Q)←Or equivalently, minimize KL(Q||P)
  • 22. Variaitional Message Passing (Winn 2005)Further factorization assumptions (Jordan et al., 1999; Jaakkola, 2001; Parisi, 1988) (restrict the family of distributions Q)Entropy term=Where,
  • 23. Variaitional Message Passing (Winn 2005)Eqn. (6) in the paperBayesian networks representationMarkov blanket:
  • 24. Spatial Latent Topic Model (Supervised)Now it becomes C x K matrix, i.e. θ depends on observed cFor a query image,Id , find its most probable category c:
  • 25. ProcessTraining stepmaximize total likelihood of training images, subject λ, α, θ and zrThe learned λ, α are fixedTesting phase, for a query Image IdEstimate its θd and zrFor classification task, find its most probable latent topics as its categoryFor segmentation task, for the same zr, merge it.(3)
  • 26. OutlineMotivationA Review on Graphical ModelsToday’s topic: the paperTheir Results
  • 28. Experimental ResultsSupervised segmentationDataset13 classes of nature scenes# of training images: 100# of topics: 60# of categories: 13
  • 29. Experimental ResultsSupervised classificationDataset28 classes from Caltech 101# of training images: 30# of test images: 30# of topics in category: 28# of topics in clutter: 346 background classes are left unlabeled
  • 31. Variaitional Message PassingFollowing this framework, and use the graphical model provided by this paper: