SlideShare a Scribd company logo
ICCV2013 reading
2014.3.28
Akisato Kimura (@_akisato)
Paper to read
(Presented at ICCV2013)
Problem dealing with in this paper
• Learning using privileged information (LUPI)
– Training
• Feature vectors : 𝑋𝑋 = 𝑥𝑥1, … , 𝑥𝑥𝑁𝑁 , 𝑥𝑥𝑖𝑖 ∈ ℝ𝑑𝑑
• Label annotation : 𝑌𝑌 = 𝑦𝑦1, … , 𝑦𝑦𝑁𝑁 , 𝑦𝑦𝑖𝑖 ∈ ℕ
• Additional information : 𝑋𝑋∗ = 𝑥𝑥1
∗
, … , 𝑥𝑥𝑁𝑁
∗
, 𝑥𝑥𝑖𝑖
∗
∈ ℝ𝑑𝑑∗
– Testing
• Prediction function : 𝑓𝑓: ℝ𝑑𝑑 → ℕ
• No additional information required
Privileged information??
• Applicable to several scenarios in CV
Formulation
• Generic supervised binary classification
– Training
• Feature vectors : 𝑋𝑋 = 𝑥𝑥1, … , 𝑥𝑥𝑁𝑁 , 𝑥𝑥𝑖𝑖 ∈ ℝ𝑑𝑑
• Label annotation : 𝑌𝑌 = 𝑦𝑦1, … , 𝑦𝑦𝑁𝑁 , 𝑦𝑦𝑖𝑖 ∈ {+1, −1}
• Additional information : 𝑋𝑋∗ = 𝑥𝑥1
∗
, … , 𝑥𝑥𝑁𝑁
∗
, 𝑥𝑥𝑖𝑖
∗
∈ ℝ𝑑𝑑∗
– Testing
• Prediction function : 𝑓𝑓: ℝ𝑑𝑑 → ℝ
• No additional information required
Key idea
• Privileged information allow us to distinguish
between easy and hard examples
– If the privileged data is easy to classify, then the
original data would also be easy to classify.
– … under the assumption that the privileged data is
similarly informative about the problem at hand.
Linear SVM
• Ordinary convergence rate = 𝑂𝑂(𝑁𝑁−1/2
)
• It improves to 𝑂𝑂(𝑁𝑁−1
)
– if we knew the optimal slack values 𝜉𝜉𝑖𝑖 in advance
(OracleSVM [Vapnik+ 2009])
min
𝑤𝑤∈ℝ𝑑𝑑,𝑏𝑏∈ℝ,𝜉𝜉𝑖𝑖∈ℝ
Slack variables in SVM
• Slack variables tell us which training examples
are easy / hard to classify
– 𝜉𝜉𝑖𝑖 = 0 → easy
– 𝜉𝜉𝑖𝑖 ≫ 0 → hard
min
𝑤𝑤∈ℝ𝑑𝑑,𝑏𝑏∈ℝ,𝜉𝜉𝑖𝑖∈ℝ
SVM+
• A 1st model for LUPI
– Use privileged data as a proxy to the oracle
– Parameterize 𝜉𝜉𝑖𝑖 = 𝑤𝑤∗, 𝑥𝑥𝑖𝑖
∗
+ 𝑏𝑏∗
[Vapnik+ NN2009, NIPS2010]
Why should SVM+ be improved?
• Cannot be solved by popular SVM packages
– Although good optimization algorithms were
derived [Pechyony+ 2011], they work only with the dual.
Learning to rank setup instead
• Underlying idea is the same
• Using the privileged data to identify easy /
hard-to-separate sample pairs
– Instead of using it to identify easy / hard-to-
classify samples
SVMrank
• Slack variables tell us which training example
pairs are easy / hard / impossible to separate
[Joachims KDD2002]
Proposed method: Rank transfer
• The strategy is similar to SVM+, but indirect.
1. SVMrank on 𝑋𝑋∗ (The ranking function 𝑓𝑓∗)
2. Margins 𝜌𝜌𝑖𝑖𝑖𝑖 = 𝑓𝑓∗
𝑥𝑥𝑖𝑖
∗
− 𝑓𝑓∗
(𝑥𝑥𝑗𝑗
∗
) ∀𝑖𝑖, 𝑗𝑗 𝑦𝑦𝑖𝑖 > 𝑦𝑦𝑗𝑗
• 𝜌𝜌𝑖𝑖𝑖𝑖 ≫ 0 : easy, 𝜌𝜌𝑖𝑖𝑖𝑖 ≈ 0 : hard, 𝜌𝜌𝑖𝑖𝑖𝑖 < 0 : impossible
3. SVMrank on 𝑋𝑋 with data-dependent margins
Intuition
• If it was difficult to correctly rank a pair on 𝑋𝑋∗
,
also it will also be difficult on 𝑋𝑋
1. Pairs (𝑖𝑖, 𝑗𝑗) with small margins 𝜌𝜌𝑖𝑖𝑖𝑖 have more
limited influence on 𝑤𝑤
2. Incorrectly ranked pairs are ignored.
1.
2.
Why not Rank transfer?
• We can use standard SVM packages!
– For the SVMrank on 𝑋𝑋∗ this is clear.
– For the SVMrank on 𝑋𝑋 we need variable
transformations
Experiments
• 4 different types of privileged information
– All of those can be handled in a unified framework.
• 4 different methods to be compared
– SVM, SVMrank, SVM+, Rank transfer
• Evaluation metric = Average Precision
(1) Attributes as privileged info
• Animals with Attributes Dataset
– 10 species ( = classes), 85 properties ( = attributes)
• Features: 2000-dim SURF
• Privileged: 85-dim predicted attributes
[Lampert+ PAMI2014]
• Learn 1-vs-1 classifiers with 100 training
samples
(1) Results
• Rank transfer is the best.
ICCV2013 reading: Learning to rank using privileged information
(2) Bounding box as privileged info
• Fine-grained setup on ILSVRC2012
– 17 classes with variety of snakes
• Features: 4096-dim Fisher vector from the
whole images
• Privileged: 4096-dim Fisher vector from the
bounding box regions
• Learn 1-vs-rest classifiers
(2) Results
• SVM+ is the best, ranking strategies do not
seem suitable for this setup.
(3) Texts as privileged info
• IsraelImages dataset [Bekkerman+ CVPR2007]
– 11 classes, 1800 images with a textual description
up to 18 words
• Features: 4096-dim Fisher vectors
• Privileged: BoWs from the texts
• Learn 1-vs-1 classifiers
Desert Trees
(3) Results
• Reference (privileged only) is the best
• All the others produce almost the same.
– Note that, high accuracy in the privileged space
does not necessarily mean that the privileged
information is helpful for the target task.
(4) Rationales as privileged info
• Hot or Not dataset [Donahue+ ICCV2011]
• Features: 500-dim densely sampled SIFT from
the whole image
• Privileged: 500-dim densely sampled SIFT
from the rationales
(4) Results
• Reference is the best.
• Rank transfer performs better for male class.
• Hard to draw a conclusion.
Appendix: Margin transfer
• One possible alternative to Rank transfer
But not so good…
Last words
• The idea is nice, easy to use.
• More privileged information, better
performance? --- needs discussions
• Which types of privileged information are
suitable? --- unknown

More Related Content

PPTX
Ml9 introduction to-unsupervised_learning_and_clustering_methods
PPTX
Ot regularization and_gradient_descent
PPTX
Ml6 decision trees
PPTX
Machine learning and_nlp
PPTX
Ml7 bagging
PDF
Boosted tree
PPTX
07 learning
PDF
Machine Learning Lecture 2 Basics
Ml9 introduction to-unsupervised_learning_and_clustering_methods
Ot regularization and_gradient_descent
Ml6 decision trees
Machine learning and_nlp
Ml7 bagging
Boosted tree
07 learning
Machine Learning Lecture 2 Basics

What's hot (20)

PPTX
Deep learning with TensorFlow
PDF
Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC
PPTX
Machine Learning - Dataset Preparation
PPTX
Introduction to Deep learning and H2O for beginner's
PDF
Lecture 8: Decision Trees & k-Nearest Neighbors
PPTX
Decision Trees
PDF
Introduction to Boosted Trees by Tianqi Chen
PPTX
06 image features
PPTX
Matrix decomposition and_applications_to_nlp
PDF
ID3 Algorithm & ROC Analysis
PPTX
Support vector machine
PPTX
Svm algorithm
PPT
002.decision trees
PPTX
02 image processing
PDF
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
PDF
Deep learning and image analytics using Python by Dr Sanparit
PPTX
Ml10 dimensionality reduction-and_advanced_topics
PPTX
Ppt shuai
PPTX
Basics of Machine Learning
Deep learning with TensorFlow
Face recognition and deep learning โดย ดร. สรรพฤทธิ์ มฤคทัต NECTEC
Machine Learning - Dataset Preparation
Introduction to Deep learning and H2O for beginner's
Lecture 8: Decision Trees & k-Nearest Neighbors
Decision Trees
Introduction to Boosted Trees by Tianqi Chen
06 image features
Matrix decomposition and_applications_to_nlp
ID3 Algorithm & ROC Analysis
Support vector machine
Svm algorithm
002.decision trees
02 image processing
Sergei Vassilvitskii, Research Scientist, Google at MLconf NYC - 4/15/16
Deep learning and image analytics using Python by Dr Sanparit
Ml10 dimensionality reduction-and_advanced_topics
Ppt shuai
Basics of Machine Learning
Ad

Viewers also liked (7)

PDF
Linear classifier
PDF
NIPS2015 reading - Learning visual biases from human imagination
PDF
NIPS2014 reading - Top rank optimization in linear time
PDF
多変量解析の一般化
PDF
CVPR2016 reading - 特徴量学習とクロスモーダル転移について
PPTX
第1回ステアラボ人工知能セミナー(オープニング)
PPTX
深層学習による自然言語処理の研究動向
Linear classifier
NIPS2015 reading - Learning visual biases from human imagination
NIPS2014 reading - Top rank optimization in linear time
多変量解析の一般化
CVPR2016 reading - 特徴量学習とクロスモーダル転移について
第1回ステアラボ人工知能セミナー(オープニング)
深層学習による自然言語処理の研究動向
Ad

Similar to ICCV2013 reading: Learning to rank using privileged information (20)

PPTX
Support Vector Machines USING MACHINE LEARNING HOW IT WORKS
PPTX
EDAB - Support Vector Machines Module - 6..pptx
PDF
Data Science - Part IX - Support Vector Machine
PPTX
Machine learning interviews day2
DOC
Introduction to Support Vector Machines
PDF
LUPI (Learning Using Privileged Information)
PPTX
support vector machine 1.pptx
PDF
Single to multiple kernel learning with four popular svm kernels (survey)
PPT
Support Vector Machines
PPTX
SVM[Support vector Machine] Machine learning
PPTX
Statistical Machine Learning unit4 lecture notes
PPT
2.6 support vector machines and associative classifiers revised
PPTX
Classification-Support Vector Machines.pptx
PDF
Understanding the Machine Learning Algorithms
PDF
Opinion mining framework using proposed RB-bayes model for text classication
PPTX
Support Vector Machine topic of machine learning.pptx
PDF
properties, application and issues of support vector machine
PPTX
Module 3 -Support Vector Machines data mining
PDF
Support Vector Machine Optimal Kernel Selection
PPT
Introduction to Support Vector Machine 221 CMU.ppt
Support Vector Machines USING MACHINE LEARNING HOW IT WORKS
EDAB - Support Vector Machines Module - 6..pptx
Data Science - Part IX - Support Vector Machine
Machine learning interviews day2
Introduction to Support Vector Machines
LUPI (Learning Using Privileged Information)
support vector machine 1.pptx
Single to multiple kernel learning with four popular svm kernels (survey)
Support Vector Machines
SVM[Support vector Machine] Machine learning
Statistical Machine Learning unit4 lecture notes
2.6 support vector machines and associative classifiers revised
Classification-Support Vector Machines.pptx
Understanding the Machine Learning Algorithms
Opinion mining framework using proposed RB-bayes model for text classication
Support Vector Machine topic of machine learning.pptx
properties, application and issues of support vector machine
Module 3 -Support Vector Machines data mining
Support Vector Machine Optimal Kernel Selection
Introduction to Support Vector Machine 221 CMU.ppt

More from Akisato Kimura (20)

PPTX
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
PPTX
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
PDF
CVPR2015 reading "Global refinement of random forest"
PDF
CVPR2015 reading "Understainding image virality" (in Japanese)
PDF
Computational models of human visual attention driven by auditory cues
PDF
CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...
PDF
ACMMM 2013 reading: Large-scale visual sentiment ontology and detectors using...
PDF
IJCAI13 Paper review: Large-scale spectral clustering on graphs
PDF
関西CVPR勉強会 2012.10.28
PDF
関西CVPR勉強会 2012.7.29
PDF
ICWSM12 Brief Review
PDF
関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)
PDF
関西CVPRML勉強会(特定物体認識) 2012.1.14
PDF
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
PDF
IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明
PDF
立命館大学 AMLコロキウム 2011.10.20
PDF
広島画像情報学セミナ 2011.9.16
PDF
関西CVPRML勉強会 2011.9.23
PDF
関西CVPRML 2011.8.27
PDF
CVPR2011祭り 発表スライド
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
Paper reading - Dropout as a Bayesian Approximation: Representing Model Uncer...
CVPR2015 reading "Global refinement of random forest"
CVPR2015 reading "Understainding image virality" (in Japanese)
Computational models of human visual attention driven by auditory cues
CVPR2014 reading "Reconstructing storyline graphs for image recommendation fr...
ACMMM 2013 reading: Large-scale visual sentiment ontology and detectors using...
IJCAI13 Paper review: Large-scale spectral clustering on graphs
関西CVPR勉強会 2012.10.28
関西CVPR勉強会 2012.7.29
ICWSM12 Brief Review
関西CVPRML勉強会 2012.2.18 (一般物体認識 - データセット)
関西CVPRML勉強会(特定物体認識) 2012.1.14
人間の視覚的注意を予測するモデル - 動的ベイジアンネットワークに基づく 最新のアプローチ -
IBIS2011 企画セッション「CV/PRで独自の進化を遂げる学習・最適化技術」 趣旨説明
立命館大学 AMLコロキウム 2011.10.20
広島画像情報学セミナ 2011.9.16
関西CVPRML勉強会 2011.9.23
関西CVPRML 2011.8.27
CVPR2011祭り 発表スライド

Recently uploaded (20)

PDF
Electronic commerce courselecture one. Pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PPTX
Cloud computing and distributed systems.
PDF
Machine learning based COVID-19 study performance prediction
PDF
Modernizing your data center with Dell and AMD
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPT
Teaching material agriculture food technology
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Empathic Computing: Creating Shared Understanding
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Big Data Technologies - Introduction.pptx
Electronic commerce courselecture one. Pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Digital-Transformation-Roadmap-for-Companies.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Cloud computing and distributed systems.
Machine learning based COVID-19 study performance prediction
Modernizing your data center with Dell and AMD
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
Teaching material agriculture food technology
Advanced methodologies resolving dimensionality complications for autism neur...
Empathic Computing: Creating Shared Understanding
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Unlocking AI with Model Context Protocol (MCP)
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
MYSQL Presentation for SQL database connectivity
Chapter 3 Spatial Domain Image Processing.pdf
Diabetes mellitus diagnosis method based random forest with bat algorithm
Big Data Technologies - Introduction.pptx

ICCV2013 reading: Learning to rank using privileged information

  • 3. Problem dealing with in this paper • Learning using privileged information (LUPI) – Training • Feature vectors : 𝑋𝑋 = 𝑥𝑥1, … , 𝑥𝑥𝑁𝑁 , 𝑥𝑥𝑖𝑖 ∈ ℝ𝑑𝑑 • Label annotation : 𝑌𝑌 = 𝑦𝑦1, … , 𝑦𝑦𝑁𝑁 , 𝑦𝑦𝑖𝑖 ∈ ℕ • Additional information : 𝑋𝑋∗ = 𝑥𝑥1 ∗ , … , 𝑥𝑥𝑁𝑁 ∗ , 𝑥𝑥𝑖𝑖 ∗ ∈ ℝ𝑑𝑑∗ – Testing • Prediction function : 𝑓𝑓: ℝ𝑑𝑑 → ℕ • No additional information required
  • 4. Privileged information?? • Applicable to several scenarios in CV
  • 5. Formulation • Generic supervised binary classification – Training • Feature vectors : 𝑋𝑋 = 𝑥𝑥1, … , 𝑥𝑥𝑁𝑁 , 𝑥𝑥𝑖𝑖 ∈ ℝ𝑑𝑑 • Label annotation : 𝑌𝑌 = 𝑦𝑦1, … , 𝑦𝑦𝑁𝑁 , 𝑦𝑦𝑖𝑖 ∈ {+1, −1} • Additional information : 𝑋𝑋∗ = 𝑥𝑥1 ∗ , … , 𝑥𝑥𝑁𝑁 ∗ , 𝑥𝑥𝑖𝑖 ∗ ∈ ℝ𝑑𝑑∗ – Testing • Prediction function : 𝑓𝑓: ℝ𝑑𝑑 → ℝ • No additional information required
  • 6. Key idea • Privileged information allow us to distinguish between easy and hard examples – If the privileged data is easy to classify, then the original data would also be easy to classify. – … under the assumption that the privileged data is similarly informative about the problem at hand.
  • 7. Linear SVM • Ordinary convergence rate = 𝑂𝑂(𝑁𝑁−1/2 ) • It improves to 𝑂𝑂(𝑁𝑁−1 ) – if we knew the optimal slack values 𝜉𝜉𝑖𝑖 in advance (OracleSVM [Vapnik+ 2009]) min 𝑤𝑤∈ℝ𝑑𝑑,𝑏𝑏∈ℝ,𝜉𝜉𝑖𝑖∈ℝ
  • 8. Slack variables in SVM • Slack variables tell us which training examples are easy / hard to classify – 𝜉𝜉𝑖𝑖 = 0 → easy – 𝜉𝜉𝑖𝑖 ≫ 0 → hard min 𝑤𝑤∈ℝ𝑑𝑑,𝑏𝑏∈ℝ,𝜉𝜉𝑖𝑖∈ℝ
  • 9. SVM+ • A 1st model for LUPI – Use privileged data as a proxy to the oracle – Parameterize 𝜉𝜉𝑖𝑖 = 𝑤𝑤∗, 𝑥𝑥𝑖𝑖 ∗ + 𝑏𝑏∗ [Vapnik+ NN2009, NIPS2010]
  • 10. Why should SVM+ be improved? • Cannot be solved by popular SVM packages – Although good optimization algorithms were derived [Pechyony+ 2011], they work only with the dual.
  • 11. Learning to rank setup instead • Underlying idea is the same • Using the privileged data to identify easy / hard-to-separate sample pairs – Instead of using it to identify easy / hard-to- classify samples
  • 12. SVMrank • Slack variables tell us which training example pairs are easy / hard / impossible to separate [Joachims KDD2002]
  • 13. Proposed method: Rank transfer • The strategy is similar to SVM+, but indirect. 1. SVMrank on 𝑋𝑋∗ (The ranking function 𝑓𝑓∗) 2. Margins 𝜌𝜌𝑖𝑖𝑖𝑖 = 𝑓𝑓∗ 𝑥𝑥𝑖𝑖 ∗ − 𝑓𝑓∗ (𝑥𝑥𝑗𝑗 ∗ ) ∀𝑖𝑖, 𝑗𝑗 𝑦𝑦𝑖𝑖 > 𝑦𝑦𝑗𝑗 • 𝜌𝜌𝑖𝑖𝑖𝑖 ≫ 0 : easy, 𝜌𝜌𝑖𝑖𝑖𝑖 ≈ 0 : hard, 𝜌𝜌𝑖𝑖𝑖𝑖 < 0 : impossible 3. SVMrank on 𝑋𝑋 with data-dependent margins
  • 14. Intuition • If it was difficult to correctly rank a pair on 𝑋𝑋∗ , also it will also be difficult on 𝑋𝑋 1. Pairs (𝑖𝑖, 𝑗𝑗) with small margins 𝜌𝜌𝑖𝑖𝑖𝑖 have more limited influence on 𝑤𝑤 2. Incorrectly ranked pairs are ignored. 1. 2.
  • 15. Why not Rank transfer? • We can use standard SVM packages! – For the SVMrank on 𝑋𝑋∗ this is clear. – For the SVMrank on 𝑋𝑋 we need variable transformations
  • 16. Experiments • 4 different types of privileged information – All of those can be handled in a unified framework. • 4 different methods to be compared – SVM, SVMrank, SVM+, Rank transfer • Evaluation metric = Average Precision
  • 17. (1) Attributes as privileged info • Animals with Attributes Dataset – 10 species ( = classes), 85 properties ( = attributes) • Features: 2000-dim SURF • Privileged: 85-dim predicted attributes [Lampert+ PAMI2014] • Learn 1-vs-1 classifiers with 100 training samples
  • 18. (1) Results • Rank transfer is the best.
  • 20. (2) Bounding box as privileged info • Fine-grained setup on ILSVRC2012 – 17 classes with variety of snakes • Features: 4096-dim Fisher vector from the whole images • Privileged: 4096-dim Fisher vector from the bounding box regions • Learn 1-vs-rest classifiers
  • 21. (2) Results • SVM+ is the best, ranking strategies do not seem suitable for this setup.
  • 22. (3) Texts as privileged info • IsraelImages dataset [Bekkerman+ CVPR2007] – 11 classes, 1800 images with a textual description up to 18 words • Features: 4096-dim Fisher vectors • Privileged: BoWs from the texts • Learn 1-vs-1 classifiers Desert Trees
  • 23. (3) Results • Reference (privileged only) is the best • All the others produce almost the same. – Note that, high accuracy in the privileged space does not necessarily mean that the privileged information is helpful for the target task.
  • 24. (4) Rationales as privileged info • Hot or Not dataset [Donahue+ ICCV2011] • Features: 500-dim densely sampled SIFT from the whole image • Privileged: 500-dim densely sampled SIFT from the rationales
  • 25. (4) Results • Reference is the best. • Rank transfer performs better for male class. • Hard to draw a conclusion.
  • 26. Appendix: Margin transfer • One possible alternative to Rank transfer
  • 27. But not so good…
  • 28. Last words • The idea is nice, easy to use. • More privileged information, better performance? --- needs discussions • Which types of privileged information are suitable? --- unknown