SlideShare a Scribd company logo
Deep Learning &
Feature Learning
Methods for Vision



    ’
Tutorial Overview
Overview
•
    –


•
    –
    –
    –


•
Existing Recognition Approach




•

•
Motivation
•

•
    –


•
What Limits Current Performance?
•
    –

•
Hand-Crafted Features
•       β
    –

•
    –

•



Mid-Level Representations
•




“   ”


•



•                   
Why Learn Features?

•

•
    –
    –
    –


•
    –
    –
Why Hierarchy?


     fi
Hierarchies in Vision
•
    –

•
    –
Hierarchies in Vision
•

•
Learning a Hierarchy
    of Feature Extractors
•


•           
•




•
Multistage Hubel-Wiesel Architecture

•
•
•



•
•
•
•
Classic Approach to Training

•
    –
    –
    –



•
    –

    –
Deep Learning

•

•

•

•
Single Layer Architecture
Example Feature Learning Architectures
SIFT Descriptor
Spatial Pyramid Matching
Filtering

•
    –
Filtering

•
    –
    –
        


                .
                .
                .
Translation Equivariance

•              
    –
    –
Filtering

•
    –
    –
Filtering

•
    –
    –
    –
    –
Normalization

•
    •
Normalization
•
    –                   
    –
Normalization
•
    –
    –
Role of Normalization
•

    – “      ”
    –
    –


•
                       |.|1   |.|1   |.|1   |.|1
Pooling
•
    –
    –
    –
Role of Pooling
•
    –
    –
Role of Pooling

•
    •
    •
    •
Unsupervised Learning

•
•



•
    –
    –
    –
Auto-Encoder
Auto-Encoder Example 1
•




          σ(WTz)      σ(Wx)
    σ                            σ
Auto-Encoder Example 2
•




        Dz        σ(Wx)
                             σ
Auto-Encoder Example 2
•




        Dz        σ(Wx)
                             σ
Taxonomy of Approaches

•
    –
    –
    –

•
    –
    –
•
    –
Stacked Auto-Encoders
At Test Time

•
•

•




•
Information Flow in Vision Models

•

•

    –
    –



•
    –
Deep Boltzmann Machines
Why is Top-Down important?
•

•

•
Multi-Scale Models
•
    •
    •




        HOG Pyramid
Hierarchical Model
•




    Input Image/ Features    Input Image/ Features
Multi-scale        vs   Hierarchical




 Feature Pyramid          Input Image/ Features
Structure Spectrum
•
    –
    –
    –




•
    –
    –
Structure Spectrum
•
    –
    –
Structure Spectrum
•


•
Structure Spectrum

•
    –
    –
Structure Spectrum

•
•

•
Structure Spectrum

•

    –
    –
Structure Spectrum
•
    –
•
    –
•
Structure Spectrum
•
    –


    –

    –
Structure Spectrum

•
    –

    –

    –
Performance of Deep Learning
•
•
    –
        •
    –
        •
•
    –
•
    –

•
Summary

•
    –


•
•

•
Further Resources

•
•

•

•

    –
•
P01 introduction cvpr2012 deep learning methods for vision
References
•
•


•


•
•
•
•

 •
•
•
•
References
•
•

•
•


•

•
•


•
References
•
•


•
•

•
•


•
•
References
•
•

•
•
•

•
•

•
•
References
•
•

•
•


•

•
•
References
•
•

•
•

•
•

•

•
•

•
References
•
•


•
•

•


•
•

•
•
References

•
•
•




•

More Related Content

PDF
India digital-future-in-focus-2013
PPT
Ellig Costs And Consequences Of Telecom Regulation Feb 2005
PPTX
Limbic system and memory
PPTX
Digital carrier modulation
PPTX
Hart - Highway Addressable Remote Transducer Protocol
PPTX
Cognitive Radio Spectrum Sensing 1586 ppt
PPTX
Hart protocol
PDF
India digital-future-in-focus-2013
Ellig Costs And Consequences Of Telecom Regulation Feb 2005
Limbic system and memory
Digital carrier modulation
Hart - Highway Addressable Remote Transducer Protocol
Cognitive Radio Spectrum Sensing 1586 ppt
Hart protocol

Similar to P01 introduction cvpr2012 deep learning methods for vision (20)

PDF
From Pixels to Understanding: Deep Learning's Impact on Image Classification ...
PDF
Icml2012 learning hierarchies of invariant features
PDF
Trade-off between recognition an reconstruction: Application of Robotics Visi...
PDF
Journal_IEEE_2023.pdf
PPT
Fcv learn fergus
PDF
Easy to learn deep learning guide - elementry
PPTX
Face recognition using artificial neural network
PDF
Fcv cross hebert
PDF
Fcv learn le_cun
PDF
MIRU2014 SLAC
PDF
P04 restricted boltzmann machines cvpr2012 deep learning methods for vision
PPTX
Conventional Neural Networks and compute
PPTX
Unit 4 Object Recognition and Classification.pptx
PDF
Book study of jilid 1bbDeep-Learning.pdf
PPTX
Deep Learning: Towards General Artificial Intelligence
PPTX
Deep Learning in Computer Vision
PDF
Quoc Le, Stanford & Google - Tera Scale Deep Learning
PPTX
Neural Networks and Deep Learning Basics
PPTX
Deep learning
PDF
3234150
From Pixels to Understanding: Deep Learning's Impact on Image Classification ...
Icml2012 learning hierarchies of invariant features
Trade-off between recognition an reconstruction: Application of Robotics Visi...
Journal_IEEE_2023.pdf
Fcv learn fergus
Easy to learn deep learning guide - elementry
Face recognition using artificial neural network
Fcv cross hebert
Fcv learn le_cun
MIRU2014 SLAC
P04 restricted boltzmann machines cvpr2012 deep learning methods for vision
Conventional Neural Networks and compute
Unit 4 Object Recognition and Classification.pptx
Book study of jilid 1bbDeep-Learning.pdf
Deep Learning: Towards General Artificial Intelligence
Deep Learning in Computer Vision
Quoc Le, Stanford & Google - Tera Scale Deep Learning
Neural Networks and Deep Learning Basics
Deep learning
3234150
Ad

More from zukun (20)

PDF
My lyn tutorial 2009
PDF
ETHZ CV2012: Tutorial openCV
PDF
ETHZ CV2012: Information
PDF
Siwei lyu: natural image statistics
PDF
Lecture9 camera calibration
PDF
Brunelli 2008: template matching techniques in computer vision
PDF
Modern features-part-4-evaluation
PDF
Modern features-part-3-software
PDF
Modern features-part-2-descriptors
PDF
Modern features-part-1-detectors
PDF
Modern features-part-0-intro
PDF
Lecture 02 internet video search
PDF
Lecture 01 internet video search
PDF
Lecture 03 internet video search
PDF
Icml2012 tutorial representation_learning
PPT
Advances in discrete energy minimisation for computer vision
PDF
Gephi tutorial: quick start
PDF
EM algorithm and its application in probabilistic latent semantic analysis
PDF
Object recognition with pictorial structures
PDF
Iccv2011 learning spatiotemporal graphs of human activities
My lyn tutorial 2009
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Information
Siwei lyu: natural image statistics
Lecture9 camera calibration
Brunelli 2008: template matching techniques in computer vision
Modern features-part-4-evaluation
Modern features-part-3-software
Modern features-part-2-descriptors
Modern features-part-1-detectors
Modern features-part-0-intro
Lecture 02 internet video search
Lecture 01 internet video search
Lecture 03 internet video search
Icml2012 tutorial representation_learning
Advances in discrete energy minimisation for computer vision
Gephi tutorial: quick start
EM algorithm and its application in probabilistic latent semantic analysis
Object recognition with pictorial structures
Iccv2011 learning spatiotemporal graphs of human activities
Ad

Recently uploaded (20)

PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Complications of Minimal Access Surgery at WLH
PDF
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Sports Quiz easy sports quiz sports quiz
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
master seminar digital applications in india
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
Lesson notes of climatology university.
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Abdominal Access Techniques with Prof. Dr. R K Mishra
Complications of Minimal Access Surgery at WLH
Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf
Microbial diseases, their pathogenesis and prophylaxis
Sports Quiz easy sports quiz sports quiz
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
01-Introduction-to-Information-Management.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF
master seminar digital applications in india
2.FourierTransform-ShortQuestionswithAnswers.pdf
Lesson notes of climatology university.
PPH.pptx obstetrics and gynecology in nursing
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
VCE English Exam - Section C Student Revision Booklet
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Microbial disease of the cardiovascular and lymphatic systems
Renaissance Architecture: A Journey from Faith to Humanism

P01 introduction cvpr2012 deep learning methods for vision

Editor's Notes

  • #11: All I am going to say about Neuroscience, although techniques do have strong connections.
  • #14: Make clear that classic methods, e.g.convnets are purely supervised.
  • #15: Need to bring outdiffereceswrt to existing ML stuff, mainly unsupervised learning part. Make use of unlabaled data (lots of it).
  • #16: Restructure to bigger emphasis on unsupervised.Make clear that classic methods, e.g.convnets are purely supervised.
  • #18: Winder and Brown paper. Slightly smoothed view of things.
  • #19: Selection instead of normalization?
  • #20: Note pooling is across space, not across Gabor channelNormalization is really nonlinear (small elements not rescaled)
  • #21: Non-maximal suppression across VW. Like an L-InfnormalizationMax = k-means
  • #32: Graph not clear. Explain better. Y-axis is change in value
  • #33: Mention Leonardis & Fidler paper
  • #34: Too far for labels to trickle down (vanishing gradients)Only information from layer below.Input is supervision.
  • #37: Add overall energy
  • #42: Not separate operations Do it at the same
  • #43: Chriswilliams oral link
  • #44: Occlusion mask: bootom right quad for sofa interpretationCan’t decide locally If you knew solution, would know what features to extract.
  • #46: DPM is shape hierarchical HOG templates
  • #47: DPM is shape hierarchical HOG templates
  • #48: Song Chun ‘s clock