P01 introduction cvpr2012 deep learning methods for vision

Download as PPTX, PDF

3 likes876 views

The document discusses deep learning and feature learning methods for computer vision. It provides an overview of existing recognition approaches, their limitations, and how learning hierarchical features from data can overcome these limitations. Deep learning methods like convolutional neural networks learn multiple levels of representation by building complex features from simpler ones in a hierarchical manner similar to the human visual system.

Education Technology

Deep Learning &
Feature Learning
Methods for Vision

’

Tutorial Overview

Overview
•
–

•
–
–
–

•

Existing Recognition Approach

•

•

Motivation
•

•
–

•

What Limits Current Performance?
•
–

•

Hand-Crafted Features
• β
–

•
–

•



Mid-Level Representations
•

“ ”

•

• 

Why Learn Features?

•

•
–
–
–

•
–
–

Why Hierarchy?

ﬁ

Hierarchies in Vision
•
–

•
–

Hierarchies in Vision
•

•

Learning a Hierarchy
of Feature Extractors
•

• 
•

•

Multistage Hubel-Wiesel Architecture

•
•
•

•
•
•
•

Classic Approach to Training

•
–
–
–

•
–

–

Deep Learning

•

•

•

•

Single Layer Architecture

Example Feature Learning Architectures

SIFT Descriptor

Spatial Pyramid Matching

Filtering

•
–

Filtering

•
–
–


.
.
.

Translation Equivariance

• 
–
–

Filtering

•
–
–

Filtering

•
–
–
–
–

Normalization

•
•

Normalization
•
– 
–

Normalization
•
–
–

Role of Normalization
•

– “ ”
–
–

•
|.|1 |.|1 |.|1 |.|1

Pooling
•
–
–
–

Role of Pooling
•
–
–

Role of Pooling

•
•
•
•

Unsupervised Learning

•
•

•
–
–
–

Auto-Encoder

Auto-Encoder Example 1
•

σ(WTz) σ(Wx)
σ σ

Auto-Encoder Example 2
•

Dz σ(Wx)
σ

Auto-Encoder Example 2
•

Dz σ(Wx)
σ

Taxonomy of Approaches

•
–
–
–

•
–
–
•
–

Stacked Auto-Encoders

At Test Time

•
•

•

•

Information Flow in Vision Models

•

•

–
–

•
–

Deep Boltzmann Machines

Why is Top-Down important?
•

•

•

Multi-Scale Models
•
•
•

HOG Pyramid

Hierarchical Model
•

Input Image/ Features Input Image/ Features

Multi-scale vs Hierarchical

Feature Pyramid Input Image/ Features

Structure Spectrum
•
–
–
–

•
–
–

Structure Spectrum
•
–
–

Structure Spectrum
•

•

Structure Spectrum

•
–
–

Structure Spectrum

•
•

•

Structure Spectrum

•

–
–

Structure Spectrum
•
–
•
–
•

Structure Spectrum
•
–

–

–

Structure Spectrum

•
–

–

–

Performance of Deep Learning
•
•
–
•
–
•
•
–
•
–

•

Summary

•
–

•
•

•

Further Resources

•
•

•

•

–
•

P01 introduction cvpr2012 deep learning methods for vision

References
•
•

•

•
•
•
•

•
•
•
•

References
•
•

•
•

•

•
•

•

References
•
•

•
•

•
•

•
•

References
•
•

•
•
•

•
•

•
•

References
•
•

•
•

•

•
•

References
•
•

•
•

•
•

•

•
•

•

References
•
•

•
•

•

•
•

•
•

References

•
•
•

•

Ad

Recommended

PDF

India digital-future-in-focus-2013

PPT

Ellig Costs And Consequences Of Telecom Regulation Feb 2005

Mercatus Center

PPTX

Limbic system and memory

Caleb Tinashe Munikwa

PPTX

Digital carrier modulation

PPTX

Hart - Highway Addressable Remote Transducer Protocol

Vasanthan Ravichandran

PPTX

Cognitive Radio Spectrum Sensing 1586 ppt

PPTX

Hart protocol

PDF

mscthesis

Miquel Perelló Nieto

PDF

From Pixels to Understanding: Deep Learning's Impact on Image Classification ...

PDF

Icml2012 learning hierarchies of invariant features

PDF

Trade-off between recognition an reconstruction: Application of Robotics Visi...

PDF

Journal_IEEE_2023.pdf

Guillermo Medina Zegarra

PPT

Fcv learn fergus

PDF

Easy to learn deep learning guide - elementry

PPTX

Face recognition using artificial neural network

PDF

Fcv cross hebert

PDF

Fcv learn le_cun

PDF

MIRU2014 SLAC

PDF

P04 restricted boltzmann machines cvpr2012 deep learning methods for vision

PPTX

Conventional Neural Networks and compute

PPTX

Unit 4 Object Recognition and Classification.pptx

PDF

Book study of jilid 1bbDeep-Learning.pdf

ArdiFahruriyannur1

PPTX

Deep Learning: Towards General Artificial Intelligence

Rukshan Batuwita

PPTX

Deep Learning in Computer Vision

PDF

Quoc Le, Stanford & Google - Tera Scale Deep Learning

PPTX

Neural Networks and Deep Learning Basics

PPTX

Deep learning

PDF

3234150

PDF

My lyn tutorial 2009

PDF

ETHZ CV2012: Tutorial openCV

More Related Content

PDF

India digital-future-in-focus-2013

PPT

Ellig Costs And Consequences Of Telecom Regulation Feb 2005

Mercatus Center

PPTX

Limbic system and memory

Caleb Tinashe Munikwa

PPTX

Digital carrier modulation

PPTX

Hart - Highway Addressable Remote Transducer Protocol

Vasanthan Ravichandran

PPTX

Cognitive Radio Spectrum Sensing 1586 ppt

PPTX

Hart protocol

PDF

mscthesis

Miquel Perelló Nieto

India digital-future-in-focus-2013

Ellig Costs And Consequences Of Telecom Regulation Feb 2005

Mercatus Center

Limbic system and memory

Caleb Tinashe Munikwa

Digital carrier modulation

Hart - Highway Addressable Remote Transducer Protocol

Vasanthan Ravichandran

Cognitive Radio Spectrum Sensing 1586 ppt

Hart protocol

mscthesis

Miquel Perelló Nieto

Similar to P01 introduction cvpr2012 deep learning methods for vision (20)

PDF

From Pixels to Understanding: Deep Learning's Impact on Image Classification ...

PDF

Icml2012 learning hierarchies of invariant features

PDF

Trade-off between recognition an reconstruction: Application of Robotics Visi...

PDF

Journal_IEEE_2023.pdf

Guillermo Medina Zegarra

PPT

Fcv learn fergus

PDF

Easy to learn deep learning guide - elementry

PPTX

Face recognition using artificial neural network

PDF

Fcv cross hebert

PDF

Fcv learn le_cun

PDF

MIRU2014 SLAC

PDF

P04 restricted boltzmann machines cvpr2012 deep learning methods for vision

PPTX

Conventional Neural Networks and compute

PPTX

Unit 4 Object Recognition and Classification.pptx

PDF

Book study of jilid 1bbDeep-Learning.pdf

ArdiFahruriyannur1

PPTX

Deep Learning: Towards General Artificial Intelligence

Rukshan Batuwita

PPTX

Deep Learning in Computer Vision

PDF

Quoc Le, Stanford & Google - Tera Scale Deep Learning

PPTX

Neural Networks and Deep Learning Basics

PPTX

Deep learning

PDF

3234150

From Pixels to Understanding: Deep Learning's Impact on Image Classification ...

Icml2012 learning hierarchies of invariant features

Trade-off between recognition an reconstruction: Application of Robotics Visi...

Journal_IEEE_2023.pdf

Guillermo Medina Zegarra

Fcv learn fergus

Easy to learn deep learning guide - elementry

Face recognition using artificial neural network

Fcv cross hebert

Fcv learn le_cun

MIRU2014 SLAC

P04 restricted boltzmann machines cvpr2012 deep learning methods for vision

Conventional Neural Networks and compute

Unit 4 Object Recognition and Classification.pptx

Book study of jilid 1bbDeep-Learning.pdf

ArdiFahruriyannur1

Deep Learning: Towards General Artificial Intelligence

Rukshan Batuwita

Deep Learning in Computer Vision

Quoc Le, Stanford & Google - Tera Scale Deep Learning

Neural Networks and Deep Learning Basics

Deep learning

3234150

Ad

More from zukun (20)

PDF

My lyn tutorial 2009

PDF

ETHZ CV2012: Tutorial openCV

PDF

ETHZ CV2012: Information

PDF

Siwei lyu: natural image statistics

PDF

Lecture9 camera calibration

PDF

Brunelli 2008: template matching techniques in computer vision

PDF

Modern features-part-4-evaluation

PDF

Modern features-part-3-software

PDF

Modern features-part-2-descriptors

PDF

Modern features-part-1-detectors

PDF

Modern features-part-0-intro

PDF

Lecture 02 internet video search

PDF

Lecture 01 internet video search

PDF

Lecture 03 internet video search

PDF

Icml2012 tutorial representation_learning

PPT

Advances in discrete energy minimisation for computer vision

PDF

Gephi tutorial: quick start

PDF

EM algorithm and its application in probabilistic latent semantic analysis

PDF

Object recognition with pictorial structures

PDF

Iccv2011 learning spatiotemporal graphs of human activities

My lyn tutorial 2009

ETHZ CV2012: Tutorial openCV

ETHZ CV2012: Information

Siwei lyu: natural image statistics

Lecture9 camera calibration

Brunelli 2008: template matching techniques in computer vision

Modern features-part-4-evaluation

Modern features-part-3-software

Modern features-part-2-descriptors

Modern features-part-1-detectors

Modern features-part-0-intro

Lecture 02 internet video search

Lecture 01 internet video search

Lecture 03 internet video search

Icml2012 tutorial representation_learning

Advances in discrete energy minimisation for computer vision

Gephi tutorial: quick start

EM algorithm and its application in probabilistic latent semantic analysis

Object recognition with pictorial structures

Iccv2011 learning spatiotemporal graphs of human activities

Ad

Recently uploaded (20)

PPTX

school management -TNTEU- B.Ed., Semester II Unit 1.pptx

NihumathunnisaH

PDF

Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf

DrMONISHAphysio

PDF

Abdominal Access Techniques with Prof. Dr. R K Mishra

PDF

Complications of Minimal Access Surgery at WLH

PDF

Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf

sreejithcareers

PPTX

Microbial diseases, their pathogenesis and prophylaxis

DevlinaSengupta

PDF

Sports Quiz easy sports quiz sports quiz

PDF

Chapter 2 Heredity, Prenatal Development, and Birth.pdf

MarjorieLopezTiu

PDF

ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx

PDF

01-Introduction-to-Information-Management.pdf

PPTX

Pharmacology of Heart Failure /Pharmacotherapy of CHF

Rajshri Ghogare

PPTX

master seminar digital applications in india

PDF

2.FourierTransform-ShortQuestionswithAnswers.pdf

PPTX

Lesson notes of climatology university.

PPTX

PPH.pptx obstetrics and gynecology in nursing

SrideviDevaraj5

PDF

Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape

PDF

VCE English Exam - Section C Student Revision Booklet

PPTX

Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx

chetansingh379583

PDF

Microbial disease of the cardiovascular and lymphatic systems

PPTX

Renaissance Architecture: A Journey from Faith to Humanism

school management -TNTEU- B.Ed., Semester II Unit 1.pptx

NihumathunnisaH

Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf

DrMONISHAphysio

Abdominal Access Techniques with Prof. Dr. R K Mishra

Complications of Minimal Access Surgery at WLH

Saundersa Comprehensive Review for the NCLEX-RN Examination.pdf

sreejithcareers

Microbial diseases, their pathogenesis and prophylaxis

DevlinaSengupta

Sports Quiz easy sports quiz sports quiz

Chapter 2 Heredity, Prenatal Development, and Birth.pdf

MarjorieLopezTiu

ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx

01-Introduction-to-Information-Management.pdf

Pharmacology of Heart Failure /Pharmacotherapy of CHF

Rajshri Ghogare

master seminar digital applications in india

2.FourierTransform-ShortQuestionswithAnswers.pdf

Lesson notes of climatology university.

PPH.pptx obstetrics and gynecology in nursing

SrideviDevaraj5

Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape

VCE English Exam - Section C Student Revision Booklet

Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx

chetansingh379583

Microbial disease of the cardiovascular and lymphatic systems

Renaissance Architecture: A Journey from Faith to Humanism

P01 introduction cvpr2012 deep learning methods for vision

1. Deep Learning & Feature Learning Methods for Vision ’

2. Tutorial Overview

3. Overview • – • – – – •

4. Existing Recognition Approach • •

5. Motivation • • – •

6. What Limits Current Performance? • – •

7. Hand-Crafted Features • β – • – • 

8. Mid-Level Representations • “ ” • • 

9. Why Learn Features? • • – – – • – –

10. Why Hierarchy? ﬁ

11. Hierarchies in Vision • – • –

12. Hierarchies in Vision • •

13. Learning a Hierarchy of Feature Extractors • •  • •

14. Multistage Hubel-Wiesel Architecture • • • • • • •

15. Classic Approach to Training • – – – • – –

16. Deep Learning • • • •

17. Single Layer Architecture

18. Example Feature Learning Architectures

19. SIFT Descriptor

20. Spatial Pyramid Matching

21. Filtering • –

22. Filtering • – –  . . .

23. Translation Equivariance •  – –

24. Filtering • – –

25. Filtering • – – – –

26. Normalization • •

27. Normalization • –  –

28. Normalization • – –

29. Role of Normalization • – “ ” – – • |.|1 |.|1 |.|1 |.|1

30. Pooling • – – –

31. Role of Pooling • – –

32. Role of Pooling • • • •

33. Unsupervised Learning • • • – – –

34. Auto-Encoder

35. Auto-Encoder Example 1 • σ(WTz) σ(Wx) σ σ

36. Auto-Encoder Example 2 • Dz σ(Wx) σ

37. Auto-Encoder Example 2 • Dz σ(Wx) σ

38. Taxonomy of Approaches • – – – • – – • –

39. Stacked Auto-Encoders

40. At Test Time • • • •

41. Information Flow in Vision Models • • – – • –

42. Deep Boltzmann Machines

43. Why is Top-Down important? • • •

44. Multi-Scale Models • • • HOG Pyramid

45. Hierarchical Model • Input Image/ Features Input Image/ Features

46. Multi-scale vs Hierarchical Feature Pyramid Input Image/ Features

47. Structure Spectrum • – – – • – –

48. Structure Spectrum • – –

49. Structure Spectrum • •

50. Structure Spectrum • – –

51. Structure Spectrum • • •

52. Structure Spectrum • – –

53. Structure Spectrum • – • – •

54. Structure Spectrum • – – –

55. Structure Spectrum • – – –

56. Performance of Deep Learning • • – • – • • – • – •

57. Summary • – • • •

58. Further Resources • • • • – •

60. References • • • • • • • • • • •

61. References • • • • • • • •

62. References • • • • • • • •

63. References • • • • • • • • •

64. References • • • • • • •

65. References • • • • • • • • • •

66. References • • • • • • • • •

67. References • • • •

Editor's Notes

#11: All I am going to say about Neuroscience, although techniques do have strong connections.
#14: Make clear that classic methods, e.g.convnets are purely supervised.
#15: Need to bring outdiffereceswrt to existing ML stuff, mainly unsupervised learning part. Make use of unlabaled data (lots of it).
#16: Restructure to bigger emphasis on unsupervised.Make clear that classic methods, e.g.convnets are purely supervised.
#18: Winder and Brown paper. Slightly smoothed view of things.
#19: Selection instead of normalization?
#20: Note pooling is across space, not across Gabor channelNormalization is really nonlinear (small elements not rescaled)
#21: Non-maximal suppression across VW. Like an L-InfnormalizationMax = k-means
#32: Graph not clear. Explain better. Y-axis is change in value
#33: Mention Leonardis & Fidler paper
#34: Too far for labels to trickle down (vanishing gradients)Only information from layer below.Input is supervision.
#37: Add overall energy
#42: Not separate operations Do it at the same
#43: Chriswilliams oral link
#44: Occlusion mask: bootom right quad for sofa interpretationCan’t decide locally If you knew solution, would know what features to extract.
#46: DPM is shape hierarchical HOG templates
#47: DPM is shape hierarchical HOG templates
#48: Song Chun ‘s clock