SlideShare a Scribd company logo
Day 1 Lecture 3
Deep Networks
Elisa Sayrol
[course site]
From Neurons to Convolutional Neural Networks
Figures Credit: Hugo Laroche NN course
2
From Neurons to Convolutional Neural Networks
Figure Credit: Hugo Laroche NN course
Hidden pre-activation
Hidden activation
Output activation
g(x) activation function:
sigmoid:
tangh:
ReLU:
o(x) output activation function:
Softmax:
3
From Neurons to Convolutional Neural Networks
L Hidden Layers
Hidden pre-activation (k>0)
Hidden activation (k=1,…L)
Output activation (k=L+1)
Slide Credit: Hugo Laroche NN course 4
From Neurons to Convolutional Neural Networks
What if the input is all the
pixels within an image?
5
From Neurons to Convolutional Neural Networks
For a 200x200 image,
we have 4x104
neurons
each one with 4x104
inputs, that is 16x108
parameters, only for one
layer!!!
Figure Credit: Ranzatto 6
From Neurons to Convolutional Neural Networks
For a 200x200 image, we
have 4x104
neurons each one
with 10x10 “local
connections” (also called
receptive field) inputs, that is
4x106
What else can we do to
reduce the number of
parameters?
Figure Credit: Ranzatto 7
From Neurons to Convolutional Neural Networks
Translation invariance: we can use same
parameters to capture a specific “feature” in any
area of the image. We can try different sets of
parameters to capture different features.
These operations are equivalent to perform
convolutions with different filters.
Ex: With100 different filters (or feature extractors)
of size 10x10, the number of parameters is 104
That is why they are called Convolutional
Neural Networks, ( ConvNets or CNNs)
Figure Credit: Ranzatto 8
From Neurons to Convolutional Neural Networks
…and don’t forget the activation function!
Figure Credit: Ranzatto
ReLu PReLu
9
From Neurons to Convolutional Neural Networks
Most ConvNets use Pooling
(or subsampling) to reduce
dimensionality and provide
invariance to small local
changes.
Pooling options:
• Max
• Average
• Stochastic pooling
Figure Credit: Ranzatto 10
From Neurons to Convolutional Neural Networks
Padding (P): When doing the
convolution in the borders, you may
add values to compute the
convolution.
When the values are zero, that is
quite common, the technique is called
zero-padding.
When padding is not used the output
size is reduced.
FxF=3x3
11
From Neurons to Convolutional Neural Networks
Padding (P): When doing the
convolution in the borders, you may
add values to compute the
convolution.
When the values are zero, that is
quite common, the technique is called
zero-padding.
When padding is not used the output
size is reduced.
FxF=5x5
12
From Neurons to Convolutional Neural Networks
Stride (S): When doing the
convolution or another operation, like
pooling, we may decide to slide not
pixel by pixel but every 2 or more
pixels. The number of pixels that we
skip is the value of the stride.
It might be used to reduce the
dimensionality of the output
13
From Neurons to Convolutional Neural Networks
Example: Most convnets contain several convolutional layers, interspersed with
pooling layers, and followed by a small number of fully connected layers
A layer is characterized by its width, height and depth (that is, the number of
filters used to generate the feature maps)
An architecture is characterized by the number of layers
LeNet-5 From Lecun ´98
14

More Related Content

PDF
Deep Learning for Computer Vision: Data Augmentation (UPC 2016)
PDF
Deep Learning for Computer Vision: Backward Propagation (UPC 2016)
PDF
Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)
PDF
Deep Learning for Computer Vision: Attention Models (UPC 2016)
PDF
Deep Learning for Computer Vision: Visualization (UPC 2016)
PDF
Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)
PDF
Deep Learning for Computer Vision: Memory usage and computational considerati...
PDF
Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)
Deep Learning for Computer Vision: Data Augmentation (UPC 2016)
Deep Learning for Computer Vision: Backward Propagation (UPC 2016)
Deep Learning for Computer Vision: Unsupervised Learning (UPC 2016)
Deep Learning for Computer Vision: Attention Models (UPC 2016)
Deep Learning for Computer Vision: Visualization (UPC 2016)
Unsupervised Learning (D2L6 2017 UPC Deep Learning for Computer Vision)
Deep Learning for Computer Vision: Memory usage and computational considerati...
Unsupervised Deep Learning (D2L1 Insight@DCU Machine Learning Workshop 2017)

What's hot (20)

PDF
Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)
PDF
Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...
PDF
Joint unsupervised learning of deep representations and image clusters
PDF
Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)
PDF
Deep Learning for Computer Vision: Segmentation (UPC 2016)
PDF
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
PDF
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
PDF
Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...
PDF
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
PDF
Convolutional Neural Networks (DLAI D5L1 2017 UPC Deep Learning for Artificia...
PDF
Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)
PDF
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
PPTX
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
PDF
Attention Models (D3L6 2017 UPC Deep Learning for Computer Vision)
PDF
Convolutional Neural Networks - Veronica Vilaplana - UPC Barcelona 2018
PDF
Recurrent Instance Segmentation (UPC Reading Group)
PDF
Deep Learning for Computer Vision: Saliency Prediction (UPC 2016)
PDF
Recurrent Neural Networks (D2L2 2017 UPC Deep Learning for Computer Vision)
PDF
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
PPTX
Image classification using CNN
Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)
Deep Learning for Computer Vision: Transfer Learning and Domain Adaptation (U...
Joint unsupervised learning of deep representations and image clusters
Optimizing Deep Networks (D1L6 Insight@DCU Machine Learning Workshop 2017)
Deep Learning for Computer Vision: Segmentation (UPC 2016)
Optimization for Deep Networks (D2L1 2017 UPC Deep Learning for Computer Vision)
Deep 3D Visual Analysis - Javier Ruiz-Hidalgo - UPC Barcelona 2017
Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...
Training Deep Networks with Backprop (D1L4 Insight@DCU Machine Learning Works...
Convolutional Neural Networks (DLAI D5L1 2017 UPC Deep Learning for Artificia...
Deep Neural Networks (D1L2 Insight@DCU Machine Learning Workshop 2017)
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Multilayer Perceptron (DLAI D1L2 2017 UPC Deep Learning for Artificial Intell...
Attention Models (D3L6 2017 UPC Deep Learning for Computer Vision)
Convolutional Neural Networks - Veronica Vilaplana - UPC Barcelona 2018
Recurrent Instance Segmentation (UPC Reading Group)
Deep Learning for Computer Vision: Saliency Prediction (UPC 2016)
Recurrent Neural Networks (D2L2 2017 UPC Deep Learning for Computer Vision)
Transfer Learning and Domain Adaptation (D2L3 2017 UPC Deep Learning for Comp...
Image classification using CNN
Ad

Similar to Deep Learning for Computer Vision: Deep Networks (UPC 2016) (20)

PPTX
Neural networks and deep learning
PDF
JAISTサマースクール2016「脳を知るための理論」講義04 Neural Networks and Neuroscience
PDF
convolutional_neural_networks in deep learning
PPTX
Mc Culloch Pitts Neuron
PPTX
4.2 Neural Networks Overviewwwwwwww.pptx
PDF
M7 - Neural Networks in machine learning.pdf
PDF
Convolutional Neural Networks (CNN)
PPTX
Machine Learning - Neural Networks - Perceptron
PPTX
Machine Learning - Introduction to Neural Networks
PPT
Machine learning by using python lesson 2 Neural Networks By Professor Lili S...
PDF
Introduction to Neural Networks in Tensorflow
PPTX
Java and Deep Learning (Introduction)
PPTX
Keras on tensorflow in R & Python
PDF
Artificial neural network paper
PPTX
Introduction to convolutional networks .pptx
PPT
Cnn method
PDF
Convolution neural networks
PDF
How machine learning is changing the world
PPS
Neural Networks Ver1
PPT
tutorial.ppt
Neural networks and deep learning
JAISTサマースクール2016「脳を知るための理論」講義04 Neural Networks and Neuroscience
convolutional_neural_networks in deep learning
Mc Culloch Pitts Neuron
4.2 Neural Networks Overviewwwwwwww.pptx
M7 - Neural Networks in machine learning.pdf
Convolutional Neural Networks (CNN)
Machine Learning - Neural Networks - Perceptron
Machine Learning - Introduction to Neural Networks
Machine learning by using python lesson 2 Neural Networks By Professor Lili S...
Introduction to Neural Networks in Tensorflow
Java and Deep Learning (Introduction)
Keras on tensorflow in R & Python
Artificial neural network paper
Introduction to convolutional networks .pptx
Cnn method
Convolution neural networks
How machine learning is changing the world
Neural Networks Ver1
tutorial.ppt
Ad

More from Universitat Politècnica de Catalunya (20)

PDF
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
PDF
Deep Generative Learning for All
PDF
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
PDF
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
PDF
The Transformer - Xavier Giró - UPC Barcelona 2021
PDF
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
PDF
Open challenges in sign language translation and production
PPTX
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
PPTX
Discovery and Learning of Navigation Goals from Pixels in Minecraft
PDF
Learn2Sign : Sign language recognition and translation using human keypoint e...
PDF
Intepretability / Explainable AI for Deep Neural Networks
PDF
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
PDF
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
PDF
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
PDF
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
PDF
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
PDF
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
PDF
Curriculum Learning for Recurrent Video Object Segmentation
PDF
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All
The Transformer in Vision | Xavier Giro | Master in Computer Vision Barcelona...
Towards Sign Language Translation & Production | Xavier Giro-i-Nieto
The Transformer - Xavier Giró - UPC Barcelona 2021
Learning Representations for Sign Language Videos - Xavier Giro - NIST TRECVI...
Open challenges in sign language translation and production
Generation of Synthetic Referring Expressions for Object Segmentation in Videos
Discovery and Learning of Navigation Goals from Pixels in Minecraft
Learn2Sign : Sign language recognition and translation using human keypoint e...
Intepretability / Explainable AI for Deep Neural Networks
Convolutional Neural Networks - Xavier Giro - UPC TelecomBCN Barcelona 2020
Self-Supervised Audio-Visual Learning - Xavier Giro - UPC TelecomBCN Barcelon...
Attention for Deep Learning - Xavier Giro - UPC TelecomBCN Barcelona 2020
Generative Adversarial Networks GAN - Xavier Giro - UPC TelecomBCN Barcelona ...
Q-Learning with a Neural Network - Xavier Giró - UPC Barcelona 2020
Language and Vision with Deep Learning - Xavier Giró - ACM ICMR 2020 (Tutorial)
Image Segmentation with Deep Learning - Xavier Giro & Carles Ventura - ISSonD...
Curriculum Learning for Recurrent Video Object Segmentation
Deep Self-supervised Learning for All - Xavier Giro - X-Europe 2020

Recently uploaded (20)

PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
Sustainable Sites - Green Building Construction
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PDF
composite construction of structures.pdf
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PDF
Well-logging-methods_new................
DOCX
573137875-Attendance-Management-System-original
PPTX
Construction Project Organization Group 2.pptx
PPTX
Internet of Things (IOT) - A guide to understanding
PPTX
OOP with Java - Java Introduction (Basics)
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
web development for engineering and engineering
PPTX
bas. eng. economics group 4 presentation 1.pptx
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
Sustainable Sites - Green Building Construction
UNIT-1 - COAL BASED THERMAL POWER PLANTS
composite construction of structures.pdf
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
Well-logging-methods_new................
573137875-Attendance-Management-System-original
Construction Project Organization Group 2.pptx
Internet of Things (IOT) - A guide to understanding
OOP with Java - Java Introduction (Basics)
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
CH1 Production IntroductoryConcepts.pptx
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
web development for engineering and engineering
bas. eng. economics group 4 presentation 1.pptx

Deep Learning for Computer Vision: Deep Networks (UPC 2016)

  • 1. Day 1 Lecture 3 Deep Networks Elisa Sayrol [course site]
  • 2. From Neurons to Convolutional Neural Networks Figures Credit: Hugo Laroche NN course 2
  • 3. From Neurons to Convolutional Neural Networks Figure Credit: Hugo Laroche NN course Hidden pre-activation Hidden activation Output activation g(x) activation function: sigmoid: tangh: ReLU: o(x) output activation function: Softmax: 3
  • 4. From Neurons to Convolutional Neural Networks L Hidden Layers Hidden pre-activation (k>0) Hidden activation (k=1,…L) Output activation (k=L+1) Slide Credit: Hugo Laroche NN course 4
  • 5. From Neurons to Convolutional Neural Networks What if the input is all the pixels within an image? 5
  • 6. From Neurons to Convolutional Neural Networks For a 200x200 image, we have 4x104 neurons each one with 4x104 inputs, that is 16x108 parameters, only for one layer!!! Figure Credit: Ranzatto 6
  • 7. From Neurons to Convolutional Neural Networks For a 200x200 image, we have 4x104 neurons each one with 10x10 “local connections” (also called receptive field) inputs, that is 4x106 What else can we do to reduce the number of parameters? Figure Credit: Ranzatto 7
  • 8. From Neurons to Convolutional Neural Networks Translation invariance: we can use same parameters to capture a specific “feature” in any area of the image. We can try different sets of parameters to capture different features. These operations are equivalent to perform convolutions with different filters. Ex: With100 different filters (or feature extractors) of size 10x10, the number of parameters is 104 That is why they are called Convolutional Neural Networks, ( ConvNets or CNNs) Figure Credit: Ranzatto 8
  • 9. From Neurons to Convolutional Neural Networks …and don’t forget the activation function! Figure Credit: Ranzatto ReLu PReLu 9
  • 10. From Neurons to Convolutional Neural Networks Most ConvNets use Pooling (or subsampling) to reduce dimensionality and provide invariance to small local changes. Pooling options: • Max • Average • Stochastic pooling Figure Credit: Ranzatto 10
  • 11. From Neurons to Convolutional Neural Networks Padding (P): When doing the convolution in the borders, you may add values to compute the convolution. When the values are zero, that is quite common, the technique is called zero-padding. When padding is not used the output size is reduced. FxF=3x3 11
  • 12. From Neurons to Convolutional Neural Networks Padding (P): When doing the convolution in the borders, you may add values to compute the convolution. When the values are zero, that is quite common, the technique is called zero-padding. When padding is not used the output size is reduced. FxF=5x5 12
  • 13. From Neurons to Convolutional Neural Networks Stride (S): When doing the convolution or another operation, like pooling, we may decide to slide not pixel by pixel but every 2 or more pixels. The number of pixels that we skip is the value of the stride. It might be used to reduce the dimensionality of the output 13
  • 14. From Neurons to Convolutional Neural Networks Example: Most convnets contain several convolutional layers, interspersed with pooling layers, and followed by a small number of fully connected layers A layer is characterized by its width, height and depth (that is, the number of filters used to generate the feature maps) An architecture is characterized by the number of layers LeNet-5 From Lecun ´98 14