SlideShare a Scribd company logo
Making Convolutional Networks
Shift-Invariant Again
Hyeongmin Lee
Image and Video Pattern Recognition LAB
Electrical and Electronic Engineering Dept, Yonsei University
5th Semester
PR-252
PR-252: Making Convolutional Networks Shift-Invariant Again
What is shift-invariancy??
What is shift-invariancy??
 Shift-variant??
Change in performance??
What is shift-invariancy??
 Shift-variant??
Aliasing
Aliasing
 Fourier Transform & Frequency Domain
𝑡𝑡 Ω
𝑋𝑋 Ω = �
−∞
∞
𝑥𝑥 𝑡𝑡 𝑒𝑒−𝑗𝑗Ω𝑡𝑡 𝑑𝑑𝑑𝑑
Aliasing
 Down-sampling in frequency domain
𝑇𝑇 𝑥𝑥 𝑡𝑡 = �
−∞
∞
𝑥𝑥 𝑡𝑡 𝑒𝑒−𝑗𝑗Ω𝑡𝑡 𝑑𝑑𝑑𝑑 = 𝑋𝑋(Ω)
𝑇𝑇 𝑥𝑥 𝑎𝑎𝑡𝑡 = �
−∞
∞
𝑥𝑥 𝑎𝑎𝑡𝑡 𝑒𝑒−𝑗𝑗Ω𝑡𝑡
𝑑𝑑𝑑𝑑 =
1
|𝑎𝑎|
�
−∞
∞
𝑥𝑥 𝑡𝑡′ 𝑒𝑒−𝑗𝑗
Ω
𝑎𝑎
𝑡𝑡′
𝑑𝑑𝑑𝑑′
𝑇𝑇 𝑥𝑥 𝑎𝑎𝑡𝑡 =
1
|𝑎𝑎|
𝑋𝑋(Ω/𝑎𝑎)
Aliasing
 Down-sampling in frequency domain
𝑡𝑡 Ω
𝑡𝑡 Ω
Aliasing
 Discrete Signal in Frequency Domain (Discrete Time Fourier Transform)
𝜔𝜔𝑛𝑛 2𝜋𝜋−2𝜋𝜋
𝑋𝑋 𝜔𝜔 = �
𝑛𝑛=−∞
∞
𝑥𝑥 𝑛𝑛 𝑒𝑒−𝑗𝑗𝑗𝑗𝑗𝑗
𝑋𝑋 𝜔𝜔 + 2𝜋𝜋 = �
𝑛𝑛=−∞
∞
𝑥𝑥 𝑛𝑛 𝑒𝑒−𝑗𝑗𝑗𝑗𝑗𝑗 𝑒𝑒−𝑗𝑗𝑗𝑗𝑗𝑗𝑗 = �
𝑛𝑛=−∞
∞
𝑥𝑥 𝑛𝑛 𝑒𝑒−𝑗𝑗𝑗𝑗𝑗𝑗 = 𝑋𝑋(𝜔𝜔)
Aliasing
 Discrete Down-Sampling in Frequency Domain
𝑓𝑓
𝑓𝑓
𝑛𝑛 2𝜋𝜋−2𝜋𝜋
𝑛𝑛
2𝜋𝜋−2𝜋𝜋
Aliasing Aliasing
Aliasing
 Aliasing (Example)
Aliasing in CNN
 Max pooling
 Average pooling
 Strided convolution
0 0 1 1 0 0 1 1 0 1 0 1
0 1 1 0 0 1 1 0 1 1 1 1
Max-pooling
Max-pooling
Anti-aliasing
Anti-Aliasing
 Shift Invariancy & Shift Equivariance
• Shift Equivariance
• Shift Invariancy
Shift-Equivariant  Shift-Invariant
Anti-Aliasing
 Anti-aliasing
𝑓𝑓2𝜋𝜋−2𝜋𝜋
Low Pass Filtering
(Blurring)
𝑓𝑓2𝜋𝜋−2𝜋𝜋
𝑓𝑓2𝜋𝜋−2𝜋𝜋
Sampling
Anti-Aliasing
 Anti-aliasing for max pooling
0 0 1 1 0 0 1 1 0 1 0 10 1 1 1 0 1 1 1
Max Sampling
0 1 1 0 0 1 1 0 1 1 1 11 1 1 0 1 1 1 0
Shift-Equivariant!!
Anti-Aliasing
 Anti-aliasing for max pooling
0 1 1 1 0 1 1 1 0.5 1 0.5 10.5 1 1 0.5 0.5 1 1 0.5
Blurring Subsampling
1 1 1 0 1 1 1 0 1 0.5 1 0.51 1 0.5 0.5 1 1 0.5 0.5
Max
Anti-Aliasing
 Anti-aliasing for various sampling operations in CNN
• MaxPool
• AveragePool
• StrideConv
Anti-Aliasing
 Down-sampling Kernels
Results
Results
 Improvement in Consistency
Results
 Improvement in Accuracy
Results
 Improvement in Accuracy
Results
 Improvement in Image Translation
Thank You!

More Related Content

PDF
MobileNet - PR044
PPTX
[PR12] PR-050: Convolutional LSTM Network: A Machine Learning Approach for Pr...
PDF
Small Deep-Neural-Networks: Their Advantages and Their Design
PDF
PR-284: End-to-End Object Detection with Transformers(DETR)
PPTX
U-Net (1).pptx
PPTX
Deep neural networks
PPTX
Batch normalization presentation
PPTX
Pattern recognition binoy 05-naive bayes classifier
MobileNet - PR044
[PR12] PR-050: Convolutional LSTM Network: A Machine Learning Approach for Pr...
Small Deep-Neural-Networks: Their Advantages and Their Design
PR-284: End-to-End Object Detection with Transformers(DETR)
U-Net (1).pptx
Deep neural networks
Batch normalization presentation
Pattern recognition binoy 05-naive bayes classifier

What's hot (20)

PDF
PR-132: SSD: Single Shot MultiBox Detector
PPTX
Denoising autoencoder by Harish.R
PPT
Deep Learning
PDF
PR-108: MobileNetV2: Inverted Residuals and Linear Bottlenecks
PPTX
Deep learning lecture - part 1 (basics, CNN)
PPTX
Machine Learning - Convolutional Neural Network
PDF
Neural Networks: Multilayer Perceptron
PDF
Autoencoder
ODP
Simple Introduction to AutoEncoder
PPTX
Markov chain
PPT
Supervised Learning
PPTX
Variational Autoencoder Tutorial
PDF
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
PDF
Recurrent neural networks
PPT
Adaline madaline
PPTX
Digital Image Fundamentals - II
PDF
Neural Networks: Self-Organizing Maps (SOM)
PPTX
Optimization/Gradient Descent
PPTX
Autoencoder
PDF
Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...
PR-132: SSD: Single Shot MultiBox Detector
Denoising autoencoder by Harish.R
Deep Learning
PR-108: MobileNetV2: Inverted Residuals and Linear Bottlenecks
Deep learning lecture - part 1 (basics, CNN)
Machine Learning - Convolutional Neural Network
Neural Networks: Multilayer Perceptron
Autoencoder
Simple Introduction to AutoEncoder
Markov chain
Supervised Learning
Variational Autoencoder Tutorial
“Introduction to DNN Model Compression Techniques,” a Presentation from Xailient
Recurrent neural networks
Adaline madaline
Digital Image Fundamentals - II
Neural Networks: Self-Organizing Maps (SOM)
Optimization/Gradient Descent
Autoencoder
Autoencoders Tutorial | Autoencoders In Deep Learning | Tensorflow Training |...
Ad

Similar to PR-252: Making Convolutional Networks Shift-Invariant Again (20)

PDF
[PR12] Making Convolutional Networks Shift-Invariant Again
KEY
Playing Go with Clojure
PPTX
Alias-Free GAN(styleGAN3).pptx
PDF
An Introduction to HDTV Principles-Part 3
PPTX
SignalDecompositionTheory.pptx
PPTX
Av 738- Adaptive Filtering - Background Material
PDF
Dsp book ch15
PPTX
Av 738- Adaptive Filtering - Wiener Filters[wk 3]
PPTX
2014.10.dartmouth
PDF
Signal, Sampling and signal quantization
PDF
a simple multi-utility miniature device used as test tube to 3D cell culture ...
PPTX
SENSING WITH CHAOS
PPTX
Acoustic echo cancellation
PPTX
Introduction to equalization
PPT
6 weeks summer training in hfss,ludhiana
PPT
data mining training in chennai
PPT
vlsi training in pune
PPT
6months industrial training in hfss, jalandhar
PPT
vlsi training in chennai
PPT
Hsff training in bangalore
[PR12] Making Convolutional Networks Shift-Invariant Again
Playing Go with Clojure
Alias-Free GAN(styleGAN3).pptx
An Introduction to HDTV Principles-Part 3
SignalDecompositionTheory.pptx
Av 738- Adaptive Filtering - Background Material
Dsp book ch15
Av 738- Adaptive Filtering - Wiener Filters[wk 3]
2014.10.dartmouth
Signal, Sampling and signal quantization
a simple multi-utility miniature device used as test tube to 3D cell culture ...
SENSING WITH CHAOS
Acoustic echo cancellation
Introduction to equalization
6 weeks summer training in hfss,ludhiana
data mining training in chennai
vlsi training in pune
6months industrial training in hfss, jalandhar
vlsi training in chennai
Hsff training in bangalore
Ad

More from Hyeongmin Lee (20)

PDF
PR-455: CoTracker: It is Better to Track Together
PDF
PR-430: CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retri...
PDF
PR-420: Scalable Model Compression by Entropy Penalized Reparameterization
PDF
PR-409: Denoising Diffusion Probabilistic Models
PDF
PR-395: Variational Image Compression with a Scale Hyperprior
PDF
PR-386: Light Field Networks: Neural Scene Representations with Single-Evalua...
PDF
PR-376: Softmax Splatting for Video Frame Interpolation
PDF
PR-365: Fast object detection in compressed video
PDF
PR-340: DVC: An End-to-end Deep Video Compression Framework
PDF
PR-328: End-to-End Optimized Image Compression
PDF
PR-315: Taming Transformers for High-Resolution Image Synthesis
PDF
PR-302: NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
PDF
PR-278: RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
PDF
PDF
PR-240: Modulating Image Restoration with Continual Levels via Adaptive Featu...
PDF
PR-228: Geonet: Unsupervised learning of dense depth, optical flow and camera...
PDF
PR-214: FlowNet: Learning Optical Flow with Convolutional Networks
PPTX
Latest Frame interpolation Algorithms
PPTX
[Paper Review] Temporal Generative Adversarial Nets with Singular Value Clipping
PPTX
[Paper Review] A Middlebury Benchmark & Context-Aware Synthesis for Video Fra...
PR-455: CoTracker: It is Better to Track Together
PR-430: CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retri...
PR-420: Scalable Model Compression by Entropy Penalized Reparameterization
PR-409: Denoising Diffusion Probabilistic Models
PR-395: Variational Image Compression with a Scale Hyperprior
PR-386: Light Field Networks: Neural Scene Representations with Single-Evalua...
PR-376: Softmax Splatting for Video Frame Interpolation
PR-365: Fast object detection in compressed video
PR-340: DVC: An End-to-end Deep Video Compression Framework
PR-328: End-to-End Optimized Image Compression
PR-315: Taming Transformers for High-Resolution Image Synthesis
PR-302: NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis
PR-278: RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
PR-240: Modulating Image Restoration with Continual Levels via Adaptive Featu...
PR-228: Geonet: Unsupervised learning of dense depth, optical flow and camera...
PR-214: FlowNet: Learning Optical Flow with Convolutional Networks
Latest Frame interpolation Algorithms
[Paper Review] Temporal Generative Adversarial Nets with Singular Value Clipping
[Paper Review] A Middlebury Benchmark & Context-Aware Synthesis for Video Fra...

Recently uploaded (20)

PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PPTX
Geodesy 1.pptx...............................................
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PPTX
web development for engineering and engineering
PDF
Operating System & Kernel Study Guide-1 - converted.pdf
PPTX
CH1 Production IntroductoryConcepts.pptx
PPT
Project quality management in manufacturing
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPT
Mechanical Engineering MATERIALS Selection
PPTX
Internet of Things (IOT) - A guide to understanding
DOCX
573137875-Attendance-Management-System-original
PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
Automation-in-Manufacturing-Chapter-Introduction.pdf
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Foundation to blockchain - A guide to Blockchain Tech
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
Geodesy 1.pptx...............................................
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
KTU 2019 -S7-MCN 401 MODULE 2-VINAY.pptx
Embodied AI: Ushering in the Next Era of Intelligent Systems
web development for engineering and engineering
Operating System & Kernel Study Guide-1 - converted.pdf
CH1 Production IntroductoryConcepts.pptx
Project quality management in manufacturing
Lecture Notes Electrical Wiring System Components
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Mechanical Engineering MATERIALS Selection
Internet of Things (IOT) - A guide to understanding
573137875-Attendance-Management-System-original
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...

PR-252: Making Convolutional Networks Shift-Invariant Again