SlideShare a Scribd company logo
Rethinking Data Augmentation
for Image Super-resolution
: A Comprehensive Analysis and a New Strategy; CutBlur & MoA
Jaejun Yoo*, Namhyuk Ahn*, and Kyung-Ah Sohn
Motivation
* Various levels of vision tasks
High-level: Semantic recognition
(e.g., classification, object detection)
Mid-level: Super-pixel
(e.g., segmentation)
Low-level: Pixels, Edges, Colors
(e.g., super-resolution, denoising)
?????
MixUp, Cutout, CutMix …
Analysis on existing DA methods
“Sharp transitions, mixed image contents or losing the relationships of pixels can
degrade SR performance.”
e.g., Cutout fails (discarding pixels) and every feature method fails (manipulation).
Training curves when applied feature DA’s
Analysis on existing DA methods
• DA methods in pixel space bring
some improvements when applied
very carefully.
Analysis on existing DA methods
• DA methods in pixel space bring
some improvements when applied
very carefully.
• Cutout:
Original setting (drop 25% of of pixels in a
rectangular shape) significantly degrades the
performance because it erases spatial information
too much. However, erasing tiny amount of pixels
(0.1% random pixels) boosts the performance (2~3
pixels of 48x48 input patch)
Cutout
Analysis on existing DA methods
• DA methods in pixel space bring
some improvements when applied
very carefully.
• Mixup & CutMix:
Improvements of using CutMix are marginal. We
suspect this happens because CutMix generates a
drastic sharp transition between two different
images.
Improvements of using Mixup is better than
CutMix but it still generates unrealistic image and
affects to the image structure.
Mixup CutMix
Analysis on existing DA methods
• DA methods in pixel space bring
some improvements when applied
very carefully.
• CutMixup:
To verify our hypothesis, we combine benefits of
Mixup and CutMix; CutMixup. CutMixup
provides various boundary cases while minimizes
the sharp transition by retaining partial cues as
Mixup does.
CutMixup
Analysis on existing DA methods
• DA methods in pixel space bring
some improvements when applied
very carefully.
• Blend & RGB permutation:
To push further, we tried a constant blending and
RGB channel permutation, which turn out to be
very simple but effective strategies showing big
performance enhancement (dB).
Note that both methods do not incur any structure
modification to an image.
BlendRGB perm.
CutBlur
CutBlur
• What does the model learn from CutBlur?
• CutBlur prevents the SR model from over-sharpening an image and helps it to super-resolve only the
necessary region.
Super-resolution results of a model (EDSR) trained without CutBlur and its error residual (Δ)
Error residual (Δ)Output
CutBlur
• What does the model learn from CutBlur?
• CutBlur prevents the SR model from over-sharpening an image and helps it to super-resolve only the
necessary region.
Super-resolution results of a model (EDSR) trained CutBlur and its error residual (Δ)
Error residual (Δ)Output
with
Improved generalization: over-sharpening
• Super-resolution (SR)
• Trained on ×4 scale factor dataset and tested on different scale factor (×2)
Improved generalization: over-smoothing
• Denoising
• Trained on severe noise (! = 70) & tested on mild noise (! = 30).
Improved generalization: over-removal
• JPEG artifact removal
• Trained on a mild compression rate & tested on a severe compression rate
Mixture of Augmentation (MoA)
• During the training phase …
• Randomly select single augmentation at
every step. (among the curated DA list)
• Apply it!
Comparison on diverse benchmark models and datasets
• SRCNN (0.07M) – ECCV’14, CARN (1.14M) – ECCV’18, RCAN (15.6M) – ECCV’18, EDSR (43.1M) – CVPRW’17
• DIV2K (synthetic), RealSR (real-world)
• Our method shows consistent improvement for different models (parameters) and
datasets (different environments and size):
Code: https://github.com/clovaai/cutblur
Paper: https://arxiv.org/abs/2004.00448
QR code for
code & paper
For more details,
please visit our website:

More Related Content

PDF
Modeling uncertainty in deep learning
PDF
PRML 2.3.2-2.3.4 ガウス分布
PDF
PRML第3章_3.3-3.4
PDF
Somの分かり易い解説
PDF
[DL Hacks]Self-Attention Generative Adversarial Networks
PDF
[Tokyor08] Rによるデータサイエンス 第2部 第3章 対応分析
PDF
PRML上巻勉強会 at 東京大学 資料 第2章2.3.3 〜 2.5.2
PPTX
Review SRGAN
Modeling uncertainty in deep learning
PRML 2.3.2-2.3.4 ガウス分布
PRML第3章_3.3-3.4
Somの分かり易い解説
[DL Hacks]Self-Attention Generative Adversarial Networks
[Tokyor08] Rによるデータサイエンス 第2部 第3章 対応分析
PRML上巻勉強会 at 東京大学 資料 第2章2.3.3 〜 2.5.2
Review SRGAN

What's hot (20)

PDF
PPTX
エンジニアのための機械学習の基礎
PDF
Linear algebra
PDF
PRML 1.6 情報理論
PDF
4 データ間の距離と類似度
PDF
PRML輪読#2
PDF
Dropout as a Bayesian Approximation
PDF
Discovering_govering_equations_from_data_by_sparse_identification_of_nonlinea...
PDF
基礎からのベイズ統計学第5章
PPTX
PRML 4.4-4.5.2 ラプラス近似
PDF
Arima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet Mahana
PDF
PythonによるCVアルゴリズム実装
PDF
K Means Clustering Algorithm | K Means Example in Python | Machine Learning A...
PDF
Image Restoration for 3D Computer Vision
DOCX
Rによるprincomp関数を使わない主成分分析
PDF
【輪読】Taking the Human Out of the Loop, section 8
PPTX
ベイズ深層学習5章 ニューラルネットワークのベイズ推論 Bayesian deep learning
PDF
SSII2020 [OS2-03] 深層学習における半教師あり学習の最新動向
PDF
Siamese neural networks for one shot image recognition paper explained
PDF
第8章 ガウス過程回帰による異常検知
エンジニアのための機械学習の基礎
Linear algebra
PRML 1.6 情報理論
4 データ間の距離と類似度
PRML輪読#2
Dropout as a Bayesian Approximation
Discovering_govering_equations_from_data_by_sparse_identification_of_nonlinea...
基礎からのベイズ統計学第5章
PRML 4.4-4.5.2 ラプラス近似
Arima Forecasting - Presentation by Sera Cresta, Nora Alosaimi and Puneet Mahana
PythonによるCVアルゴリズム実装
K Means Clustering Algorithm | K Means Example in Python | Machine Learning A...
Image Restoration for 3D Computer Vision
Rによるprincomp関数を使わない主成分分析
【輪読】Taking the Human Out of the Loop, section 8
ベイズ深層学習5章 ニューラルネットワークのベイズ推論 Bayesian deep learning
SSII2020 [OS2-03] 深層学習における半教師あり学習の最新動向
Siamese neural networks for one shot image recognition paper explained
第8章 ガウス過程回帰による異常検知
Ad

Similar to Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Analysis and a New Strategy (20)

PDF
Survey on Single image Super Resolution Techniques
PDF
Survey on Single image Super Resolution Techniques
PDF
Image resolution enhancement using blind technique
PDF
Image resolution enhancement using blind technique
PDF
[CVPR2020] Simple but effective image enhancement techniques
PDF
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
PDF
A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...
PDF
A REGULARIZED ROBUST SUPER-RESOLUTION APPROACH FORALIASED IMAGES AND LOW RESO...
PDF
IRJET- Exploring Image Super Resolution Techniques
PPTX
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...
PDF
Asilomar09 compressive superres
PDF
Effective Pixel Interpolation for Image Super Resolution
PDF
Effective Pixel Interpolation for Image Super Resolution
PDF
27 robust super resolution for 276-282
PDF
Framework on Retrieval of Hypermedia Data using Data mining Technique
PPTX
Super Resolution
PDF
Pixel Recursive Super Resolution. Google Brain
PDF
Learning Based Single Frame Image Super-resolution Using Fast Discrete Curvel...
PDF
SINGLE IMAGE SUPER RESOLUTION: A COMPARATIVE STUDY
PPT
Image enhancement ppt nal2
Survey on Single image Super Resolution Techniques
Survey on Single image Super Resolution Techniques
Image resolution enhancement using blind technique
Image resolution enhancement using blind technique
[CVPR2020] Simple but effective image enhancement techniques
Analysis of Various Single Frame Super Resolution Techniques for better PSNR
A Novel and Robust Wavelet based Super Resolution Reconstruction of Low Resol...
A REGULARIZED ROBUST SUPER-RESOLUTION APPROACH FORALIASED IMAGES AND LOW RESO...
IRJET- Exploring Image Super Resolution Techniques
CutMix: Regularization Strategy to Train Strong Classifiers with Localizable ...
Asilomar09 compressive superres
Effective Pixel Interpolation for Image Super Resolution
Effective Pixel Interpolation for Image Super Resolution
27 robust super resolution for 276-282
Framework on Retrieval of Hypermedia Data using Data mining Technique
Super Resolution
Pixel Recursive Super Resolution. Google Brain
Learning Based Single Frame Image Super-resolution Using Fast Discrete Curvel...
SINGLE IMAGE SUPER RESOLUTION: A COMPARATIVE STUDY
Image enhancement ppt nal2
Ad

More from JaeJun Yoo (13)

PDF
[PR12] Generative Models as Distributions of Functions
PDF
Super resolution in deep learning era - Jaejun Yoo
PDF
A beginner's guide to Style Transfer and recent trends
PDF
[PR12] Spectral Normalization for Generative Adversarial Networks
PDF
Introduction to ambient GAN
PDF
[PR12] categorical reparameterization with gumbel softmax
PDF
[PR12] understanding deep learning requires rethinking generalization
PDF
[PR12] Capsule Networks - Jaejun Yoo
PPTX
[PR12] Inception and Xception - Jaejun Yoo
PDF
[PR12] PixelRNN- Jaejun Yoo
PPTX
[Pr12] dann jaejun yoo
PDF
Variants of GANs - Jaejun Yoo
PDF
[PR12] intro. to gans jaejun yoo
[PR12] Generative Models as Distributions of Functions
Super resolution in deep learning era - Jaejun Yoo
A beginner's guide to Style Transfer and recent trends
[PR12] Spectral Normalization for Generative Adversarial Networks
Introduction to ambient GAN
[PR12] categorical reparameterization with gumbel softmax
[PR12] understanding deep learning requires rethinking generalization
[PR12] Capsule Networks - Jaejun Yoo
[PR12] Inception and Xception - Jaejun Yoo
[PR12] PixelRNN- Jaejun Yoo
[Pr12] dann jaejun yoo
Variants of GANs - Jaejun Yoo
[PR12] intro. to gans jaejun yoo

Recently uploaded (20)

PDF
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
PPT on Performance Review to get promotions
PDF
Well-logging-methods_new................
PPTX
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PPTX
Construction Project Organization Group 2.pptx
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PPT
Mechanical Engineering MATERIALS Selection
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPTX
bas. eng. economics group 4 presentation 1.pptx
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
R24 SURVEYING LAB MANUAL for civil enggi
DOCX
573137875-Attendance-Management-System-original
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
PPTX
additive manufacturing of ss316l using mig welding
The CXO Playbook 2025 – Future-Ready Strategies for C-Suite Leaders Cerebrai...
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPT on Performance Review to get promotions
Well-logging-methods_new................
Recipes for Real Time Voice AI WebRTC, SLMs and Open Source Software.pptx
Foundation to blockchain - A guide to Blockchain Tech
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Construction Project Organization Group 2.pptx
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
Mechanical Engineering MATERIALS Selection
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
bas. eng. economics group 4 presentation 1.pptx
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
R24 SURVEYING LAB MANUAL for civil enggi
573137875-Attendance-Management-System-original
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
additive manufacturing of ss316l using mig welding

Rethinking Data Augmentation for Image Super-resolution: A Comprehensive Analysis and a New Strategy

  • 1. Rethinking Data Augmentation for Image Super-resolution : A Comprehensive Analysis and a New Strategy; CutBlur & MoA Jaejun Yoo*, Namhyuk Ahn*, and Kyung-Ah Sohn
  • 2. Motivation * Various levels of vision tasks High-level: Semantic recognition (e.g., classification, object detection) Mid-level: Super-pixel (e.g., segmentation) Low-level: Pixels, Edges, Colors (e.g., super-resolution, denoising) ????? MixUp, Cutout, CutMix …
  • 3. Analysis on existing DA methods “Sharp transitions, mixed image contents or losing the relationships of pixels can degrade SR performance.” e.g., Cutout fails (discarding pixels) and every feature method fails (manipulation). Training curves when applied feature DA’s
  • 4. Analysis on existing DA methods • DA methods in pixel space bring some improvements when applied very carefully.
  • 5. Analysis on existing DA methods • DA methods in pixel space bring some improvements when applied very carefully. • Cutout: Original setting (drop 25% of of pixels in a rectangular shape) significantly degrades the performance because it erases spatial information too much. However, erasing tiny amount of pixels (0.1% random pixels) boosts the performance (2~3 pixels of 48x48 input patch) Cutout
  • 6. Analysis on existing DA methods • DA methods in pixel space bring some improvements when applied very carefully. • Mixup & CutMix: Improvements of using CutMix are marginal. We suspect this happens because CutMix generates a drastic sharp transition between two different images. Improvements of using Mixup is better than CutMix but it still generates unrealistic image and affects to the image structure. Mixup CutMix
  • 7. Analysis on existing DA methods • DA methods in pixel space bring some improvements when applied very carefully. • CutMixup: To verify our hypothesis, we combine benefits of Mixup and CutMix; CutMixup. CutMixup provides various boundary cases while minimizes the sharp transition by retaining partial cues as Mixup does. CutMixup
  • 8. Analysis on existing DA methods • DA methods in pixel space bring some improvements when applied very carefully. • Blend & RGB permutation: To push further, we tried a constant blending and RGB channel permutation, which turn out to be very simple but effective strategies showing big performance enhancement (dB). Note that both methods do not incur any structure modification to an image. BlendRGB perm.
  • 10. CutBlur • What does the model learn from CutBlur? • CutBlur prevents the SR model from over-sharpening an image and helps it to super-resolve only the necessary region. Super-resolution results of a model (EDSR) trained without CutBlur and its error residual (Δ) Error residual (Δ)Output
  • 11. CutBlur • What does the model learn from CutBlur? • CutBlur prevents the SR model from over-sharpening an image and helps it to super-resolve only the necessary region. Super-resolution results of a model (EDSR) trained CutBlur and its error residual (Δ) Error residual (Δ)Output with
  • 12. Improved generalization: over-sharpening • Super-resolution (SR) • Trained on ×4 scale factor dataset and tested on different scale factor (×2)
  • 13. Improved generalization: over-smoothing • Denoising • Trained on severe noise (! = 70) & tested on mild noise (! = 30).
  • 14. Improved generalization: over-removal • JPEG artifact removal • Trained on a mild compression rate & tested on a severe compression rate
  • 15. Mixture of Augmentation (MoA) • During the training phase … • Randomly select single augmentation at every step. (among the curated DA list) • Apply it!
  • 16. Comparison on diverse benchmark models and datasets • SRCNN (0.07M) – ECCV’14, CARN (1.14M) – ECCV’18, RCAN (15.6M) – ECCV’18, EDSR (43.1M) – CVPRW’17 • DIV2K (synthetic), RealSR (real-world) • Our method shows consistent improvement for different models (parameters) and datasets (different environments and size):