SlideShare a Scribd company logo
Adaptive Consistency Regularization for
Semi-Supervised Transfer Learning
Abuduweili et al. (CVPR 2021)
Dongmin Choi
Yonsei University Translational Artificial Intelligence Lab
Introduction
Semi-Supervised Learning (SSL)
• Effectively leveraging both labeled and unlabeled data
• Three main approaches:
1) consistency based regularization
2) entropy minimization
3) pseudo label
Introduction
Transfer Learning
• The powerful pre-trained model
1) excellent transferability
2) generalization capacity
• Zhou et al.
1) the benefit of SSL are smaller when trained from a pre-trained model
2) combining SSL and transfer learning can solve the domain gap
[Zhou et al, When Semi-Supervised Learning Meets Transfer Learning: Training Strategies, Models and Datasets, arXiv 2018]
Introduction
A Semi-Supervised Transfer Learning Framework
• Extend consistency regularization in SSL to adapt the
inductive transfer learning
• Two essential components:
1) Adaptive Knowledge Consistency (AKC)
- transfer knowledge from the pre-trained model
2) Adaptive Representation Consistency (ARC)
- utilize unlabeled examples to adjust the representation
Related Work
Domain Adaptation
• Tackle the sample selection bias btw the training and test data
• Generate domain invariant representation over the training set
• 내용 추가 필요
Related Work
Semi-Supervised Learning
• Consistency based regularization
- hypothesis : the decision boundary should not pass through high-
density areas
→ two close inputs are expected to have the same label
[Engelen et al, A survey on semi-supervised learning, Machine Learning 2020
Related Work
Semi-Supervised Learning
• П-model
[Laine, Temporal Ensembling for Semi-Supervised Learning, ICLR 2017
Targets can be noisy
prior network evaluations
Related Work
Semi-Supervised Learning
• Mean Teacher
[Tarvainen, Mean teachers are better role models: Weight-averaged consistency targets improve semi-supervised deep learning results, NIPS 2017
Averages model weights instead of label predictions
Related Work
Semi-Supervised Learning
• FixMatch
[Sohn et al., FixMatch: Simplifying Semi-Supervised Learning with Consistency and Confidence, NeurIPS 2020
Consistency regularization + Pseudo labeling
The Proposed Framework
The Proposed Framework
𝑫𝒕
𝒍
𝑫𝒕
𝒖
𝑭𝜽𝟎
𝑭𝜽
𝑮𝜽𝟎
𝑮𝜽
𝜃∗
, 𝜙∗
= arg min
𝜃,𝜙
∑𝑖=1
𝑛
𝐿CE 𝜃, 𝜙; 𝑥𝑙
𝑖
+ 𝑅 𝜃
The Proposed Framework
𝑅𝐾 =
1
𝐵𝑙 + 𝐵𝑢
෍
𝑥𝑖∈𝐿∪𝑈
𝑤K
𝑖
KL 𝐹𝜃0 𝑥𝑖
, 𝐹𝜃 𝑥𝑖
1. Adaptive Knowledge Consistency (AKC)
The Proposed Framework
𝑅𝐾 =
1
𝐵𝑙 + 𝐵𝑢
෍
𝑥𝑖∈𝐿∪𝑈
𝒘𝐊
𝒊
KL 𝐹𝜃0 𝑥𝑖
, 𝐹𝜃 𝑥𝑖
1. Adaptive Knowledge Consistency (AKC)
Sample importance 𝒘𝐊
𝒊
= 𝐈 𝐇 𝐩𝒔
𝒊
≤ 𝝐𝐊
- An entropy function H p𝑠
𝑖 = − ∑𝑗=1
𝐶𝑠
p𝑠,𝑗
𝑖
log p𝑠,𝑗
𝑖
- I : a hard entropy-gate function (calculated entropy → binary sample importance)
The Proposed Framework
2. Adaptive Representation Consistency (ARC)
Maximum Mean Discrepancies (MMD)
to measure the distance
(Let’s skip the details!)
The Proposed Framework
Summarization of the Framework
𝐿 𝜃, 𝜙 =
1
𝑛
෍
𝑖=1
𝑛
𝐿CE 𝜃, 𝜙; 𝑥𝑙
𝑖
+ 𝜆S𝐿S 𝑥𝑢
𝑖
+ 𝜆K𝑅K 𝑥𝑙
𝑖
, 𝑥𝑢
𝑖
+ 𝜆R𝐿R 𝑥𝑙
𝑖
, 𝑥𝑢
𝑖
1
2
3
4
1 2 3 4
Experiments
Results on CUB-200-2011
Experiments
Results on MURA
Experiments
Results on CIFAR-10
Experiments
Results on CIFAR-10
Experiments
The actual sample selected ratio in ARC and AKC
Near 0.9
- exclude hard samples
Experiments
In Fully Supervised Transfer Learning
Conclusion
Two regularization methods : AKC and ARC
• Competitive among S.O.T.A SSL methods
• Best performance among several baseline methods on various
transfer learning benchmarks
• Can be used for more general transfer learning and (semi-)
supervised learning frameworks
Thank you

More Related Content

PDF
Learning loss for active learning
PDF
Naver learning to rank question answer pairs using hrde-ltc
PDF
ViT (Vision Transformer) Review [CDM]
PDF
Icml2018 naver review
PDF
Relational knowledge distillation
PDF
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
PDF
Review : Rethinking Pre-training and Self-training
PDF
Review: Incremental Few-shot Instance Segmentation [CDM]
Learning loss for active learning
Naver learning to rank question answer pairs using hrde-ltc
ViT (Vision Transformer) Review [CDM]
Icml2018 naver review
Relational knowledge distillation
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
Review : Rethinking Pre-training and Self-training
Review: Incremental Few-shot Instance Segmentation [CDM]

What's hot (20)

PDF
Generative Models for General Audiences
PDF
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
PDF
Paraphrasing complex network
PDF
Test for AI model
PDF
[poster] A Compare-Aggregate Model with Latent Clustering for Answer Selection
PPTX
Explainable AI
PDF
Tutorial on Deep Generative Models
PDF
Network recasting
PDF
Backbone can not be trained at once rolling back to pre trained network for p...
PDF
Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017
PDF
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
PDF
How much position information do convolutional neural networks encode? review...
PDF
Comparing Incremental Learning Strategies for Convolutional Neural Networks
PDF
Dual Learning for Machine Translation (NIPS 2016)
PDF
MaxEnt (Loglinear) Models - Overview
PDF
Knowledge distillation deeplab
PDF
Model-Based Reinforcement Learning @NIPS2017
PDF
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
PPTX
One shot learning
PDF
[Introduction] Neural Network-Based Abstract Generation for Opinions and Argu...
Generative Models for General Audiences
Review : Multi-Domain Image Completion for Random Missing Input Data [cdm]
Paraphrasing complex network
Test for AI model
[poster] A Compare-Aggregate Model with Latent Clustering for Answer Selection
Explainable AI
Tutorial on Deep Generative Models
Network recasting
Backbone can not be trained at once rolling back to pre trained network for p...
Overview of Machine Learning for Molecules and Materials Workshop @ NIPS2017
Challenging Common Assumptions in the Unsupervised Learning of Disentangled R...
How much position information do convolutional neural networks encode? review...
Comparing Incremental Learning Strategies for Convolutional Neural Networks
Dual Learning for Machine Translation (NIPS 2016)
MaxEnt (Loglinear) Models - Overview
Knowledge distillation deeplab
Model-Based Reinforcement Learning @NIPS2017
Deep Learning for Computer Vision: A comparision between Convolutional Neural...
One shot learning
[Introduction] Neural Network-Based Abstract Generation for Opinions and Argu...
Ad

Similar to Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Learning (20)

PDF
(SURVEY) Semi Supervised Learning
PPTX
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
PDF
Neural Semi-supervised Learning under Domain Shift
PPTX
End to-end semi-supervised object detection with soft teacher ver.1.0
PDF
M4L18 Unsupervised and Semi-Supervised Learning - Slides v2.pdf
PPTX
Semi supervised learning machine learning made simple
PPTX
Self-ensembling for visual domain adpation
PDF
PhD Defense Slides
PPTX
Predicting More from Less: Synergies of Learning
PDF
Preliminary Exam Slides
PDF
Meta Pseudo Label - InsideAIML
PDF
FixMatch:simplifying semi supervised learning with consistency and confidence
PDF
On Semi-Supervised Learning and Beyond
PDF
Distribution Aligning Refinery of Pseudo-label for Imbalanced Semi-supervised...
PPTX
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PDF
Lifelong / Incremental Deep Learning - Ramon Morros - UPC Barcelona 2018
PDF
Life-long / Incremental Learning (DLAI D6L1 2017 UPC Deep Learning for Artifi...
PPTX
Semi-supervised Learning Survey - 20 years of evaluation
PDF
Icann2018ppt final
PPT
Semi-supervised Learning
(SURVEY) Semi Supervised Learning
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
Neural Semi-supervised Learning under Domain Shift
End to-end semi-supervised object detection with soft teacher ver.1.0
M4L18 Unsupervised and Semi-Supervised Learning - Slides v2.pdf
Semi supervised learning machine learning made simple
Self-ensembling for visual domain adpation
PhD Defense Slides
Predicting More from Less: Synergies of Learning
Preliminary Exam Slides
Meta Pseudo Label - InsideAIML
FixMatch:simplifying semi supervised learning with consistency and confidence
On Semi-Supervised Learning and Beyond
Distribution Aligning Refinery of Pseudo-label for Imbalanced Semi-supervised...
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
Lifelong / Incremental Deep Learning - Ramon Morros - UPC Barcelona 2018
Life-long / Incremental Learning (DLAI D6L1 2017 UPC Deep Learning for Artifi...
Semi-supervised Learning Survey - 20 years of evaluation
Icann2018ppt final
Semi-supervised Learning
Ad

More from Dongmin Choi (17)

PDF
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...
PDF
Review: You Only Look One-level Feature
PDF
Transformer in Computer Vision
PDF
YolactEdge Review [cdm]
PDF
Review : Inter-slice Context Residual Learning for 3D Medical Image Segmentation
PDF
Deformable DETR Review [CDM]
PDF
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
PDF
Review : Structure Boundary Preserving Segmentation
for Medical Image with Am...
PDF
Pyradiomics Customization [CDM]
PDF
Seeing What a GAN Cannot Generate [cdm]
PDF
Neural network pruning with residual connections and limited-data review [cdm]
PDF
Network Deconvolution review [cdm]
PDF
Objects as points (CenterNet) review [CDM]
PDF
Augmix review [cdm]
PDF
Bag of tricks for image classification with convolutional neural networks r...
PPTX
ICCV 2019 REVIEW [CDM]
PDF
[Review] High-performance medicine: the convergence of human and artificial i...
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...
Review: You Only Look One-level Feature
Transformer in Computer Vision
YolactEdge Review [cdm]
Review : Inter-slice Context Residual Learning for 3D Medical Image Segmentation
Deformable DETR Review [CDM]
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Review : Structure Boundary Preserving Segmentation
for Medical Image with Am...
Pyradiomics Customization [CDM]
Seeing What a GAN Cannot Generate [cdm]
Neural network pruning with residual connections and limited-data review [cdm]
Network Deconvolution review [cdm]
Objects as points (CenterNet) review [CDM]
Augmix review [cdm]
Bag of tricks for image classification with convolutional neural networks r...
ICCV 2019 REVIEW [CDM]
[Review] High-performance medicine: the convergence of human and artificial i...

Recently uploaded (20)

PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
KodekX | Application Modernization Development
PDF
Encapsulation theory and applications.pdf
PDF
Network Security Unit 5.pdf for BCA BBA.
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PPTX
Cloud computing and distributed systems.
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Big Data Technologies - Introduction.pptx
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Per capita expenditure prediction using model stacking based on satellite ima...
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Building Integrated photovoltaic BIPV_UPV.pdf
KodekX | Application Modernization Development
Encapsulation theory and applications.pdf
Network Security Unit 5.pdf for BCA BBA.
Understanding_Digital_Forensics_Presentation.pptx
Cloud computing and distributed systems.
Spectroscopy.pptx food analysis technology
Big Data Technologies - Introduction.pptx
Review of recent advances in non-invasive hemoglobin estimation
Spectral efficient network and resource selection model in 5G networks
Unlocking AI with Model Context Protocol (MCP)
Reach Out and Touch Someone: Haptics and Empathic Computing
MYSQL Presentation for SQL database connectivity
Chapter 3 Spatial Domain Image Processing.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Advanced methodologies resolving dimensionality complications for autism neur...

Review : Adaptive Consistency Regularization for Semi-Supervised Transfer Learning