SlideShare a Scribd company logo
Review : Dense Prediction Tasks for Self-SL
Sungchul Kim
2022. 01. 06
Contents
▪ Introduction
▪ DenseCL (CVPR 2021), PixPro (CVPR 2021), SCRL (CVPR 2021)
▪ DetCon (ICCV 2021)
▪ MaskCo (ICCV 2021)
▪ ReSim (ICCV 2021)
▪ RegionCL (arXiv:2111.12309)
▪ SoCo (NeurIPS 2021)
Introduction
▪ Learns from unlabeled sample data
• Pretext task
• Contrastive SSL vs. Non-contrastive SSL
• Instance discrimination vs. Dense prediction
• Downstream task
• classification, detection, segmentation, …
Self-Supervised Learning
Introduction
▪ Instance Discrimination (image-level comparison)
• treats each image in a training set as a single class
Self-Supervised Learning
SupCon : https://arxiv.org/pdf/2004.11362.pdf
SimSiam : https://arxiv.org/pdf/2011.10566.pdf
Introduction
▪ Dense Prediction (pixel-level representation)
• compares how closely each pixel or area is related
Self-Supervised Learning
DenseCL : https://arxiv.org/pdf/2011.09157.pdf
MaskCo : https://arxiv.org/pdf/2108.07954.pdf
DenseCL (CVPR 2021), PixPro (CVPR 2021), SCRL (CVPR 2021)
DenseCL : https://arxiv.org/pdf/2011.09157.pdf
PixPro : https://arxiv.org/pdf/2011.10043.pdf
SCRL : https://arxiv.org/pdf/2103.06122.pdf
BYOL SCRL
DetCon (ICCV 2021)
https://arxiv.org/pdf/2103.10957.pdf
DetCon (ICCV 2021)
https://arxiv.org/pdf/2103.10957.pdf
MaskCo (ICCV 2021)
https://arxiv.org/pdf/2108.07954.pdf
MaskCo (ICCV 2021)
https://arxiv.org/pdf/2108.07954.pdf
MaskCo (ICCV 2021)
https://arxiv.org/pdf/2108.07954.pdf
ReSim (ICCV 2021)
https://arxiv.org/pdf/2103.12902.pdf
ReSim (ICCV 2021)
https://arxiv.org/pdf/2103.12902.pdf
ReSim (ICCV 2021) Pascal VOC
MS COCO
https://arxiv.org/pdf/2103.12902.pdf
RegionCL (arXiv:2111.12309)
https://arxiv.org/pdf/2111.12309.pdf
RegionCL (arXiv:2111.12309)
https://arxiv.org/pdf/2111.12309.pdf
RegionCL (arXiv:2111.12309)
https://arxiv.org/pdf/2111.12309.pdf
SoCo (NeurIPS 2021)
https://arxiv.org/pdf/2106.02637.pdf
SoCo (NeurIPS 2021)
https://arxiv.org/pdf/2106.02637.pdf

More Related Content

PDF
SAM2: Segment Anything in Images and Videos
PDF
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
PDF
Personalize Segment Anything Model with One Shot
PDF
TOOD: Task-aligned One-stage Object Detection
PDF
FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
PDF
Network Representation Analysis using Centered Kernel Alignment (CKA)
PPTX
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
PDF
Revisiting the Calibration of Modern Neural Networks
SAM2: Segment Anything in Images and Videos
FeatUp: A Model-Agnostic Framework for Features at Any Resolution
Personalize Segment Anything Model with One Shot
TOOD: Task-aligned One-stage Object Detection
FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
Network Representation Analysis using Centered Kernel Alignment (CKA)
PR-343: Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision
Revisiting the Calibration of Modern Neural Networks

More from Sungchul Kim (20)

PDF
Emerging Properties in Self-Supervised Vision Transformers
PDF
PR-305: Exploring Simple Siamese Representation Learning
PDF
Score based Generative Modeling through Stochastic Differential Equations
PDF
Exploring Simple Siamese Representation Learning
PDF
Revisiting the Sibling Head in Object Detector
PDF
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
PDF
Deeplabv1, v2, v3, v3+
PDF
Going Deeper with Convolutions
PDF
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
PDF
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
PDF
Panoptic Segmentation
PDF
On the Variance of the Adaptive Learning Rate and Beyond
PDF
A Benchmark for Interpretability Methods in Deep Neural Networks
PDF
KDGAN: Knowledge Distillation with Generative Adversarial Networks
PDF
Designing Network Design Spaces
PDF
Search to Distill: Pearls are Everywhere but not the Eyes
PDF
Supervised Constrastive Learning
PDF
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
PDF
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
PDF
Regularizing Class-wise Predictions via Self-knowledge Distillation
Emerging Properties in Self-Supervised Vision Transformers
PR-305: Exploring Simple Siamese Representation Learning
Score based Generative Modeling through Stochastic Differential Equations
Exploring Simple Siamese Representation Learning
Revisiting the Sibling Head in Object Detector
Do Wide and Deep Networks Learn the Same Things: Uncovering How Neural Networ...
Deeplabv1, v2, v3, v3+
Going Deeper with Convolutions
Grad-CAM: Visual Explanations from Deep Networks via Gradient-based Localization
Generalized Intersection over Union: A Metric and A Loss for Bounding Box Reg...
Panoptic Segmentation
On the Variance of the Adaptive Learning Rate and Beyond
A Benchmark for Interpretability Methods in Deep Neural Networks
KDGAN: Knowledge Distillation with Generative Adversarial Networks
Designing Network Design Spaces
Search to Distill: Pearls are Everywhere but not the Eyes
Supervised Constrastive Learning
Bootstrap Your Own Latent: A New Approach to Self-Supervised Learning
FickleNet: Weakly and Semi-supervised Semantic Image Segmentation using Stoch...
Regularizing Class-wise Predictions via Self-knowledge Distillation
Ad

Recently uploaded (20)

PPTX
UNIT 4 Total Quality Management .pptx
PPT
introduction to datamining and warehousing
PDF
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
PDF
Integrating Fractal Dimension and Time Series Analysis for Optimized Hyperspe...
PDF
737-MAX_SRG.pdf student reference guides
PPTX
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
PDF
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
PPTX
communication and presentation skills 01
PDF
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPT
Occupational Health and Safety Management System
PPTX
Fundamentals of Mechanical Engineering.pptx
PDF
Soil Improvement Techniques Note - Rabbi
PDF
PPT on Performance Review to get promotions
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
III.4.1.2_The_Space_Environment.p pdffdf
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PDF
Analyzing Impact of Pakistan Economic Corridor on Import and Export in Pakist...
PDF
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
PDF
R24 SURVEYING LAB MANUAL for civil enggi
UNIT 4 Total Quality Management .pptx
introduction to datamining and warehousing
BIO-INSPIRED ARCHITECTURE FOR PARSIMONIOUS CONVERSATIONAL INTELLIGENCE : THE ...
Integrating Fractal Dimension and Time Series Analysis for Optimized Hyperspe...
737-MAX_SRG.pdf student reference guides
6ME3A-Unit-II-Sensors and Actuators_Handouts.pptx
keyrequirementskkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkkk
communication and presentation skills 01
A SYSTEMATIC REVIEW OF APPLICATIONS IN FRAUD DETECTION
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Occupational Health and Safety Management System
Fundamentals of Mechanical Engineering.pptx
Soil Improvement Techniques Note - Rabbi
PPT on Performance Review to get promotions
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
III.4.1.2_The_Space_Environment.p pdffdf
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
Analyzing Impact of Pakistan Economic Corridor on Import and Export in Pakist...
COURSE DESCRIPTOR OF SURVEYING R24 SYLLABUS
R24 SURVEYING LAB MANUAL for civil enggi
Ad

Review. Dense Prediction Tasks for SSL