SlideShare a Scribd company logo
DA-CIL: Towards Domain Adaptive Class-Incremental
3D Object Detection
Ziyuan Zhao1,2, Mingxi Xu1,3, Peisheng Qian1, Ramanpreet Singh Pahwa1,2, and Richard Chang1
Presenter: Zhao Ziyuan
1 Institute for Infocomm Research (I2R), A*STAR, Singapore
2 Artificial Intelligence Analytics & Informatics (AI3), A*STAR, Singapore
3 Nanyang Technological University, Singapore
Paper ID: 916
2
Introduction – Catastrophic Forgetting & Domain Shift
- Deep learning-based 3D object detection has received considerable attention in 3D point clouds.
- Catastrophic forgetting: the performance of DL models on old classes tends to decrease substantially
when trained on novel classes.
- Domain Shift: DL models trained on one domain (i.e., source domain) always suffer tremendous
performance degradation when evaluated on another domain (i.e., target domain).
3
Motivation – Domain Adaptive Class-incremental Learning
- Existing CIL and UDA methods are designed for addressing different challenges separately, ignoring
that the catastrophic forgetting and domain shift problems concurrently exist in 3D object detection
on point clouds of real-world environments.
- We aim to leverage labeled old classes on the source domain and labeled new classes on the
target domain to close the domain shift and adapt to new classes without forgetting old classes on the
target domain in DA-CIL.
4
Method – Transformation-consistent Meta-hallucination
- We proposed a new CIL paradigm to enable Domain Adaptive Class-Incremental Learning for 3D object
detection.
- Dual-domain copy-paste (CP) address both in-domain data scarcity and cross-domain distribution
shift.
- Dual-teacher knowledge distillation (KD) enforces multi-level consistency regularization (MLC).
5
Method (1) – Dual-Domain Copy-Paste
- To relieve data scarcity and reduce the domain gap at the data level, we extensively leverage copy-
paste (CP) augmentation techniques for creating cross-domain and in-domain point clouds.
6
Method (2) – Multi-Level Consistency
- To facilitate the dual-teacher knowledge transfer, we propose multi-level consistency regularization from
two aspects.
- Statistics-Level (SC) Consistency ( Batch Normalization parameters alignment)
- Bounding Box-Level Consistency ( center-, class and size-level consistency loss)
Na Zhao, Tat-Seng Chua, and Gim Hee Lee. Sess: Self-ensembling semi-supervised 3d object detection. CVPR, pp.11079–11087, 2020.
7
Method (3) – Dual-Teacher Training
- Cross-domain Teacher: The student model learns the underlying knowledge in base classes from a
cross-domain teacher via distillation loss (square of Euclidean distance between the classification
logits of different bounding boxes).
- In-domain Teacher: Meanwhile, the EMA in-domain teacher helps student model capture structure and
semantic invariant information in objects with consistency loss
- The mixed labels (pseudo base + real novel) are transformed by the same augmentation step that is
applied on the augmented source domain to compute a supervised loss with the backbone VoteNet
Charles R Qi, Or Litany, Kaiming He, and Leonidas J Guibas. Deep hough voting for 3d object detection in point clouds. CVPR, pp. 9277–9286, 2019.
8
Experimental Results
- Dataset
- We evaluate our method on 3D object detection datasets, ScanNet (source) and SUN RGB-D
(target).
- 5 categories (bathtub, bed, bookshelf, chair, desk) were selected as base classes.
- 5 additional categories (dresser, nightstand, sofa, table, toilet) in SUN RGB-D were selected as
novel classes in the target domain.
ScanNet dataset SUN RGB-D dataset
9
Experimental Results (1) – Comparison with different methods
- For class-incremental learning, we evaluated 2 naive transfer learning baselines Freeze-and-add,
and Fine-tune. We also used SDCoT as a strong baseline for CIL.
- For unsupervised domain adaptation, We implemented one recent popular self-ensembling method,
mean teacher (MT).
- Our method outperforms many methods by a clear margin in CIL under domain shift.
Na Zhao, Tat-Seng Chua, and Gim Hee Lee. Sess: Self-ensembling semi-supervised 3d object detection. CVPR, pp.11079–11087, 2020.
10
Experimental Results (2) – Comparison with different augmentation techniques
- We implemented two popular augmentation techniques for comparison, Mix3D and CutMix.
- Mix3D: combines 2 point clouds into a mixed point cloud via concatenation, which results in
excessive overlaps in the mixed point clouds
- CutMix: randomly replaces patches of point clouds with patches from other point clouds, which
unnecessarily introduces extra context from the source domain.
- Our method outperforms the two augmentation techniques.
11
Experimental Results (3) – Ablation Analysis
- Without cross-domain copy-paste, the model achieves a lower result in base classes, which proves
the effectiveness of cross-domain copy-paste augmentation in base class recognition.
- Similarly, it can be inferred from results without in-domain augmentation that in-domain copy-paste
can enhance the model performance in novel classes.
- Moreover, statistics-level consistency provides around 0.8% performance increment in all classes.
12
Experimental Results (4) – Qualitative Comparison (good cases)
- Our method can accurately detect both
old and novel classes in the target
domain, overcoming the domain shift in
old classes.
13
Experimental Results (4) – Qualitative Comparison (failure cases)
- Fail to detect the bookshelf in the point clouds.
The failure is due to the loss of the geometric
structure of the bookshelf in the point cloud data.
- The desk object is misclassified as a table,
which is likely due to geometric similarities
between the 2 classes.
- The bounding box of the desk object deviates
from the ground-truth, which can be attributed to
the large size and irregular shape of the object.
14
Conclusions
- We identify and explore a novel domain adaptive class-incremental learning paradigm for 3D object
detection.
- We propose a novel 3D object detection framework, in which we design a novel dual-domain copy-
paste augmentation method to address both in-domain data scarcity and cross-domain distribution shift.
- We further enhance the dual-teacher knowledge distillation with multi-level consistency between
different domains.
- We achieve superior performance in extensive experiments.
15
Conclusions
- We identify and explore a novel domain adaptive class-incremental learning paradigm for 3D object
detection.
- We propose a novel 3D object detection framework, in which we design a novel dual-domain copy-
paste augmentation method to address both in-domain data scarcity and cross-domain distribution shift.
- We further enhance the dual-teacher knowledge distillation with multi-level consistency between
different domains.
- We achieve superior performance in extensive experiments.
Thanks
Zhao_Ziyuan@i2r.a-star.edu.sg
https://jacobzhaoziyuan.github.io/

More Related Content

PPTX
Deep learning based object detection
PPTX
Few shot learning/ one shot learning/ machine learning
PDF
Understanding Convolutional Neural Networks
PPTX
Object detection presentation
PDF
Deep learning based object detection basics
PPTX
Object detection
PDF
Image Captioning Generator using Deep Machine Learning
PPTX
Image classification using CNN
Deep learning based object detection
Few shot learning/ one shot learning/ machine learning
Understanding Convolutional Neural Networks
Object detection presentation
Deep learning based object detection basics
Object detection
Image Captioning Generator using Deep Machine Learning
Image classification using CNN

What's hot (20)

PPTX
Convolutional Neural Network
PDF
Introduction to object detection
PPTX
Image classification with Deep Neural Networks
PDF
Generative adversarial networks
PDF
CIFAR-10
PDF
Object detection and Instance Segmentation
PDF
Exploring Generating AI with Diffusion Models
PPTX
Transfer Learning and Fine-tuning Deep Neural Networks
PPTX
Object detection with deep learning
PDF
Deep Learning for Computer Vision: Object Detection (UPC 2016)
PPTX
Explaining video summarization based on the focus of attention
PDF
Seminar(Pattern Recognition)
PDF
A survey of deep learning approaches to medical applications
PPTX
Semi-Supervised Learning
PDF
Machine Learning - Object Detection and Classification
PPTX
Transfer learning-presentation
PDF
Introduction to Few shot learning
PPTX
Introduction to Generative Models.pptx
PPTX
Image Segmentation Using Deep Learning : A survey
Convolutional Neural Network
Introduction to object detection
Image classification with Deep Neural Networks
Generative adversarial networks
CIFAR-10
Object detection and Instance Segmentation
Exploring Generating AI with Diffusion Models
Transfer Learning and Fine-tuning Deep Neural Networks
Object detection with deep learning
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Explaining video summarization based on the focus of attention
Seminar(Pattern Recognition)
A survey of deep learning approaches to medical applications
Semi-Supervised Learning
Machine Learning - Object Detection and Classification
Transfer learning-presentation
Introduction to Few shot learning
Introduction to Generative Models.pptx
Image Segmentation Using Deep Learning : A survey
Ad

Similar to [BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection (20)

PDF
GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...
PDF
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
PDF
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
PDF
Deep Image Clustering Based on Label Similarity and Maximizing Mutual Informa...
PPTX
Unveiling deforestation patterns using satellite imagery.pptx
PPTX
Nuts and Bolts of Transfer Learning.pptx
PPTX
Machine learning clustering
PPTX
Machine learning clustering
PPT
ClusetrigBasic.ppt
PDF
K means Clustering
PPTX
Could a Data Science Program use Data Science Insights?
PPTX
Recent Progress on Object Detection_20170331
PDF
ME Synopsis
PDF
CrossScenarioTransferPersonReidentification_finalManuscript
PDF
Analysis on Domain Adaptation based on different papers
PDF
PPTX
Deep Semi-supervised Learning methods
DOCX
ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATA
PPTX
Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...
PDF
Relational knowledge distillation
GeoAI: A Model-Agnostic Meta-Ensemble Zero-Shot Learning Method for Hyperspec...
Transfer Learning and Domain Adaptation (DLAI D5L2 2017 UPC Deep Learning for...
Transfer Learning and Domain Adaptation - Ramon Morros - UPC Barcelona 2018
Deep Image Clustering Based on Label Similarity and Maximizing Mutual Informa...
Unveiling deforestation patterns using satellite imagery.pptx
Nuts and Bolts of Transfer Learning.pptx
Machine learning clustering
Machine learning clustering
ClusetrigBasic.ppt
K means Clustering
Could a Data Science Program use Data Science Insights?
Recent Progress on Object Detection_20170331
ME Synopsis
CrossScenarioTransferPersonReidentification_finalManuscript
Analysis on Domain Adaptation based on different papers
Deep Semi-supervised Learning methods
ON DISTRIBUTED FUZZY DECISION TREES FOR BIG DATA
Shift-Robust Node Classification via Graph Adversarial Clustering Neurips 202...
Relational knowledge distillation
Ad

More from Ziyuan Zhao (15)

PPTX
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
PPTX
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
PPTX
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...
PDF
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
PPTX
[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...
PPTX
[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...
PPTX
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
PPTX
[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...
PDF
[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...
PDF
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
PDF
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
PPTX
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
PPTX
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
PDF
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
PDF
[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...
[IJCAI 2023] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network for Effi...
[IJCAI 2023 - Poster] SemiGNN-PPI: Self-Ensembling Multi-Graph Neural Network...
[IAIM 2023 - Poster] Label-efficient Generalizable Deep Learning for Medical...
[BMVC 2022 - Spotlight] DA-CIL: Towards Domain Adaptive Class-Incremental 3D ...
[ICIP 2022] ACT-NET: Asymmetric Co-Teacher Network for Semi-Supervised Memory...
[ICIP 2022] MMGL: Multi-Scale Multi-View Global-Local Contrastive learning fo...
[ICIP 2022 - Poster] MMGL: Multi-Scale Multi-View Global-Local Contrastive le...
[MICCAI 2022] Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Imag...
[EMBC 2022] Self-supervised Assisted Active Learning for Skin Lesion Segmenta...
[ICME 2022] Adaptive Mean-Residue Loss for Robust Facial Age Estimation
[MICCAI 2021] MT-UDA: Towards unsupervised cross-modality medical image segme...
[EMBC 2021] Hierarchical Consistency Regularized Mean Teacher for Semi-superv...
[EMBC 2021] Multi Slice Dense Sparse Learning for Efficient Liver and Tumor S...
[ICIP 2020] SEA-Net: Squeeze-and-Excitation Attention Net for Diabetic Retino...
[MICCAI 2021 - Poster] MT-UDA: Towards unsupervised cross-modality medical im...

Recently uploaded (20)

PDF
Spectral efficient network and resource selection model in 5G networks
PPTX
Cloud computing and distributed systems.
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
Machine learning based COVID-19 study performance prediction
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PPTX
Spectroscopy.pptx food analysis technology
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
KodekX | Application Modernization Development
PDF
Electronic commerce courselecture one. Pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PPTX
Big Data Technologies - Introduction.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PPTX
sap open course for s4hana steps from ECC to s4
PDF
Approach and Philosophy of On baking technology
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
Spectral efficient network and resource selection model in 5G networks
Cloud computing and distributed systems.
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
Machine learning based COVID-19 study performance prediction
Diabetes mellitus diagnosis method based random forest with bat algorithm
Spectroscopy.pptx food analysis technology
20250228 LYD VKU AI Blended-Learning.pptx
Unlocking AI with Model Context Protocol (MCP)
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
KodekX | Application Modernization Development
Electronic commerce courselecture one. Pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Building Integrated photovoltaic BIPV_UPV.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Big Data Technologies - Introduction.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
sap open course for s4hana steps from ECC to s4
Approach and Philosophy of On baking technology
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
MIND Revenue Release Quarter 2 2025 Press Release

[BMVC 2022] DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection

  • 1. DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection Ziyuan Zhao1,2, Mingxi Xu1,3, Peisheng Qian1, Ramanpreet Singh Pahwa1,2, and Richard Chang1 Presenter: Zhao Ziyuan 1 Institute for Infocomm Research (I2R), A*STAR, Singapore 2 Artificial Intelligence Analytics & Informatics (AI3), A*STAR, Singapore 3 Nanyang Technological University, Singapore Paper ID: 916
  • 2. 2 Introduction – Catastrophic Forgetting & Domain Shift - Deep learning-based 3D object detection has received considerable attention in 3D point clouds. - Catastrophic forgetting: the performance of DL models on old classes tends to decrease substantially when trained on novel classes. - Domain Shift: DL models trained on one domain (i.e., source domain) always suffer tremendous performance degradation when evaluated on another domain (i.e., target domain).
  • 3. 3 Motivation – Domain Adaptive Class-incremental Learning - Existing CIL and UDA methods are designed for addressing different challenges separately, ignoring that the catastrophic forgetting and domain shift problems concurrently exist in 3D object detection on point clouds of real-world environments. - We aim to leverage labeled old classes on the source domain and labeled new classes on the target domain to close the domain shift and adapt to new classes without forgetting old classes on the target domain in DA-CIL.
  • 4. 4 Method – Transformation-consistent Meta-hallucination - We proposed a new CIL paradigm to enable Domain Adaptive Class-Incremental Learning for 3D object detection. - Dual-domain copy-paste (CP) address both in-domain data scarcity and cross-domain distribution shift. - Dual-teacher knowledge distillation (KD) enforces multi-level consistency regularization (MLC).
  • 5. 5 Method (1) – Dual-Domain Copy-Paste - To relieve data scarcity and reduce the domain gap at the data level, we extensively leverage copy- paste (CP) augmentation techniques for creating cross-domain and in-domain point clouds.
  • 6. 6 Method (2) – Multi-Level Consistency - To facilitate the dual-teacher knowledge transfer, we propose multi-level consistency regularization from two aspects. - Statistics-Level (SC) Consistency ( Batch Normalization parameters alignment) - Bounding Box-Level Consistency ( center-, class and size-level consistency loss) Na Zhao, Tat-Seng Chua, and Gim Hee Lee. Sess: Self-ensembling semi-supervised 3d object detection. CVPR, pp.11079–11087, 2020.
  • 7. 7 Method (3) – Dual-Teacher Training - Cross-domain Teacher: The student model learns the underlying knowledge in base classes from a cross-domain teacher via distillation loss (square of Euclidean distance between the classification logits of different bounding boxes). - In-domain Teacher: Meanwhile, the EMA in-domain teacher helps student model capture structure and semantic invariant information in objects with consistency loss - The mixed labels (pseudo base + real novel) are transformed by the same augmentation step that is applied on the augmented source domain to compute a supervised loss with the backbone VoteNet Charles R Qi, Or Litany, Kaiming He, and Leonidas J Guibas. Deep hough voting for 3d object detection in point clouds. CVPR, pp. 9277–9286, 2019.
  • 8. 8 Experimental Results - Dataset - We evaluate our method on 3D object detection datasets, ScanNet (source) and SUN RGB-D (target). - 5 categories (bathtub, bed, bookshelf, chair, desk) were selected as base classes. - 5 additional categories (dresser, nightstand, sofa, table, toilet) in SUN RGB-D were selected as novel classes in the target domain. ScanNet dataset SUN RGB-D dataset
  • 9. 9 Experimental Results (1) – Comparison with different methods - For class-incremental learning, we evaluated 2 naive transfer learning baselines Freeze-and-add, and Fine-tune. We also used SDCoT as a strong baseline for CIL. - For unsupervised domain adaptation, We implemented one recent popular self-ensembling method, mean teacher (MT). - Our method outperforms many methods by a clear margin in CIL under domain shift. Na Zhao, Tat-Seng Chua, and Gim Hee Lee. Sess: Self-ensembling semi-supervised 3d object detection. CVPR, pp.11079–11087, 2020.
  • 10. 10 Experimental Results (2) – Comparison with different augmentation techniques - We implemented two popular augmentation techniques for comparison, Mix3D and CutMix. - Mix3D: combines 2 point clouds into a mixed point cloud via concatenation, which results in excessive overlaps in the mixed point clouds - CutMix: randomly replaces patches of point clouds with patches from other point clouds, which unnecessarily introduces extra context from the source domain. - Our method outperforms the two augmentation techniques.
  • 11. 11 Experimental Results (3) – Ablation Analysis - Without cross-domain copy-paste, the model achieves a lower result in base classes, which proves the effectiveness of cross-domain copy-paste augmentation in base class recognition. - Similarly, it can be inferred from results without in-domain augmentation that in-domain copy-paste can enhance the model performance in novel classes. - Moreover, statistics-level consistency provides around 0.8% performance increment in all classes.
  • 12. 12 Experimental Results (4) – Qualitative Comparison (good cases) - Our method can accurately detect both old and novel classes in the target domain, overcoming the domain shift in old classes.
  • 13. 13 Experimental Results (4) – Qualitative Comparison (failure cases) - Fail to detect the bookshelf in the point clouds. The failure is due to the loss of the geometric structure of the bookshelf in the point cloud data. - The desk object is misclassified as a table, which is likely due to geometric similarities between the 2 classes. - The bounding box of the desk object deviates from the ground-truth, which can be attributed to the large size and irregular shape of the object.
  • 14. 14 Conclusions - We identify and explore a novel domain adaptive class-incremental learning paradigm for 3D object detection. - We propose a novel 3D object detection framework, in which we design a novel dual-domain copy- paste augmentation method to address both in-domain data scarcity and cross-domain distribution shift. - We further enhance the dual-teacher knowledge distillation with multi-level consistency between different domains. - We achieve superior performance in extensive experiments.
  • 15. 15 Conclusions - We identify and explore a novel domain adaptive class-incremental learning paradigm for 3D object detection. - We propose a novel 3D object detection framework, in which we design a novel dual-domain copy- paste augmentation method to address both in-domain data scarcity and cross-domain distribution shift. - We further enhance the dual-teacher knowledge distillation with multi-level consistency between different domains. - We achieve superior performance in extensive experiments.

Editor's Notes

  • #2: Hello everyone. Welcome to my presentation. My name is Zhao Ziyuan. Here I am presenting our paper titled DA-CIL: Towards Domain Adaptive Class-Incremental 3D Object Detection.
  • #3: Deep learning-based 3D object detection has received considerable attention in 3D point clouds. But there are still many challenges in the real scenario. Firstly, the catastrophic forgetting problem seriously limits model performance in class incremental learning (CIL). Secondly, as shown in the figure, there are distribution gaps between two different datasets, and the distribution gap is part of the domain gap. So, in the presence of the domain shift, deep learning models trained on one domain always suffer tremendous performance degradation when evaluated on another domain.
  • #4: Existing CIL and UDA methods address different challenges separately, ignoring that the catastrophic forgetting and domain shift concurrently exist in 3D object detection on point clouds of real-world environments. And we aim to leverage labeled old classes on the source domain and labeled new classes on the target domain to close the domain shift and adapt to new classes without forgetting old classes on the target domain in DA-CIL.
  • #5: As fig3 shows, our DA-CIL framework contains two steps: First, cross-domain intermediate point clouds and in-domain augmented point clouds are generated by dual-domain copy-paste augmentation. Then, a dual-teacher network is constructed to distill knowledge from these domains for domain adaptive class-incremental learning.
  • #6: As shown in fig 4, our dual-domain copy paste first extracts objects from the source domain to populate the ground-truth database. Then objects are copy-pasted into point clouds from the source domain and the target domain to generate the augmented domain and the intermediate domain, respectively.
  • #7: To facilitate the dual-teacher knowledge transfer, we propose multi-level consistency regularization from two aspects: Statistics-Level Consistency and Bounding Box-Level Consistency. Statistics-Level Consistency is implemented in the feedforward process and Bounding Box-Level Consistency is calculated as above equations.
  • #8: Read slide
  • #9: We evaluate our method on 3D object detection datasets, ScanNet and SUN RGB-D. We set 5 categories as base classes and addition 5 categories in SUN RGB-D as novel classes. The above two figures present the gap between the two datasets intuitively.
  • #10: We evaluated 2 naive transfer learning baselines Freeze-and-add, and Fine-tune, one recent popular self-ensembling method, Mean Teacher and SDCoT, the co-teaching architecture, as a strong baseline for class-incremental learning. As table 1 shows, our method outperforms many methods by a clear margin in CIL under domain shift.
  • #11: Read slide
  • #12: To evaluate the effectiveness of different components in our method for CIL under Domain Shift, we perform ablation studies by removing components in our approach. +slide
  • #13: As illustrated in left figure, our method is able to accurately detect both old and novel classes in the target domain, overcoming the domain shift in old classes. Moreover, our model consistently outperforms the baseline SDCoT in both scenarios.
  • #14: Read
  • #15: Read slide
  • #16: Read slide