SlideShare a Scribd company logo
Deep learning for person re-identification
Jingya Wang
jingya.wang@usyd.edu.au
2
Person re-identification (re-id)
Person re-identification (re-id) aims at matching people
across non-overlapping camera views distributed at distinct
locations.
Camera A Camera B
Presentation Outline
• Supervised Person Re-Identification
• Unsupervised Person Re-Identification
• Active Learning for Person Re-Identification
Supervised Person Re-Identification
• Training and testing data are from same domain
Lavi, B., Serj, M.F. and Ullah, I., 2018. Survey on deep learning techniques for person re-identification task
Contrastive loss Triplet lossClassification loss
https://paperswithcode.com/sota/person-re-identification-on-market-1501
Deep learning for person re-identification
Deep learning for person re-identification
10
300
600
900
1200
1500
1800
2100
2400
2700
3000
3300
3600
3900
4200
4500
4800
5100
5400
5700
6000
6300
6600
6900
7200
7500
7800
8100
8400
8700
9000
9300
9600
9900102001050010800
AnnotationCost
Data Size
Challenge For Re-ID
rapid increase
Presentation Outline
• Supervised Person Re-Identification
• Unsupervised Person Re-Identification
• Active Learning for Person Re-Identification
Unsupervised Person Re-Identification
Training and testing data are from different domain
-> learned on the source domain and transfer the knowledge to
target domain (unsupervised domain adaptation)
Train: Market1501 Test: DUKE
Challenges:
• Source and target domains have unknown camera viewing conditions
• The identity/class between source and target domains are non-overlapping
therefore presents a more challenging open-set recognition problem
-> Transferring knowledge of the source domain to target domain in
attribute space
Wang J, Zhu X, Gong S, Li W. Transferable joint attribute-identity deep learning
for unsupervised person re-identification. CVPR,2018
Unsupervised Person Re-Identification
Wang J, Zhu X, Gong S, Li W. Transferable joint attribute-identity deep learning
for unsupervised person re-identification. CVPR,2018
Unsupervised Person Re-Identification
Unsupervised Target Domain Adaptation
Wang J, Zhu X, Gong S, Li W. Transferable joint attribute-identity deep learning
for unsupervised person re-identification. CVPR,2018
Unsupervised Person Re-Identification
Unsupervised Person Re-Identification
Image-to-image translation method: SPGAN
Deng et al., Image-image domain adaptation with preserved self-similarity
and domain-dissimilarity for person re-identification. CVPR 2018
preserved self-similarity and domain dissimilarity
Unsupervised Person Re-Identification
Image-to-image translation method: CamStyle GAN
Zhong, Zhun, et al. "Camera style adaptation for person re-identification." CVPR. 2018.
10
300
600
900
1200
1500
1800
2100
2400
2700
3000
3300
3600
3900
4200
4500
4800
5100
5400
5700
6000
6300
6600
6900
7200
7500
7800
8100
8400
8700
9000
9300
9600
9900102001050010800
AnnotationCost
Data Size
Challenge For Re-ID
rapid increase
Presentation Outline
• Supervised Person Re-Identification
• Unsupervised Person Re-Identification
• Active Learning for Person Re-Identification
Make AI work in the real world: Human-In-The-Loop
Human-in-the-Loop (HITL) explores human feedback
in an incremental learning cycle of the machine for
rapid model domain adaptation
Active learning is a special case of machine learning in which a learning algorithm
is able to interactively query the user (or some other information source) to obtain
the desired outputs at new data points.
There are three scenarios for Active learning :
1. Membership Query Synthesis: the learner generates/constructs an
instance (from some underlying natural distribution).
2. Stream-Based selective sampling, i.e, each sample is considered separately
in our case for label-querying or rejection. Similarly to online-learning, the
data is not saved, there are no assumptions on data distribution, and
therefore it is adaptive to change.
3. Pool-Based sampling, i.e., sampled are chosen from a pool of
unlabeled data for the purpose of labeling
Training
Pool
Agent
request
label
model
query
Training
Pool
selection
strategy
request
label
model
Active Learning Person Re-Identification
Liu, Z.*, Wang, J *., Gong, S., Lu, H. and Tao, D. Deep Reinforcement
Active Learning for Human-in-the-Loop Person Re-Identification. ICCV,
2019,Oral
Concept
A user annotates few informative pedestrian pairs recommended
by an adaptive agent in a human-in-the-loop learning
mechanism
Re-ID
Model
Sample Selection
(agent)
annotator
Pairwise Data
human-in-the-loop
Agent
action!"
query
ancho
r
query for label
unlabeled gallery
pool
state
reward
Goal: Sample Informative Pair
Action: Select One Sample at Each Step
State: Reflect Sample Correlation
Reward: Uncertainty
Liu, Z.*, Wang, J *., Gong, S., Lu, H. and Tao, D. Deep Reinforcement
Active Learning for Human-in-the-Loop Person Re-Identification. ICCV,
2019,Oral
STATE
annotator
Re-ID Loss(Triplet)
REWARD
ACTION
Sample Selection Strategy
gallery pool
query
q
…
g
1
g
2
g
N
0 0.83 0.71 0.66 0.47 0.36
0.83 0 0.85 0 0.87 0
0.71 0.85 0 0 0 0
0.66 0 0 0 0 0
0.47 0.87 0 0 0 0.77
0.36 0 0 0 0.77 0
gKq
Methodology
Joint Reinforcement Active Learning in A Deep Network
false
CNN
!
Agent
"
Liu, Z.*, Wang, J *., Gong, S., Lu, H. and Tao, D. Deep Reinforcement
Active Learning for Human-in-the-Loop Person Re-Identification. ICCV,
2019,Oral
Action: Select One Sample at Each Step
State: We construct a sparse similarity graph among query and gallery samples and take
it as the state value (Reflect Sample Correlation)
1. Base CNN Network
2. A Deep Reinforced Active Learner - An Agent
Methodology
An example of state updating with different human feedback
Reward: we perform a similar hard triplet loss to measure the uncertainty of data.
Dataset & Result (Market-1501)
Supervised Transfer unsupervised
87.95
84.2
42.79
Ours
0.15%
annotated
100%
annotated
0% annotated
73.25
66.26
20.04
R-1 mAP
Deep learning for person re-identification
Presentation Outline
Link Person Re-Identification with ….
• Attribute Learning
• Detection (Person Search )
• Tracking (Multi-target multi-camera tracking)
Attribute recognition usually denotes local structures of a person
Person Re-Identification and Attribute Learning
Ø How do human brain match person?
Long hair
bag
31
Attribute recognition usually denotes local structures of a person
Person Re-Iden3fica3on and A6ribute Learning
Ø How do human brain match person?
Long hair
bag
Attribute Recognition in in Surveillance
ØChallenges:
• Poor image quality
• Complex background clutter
• Uncontrolled viewing conditions
• Small number of labelled training
ØMain idea:
•Discover the interdependency and correlation among
attributes
•Explore visual context as an extra information source to
assist attribute recognition
ØContributions:
•A novel end-to-end encoder-decoder architecture capable
of jointly learning image level context and attribute level
sequential correlation
•Exploit more latent and richer higher-order dependency
among attributes
Wang, J, et al. "Attribute recognition by joint recurrent learning of
context and correlation." ICCV. 2017
Attribute Recognition in in Surveillance
Person Re-Identification and Attribute Recognition
Lin, Yutian, et al. "Improving person re-identification by attribute and
identity learning." Pattern Recognition (2019).
Attribute-based Person Re-Identification
35
•Teenager
•Backpack
•Pants
•Short bottom wear
•Short top wear
•Long hair
•Female
•Top white
•Bottom blue
Ranked
retrieval
results
Query attribute descriptions
Gallery images
Yin, Zhou, et al. "Adversarial attribute-image person re-identification." IJCAI, 2018.
Attribute-based Person Re-Identification
Presentation Outline
Link Person Re-Identification with ….
• Attribute Learning
• Detection (Person Search )
• Tracking (Multi-target multi-camera tracking)
Person Re-Identification and Detection
Zheng, L., Yang, Y., & Hauptmann, A. G. (2016). Person re-identification: Past, present and future.
Person Re-Identification Datasets
Detection
Xiao, Tong, et al. "Joint detection and identification feature learning for
person search." CVPR. 2017
Person Re-Identification and Detection
Liu, Hao, et al. "Neural person search machines." ICCV. 2017.
Person Re-Identification and Detection
Presentation Outline
Link Person Re-Identification with ….
• Attribute Learning
• Detection (Person Search )
• Tracking (Multi-target multi-camera tracking)
Multi-target multi-camera tracking
1st MTMCT and ReID workshop CVPR 2017
2nd MTMCT and ReID workshop CVPR 2019
Duke MTMC (Multi-Target, Multi-Camera) dataset
Multi-target multi-camera tracking
Ristani, Ergys, and Carlo Tomasi. "Features for multi-target multi-
camera tracking and re-identification." CVPR. 2018.
Conclusion
• Supervised Person Re-Identification
• Unsupervised Person Re-Identification
• Active Learning for Person Re-Identification
Link Person Re-Identification with ….
Ø Attribute Learning
Ø Detection (Person Search )
Ø Tracking (Multi-target multi-camera tracking)
Thank you

More Related Content

PPTX
Object recognition of CIFAR - 10
PDF
diffusion 모델부터 DALLE2까지.pdf
PDF
PR-284: End-to-End Object Detection with Transformers(DETR)
PPTX
EfficientNet
PDF
Introduction to Diffusion Models
PDF
Wasserstein GAN 수학 이해하기 I
DOCX
Packet sniffer repot
PPTX
Human Pose Estimation by Deep Learning
Object recognition of CIFAR - 10
diffusion 모델부터 DALLE2까지.pdf
PR-284: End-to-End Object Detection with Transformers(DETR)
EfficientNet
Introduction to Diffusion Models
Wasserstein GAN 수학 이해하기 I
Packet sniffer repot
Human Pose Estimation by Deep Learning

What's hot (20)

PPTX
Object recognition
PDF
Object Detection Using R-CNN Deep Learning Framework
PDF
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
PDF
Convolutional Neural Networks (CNN)
PPTX
Object Detection using Deep Neural Networks
PPTX
Object detection presentation
PPTX
Object Detection & Tracking
PPTX
PPTX
Object detection
PDF
Feature Extraction
PPTX
PPT
Image segmentation
PDF
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
PDF
Introduction to object detection
PPTX
Deep re-id: 关于行人重识别的深度学习方法
PPTX
Image classification using CNN
PPTX
Object Recognition
PPTX
PPTX
CNN Tutorial
PPTX
Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...
Object recognition
Object Detection Using R-CNN Deep Learning Framework
"Semantic Segmentation for Scene Understanding: Algorithms and Implementation...
Convolutional Neural Networks (CNN)
Object Detection using Deep Neural Networks
Object detection presentation
Object Detection & Tracking
Object detection
Feature Extraction
Image segmentation
Image-to-Image Translation with Conditional Adversarial Nets (UPC Reading Group)
Introduction to object detection
Deep re-id: 关于行人重识别的深度学习方法
Image classification using CNN
Object Recognition
CNN Tutorial
Recurrent Neural Network (RNN) | RNN LSTM Tutorial | Deep Learning Course | S...
Ad

Similar to Deep learning for person re-identification (20)

PDF
Real Time Object Detection with Audio Feedback using Yolo v3
PDF
Backbone can not be trained at once rolling back to pre trained network for p...
PDF
Object Classification and recognition using deep learning methods
PPTX
PDF
Survey on Human Behavior Recognition using CNN
PDF
MediaEval 2018 Pixel Privacy: Task Overview
PDF
Study of assessment of cognitive ability
PPTX
The deep learning technology on coco framework
PDF
Age and gender detection using deep learning
PDF
Paper of Final Year Project.pdf
PDF
Hierarchical cross network for person re identification
PPTX
presentation.pptxpresentation.pptxpresentation.pptx
PDF
Adversarial Multi Scale Features Learning for Person Re Identification
PDF
(2017/06)Practical points of deep learning for medical imaging
PDF
Global-local attention with triplet loss and label smoothed cross-entropy for...
PDF
Beyond Frontal Faces: Improving Person Recognition Using Multiple Cues
PDF
Paper id 25201471
PDF
IRJET- Comparative Analysis of Video Processing Object Detection
PPTX
AGE AND GENDER DETECTION USING DEEP LEARNING.pptx
PDF
Hardoon Image Ranking With Implicit Feedback From Eye Movements
Real Time Object Detection with Audio Feedback using Yolo v3
Backbone can not be trained at once rolling back to pre trained network for p...
Object Classification and recognition using deep learning methods
Survey on Human Behavior Recognition using CNN
MediaEval 2018 Pixel Privacy: Task Overview
Study of assessment of cognitive ability
The deep learning technology on coco framework
Age and gender detection using deep learning
Paper of Final Year Project.pdf
Hierarchical cross network for person re identification
presentation.pptxpresentation.pptxpresentation.pptx
Adversarial Multi Scale Features Learning for Person Re Identification
(2017/06)Practical points of deep learning for medical imaging
Global-local attention with triplet loss and label smoothed cross-entropy for...
Beyond Frontal Faces: Improving Person Recognition Using Multiple Cues
Paper id 25201471
IRJET- Comparative Analysis of Video Processing Object Detection
AGE AND GENDER DETECTION USING DEEP LEARNING.pptx
Hardoon Image Ranking With Implicit Feedback From Eye Movements
Ad

More from 哲东 郑 (20)

PDF
Cross-domain complementary learning with synthetic data for multi-person part...
PDF
Step zhedong
PPTX
Visual saliency
PDF
Image Synthesis From Reconfigurable Layout and Style
PPTX
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
PPTX
Weijian image retrieval
PPTX
Scops self supervised co-part segmentation
PPTX
Video object detection
PDF
Center nets
PPTX
C2 ae open set recognition
PPTX
Sota semantic segmentation
PPTX
Deep randomized embedding
PPTX
Semantic Image Synthesis with Spatially-Adaptive Normalization
PPTX
Instance level facial attributes transfer with geometry-aware flow
PPTX
Learning to adapt structured output space for semantic
PPTX
Unsupervised Learning of Object Landmarks through Conditional Image Generation
PPTX
Graph based global reasoning networks
PPTX
Style gan
PDF
Vi2vi
PPTX
Variational Discriminator Bottleneck
Cross-domain complementary learning with synthetic data for multi-person part...
Step zhedong
Visual saliency
Image Synthesis From Reconfigurable Layout and Style
Polysemous Visual-Semantic Embedding for Cross-Modal Retrieval
Weijian image retrieval
Scops self supervised co-part segmentation
Video object detection
Center nets
C2 ae open set recognition
Sota semantic segmentation
Deep randomized embedding
Semantic Image Synthesis with Spatially-Adaptive Normalization
Instance level facial attributes transfer with geometry-aware flow
Learning to adapt structured output space for semantic
Unsupervised Learning of Object Landmarks through Conditional Image Generation
Graph based global reasoning networks
Style gan
Vi2vi
Variational Discriminator Bottleneck

Recently uploaded (20)

PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Dropbox Q2 2025 Financial Results & Investor Presentation
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
PPTX
Cloud computing and distributed systems.
PDF
KodekX | Application Modernization Development
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Encapsulation theory and applications.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PDF
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
PPTX
MYSQL Presentation for SQL database connectivity
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Digital-Transformation-Roadmap-for-Companies.pptx
Chapter 3 Spatial Domain Image Processing.pdf
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Dropbox Q2 2025 Financial Results & Investor Presentation
20250228 LYD VKU AI Blended-Learning.pptx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Advanced methodologies resolving dimensionality complications for autism neur...
Profit Center Accounting in SAP S/4HANA, S4F28 Col11
Cloud computing and distributed systems.
KodekX | Application Modernization Development
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Network Security Unit 5.pdf for BCA BBA.
Encapsulation theory and applications.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
MIND Revenue Release Quarter 2 2025 Press Release
Optimiser vos workloads AI/ML sur Amazon EC2 et AWS Graviton
MYSQL Presentation for SQL database connectivity
Agricultural_Statistics_at_a_Glance_2022_0.pdf

Deep learning for person re-identification