SlideShare a Scribd company logo
One Shot Scene Specific Crowd
Counting
Mohammad Asiful Hossain, Mahesh Kumar K, Mehrdad Hosseinzadeh, Omit
Chanda and Yang Wang
Presented by
Hafsa Moontari Ali
ID: 7880835
Course Title: Research Methodology
Winter 2020, University of Manitoba
What is Crowd Counting?
●A technique used to count or estimate the number of people in a crowd.
●The most solution to crowd counting is to actually count the number of people from a
crowd.
●But it becomes difficult when the images of crowd are captured from open areas such as
streets or parks.
2
Sample Image
3
The problem becomes harder...
4
Where are the potential application areas?
●Urban planning
●Surveillance
●Traffic monitoring
●Geo-political analysis
5
Contribution of this paper
●Addressed a novel problem, “one-shot scene-specific crowd counting”.
●Generated a crowd counting model using Deep Learning.
●Significantly outperformed baseline methods.
6
Proposed Approach
●A density map is predicted from the
input static image.
●Each pixel of the density map
indicates the crowd density at the
corresponding location in the image.
●
●Crowd counting is obtained by
summing the entries of the density
map.
7
Model Architecture
●Dilated Convolutional Neural Networks(CSRNet) architecture is used as backbone.
●It employs convolutional neural network to extract features.
●Dilated convolutional neural network generates output from the features.
●The split of encoder/decoder is flexible and application specific.
8
Model Architecture
Figure 1: One-shot scene-specific adaptation using CSRNet
9
Model Learning
●During training, a collection of labeled training images are used.
●Each scene might correspond to a camera fixed at a particular location.
●It is assumed that each scene has same number of N training images.
●The model can be generalized where different scenes have different number of training
images.
●During training, the model learns the parameters of the encoder network.
10
One Shot Scene Specific Adaptation
●During testing, the crowd counting algorithm is deployed in a specific target scene.
●In this paper, one-shot learning is applied by fine tuning the decoder network.
●The distance between predicted density map and ground truth density map is considered
as loss function.
●Fine tuning is done by computing the gradient of the distance.
●The model is effectively tuned to the target scene.
11
Experimental Results
T
Table 1: Comparison of the performance (MAE and MSE) of our approach and the baselines on the WorldExpo’10
dataset and Trancos dataset. For “ours” and “simple fine-tuning”, either using the last layer or the last two layers of
CSRNet as the decoder are considered.
12
Cross Dataset Testing
Table 2: Performance in the cross-dataset testing with the same(a,b) and different (c,d) object. “W”, “U”, “M” and
“T” are used to denote WorldExpo’10, UCSD, Mall, Trancos, respectively.
13
Future Work
●This paper attempts to deploy a crowd counting model in real-world application.
●In future, this approach can be extended to few shot learning.
●Meta learning, meta-auxiliary learning can also be employed.
●This approach can be extended for unsupervised learning.
14
15
Any Question?

More Related Content

PDF
M.Sc. Thesis - Automatic People Counting in Crowded Scenes
PPTX
Lecture 18: Gaussian Mixture Models and Expectation Maximization
PDF
Deep Learning and CNN Architectures
PPTX
Machine Learning - Ensemble Methods
PPTX
Machine Learning - Breast Cancer Diagnosis
PPTX
Ensemble methods in machine learning
PPTX
Machine Learning by Analogy
PPTX
Speaker Recognition using Gaussian Mixture Model
M.Sc. Thesis - Automatic People Counting in Crowded Scenes
Lecture 18: Gaussian Mixture Models and Expectation Maximization
Deep Learning and CNN Architectures
Machine Learning - Ensemble Methods
Machine Learning - Breast Cancer Diagnosis
Ensemble methods in machine learning
Machine Learning by Analogy
Speaker Recognition using Gaussian Mixture Model

What's hot (20)

PDF
Computer Vision: Feature matching with RANSAC Algorithm
PPTX
Brain tumor detection using convolutional neural network
PPTX
Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim...
DOCX
deep learning applications in medical image analysis brain tumor
PDF
Airline flights delay prediction- 2014 Spring Data Mining Project
PPTX
Fuzzy c means manual work
PPTX
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
PPT
K means Clustering Algorithm
PPTX
Application of-image-segmentation-in-brain-tumor-detection
PPTX
Unsupervised learning clustering
PDF
K-means and GMM
PDF
02 Machine Learning - Introduction probability
PDF
Deep Learning for Computer Vision: Medical Imaging (UPC 2016)
PPTX
K-Nearest Neighbor Classifier
PPT
Chapter 12. Outlier Detection.ppt
PPTX
Machine Learning Unit 1 Semester 3 MSc IT Part 2 Mumbai University
PPTX
Neural ODE
PPT
Support Vector Machines
PDF
L4. Ensembles of Decision Trees
ODP
Introduction to Principle Component Analysis
Computer Vision: Feature matching with RANSAC Algorithm
Brain tumor detection using convolutional neural network
Convolutional Neural Network for Alzheimer’s disease diagnosis with Neuroim...
deep learning applications in medical image analysis brain tumor
Airline flights delay prediction- 2014 Spring Data Mining Project
Fuzzy c means manual work
BRAIN TUMOR MRI IMAGE SEGMENTATION AND DETECTION IN IMAGE PROCESSING
K means Clustering Algorithm
Application of-image-segmentation-in-brain-tumor-detection
Unsupervised learning clustering
K-means and GMM
02 Machine Learning - Introduction probability
Deep Learning for Computer Vision: Medical Imaging (UPC 2016)
K-Nearest Neighbor Classifier
Chapter 12. Outlier Detection.ppt
Machine Learning Unit 1 Semester 3 MSc IT Part 2 Mumbai University
Neural ODE
Support Vector Machines
L4. Ensembles of Decision Trees
Introduction to Principle Component Analysis
Ad

Similar to One shot scene specific crowd counting (20)

PPTX
crowd counting.pptx
PPTX
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
PDF
Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...
PDF
Web image annotation by diffusion maps manifold learning algorithm
PDF
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...
PDF
Fuzzy Entropy Based Optimal Thresholding Technique for Image Enhancement
PDF
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
PDF
Human detection in hours of
PDF
12 SuperAI on Supercomputers
PDF
International Journal of Pharmaceutical Science Invention (IJPSI)
ODT
Probability and random processes project based learning template.pdf
PDF
HUMAN DETECTION IN HOURS OF DARKNESS USING GAUSSIAN MIXTURE MODEL ALGORITHM
PDF
HUMAN DETECTION IN HOURS OF DARKNESS USING GAUSSIAN MIXTURE MODEL ALGORITHM
PPTX
[NS][Lab_Seminar_240909]Sparse Multi-Relational Graph Convolutional Network f...
PPTX
Review of MVSNet(2018)_250110_OJung.pptx
PPT
Where Next
PDF
CLUSTERING HYPERSPECTRAL DATA
PDF
Fuzzy entropy based optimal
PDF
Face recognition using gaussian mixture model & artificial neural network
PPTX
Viii sem
crowd counting.pptx
Dimension Reduction And Visualization Of Large High Dimensional Data Via Inte...
Locate, Size and Count: Accurately Resolving People in Dense Crowds via Detec...
Web image annotation by diffusion maps manifold learning algorithm
Introducing New Parameters to Compare the Accuracy and Reliability of Mean-Sh...
Fuzzy Entropy Based Optimal Thresholding Technique for Image Enhancement
Point-GNN: Graph Neural Network for 3D Object Detection in a Point Cloud
Human detection in hours of
12 SuperAI on Supercomputers
International Journal of Pharmaceutical Science Invention (IJPSI)
Probability and random processes project based learning template.pdf
HUMAN DETECTION IN HOURS OF DARKNESS USING GAUSSIAN MIXTURE MODEL ALGORITHM
HUMAN DETECTION IN HOURS OF DARKNESS USING GAUSSIAN MIXTURE MODEL ALGORITHM
[NS][Lab_Seminar_240909]Sparse Multi-Relational Graph Convolutional Network f...
Review of MVSNet(2018)_250110_OJung.pptx
Where Next
CLUSTERING HYPERSPECTRAL DATA
Fuzzy entropy based optimal
Face recognition using gaussian mixture model & artificial neural network
Viii sem
Ad

Recently uploaded (20)

PDF
Nykaa-Strategy-Case-Fixing-Retention-UX-and-D2C-Engagement (1).pdf
PPTX
3RD-Q 2022_EMPLOYEE RELATION - Copy.pptx
PPTX
Tablets And Capsule Preformulation Of Paracetamol
PPTX
An Unlikely Response 08 10 2025.pptx
PPTX
Introduction-to-Food-Packaging-and-packaging -materials.pptx
PPTX
2025-08-10 Joseph 02 (shared slides).pptx
PPTX
nose tajweed for the arabic alphabets for the responsive
PPTX
Module_4_Updated_Presentation CORRUPTION AND GRAFT IN THE PHILIPPINES.pptx
DOC
LSTM毕业证学历认证,利物浦大学毕业证学历认证怎么认证
PPTX
Human Mind & its character Characteristics
PPTX
MERISTEMATIC TISSUES (MERISTEMS) PPT PUBLIC
PPTX
The Effect of Human Resource Management Practice on Organizational Performanc...
PPTX
fundraisepro pitch deck elegant and modern
PPTX
Hydrogel Based delivery Cancer Treatment
PPTX
Project and change Managment: short video sequences for IBA
PPTX
chapter8-180915055454bycuufucdghrwtrt.pptx
PPTX
Impressionism_PostImpressionism_Presentation.pptx
PPTX
Intro to ISO 9001 2015.pptx wareness raising
PPTX
NORMAN_RESEARCH_PRESENTATION.in education
PPT
First Aid Training Presentation Slides.ppt
Nykaa-Strategy-Case-Fixing-Retention-UX-and-D2C-Engagement (1).pdf
3RD-Q 2022_EMPLOYEE RELATION - Copy.pptx
Tablets And Capsule Preformulation Of Paracetamol
An Unlikely Response 08 10 2025.pptx
Introduction-to-Food-Packaging-and-packaging -materials.pptx
2025-08-10 Joseph 02 (shared slides).pptx
nose tajweed for the arabic alphabets for the responsive
Module_4_Updated_Presentation CORRUPTION AND GRAFT IN THE PHILIPPINES.pptx
LSTM毕业证学历认证,利物浦大学毕业证学历认证怎么认证
Human Mind & its character Characteristics
MERISTEMATIC TISSUES (MERISTEMS) PPT PUBLIC
The Effect of Human Resource Management Practice on Organizational Performanc...
fundraisepro pitch deck elegant and modern
Hydrogel Based delivery Cancer Treatment
Project and change Managment: short video sequences for IBA
chapter8-180915055454bycuufucdghrwtrt.pptx
Impressionism_PostImpressionism_Presentation.pptx
Intro to ISO 9001 2015.pptx wareness raising
NORMAN_RESEARCH_PRESENTATION.in education
First Aid Training Presentation Slides.ppt

One shot scene specific crowd counting

  • 1. One Shot Scene Specific Crowd Counting Mohammad Asiful Hossain, Mahesh Kumar K, Mehrdad Hosseinzadeh, Omit Chanda and Yang Wang Presented by Hafsa Moontari Ali ID: 7880835 Course Title: Research Methodology Winter 2020, University of Manitoba
  • 2. What is Crowd Counting? ●A technique used to count or estimate the number of people in a crowd. ●The most solution to crowd counting is to actually count the number of people from a crowd. ●But it becomes difficult when the images of crowd are captured from open areas such as streets or parks. 2
  • 4. The problem becomes harder... 4
  • 5. Where are the potential application areas? ●Urban planning ●Surveillance ●Traffic monitoring ●Geo-political analysis 5
  • 6. Contribution of this paper ●Addressed a novel problem, “one-shot scene-specific crowd counting”. ●Generated a crowd counting model using Deep Learning. ●Significantly outperformed baseline methods. 6
  • 7. Proposed Approach ●A density map is predicted from the input static image. ●Each pixel of the density map indicates the crowd density at the corresponding location in the image. ● ●Crowd counting is obtained by summing the entries of the density map. 7
  • 8. Model Architecture ●Dilated Convolutional Neural Networks(CSRNet) architecture is used as backbone. ●It employs convolutional neural network to extract features. ●Dilated convolutional neural network generates output from the features. ●The split of encoder/decoder is flexible and application specific. 8
  • 9. Model Architecture Figure 1: One-shot scene-specific adaptation using CSRNet 9
  • 10. Model Learning ●During training, a collection of labeled training images are used. ●Each scene might correspond to a camera fixed at a particular location. ●It is assumed that each scene has same number of N training images. ●The model can be generalized where different scenes have different number of training images. ●During training, the model learns the parameters of the encoder network. 10
  • 11. One Shot Scene Specific Adaptation ●During testing, the crowd counting algorithm is deployed in a specific target scene. ●In this paper, one-shot learning is applied by fine tuning the decoder network. ●The distance between predicted density map and ground truth density map is considered as loss function. ●Fine tuning is done by computing the gradient of the distance. ●The model is effectively tuned to the target scene. 11
  • 12. Experimental Results T Table 1: Comparison of the performance (MAE and MSE) of our approach and the baselines on the WorldExpo’10 dataset and Trancos dataset. For “ours” and “simple fine-tuning”, either using the last layer or the last two layers of CSRNet as the decoder are considered. 12
  • 13. Cross Dataset Testing Table 2: Performance in the cross-dataset testing with the same(a,b) and different (c,d) object. “W”, “U”, “M” and “T” are used to denote WorldExpo’10, UCSD, Mall, Trancos, respectively. 13
  • 14. Future Work ●This paper attempts to deploy a crowd counting model in real-world application. ●In future, this approach can be extended to few shot learning. ●Meta learning, meta-auxiliary learning can also be employed. ●This approach can be extended for unsupervised learning. 14