SlideShare a Scribd company logo
VIVA-Tech International Journal for Research and Innovation Volume 1, Issue 4 (2021)
ISSN(Online): 2581-7280 Article No. X
PP XX-XX
VIVA Institute of Technology
9th
National Conference on Role of Engineers in Nation Building – 2021 (NCRENB-2021)
1
www.viva-technology.org/New/IJRI
People Monitoring and Mask Detection using Real-time video
analyzing
Yogesh Gowari1
, Ritik Gaikwad2
, Aniket Gurav3
, Prof. Vinit Raut4
1,2,3,4
(Computer Engineering, VIVA Institute of Technology, India)
Abstract: People Counting and mask detection based on video is an important field in a Computer Vision.
There is growing interest in video-based solutions for people monitoring and counting in business and security
applications using Computer Vision technology. It has been effectively used in many Artificial Intelligence
fields. Compareing to normal sensor based solutions the one with video based allows more flexible
performance, improved functionalities with lower costs. The system with people counter program requires more
processing because that deals with real-time video, so this particular proposed technique converts a color
image into binary in order to minimize data of image. Reducing processing time is an important term in
Software Engineering to build a good working system. People counting methods based on head detection and
tracking to evaluate the total number of people who move under an overhead camera and check whether that
people are wearing a mask or not. There basically four main features in this proposed system: People counting,
Mask detection, Alarm alert and Scan ID. Based on tracking of head, this method uses the crossing-line
judgment to determine whether the particular head object will get counted or not to be counted. The two main
challenges overcome in this system are: tough estimation of the background scene and the number of persons in
merge split scenarios. A technique for masked face detection using three different steps of estimating eye line
detection, facial part detection and eye detection is used in this system. On exceeding the count of people or in
case mask is not worn then alarm gets alerted
Keywords - Convolution Neural Network, MobileNet SSD, Dataset
I. INTRODUCTION
Public safety has become a very major problem in areas like malls, railway stations and streets during
festive seasons, concerts etc. during any pandemic situation. The massive disasters that happen worldwide
include numerous instances of fatality where people gather in crowds. An efficient automated system to manage
the crowd count is essential. People head tracking provides a way to detect the position, to obtain the motion
trail and to maintain identities of persons in the scene. Managing a crowd of varying densities involves detection
of the individual humans in the crowd. In a high density crowd, because of inter-object closure, detection and
tracking of humans in the crowd will be a challenge in the computer vision field. This system focuses on
training a model for human head detection by some positive samples and negative samples. The trained model is
then used to process the video frames in which the human heads are detected and the count of humans in the
scenario is provided. It also detects whether people are wearing a mask or not. If people are not wearing then the
alarm gets alerted, the same alerting happens when the number of people gathering exceeds. This system can be
used in malls or any other places where crowd should be minimum.
VIVA-Tech International Journal for Research and Innovation Volume 1, Issue 4 (2021)
ISSN(Online): 2581-7280 Article No. X
PP XX-XX
VIVA Institute of Technology
9th
National Conference on Role of Engineers in Nation Building – 2021 (NCRENB-2021)
2
www.viva-technology.org/New/IJRI
II. RELATED WORK
Mingjie Jiang [1], In the proposed paper and high accuracy and efficient mask detector i.e.an Retina
Facemask is proposed which is a one-stage detector, which consists of a feature pyramid network and a module
to focus on detecting face masks. Misbah Ahmad [2], in this paper, a deep neural network model SSD(Single
Shot multibox Detector ) is explored to solve problems like perspective distortion, variations in human pose,
size or orientation; which gives a better accuracy.
M. Martínez-Zarzuela [3], An approach for AdaBoost face detection using haar features in GPU. The
GPU speeds up the performance and a better video resolution. As CPU is ideal it can perform some other
computer Vision tasks. Akshay Mangawati [4], This paper elaborates the exhaustive survey of various object
tracking algorithms under different environmental conditions and identifies efficient algorithms in various types
of tracking.
J. Grönman[5], This paper presents a real-life use case of collecting statistics about bus passengers on a
free-to-ride bus route. The use case utilized cost-effective and off-the-shelf components. Prof. P Y Kumbhar[6],
In this paper the author presents that face detection is an computer technology that determines location and sizes
of human faces in digital image, which detects face and ignores other things like buildings, trees or bodies.
Locating and tracking human faces is a prerequisite face recognition analysis.
RafaelMuñoz-Salinas[7], This work presents a system able to visually detect and track multiple people
using a stereo camera placed at an under-head position. This camera position is especially appropriated for
human–machine applications that require interacting with people or to analyze human facial gestures. Tracking
based exclusively on position information is unreliable when people establish close interactions Thus, we also
include colour information about the people clothes in order to increase the tracking robustness. Zebin Cai [8],
In this paper, we propose a people counting method in crowded scenes by detection the head information from
the video taken from a camera installed straight down on the ceiling.is proposed for people detection.
Combining the head detection and tracking together, a people counting strategy is presented to count the number
of the people in the video frames.
Heemoon Yoon[9], Within this paper the aim is to develop an user friendly Graphical Framework for
object detection API on TensorFlow which is called TensorFlow Graphical Framework (TF-GraF). The TF-
GraF provides independent virtual environments according to user accounts in server-side, additionally,
execution of data preprocessing, training, and evaluation without CLI in client-side.Since TF-GraF takes care of
setting and configuration, it allows anyone to use deep learning technology for their project without spending
time to install complex software and environment. Gretchel Karen L. Alcantara[10], In this paper, the
researchers familiarizes and exposes themselves with OpenCV. OpenCV is an open source computer vision
library that is written in C and C++. First the group aims to have a deeper knowledge and understanding about
head detection and tracking using OpenCV.
S. Syed Ameer Abbas[11], In this paper, we propose a method to manage the crowd by keeping in
track the count of the people in the scene. In our study, we develop a system using Raspberry Pi 3 board that
consists of ARMv8 CPU that detects the human heads and provide a count of humans in the region using
OpenCV-Python. Fabio Dittrich[12], The author presents two novel approaches for people counting in crowded
and open environments that combines the information gathered by multiple views.
III. METHODOLOGY
The Crowd Monitoring and Mask Detection is a simple system used for people counting and detection
of mask in crowded places. This system uses Convolution Neural Network (CNN), which is an image
classification algorithm as well as MobileNet SSD which is used for the same. CNN is made up of neurons, each
having an independent weight assigned to it. CNN is a class of deep neural networks specially used for image
recognition and image processing. MobileNet is a simple but efficient and not very intensive convolutional
neural network for mobile vision applications. MobileNet is widely used in many real-world applications which
include fine-grained classifications, object detection, face attributes, and localization. CNN takes the input as an
image, identifies and assigns priority to various features of the image and it differentiates the features from one
another. Mobilenet is a neural network that is used for classification and recognition whereas the SSD is a
framework that is used to realize the multi detector. Only the combination of both can do object detection. SSD
can be interchanged with RCNN. The preprocessing required for CNN is less and has the ability to learn image
characteristics. CNN consists of several sets of convolution layers, pooling layers, flatten and dense. The sets of
convolution and pooling layers are used for feature extraction and the number of such sets may vary.
Convolution layer is the basic building block of the CNN and is used for extracting features from an input
image. The proposed system uses Convolution model which consists of multiple layers for the purpose of
feature extraction from the image. Training data is provided to the model for better prediction of people wearing
VIVA-Tech International Journal for Research and Innovation Volume 1, Issue 4 (2021)
ISSN(Online): 2581-7280 Article No. X
PP XX-XX
VIVA Institute of Technology
9th
National Conference on Role of Engineers in Nation Building – 2021 (NCRENB-2021)
3
www.viva-technology.org/New/IJRI
a mask or not. The classification of people wearing mask, the input video is converted into frames and then into
RGB format and then is flattened in matrix to extract the information by convolution layer. Multiple
convolutional layers used to provide better predictions with higher accuracy. Figure 1 represents the mask
detection system flow using CNN with MobileNet algorithm is used in this system as it consumes less data
processing time. The testing of the module is done using real time images of people with masks and no mask to
reflect the accuracy of the model. Hence, the model classifies the real time people counting and masks detection
in an efficient way.
Figure 1: System Flow Diagram
IV. CONCLUSION
The mask detection using CNN with MobileNet algorithm is used in this system as it consumes less
data processing time. This System presents a people counting system as a way to manage crowds by keeping the
count of people. Keeping in mind the Pandemic situation Mask-Detection feature is added if the count exceeds
the prohibited count or if the model recognizes whether people are not wearing masks then the alarm gets
alerted. This system will reduce the time taken for humans for counting or checking purposes and ensure them,
this work is done by the system itself in no time. By this model human errors will be reduced to great extents as
the system itself gets trained through large datasets. This process requires comparatively less time and provides
great accuracy. As the system trains itself by doing the same tasks of mask detection so that there is less loss and
provides a better accuracy. As this system is still under progress so we can’t predict accurate accuracy but it
offers better accuracy.
REFERENCES
[1] Mingjie Jiang, Xinqi Fan ―RetinaMask: A Face Mask detector,2020, 7th International conference on
Artificial Intelligience, IEEE, 2020, pg.9.
[2] M. Ahmad, I. Ahmed, K. Ullah, I. Khan, A. Khattak and A. Adnan, ―Energy efficient camera solution
for video surveillance‖, International Journal of Advanced Computer Science and Applications, vol. 10,
no. 3, IEEE, 2019, pg.2.
VIVA-Tech International Journal for Research and Innovation Volume 1, Issue 4 (2021)
ISSN(Online): 2581-7280 Article No. X
PP XX-XX
VIVA Institute of Technology
9th
National Conference on Role of Engineers in Nation Building – 2021 (NCRENB-2021)
4
www.viva-technology.org/New/IJRI
[3] Mario Martínez Zarzuela, Francisco Javier Díaz-Pernas, Miriam Antón-Rodríguez, ―AdaBoost Face
Detection on the GPU Using Haar-Like Features‖, Proceedings of the 4th international conference on
Interplay between natural and artificial computation: new challenges on bioinspired applications-Volume
III, IEEE, 2018, pg.9
[4] Akshay Mangawati; Mohana; Mohammed Leesan; H.V. Ravish Aradhya ―Object Tracking Algorithms
for Video Surveillance Applications‖, 2018 International Conference on Object detection, and motion
sensor, IEEE, 2018, pg.6.
[5] J. Grönman; P. Sillberg; P. Rantanen; M. Saari;” People Counting in a Public Event—Use Case: Free-to-
Ride Bus”,IEEE,2019.
[6] Prof. P Y Kumbhar1 , Mohammad Attaullah2 , Shubham Dhere3 , Shivkumar Hipparagi:” REAL TIME
FACE DETECTION AND TRACKING USING OPENCV”,2019.
[7] RafaelMuñoz-SalinasaEugenioAguirrebMiguelGarcía-Silventeb “People detection and tracking using
stereo vision and color”,2007.
[8] Zebin Cai; Zhu Liang Yu; Hao Liu; Ke Zhang “Counting People in Crowded Scenes by Video
Analyzing”,IEEE,2014.
[9] Heemoon Yoon, Sang-Hee Lee, Mira Park,” TensorFlow with user friendly Graphical Framework for
object detection API”,2020.
[10] Gretchel Karen L. Alcantara; Ivan Darren J. Evangelista; Jerome Vincent B. Malinao; Ofelia B. Ong;
Reginald Steven DM. Rivera “Head Detection and Tracking Using OpenCV”,IEEE,2018.
[11] S. Syed Ameer Abbas; P. Oliver Jayaprakash; M. Anitha; X. Vinitha Jaini “Crowd Detection and
Management using Cascade classifier on ARMv8 and OpenCV-Python”,IEEE,2017.
[12] Fabio Dittrich, Luiz E. S. de Oliveira, Alceu S. Britto Jr. and Alessandro L. Koerich “People Counting in
Crowded and Outdoor Scenes using a Hybrid Multi-Camera Approach”,IEEE,2019.

More Related Content

PDF
Sonsik seminar mask-detection_yagya
PDF
Face Mask Detection And Attendance System
PPTX
Face mask detection
DOCX
Face mask detection using convolutional neural networks article
PDF
IEEE 2013 - 2014 Embedded and Android Titles
PPTX
Deep fakes and beyond
PDF
An efficient system for real time fatigue detection
PDF
A Robust Embedded Based String Recognition for Visually Impaired People
Sonsik seminar mask-detection_yagya
Face Mask Detection And Attendance System
Face mask detection
Face mask detection using convolutional neural networks article
IEEE 2013 - 2014 Embedded and Android Titles
Deep fakes and beyond
An efficient system for real time fatigue detection
A Robust Embedded Based String Recognition for Visually Impaired People

What's hot (19)

PPTX
Quadcopter for Monitoring and Detection
PPTX
Image attendance system
PPT
Face recognition using laplacianfaces
PDF
IRJET - Real-Time Analysis of Video Surveillance using Machine Learning a...
PPTX
Face recognization using artificial nerual network
PDF
An Improved Tracking Using IMU and Vision Fusion for Mobile Augmented Reality...
PDF
A Novel Biometric Approach for Authentication In Pervasive Computing Environm...
PDF
Graduation Project - Face Login : A Robust Face Identification System for Sec...
PDF
A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...
PDF
Image recognition technology (Medical Presentation)
PDF
Report face recognition : ArganRecogn
PDF
M phil-computer-science-biometric-system-projects
PPTX
human face detection using matlab
PPTX
FACE RECOGNITION USING NEURAL NETWORK
PDF
TRANSFER LEARNING WITH CONVOLUTIONAL NEURAL NETWORKS FOR IRIS RECOGNITION
DOC
Face recognition using laplacianfaces
PPTX
Deep learning on face recognition (use case, development and risk)
PPTX
Computer Vision - Real Time Face Recognition using Open CV and Python
PDF
Quadcopter for Monitoring and Detection
Image attendance system
Face recognition using laplacianfaces
IRJET - Real-Time Analysis of Video Surveillance using Machine Learning a...
Face recognization using artificial nerual network
An Improved Tracking Using IMU and Vision Fusion for Mobile Augmented Reality...
A Novel Biometric Approach for Authentication In Pervasive Computing Environm...
Graduation Project - Face Login : A Robust Face Identification System for Sec...
A New Proposal for Smartphone Based Drowsiness Detection and Warning System f...
Image recognition technology (Medical Presentation)
Report face recognition : ArganRecogn
M phil-computer-science-biometric-system-projects
human face detection using matlab
FACE RECOGNITION USING NEURAL NETWORK
TRANSFER LEARNING WITH CONVOLUTIONAL NEURAL NETWORKS FOR IRIS RECOGNITION
Face recognition using laplacianfaces
Deep learning on face recognition (use case, development and risk)
Computer Vision - Real Time Face Recognition using Open CV and Python
Ad

Similar to People Monitoring and Mask Detection using Real-time video analyzing (20)

PDF
IRJET- Comparative Study of Different Techniques for Text as Well as Object D...
PDF
Crime Detection using Machine Learning
PDF
Application To Monitor And Manage People In Crowded Places Using Neural Networks
PDF
Social distance and face mask detector system exploiting transfer learning
PDF
Intelligent System For Face Mask Detection
PDF
Review of Pose Recognition Systems
PDF
Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...
PDF
IRJET- Car Defect Detection using Machine Learning for Insurance
PDF
CRIMINAL IDENTIFICATION FOR LOW RESOLUTION SURVEILLANCE
PDF
Object and Currency Detection for the Visually Impaired
PDF
Covid Mask Detection and Social Distancing Using Raspberry pi
PDF
A NOVEL BIOMETRIC APPROACH FOR AUTHENTICATION IN PERVASIVE COMPUTING ENVIRONM...
PDF
Advanced Computational Intelligence: An International Journal (ACII)
PDF
A Novel Biometric Approach for Authentication In Pervasive Computing Environm...
PDF
FACE MASK DETECTION USING MACHINE LEARNING AND IMAGE PROCESSING
PDF
Toddler monitoring system in vehicle using single shot detector-mobilenet and...
PDF
SMART SURVEILLANCE SYSTEM USING LBPH ALGORITHM
PDF
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
DOCX
Image Recognition Expert System based on deep learning
PDF
Face Mask Detection System Using Artificial Intelligence
IRJET- Comparative Study of Different Techniques for Text as Well as Object D...
Crime Detection using Machine Learning
Application To Monitor And Manage People In Crowded Places Using Neural Networks
Social distance and face mask detector system exploiting transfer learning
Intelligent System For Face Mask Detection
Review of Pose Recognition Systems
Inspection of Suspicious Human Activity in the Crowd Sourced Areas Captured i...
IRJET- Car Defect Detection using Machine Learning for Insurance
CRIMINAL IDENTIFICATION FOR LOW RESOLUTION SURVEILLANCE
Object and Currency Detection for the Visually Impaired
Covid Mask Detection and Social Distancing Using Raspberry pi
A NOVEL BIOMETRIC APPROACH FOR AUTHENTICATION IN PERVASIVE COMPUTING ENVIRONM...
Advanced Computational Intelligence: An International Journal (ACII)
A Novel Biometric Approach for Authentication In Pervasive Computing Environm...
FACE MASK DETECTION USING MACHINE LEARNING AND IMAGE PROCESSING
Toddler monitoring system in vehicle using single shot detector-mobilenet and...
SMART SURVEILLANCE SYSTEM USING LBPH ALGORITHM
IRJET- Review on Human Action Detection in Stored Videos using Support Vector...
Image Recognition Expert System based on deep learning
Face Mask Detection System Using Artificial Intelligence
Ad

More from vivatechijri (20)

PDF
Design and Implementation of Water Garbage Cleaning Robot
PDF
Software Development Using Python Language For Designing Of Servomotor
PDF
GSM Based Controlling and Monitoring System of UPS Battery
PDF
Electrical Drive Based Floor Cleaning Robot
PDF
IoT BASED FIRE EXTINGUISHER SYSTEM with IOT
PDF
Wave Energy Generation producing electricity in future
PDF
Predictive Maintenance of Motor Using Machine Learning
PDF
Development of an Android App For Designing Of Stepper Motor By Kodular Software
PDF
Implementation Technology to Repair Pothole Using Waste Plastic
PDF
NFC BASED VOTING SYSTEM with Electronic voting devices
PDF
Review on Electrical Audit Management in MATLAB Software.
PDF
DESIGN AND FABRICATION OF AUTOMATIC CEMENT PLASTERING MACHINE
PDF
Research on Inspection Robot for Chemical Industry
PDF
Digital Synchroscope using Arduino microcontroller
PDF
BLDC MACHINE DESIGN SOFTWARE AND CALCULATION
PDF
SIMULATION MODEL OF 3 PHASE TRANSMISSION LINE FAULT ANALYSIS
PDF
Automated Water Supply and Theft Identification Using ESP32
PDF
Multipurpose Swimming Pool Cleaning Device for Observation, Cleaning and Life...
PDF
Annapurna – Waste Food Management system
PDF
A One stop APP for Personal Data management with enhanced Security using Inte...
Design and Implementation of Water Garbage Cleaning Robot
Software Development Using Python Language For Designing Of Servomotor
GSM Based Controlling and Monitoring System of UPS Battery
Electrical Drive Based Floor Cleaning Robot
IoT BASED FIRE EXTINGUISHER SYSTEM with IOT
Wave Energy Generation producing electricity in future
Predictive Maintenance of Motor Using Machine Learning
Development of an Android App For Designing Of Stepper Motor By Kodular Software
Implementation Technology to Repair Pothole Using Waste Plastic
NFC BASED VOTING SYSTEM with Electronic voting devices
Review on Electrical Audit Management in MATLAB Software.
DESIGN AND FABRICATION OF AUTOMATIC CEMENT PLASTERING MACHINE
Research on Inspection Robot for Chemical Industry
Digital Synchroscope using Arduino microcontroller
BLDC MACHINE DESIGN SOFTWARE AND CALCULATION
SIMULATION MODEL OF 3 PHASE TRANSMISSION LINE FAULT ANALYSIS
Automated Water Supply and Theft Identification Using ESP32
Multipurpose Swimming Pool Cleaning Device for Observation, Cleaning and Life...
Annapurna – Waste Food Management system
A One stop APP for Personal Data management with enhanced Security using Inte...

Recently uploaded (20)

PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
PPT on Performance Review to get promotions
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PPTX
web development for engineering and engineering
PDF
Automation-in-Manufacturing-Chapter-Introduction.pdf
PPT
Mechanical Engineering MATERIALS Selection
PPTX
additive manufacturing of ss316l using mig welding
PPTX
Geodesy 1.pptx...............................................
PPTX
OOP with Java - Java Introduction (Basics)
PPTX
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PPTX
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
CH1 Production IntroductoryConcepts.pptx
CYBER-CRIMES AND SECURITY A guide to understanding
PPT on Performance Review to get promotions
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
web development for engineering and engineering
Automation-in-Manufacturing-Chapter-Introduction.pdf
Mechanical Engineering MATERIALS Selection
additive manufacturing of ss316l using mig welding
Geodesy 1.pptx...............................................
OOP with Java - Java Introduction (Basics)
IOT PPTs Week 10 Lecture Material.pptx of NPTEL Smart Cities contd
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
FINAL REVIEW FOR COPD DIANOSIS FOR PULMONARY DISEASE.pptx
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
UNIT 4 Total Quality Management .pptx
M Tech Sem 1 Civil Engineering Environmental Sciences.pptx
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf

People Monitoring and Mask Detection using Real-time video analyzing

  • 1. VIVA-Tech International Journal for Research and Innovation Volume 1, Issue 4 (2021) ISSN(Online): 2581-7280 Article No. X PP XX-XX VIVA Institute of Technology 9th National Conference on Role of Engineers in Nation Building – 2021 (NCRENB-2021) 1 www.viva-technology.org/New/IJRI People Monitoring and Mask Detection using Real-time video analyzing Yogesh Gowari1 , Ritik Gaikwad2 , Aniket Gurav3 , Prof. Vinit Raut4 1,2,3,4 (Computer Engineering, VIVA Institute of Technology, India) Abstract: People Counting and mask detection based on video is an important field in a Computer Vision. There is growing interest in video-based solutions for people monitoring and counting in business and security applications using Computer Vision technology. It has been effectively used in many Artificial Intelligence fields. Compareing to normal sensor based solutions the one with video based allows more flexible performance, improved functionalities with lower costs. The system with people counter program requires more processing because that deals with real-time video, so this particular proposed technique converts a color image into binary in order to minimize data of image. Reducing processing time is an important term in Software Engineering to build a good working system. People counting methods based on head detection and tracking to evaluate the total number of people who move under an overhead camera and check whether that people are wearing a mask or not. There basically four main features in this proposed system: People counting, Mask detection, Alarm alert and Scan ID. Based on tracking of head, this method uses the crossing-line judgment to determine whether the particular head object will get counted or not to be counted. The two main challenges overcome in this system are: tough estimation of the background scene and the number of persons in merge split scenarios. A technique for masked face detection using three different steps of estimating eye line detection, facial part detection and eye detection is used in this system. On exceeding the count of people or in case mask is not worn then alarm gets alerted Keywords - Convolution Neural Network, MobileNet SSD, Dataset I. INTRODUCTION Public safety has become a very major problem in areas like malls, railway stations and streets during festive seasons, concerts etc. during any pandemic situation. The massive disasters that happen worldwide include numerous instances of fatality where people gather in crowds. An efficient automated system to manage the crowd count is essential. People head tracking provides a way to detect the position, to obtain the motion trail and to maintain identities of persons in the scene. Managing a crowd of varying densities involves detection of the individual humans in the crowd. In a high density crowd, because of inter-object closure, detection and tracking of humans in the crowd will be a challenge in the computer vision field. This system focuses on training a model for human head detection by some positive samples and negative samples. The trained model is then used to process the video frames in which the human heads are detected and the count of humans in the scenario is provided. It also detects whether people are wearing a mask or not. If people are not wearing then the alarm gets alerted, the same alerting happens when the number of people gathering exceeds. This system can be used in malls or any other places where crowd should be minimum.
  • 2. VIVA-Tech International Journal for Research and Innovation Volume 1, Issue 4 (2021) ISSN(Online): 2581-7280 Article No. X PP XX-XX VIVA Institute of Technology 9th National Conference on Role of Engineers in Nation Building – 2021 (NCRENB-2021) 2 www.viva-technology.org/New/IJRI II. RELATED WORK Mingjie Jiang [1], In the proposed paper and high accuracy and efficient mask detector i.e.an Retina Facemask is proposed which is a one-stage detector, which consists of a feature pyramid network and a module to focus on detecting face masks. Misbah Ahmad [2], in this paper, a deep neural network model SSD(Single Shot multibox Detector ) is explored to solve problems like perspective distortion, variations in human pose, size or orientation; which gives a better accuracy. M. Martínez-Zarzuela [3], An approach for AdaBoost face detection using haar features in GPU. The GPU speeds up the performance and a better video resolution. As CPU is ideal it can perform some other computer Vision tasks. Akshay Mangawati [4], This paper elaborates the exhaustive survey of various object tracking algorithms under different environmental conditions and identifies efficient algorithms in various types of tracking. J. Grönman[5], This paper presents a real-life use case of collecting statistics about bus passengers on a free-to-ride bus route. The use case utilized cost-effective and off-the-shelf components. Prof. P Y Kumbhar[6], In this paper the author presents that face detection is an computer technology that determines location and sizes of human faces in digital image, which detects face and ignores other things like buildings, trees or bodies. Locating and tracking human faces is a prerequisite face recognition analysis. RafaelMuñoz-Salinas[7], This work presents a system able to visually detect and track multiple people using a stereo camera placed at an under-head position. This camera position is especially appropriated for human–machine applications that require interacting with people or to analyze human facial gestures. Tracking based exclusively on position information is unreliable when people establish close interactions Thus, we also include colour information about the people clothes in order to increase the tracking robustness. Zebin Cai [8], In this paper, we propose a people counting method in crowded scenes by detection the head information from the video taken from a camera installed straight down on the ceiling.is proposed for people detection. Combining the head detection and tracking together, a people counting strategy is presented to count the number of the people in the video frames. Heemoon Yoon[9], Within this paper the aim is to develop an user friendly Graphical Framework for object detection API on TensorFlow which is called TensorFlow Graphical Framework (TF-GraF). The TF- GraF provides independent virtual environments according to user accounts in server-side, additionally, execution of data preprocessing, training, and evaluation without CLI in client-side.Since TF-GraF takes care of setting and configuration, it allows anyone to use deep learning technology for their project without spending time to install complex software and environment. Gretchel Karen L. Alcantara[10], In this paper, the researchers familiarizes and exposes themselves with OpenCV. OpenCV is an open source computer vision library that is written in C and C++. First the group aims to have a deeper knowledge and understanding about head detection and tracking using OpenCV. S. Syed Ameer Abbas[11], In this paper, we propose a method to manage the crowd by keeping in track the count of the people in the scene. In our study, we develop a system using Raspberry Pi 3 board that consists of ARMv8 CPU that detects the human heads and provide a count of humans in the region using OpenCV-Python. Fabio Dittrich[12], The author presents two novel approaches for people counting in crowded and open environments that combines the information gathered by multiple views. III. METHODOLOGY The Crowd Monitoring and Mask Detection is a simple system used for people counting and detection of mask in crowded places. This system uses Convolution Neural Network (CNN), which is an image classification algorithm as well as MobileNet SSD which is used for the same. CNN is made up of neurons, each having an independent weight assigned to it. CNN is a class of deep neural networks specially used for image recognition and image processing. MobileNet is a simple but efficient and not very intensive convolutional neural network for mobile vision applications. MobileNet is widely used in many real-world applications which include fine-grained classifications, object detection, face attributes, and localization. CNN takes the input as an image, identifies and assigns priority to various features of the image and it differentiates the features from one another. Mobilenet is a neural network that is used for classification and recognition whereas the SSD is a framework that is used to realize the multi detector. Only the combination of both can do object detection. SSD can be interchanged with RCNN. The preprocessing required for CNN is less and has the ability to learn image characteristics. CNN consists of several sets of convolution layers, pooling layers, flatten and dense. The sets of convolution and pooling layers are used for feature extraction and the number of such sets may vary. Convolution layer is the basic building block of the CNN and is used for extracting features from an input image. The proposed system uses Convolution model which consists of multiple layers for the purpose of feature extraction from the image. Training data is provided to the model for better prediction of people wearing
  • 3. VIVA-Tech International Journal for Research and Innovation Volume 1, Issue 4 (2021) ISSN(Online): 2581-7280 Article No. X PP XX-XX VIVA Institute of Technology 9th National Conference on Role of Engineers in Nation Building – 2021 (NCRENB-2021) 3 www.viva-technology.org/New/IJRI a mask or not. The classification of people wearing mask, the input video is converted into frames and then into RGB format and then is flattened in matrix to extract the information by convolution layer. Multiple convolutional layers used to provide better predictions with higher accuracy. Figure 1 represents the mask detection system flow using CNN with MobileNet algorithm is used in this system as it consumes less data processing time. The testing of the module is done using real time images of people with masks and no mask to reflect the accuracy of the model. Hence, the model classifies the real time people counting and masks detection in an efficient way. Figure 1: System Flow Diagram IV. CONCLUSION The mask detection using CNN with MobileNet algorithm is used in this system as it consumes less data processing time. This System presents a people counting system as a way to manage crowds by keeping the count of people. Keeping in mind the Pandemic situation Mask-Detection feature is added if the count exceeds the prohibited count or if the model recognizes whether people are not wearing masks then the alarm gets alerted. This system will reduce the time taken for humans for counting or checking purposes and ensure them, this work is done by the system itself in no time. By this model human errors will be reduced to great extents as the system itself gets trained through large datasets. This process requires comparatively less time and provides great accuracy. As the system trains itself by doing the same tasks of mask detection so that there is less loss and provides a better accuracy. As this system is still under progress so we can’t predict accurate accuracy but it offers better accuracy. REFERENCES [1] Mingjie Jiang, Xinqi Fan ―RetinaMask: A Face Mask detector,2020, 7th International conference on Artificial Intelligience, IEEE, 2020, pg.9. [2] M. Ahmad, I. Ahmed, K. Ullah, I. Khan, A. Khattak and A. Adnan, ―Energy efficient camera solution for video surveillance‖, International Journal of Advanced Computer Science and Applications, vol. 10, no. 3, IEEE, 2019, pg.2.
  • 4. VIVA-Tech International Journal for Research and Innovation Volume 1, Issue 4 (2021) ISSN(Online): 2581-7280 Article No. X PP XX-XX VIVA Institute of Technology 9th National Conference on Role of Engineers in Nation Building – 2021 (NCRENB-2021) 4 www.viva-technology.org/New/IJRI [3] Mario Martínez Zarzuela, Francisco Javier Díaz-Pernas, Miriam Antón-Rodríguez, ―AdaBoost Face Detection on the GPU Using Haar-Like Features‖, Proceedings of the 4th international conference on Interplay between natural and artificial computation: new challenges on bioinspired applications-Volume III, IEEE, 2018, pg.9 [4] Akshay Mangawati; Mohana; Mohammed Leesan; H.V. Ravish Aradhya ―Object Tracking Algorithms for Video Surveillance Applications‖, 2018 International Conference on Object detection, and motion sensor, IEEE, 2018, pg.6. [5] J. Grönman; P. Sillberg; P. Rantanen; M. Saari;” People Counting in a Public Event—Use Case: Free-to- Ride Bus”,IEEE,2019. [6] Prof. P Y Kumbhar1 , Mohammad Attaullah2 , Shubham Dhere3 , Shivkumar Hipparagi:” REAL TIME FACE DETECTION AND TRACKING USING OPENCV”,2019. [7] RafaelMuñoz-SalinasaEugenioAguirrebMiguelGarcía-Silventeb “People detection and tracking using stereo vision and color”,2007. [8] Zebin Cai; Zhu Liang Yu; Hao Liu; Ke Zhang “Counting People in Crowded Scenes by Video Analyzing”,IEEE,2014. [9] Heemoon Yoon, Sang-Hee Lee, Mira Park,” TensorFlow with user friendly Graphical Framework for object detection API”,2020. [10] Gretchel Karen L. Alcantara; Ivan Darren J. Evangelista; Jerome Vincent B. Malinao; Ofelia B. Ong; Reginald Steven DM. Rivera “Head Detection and Tracking Using OpenCV”,IEEE,2018. [11] S. Syed Ameer Abbas; P. Oliver Jayaprakash; M. Anitha; X. Vinitha Jaini “Crowd Detection and Management using Cascade classifier on ARMv8 and OpenCV-Python”,IEEE,2017. [12] Fabio Dittrich, Luiz E. S. de Oliveira, Alceu S. Britto Jr. and Alessandro L. Koerich “People Counting in Crowded and Outdoor Scenes using a Hybrid Multi-Camera Approach”,IEEE,2019.