SlideShare a Scribd company logo
B-FPGM: Lightweight Face Detection via
Bayesian-Optimized Soft FPGM Pruning
Nikolaos Kaparinos, Vasileios Mezaris
CERTH-ITI, Thermi, Thessaloniki, Greece
Real-World Surveillance
Workshop @ WACV 2025
The Growing Demand for Compact AI Models
● The deployment of AI models on mobile devices, such as smartphones and
drones, is increasingly common.
● Thus, the need for compact and efficient AI models has dramatically
increased.
● Face detectors are a type of model commonly deployed on mobile devices.
● Lightweight face detectors have been proposed in the literature.
● They utilize lightweight backbone networks and other optimization techniques,
such as pruning.
2
Network Pruning
● Network pruning is a technique used to reduce the number of parameters in a
model.
● Pruning methods can also be classified into uniform and non-uniform
approaches.
● FPGM pruning is a structured pruning approach that has demonstrated high
performance in the literature.
● Soft Filter Pruning (SFP) is a pruning method that allows the pruned filters to
be updated during subsequent training steps.
3
B-FPGM
● This work proposes, B-FPGM, a novel non-uniform face detection network
pruning technique.
● This work represents the first application of Bayesian optimization to
structured pruning as well as non-uniform pruning in the literature.
● B-FPGM divides the network layers into 6 groups and employes Bayesian
optimization to optimize the pruning rate of each group.
● The optimal pruning rates are then applied alongside FPGM pruning and
SFP.
4
B-FPGM Advantages
● B-FPGM offers flexibility through its non-universal pruning approach.
● It eliminates the need for engineering expertise to define rules for optimal
pruning rates, effectively taking the ‘human out of the loop’.
● At the same time, it avoids utilizing Reinforcement Learning, which comes
with significant drawbacks.
5
B-FPGM overall pipeline
6
Bayesian optimization step
● The Bayesian optimization step is employed to identify the optimal pruning
rate for each layer group, given a target overall sparsity.
● In each iteration, the pre-trained network is soft-pruned and trained for one
epoch.
● The objective function value is equal to the validation loss, plus an additional
term to ensure that the network is pruned approximately at the target overall
sparsity.
7
Network Layer Groups
8
The number of parameters
in each network layer group.
EResFD model architecture
and layer groups.
Overall B-FPGM algorithm
9
Experimental Setup
● All our experiments were applied to EResFD, the currently smallest (in number of
parameters) well-performing face detector of the literature.
● A small ablation study with a second small face detector, EXTD, is also reported.
● The experiments were performed using the WIDER FACE dataset.
○ 12941 training images
○ Three validation subsets based on difficulty: Easy (1146 images), Medium (1079 images), Hard (1001
images)
● Experiments were conducted with target pruning rates ranging from 10% to 60%.
10
Results on EResFD using the WIDER FACE dataset
11
Hard Subset
Group pruning rates determined by Bayesian
optimization. T is the target pruning rate.
10%
10%
20%
20%
30%
30%
40%
40%
50%
50%
60%
60%
Comparison with SoA models
12
Robustness to Randomness
13
Mean mAP ± standard deviation of B-FPGM on
EResFD across five runs, using different random
seeds, for 20% target pruning rate.
Number of layer groups ablation
14
MAP of B-FPGM on EResFD, on WIDER FACE (Easy, Medium,
Hard subsets), for different network layer groupings. N is the
number of layer groups and T is the target pruning rate.
Inference visual example
15
EResFD 50% pruned using B-FPGM
Thank you for your attention!
Questions?
Nikolaos Kaparinos, kaparinos@iti.gr
Vasileios Mezaris, bmezaris@iti.gr
Source code and pruned models available at:
https://github.com/IDT-ITI/B-FPGM
This work was supported by the EU Horizon Europe and Horizon 2020 programmes
under grant agreements 101070093 vera.ai and 951911 AI4Media, respectively.
16

More Related Content

PDF
PR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild
PDF
Sparse Support Faces - Battista Biggio - Int'l Conf. Biometrics, ICB 2015, Ph...
PDF
Facial recognition based on enhanced neural network
PDF
HC-4012, Complex Network Clustering Using GPU-based Parallel Non-negative Mat...
PPT
Software tookits for machine learning and graphical models
PDF
Localization, Classification, and Evaluation.pdf
PDF
Beyond relational: «neural» DBMS?
PPTX
Face recognition technology
PR-185: RetinaFace: Single-stage Dense Face Localisation in the Wild
Sparse Support Faces - Battista Biggio - Int'l Conf. Biometrics, ICB 2015, Ph...
Facial recognition based on enhanced neural network
HC-4012, Complex Network Clustering Using GPU-based Parallel Non-negative Mat...
Software tookits for machine learning and graphical models
Localization, Classification, and Evaluation.pdf
Beyond relational: «neural» DBMS?
Face recognition technology

Similar to B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning (20)

PDF
Not Enough Measurements, Too Many Measurements
PDF
Solr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBM
PDF
Images as Occlusions of Textures: A Framework for Segmentation
PDF
20120140504019
PPTX
Facial emotion detection on babies' emotional face using Deep Learning.
PDF
And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
PDF
Lecture 7&8 computer vision face_recognition
PDF
IEEE Pattern analysis and machine intelligence 2016 Title and Abstract
PDF
ICCV2009: MAP Inference in Discrete Models: Part 5
PDF
FPGA ARCHITECTURE FOR FACIAL-FEATURES AND COMPONENTS EXTRACTION
PDF
International Journal of Computer Science, Engineering and Information Techno...
PDF
Adversarial attack driven data augmentation for medical images
PDF
IRJET- Image Segmentation using Classification of Radial Basis Function of Ne...
PPTX
Unsupervised Feature Learning
PDF
INTELLIGENT MALWARE DETECTION USING EXTREME LEARNING MACHINE
PPTX
Modelling Probability Distributions using Neural Networks: Applications to Me...
PPTX
Detection and recognition of face using neural network
PDF
Smart Face Recognition System Analysis
PDF
Design and analysis of face recognition system based on VGG-Face-16 with vari...
PDF
Ag044216224
Not Enough Measurements, Too Many Measurements
Solr and Machine Vision - Scott Cote, Lucidworks & Trevor Grant, IBM
Images as Occlusions of Textures: A Framework for Segmentation
20120140504019
Facial emotion detection on babies' emotional face using Deep Learning.
And Then There Are Algorithms - Danilo Poccia - Codemotion Rome 2018
Lecture 7&8 computer vision face_recognition
IEEE Pattern analysis and machine intelligence 2016 Title and Abstract
ICCV2009: MAP Inference in Discrete Models: Part 5
FPGA ARCHITECTURE FOR FACIAL-FEATURES AND COMPONENTS EXTRACTION
International Journal of Computer Science, Engineering and Information Techno...
Adversarial attack driven data augmentation for medical images
IRJET- Image Segmentation using Classification of Radial Basis Function of Ne...
Unsupervised Feature Learning
INTELLIGENT MALWARE DETECTION USING EXTREME LEARNING MACHINE
Modelling Probability Distributions using Neural Networks: Applications to Me...
Detection and recognition of face using neural network
Smart Face Recognition System Analysis
Design and analysis of face recognition system based on VGG-Face-16 with vari...
Ag044216224
Ad

More from VasileiosMezaris (20)

PDF
Combatting video-borne disinformation and increasing trust in AI methods
PDF
An LLM Framework for Long-form Video Retrieval and Audio-Visual Question Answ...
PDF
Improving the Perturbation-Based Explanation of Deepfake Detectors Through th...
PPTX
LMM-Regularized CLIP Embeddings for Image Classification
PPTX
Disturbing Image Detection Using LMM-Elicited Emotion Embeddings
PPTX
Exploiting LMM based knowledge for image classification tasks
PPTX
Detecting visual-media-borne disinformation: a summary of latest advances at ...
PPTX
Dataset and methods for 360-degree video summarization
PPTX
Explainable Deepfake Image/Video Detection
PPTX
Multi-Modal Fusion for Image Manipulation Detection and Localization
PDF
CERTH-ITI at MediaEval 2023 NewsImages Task
PPTX
Spatio-Temporal Summarization of 360-degrees Videos
PPTX
Masked Feature Modelling for the unsupervised pre-training of a Graph Attenti...
PPTX
Cross-modal Networks and Dual Softmax Operation for MediaEval NewsImages 2022
PPTX
TAME: Trainable Attention Mechanism for Explanations
PPTX
Gated-ViGAT
PPTX
Explaining video summarization based on the focus of attention
PPTX
Combining textual and visual features for Ad-hoc Video Search
PPTX
Explaining the decisions of image/video classifiers
PPTX
Learning visual explanations for DCNN-based image classifiers using an attent...
Combatting video-borne disinformation and increasing trust in AI methods
An LLM Framework for Long-form Video Retrieval and Audio-Visual Question Answ...
Improving the Perturbation-Based Explanation of Deepfake Detectors Through th...
LMM-Regularized CLIP Embeddings for Image Classification
Disturbing Image Detection Using LMM-Elicited Emotion Embeddings
Exploiting LMM based knowledge for image classification tasks
Detecting visual-media-borne disinformation: a summary of latest advances at ...
Dataset and methods for 360-degree video summarization
Explainable Deepfake Image/Video Detection
Multi-Modal Fusion for Image Manipulation Detection and Localization
CERTH-ITI at MediaEval 2023 NewsImages Task
Spatio-Temporal Summarization of 360-degrees Videos
Masked Feature Modelling for the unsupervised pre-training of a Graph Attenti...
Cross-modal Networks and Dual Softmax Operation for MediaEval NewsImages 2022
TAME: Trainable Attention Mechanism for Explanations
Gated-ViGAT
Explaining video summarization based on the focus of attention
Combining textual and visual features for Ad-hoc Video Search
Explaining the decisions of image/video classifiers
Learning visual explanations for DCNN-based image classifiers using an attent...
Ad

Recently uploaded (20)

PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
2. Earth - The Living Planet earth and life
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
Microbiology with diagram medical studies .pptx
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPT
Chemical bonding and molecular structure
PPTX
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
Derivatives of integument scales, beaks, horns,.pptx
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
. Radiology Case Scenariosssssssssssssss
Taita Taveta Laboratory Technician Workshop Presentation.pptx
2. Earth - The Living Planet earth and life
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
AlphaEarth Foundations and the Satellite Embedding dataset
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Microbiology with diagram medical studies .pptx
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
INTRODUCTION TO EVS | Concept of sustainability
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
Biophysics 2.pdffffffffffffffffffffffffff
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
ECG_Course_Presentation د.محمد صقران ppt
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
Chemical bonding and molecular structure
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...

B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning

  • 1. B-FPGM: Lightweight Face Detection via Bayesian-Optimized Soft FPGM Pruning Nikolaos Kaparinos, Vasileios Mezaris CERTH-ITI, Thermi, Thessaloniki, Greece Real-World Surveillance Workshop @ WACV 2025
  • 2. The Growing Demand for Compact AI Models ● The deployment of AI models on mobile devices, such as smartphones and drones, is increasingly common. ● Thus, the need for compact and efficient AI models has dramatically increased. ● Face detectors are a type of model commonly deployed on mobile devices. ● Lightweight face detectors have been proposed in the literature. ● They utilize lightweight backbone networks and other optimization techniques, such as pruning. 2
  • 3. Network Pruning ● Network pruning is a technique used to reduce the number of parameters in a model. ● Pruning methods can also be classified into uniform and non-uniform approaches. ● FPGM pruning is a structured pruning approach that has demonstrated high performance in the literature. ● Soft Filter Pruning (SFP) is a pruning method that allows the pruned filters to be updated during subsequent training steps. 3
  • 4. B-FPGM ● This work proposes, B-FPGM, a novel non-uniform face detection network pruning technique. ● This work represents the first application of Bayesian optimization to structured pruning as well as non-uniform pruning in the literature. ● B-FPGM divides the network layers into 6 groups and employes Bayesian optimization to optimize the pruning rate of each group. ● The optimal pruning rates are then applied alongside FPGM pruning and SFP. 4
  • 5. B-FPGM Advantages ● B-FPGM offers flexibility through its non-universal pruning approach. ● It eliminates the need for engineering expertise to define rules for optimal pruning rates, effectively taking the ‘human out of the loop’. ● At the same time, it avoids utilizing Reinforcement Learning, which comes with significant drawbacks. 5
  • 7. Bayesian optimization step ● The Bayesian optimization step is employed to identify the optimal pruning rate for each layer group, given a target overall sparsity. ● In each iteration, the pre-trained network is soft-pruned and trained for one epoch. ● The objective function value is equal to the validation loss, plus an additional term to ensure that the network is pruned approximately at the target overall sparsity. 7
  • 8. Network Layer Groups 8 The number of parameters in each network layer group. EResFD model architecture and layer groups.
  • 10. Experimental Setup ● All our experiments were applied to EResFD, the currently smallest (in number of parameters) well-performing face detector of the literature. ● A small ablation study with a second small face detector, EXTD, is also reported. ● The experiments were performed using the WIDER FACE dataset. ○ 12941 training images ○ Three validation subsets based on difficulty: Easy (1146 images), Medium (1079 images), Hard (1001 images) ● Experiments were conducted with target pruning rates ranging from 10% to 60%. 10
  • 11. Results on EResFD using the WIDER FACE dataset 11 Hard Subset Group pruning rates determined by Bayesian optimization. T is the target pruning rate. 10% 10% 20% 20% 30% 30% 40% 40% 50% 50% 60% 60%
  • 12. Comparison with SoA models 12
  • 13. Robustness to Randomness 13 Mean mAP ± standard deviation of B-FPGM on EResFD across five runs, using different random seeds, for 20% target pruning rate.
  • 14. Number of layer groups ablation 14 MAP of B-FPGM on EResFD, on WIDER FACE (Easy, Medium, Hard subsets), for different network layer groupings. N is the number of layer groups and T is the target pruning rate.
  • 15. Inference visual example 15 EResFD 50% pruned using B-FPGM
  • 16. Thank you for your attention! Questions? Nikolaos Kaparinos, kaparinos@iti.gr Vasileios Mezaris, bmezaris@iti.gr Source code and pruned models available at: https://github.com/IDT-ITI/B-FPGM This work was supported by the EU Horizon Europe and Horizon 2020 programmes under grant agreements 101070093 vera.ai and 951911 AI4Media, respectively. 16