SlideShare a Scribd company logo
CANEXPOSURE,NOISEANDCOMPRESSION
AFFECTIMAGERECOGNITION?
An Assessment of the Impacts
on State-of-the-art ConvNets
STEFFENS, Cristiano R.; MESSIAS, Lucas R. V.;
DREWS-JR, Paulo J. L.;BOTELHO, Silvia S. d. C.
cristianosteffens@furg.br
OUTLINE
Evaluating image recognition models behiond validation sets
• Perception / Vision is an important component of modern autonomous systems
• CNNs hold the state-of-the-art in image recognition
• Growing interest in reliability / robustness
• Comprehensive assessment
• Clear methodology
• State-of-the-art models
• Several types of distortion
• Further directions
• How can we build better models?
• Can we prevent systems from operating on faulty data?
• Can we build better pipelines?
2
MOTIVATION
Under-exposure conditions
3
• Weakly illuminated scenes
• Time constraints (i.e. the robot depends on the image acquisition/processing to make a decision)
• Scenes with high dynamic ranges
• Small apperture (hardware construction)
• Low quality/cost sensors
Properly exposed Low Range Gamma 2 Gamma 4 Gamma 8
MOTIVATION
Some common conditions
4
MOTIVATION
Over-Exposure
5
• Scene with high dynamic range
• Ill adjusted optics/gain
• Time constraints
• Reflective surfaces
• Low dynamic range sensors
Properly exposed Low Range Gamma 1/2 Gamma 1/4 Gamma 1/8
MOTIVATION
Lossy Compression, Poisson, Gaussian, Salt & Pepper and Speckle Noise
6
• Bandwidth limitation
• Storage limitation
• Sensor quality
• Dead pixels (always off or always on)
• Wear and tear
• Dust, damage on lens and sensors, noise
Over-compression Poisson Noise Gaussian Noise Salt & Pepper Speckle Noise
EVALUATED MODELS
Imagenet Large Scale Visual Recognition Challenge (ILSVRC) State-of-the-Art holders
Model Year Size Parameters
Top-1
Acuracy
Input Size
Inception-
ResNet-v2
2017 215 MB 55,873,736 0,80 299
MobileNetV1 2017 16 MB 4,253,864 0,70 224
NASNetLarge 2018 343 MB 88,949,818 0,83 331
NASNetMobile 2018 23 MB 5,326,716 0,74 224
VGG16 2014 528 MB 138,357,544 0,71 224
Xception 2017 88 MB 22,910,480 0,79 299
7
PROCEDURE
A procedure that can be reproduced and used for any vision task
• We use pre-trained image recognition models
• No fine-tuning
• Exact same preprocessing as in the original implementation
• Oficial Imagenet validation set
• 1000 classes
• 50 images per class
• Inference on:
• Original set (to avoid hardware related, interpolation and other bias)
• 8 levels of misexposure
• Over-compressed images
• 4 types of typical noise
8
RESULTS
Overview – Top-1 Accuracy
9
RESULTS - INCEPTION-RESNET-V2
Overall good performance. Robust towards mild mis-exposure, compression, Gaussian and Poisson
10
RESULTS - INCEPTION-RESNET-V2
Overall good performance. Robust towards mild mis-exposure, compression, Gaussian and Poisson
11
FNs are limited to 50 due to the validation
dataset properties
No upper bound for FP
Statistics are per class:
A median of 10 means that 50% of the
classes in the dataset presented 10 or less
false negatives.
What is more important?
Would you rather overrun a person due to a
FN or stop in the middle of the road due to a
FP ?
RESULTS - MOBILENETV1
No robustness to S&P and Speckle Noise. Highly affected by moderate mis-exposure.
12
RESULTS - NASNET LARGE
Best accuracy, precision, and F1-Score among all models considered in this study
13
RESULTS - NASNET MOBILE
Significantly affected by severe miss-exposure conditions, S&P, and Speckle noise
14
RESULTS – VGG16
Slightly affected by mild mis-exposure and Poisson noise
15
RESULTS – XCEPTION
Robust towards moderate mis-exposure, over-compression, Gaussian and Poisson noise
16
CONCLUSION
New is Always better! Larger is better!
• Relevant
• Autonomous systems
• Robotics
• Applications that rely on visual perception
• Comprehensive experiment
• Broad set of classifiers
• Based on standard ILSVRC validation set
• Poor exposure
• Heavy compression
• Signal independent noise
• Signal dependent noise
• Reproducible procedure
• Objective evaluation
• No human bias
17
CONCLUSION
New is Always better! Larger is better!
• Most models are
• Little affected by mild miss-exposure.
• Robust towards Poisson and Gaussian noise
• Critically affected by moderate to severe miss exposure
• Critically affected by S&P and Speckle noise
• CNNs are evolving
• Modern architectures, such as NASNet, Inception Resnet v2 and Xception are more robust
• VGG is among the least robust
• Large models are better
• NOT you VGG!!
• NASNet Large performs significantly better than its Mobile version (while both share the same building
blocks)
• Mobile models are most affected
18
ONGOINGAND FUTURE WORK
We have a real issue! How can we solve it?
• Could the models’ accuracy be improved by adding these
common distortions in training time?
😞 Preliminary results show small improvement
• Can we build image processing pipelines which protect the
application from failing due to faulty data?
😃 Absolutelly! Preliminary results are promising 👉
• Can we prevent ill exposure in mobile/outdoor robotics?
⏳ Future Work
• Can we improve classification models by putting more
emphasis on image classes that are more prone to error?
⏳ Future Work
19
👈 Damaged
👈 Restored
☝️ Original
THANKS!
Cristiano Steffens
cristianosteffens@furg.br
researchgate.net/profile/Cristiano_Steffens
20
cristianosteffens@ieee.org.br
github.com/steffensbola
This study was financed in part by the Coordenação de Aperfeiçoamento de
Pessoal de Nível Superior – Brasil (CAPES) – Finance Code 001.

More Related Content

PPTX
A pipelined approach to deal with image distortion in computer vision - BRACI...
PPT
2007- SSVS for hand held microscopes
PDF
Augmix review [cdm]
PDF
Improving Hardware Efficiency for DNN Applications
PPTX
patil.pptx.deep fake image on ppt slide share
PDF
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
PPTX
SLICING AIDED HYPER INFERENCE AND FINE-TUNING FOR SMALL.pptx
PDF
Full reference video quality assessment
A pipelined approach to deal with image distortion in computer vision - BRACI...
2007- SSVS for hand held microscopes
Augmix review [cdm]
Improving Hardware Efficiency for DNN Applications
patil.pptx.deep fake image on ppt slide share
"How Image Sensor and Video Compression Parameters Impact Vision Algorithms,"...
SLICING AIDED HYPER INFERENCE AND FINE-TUNING FOR SMALL.pptx
Full reference video quality assessment

Similar to Can Exposure, Noise and Compression affect Image Recognition? An Assessment of the Impacts on State-of-the-art ConvNets (20)

PPTX
Sign Language Recognition System.pptx
PPTX
Cvpr 2018 papers review (efficient computing)
PPTX
Face Detection.pptx
PDF
Steer and/or sink the supertanker by Andrew Rendell
PDF
Semi-Supervised Deep Learning
PDF
Env2Vec: Accelerating VNF Testing with Deep Learning
PDF
Robustness of compressed CNNs
PPTX
408187464-Age-and-Gender-Detection-3-pptx.pptx
PPTX
ROLE OF DIGITAL IMAGING IN PATHOLOGY.pptx
PDF
06 13sept 8313 9997-2-ed an adaptive (edit lafi)
PPTX
Convolutional neural networks 이론과 응용
PDF
“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung
PDF
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...
PDF
Senslab - open hardware - fossa2010
PPTX
SCREENLESS DISPLAY.pptx
PPTX
Edge ai in iot automation and communication
PPTX
A DEEP LEARNING APPROACH FOR SEMANTIC SEGMENTATION IN BRAIN TUMOR IMAGES
PDF
DATI, AI E ROBOTICA @POLITO
PPTX
ICEATsssefggfdrghhkoolknvdesfjonbeeffd0.pptx
PPTX
Feature based heart disease prediction approach
Sign Language Recognition System.pptx
Cvpr 2018 papers review (efficient computing)
Face Detection.pptx
Steer and/or sink the supertanker by Andrew Rendell
Semi-Supervised Deep Learning
Env2Vec: Accelerating VNF Testing with Deep Learning
Robustness of compressed CNNs
408187464-Age-and-Gender-Detection-3-pptx.pptx
ROLE OF DIGITAL IMAGING IN PATHOLOGY.pptx
06 13sept 8313 9997-2-ed an adaptive (edit lafi)
Convolutional neural networks 이론과 응용
“A Highly Data-Efficient Deep Learning Approach,” a Presentation from Samsung
SafeguardAI and Surprise Based Learning -- Protect your AI solutions from Uni...
Senslab - open hardware - fossa2010
SCREENLESS DISPLAY.pptx
Edge ai in iot automation and communication
A DEEP LEARNING APPROACH FOR SEMANTIC SEGMENTATION IN BRAIN TUMOR IMAGES
DATI, AI E ROBOTICA @POLITO
ICEATsssefggfdrghhkoolknvdesfjonbeeffd0.pptx
Feature based heart disease prediction approach
Ad

More from Cristiano Rafael Steffens (20)

PDF
CONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and video
PPTX
A CNN BASED MODEL TO RESTORE ILL EXPOSED IMAGES
PPTX
MODELAGEM DAS DINÂMICAS DA FORMAÇÃO DA GOTA E TRANSFERÊNCIA DE MASSA EM PROCE...
PPTX
UMA ABORDAGEM COMPARATIVA ENTRE MICROCONTROLADORES: ARDUINO MEGA X ARDUINO DU...
PPTX
FPGA-based sensor integration and communication protocols for automated
PPTX
Lars 2016 A Texture Driven Approach for Visible Spectrum Fire Detection
PDF
Php Math and arrays
PPTX
ICRA 2016 - Interactive section Presentation
PDF
Vision-Based System for Welding Groove Measurements for Robotic Welding Appli...
PPT
Simpósio Unicruz: OpenCV + Python (parte 1)
PPTX
Welding Groove Mapping: Image Acquisition and Processing on Shiny Surfaces - ...
PPTX
Automated control module based on VBM for shipyard welding applications: Stud...
PPTX
An Unconstrained Dataset for Non-stationary Video Based Fire Detection
PPTX
Introdução ao processamento de imagens com OpenCV (cont)
PPTX
Introdução OpenCV (Pt-Br) com exemplos
PPTX
Um Sistema De Detecção De Fogo Baseado Em Vídeo
PPTX
Um sistema de detecção de chamas utilizando RF e SVM (Short Version)
PPT
G xserver curso-actualizgxxev1
PDF
20 pratico cursogxxbr
PDF
18 indice cursogxx
CONVOLUTIONAL NEURAL NETWORKS: The workhorse of image and video
A CNN BASED MODEL TO RESTORE ILL EXPOSED IMAGES
MODELAGEM DAS DINÂMICAS DA FORMAÇÃO DA GOTA E TRANSFERÊNCIA DE MASSA EM PROCE...
UMA ABORDAGEM COMPARATIVA ENTRE MICROCONTROLADORES: ARDUINO MEGA X ARDUINO DU...
FPGA-based sensor integration and communication protocols for automated
Lars 2016 A Texture Driven Approach for Visible Spectrum Fire Detection
Php Math and arrays
ICRA 2016 - Interactive section Presentation
Vision-Based System for Welding Groove Measurements for Robotic Welding Appli...
Simpósio Unicruz: OpenCV + Python (parte 1)
Welding Groove Mapping: Image Acquisition and Processing on Shiny Surfaces - ...
Automated control module based on VBM for shipyard welding applications: Stud...
An Unconstrained Dataset for Non-stationary Video Based Fire Detection
Introdução ao processamento de imagens com OpenCV (cont)
Introdução OpenCV (Pt-Br) com exemplos
Um Sistema De Detecção De Fogo Baseado Em Vídeo
Um sistema de detecção de chamas utilizando RF e SVM (Short Version)
G xserver curso-actualizgxxev1
20 pratico cursogxxbr
18 indice cursogxx
Ad

Recently uploaded (20)

PDF
AlphaEarth Foundations and the Satellite Embedding dataset
PDF
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
PDF
. Radiology Case Scenariosssssssssssssss
PPTX
Cell Membrane: Structure, Composition & Functions
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PPTX
Introduction to Cardiovascular system_structure and functions-1
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
2Systematics of Living Organisms t-.pptx
PPTX
Microbiology with diagram medical studies .pptx
PPTX
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
neck nodes and dissection types and lymph nodes levels
PDF
Sciences of Europe No 170 (2025)
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PDF
The scientific heritage No 166 (166) (2025)
PDF
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
PPTX
BIOMOLECULES PPT........................
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...
AlphaEarth Foundations and the Satellite Embedding dataset
Unveiling a 36 billion solar mass black hole at the centre of the Cosmic Hors...
. Radiology Case Scenariosssssssssssssss
Cell Membrane: Structure, Composition & Functions
Derivatives of integument scales, beaks, horns,.pptx
7. General Toxicologyfor clinical phrmacy.pptx
Introduction to Cardiovascular system_structure and functions-1
Classification Systems_TAXONOMY_SCIENCE8.pptx
2Systematics of Living Organisms t-.pptx
Microbiology with diagram medical studies .pptx
cpcsea ppt.pptxssssssssssssssjjdjdndndddd
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
neck nodes and dissection types and lymph nodes levels
Sciences of Europe No 170 (2025)
ECG_Course_Presentation د.محمد صقران ppt
The scientific heritage No 166 (166) (2025)
CAPERS-LRD-z9:AGas-enshroudedLittleRedDotHostingaBroad-lineActive GalacticNuc...
BIOMOLECULES PPT........................
Taita Taveta Laboratory Technician Workshop Presentation.pptx
Vitamins & Minerals: Complete Guide to Functions, Food Sources, Deficiency Si...

Can Exposure, Noise and Compression affect Image Recognition? An Assessment of the Impacts on State-of-the-art ConvNets

  • 1. CANEXPOSURE,NOISEANDCOMPRESSION AFFECTIMAGERECOGNITION? An Assessment of the Impacts on State-of-the-art ConvNets STEFFENS, Cristiano R.; MESSIAS, Lucas R. V.; DREWS-JR, Paulo J. L.;BOTELHO, Silvia S. d. C. cristianosteffens@furg.br
  • 2. OUTLINE Evaluating image recognition models behiond validation sets • Perception / Vision is an important component of modern autonomous systems • CNNs hold the state-of-the-art in image recognition • Growing interest in reliability / robustness • Comprehensive assessment • Clear methodology • State-of-the-art models • Several types of distortion • Further directions • How can we build better models? • Can we prevent systems from operating on faulty data? • Can we build better pipelines? 2
  • 3. MOTIVATION Under-exposure conditions 3 • Weakly illuminated scenes • Time constraints (i.e. the robot depends on the image acquisition/processing to make a decision) • Scenes with high dynamic ranges • Small apperture (hardware construction) • Low quality/cost sensors Properly exposed Low Range Gamma 2 Gamma 4 Gamma 8
  • 5. MOTIVATION Over-Exposure 5 • Scene with high dynamic range • Ill adjusted optics/gain • Time constraints • Reflective surfaces • Low dynamic range sensors Properly exposed Low Range Gamma 1/2 Gamma 1/4 Gamma 1/8
  • 6. MOTIVATION Lossy Compression, Poisson, Gaussian, Salt & Pepper and Speckle Noise 6 • Bandwidth limitation • Storage limitation • Sensor quality • Dead pixels (always off or always on) • Wear and tear • Dust, damage on lens and sensors, noise Over-compression Poisson Noise Gaussian Noise Salt & Pepper Speckle Noise
  • 7. EVALUATED MODELS Imagenet Large Scale Visual Recognition Challenge (ILSVRC) State-of-the-Art holders Model Year Size Parameters Top-1 Acuracy Input Size Inception- ResNet-v2 2017 215 MB 55,873,736 0,80 299 MobileNetV1 2017 16 MB 4,253,864 0,70 224 NASNetLarge 2018 343 MB 88,949,818 0,83 331 NASNetMobile 2018 23 MB 5,326,716 0,74 224 VGG16 2014 528 MB 138,357,544 0,71 224 Xception 2017 88 MB 22,910,480 0,79 299 7
  • 8. PROCEDURE A procedure that can be reproduced and used for any vision task • We use pre-trained image recognition models • No fine-tuning • Exact same preprocessing as in the original implementation • Oficial Imagenet validation set • 1000 classes • 50 images per class • Inference on: • Original set (to avoid hardware related, interpolation and other bias) • 8 levels of misexposure • Over-compressed images • 4 types of typical noise 8
  • 10. RESULTS - INCEPTION-RESNET-V2 Overall good performance. Robust towards mild mis-exposure, compression, Gaussian and Poisson 10
  • 11. RESULTS - INCEPTION-RESNET-V2 Overall good performance. Robust towards mild mis-exposure, compression, Gaussian and Poisson 11 FNs are limited to 50 due to the validation dataset properties No upper bound for FP Statistics are per class: A median of 10 means that 50% of the classes in the dataset presented 10 or less false negatives. What is more important? Would you rather overrun a person due to a FN or stop in the middle of the road due to a FP ?
  • 12. RESULTS - MOBILENETV1 No robustness to S&P and Speckle Noise. Highly affected by moderate mis-exposure. 12
  • 13. RESULTS - NASNET LARGE Best accuracy, precision, and F1-Score among all models considered in this study 13
  • 14. RESULTS - NASNET MOBILE Significantly affected by severe miss-exposure conditions, S&P, and Speckle noise 14
  • 15. RESULTS – VGG16 Slightly affected by mild mis-exposure and Poisson noise 15
  • 16. RESULTS – XCEPTION Robust towards moderate mis-exposure, over-compression, Gaussian and Poisson noise 16
  • 17. CONCLUSION New is Always better! Larger is better! • Relevant • Autonomous systems • Robotics • Applications that rely on visual perception • Comprehensive experiment • Broad set of classifiers • Based on standard ILSVRC validation set • Poor exposure • Heavy compression • Signal independent noise • Signal dependent noise • Reproducible procedure • Objective evaluation • No human bias 17
  • 18. CONCLUSION New is Always better! Larger is better! • Most models are • Little affected by mild miss-exposure. • Robust towards Poisson and Gaussian noise • Critically affected by moderate to severe miss exposure • Critically affected by S&P and Speckle noise • CNNs are evolving • Modern architectures, such as NASNet, Inception Resnet v2 and Xception are more robust • VGG is among the least robust • Large models are better • NOT you VGG!! • NASNet Large performs significantly better than its Mobile version (while both share the same building blocks) • Mobile models are most affected 18
  • 19. ONGOINGAND FUTURE WORK We have a real issue! How can we solve it? • Could the models’ accuracy be improved by adding these common distortions in training time? 😞 Preliminary results show small improvement • Can we build image processing pipelines which protect the application from failing due to faulty data? 😃 Absolutelly! Preliminary results are promising 👉 • Can we prevent ill exposure in mobile/outdoor robotics? ⏳ Future Work • Can we improve classification models by putting more emphasis on image classes that are more prone to error? ⏳ Future Work 19 👈 Damaged 👈 Restored ☝️ Original
  • 20. THANKS! Cristiano Steffens cristianosteffens@furg.br researchgate.net/profile/Cristiano_Steffens 20 cristianosteffens@ieee.org.br github.com/steffensbola This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior – Brasil (CAPES) – Finance Code 001.