SlideShare a Scribd company logo
Rapid Object Detection using a Boosted
        Cascade of Simple Features
                    Authors: Paul Viola and Michael Jones
                 Presented by: Hirantha Pradeep Weerarathna
                      Supervisor: Dr. Anuja Dharmaratne




University of Colombo School of Computing
Authors

Paul Viola

   – Got PhD in 1995 from MIT
   – Current Designations:
       •     Manger, Architect, Developer, and Researcher at Microsoft
       • Professor at MIT
   – Research Interests are Machine Learning, Web Search and Image
     Processing
Authors

Michael Jones



   – Got PhD from Massachusetts Institute of Technology in 1997
   – Currently senior member in MERL Research
   – Research Interests are Computer Vision and Machine Learning
Road Map

−   Introduction
−   Key Contributions
−   Results and Comparisons
−   Conclusion
   A prominent paper in the domain of Object Detection.
   More than 6000 citations to the date in related articles.
   First solution provided real time Object Detection.
   Still most widely used solution for the Object Detection.
Object detection

Detecting instances of semantic objects of a certain class (such as
humans, buildings, or cars) in digital images and videos.
                                          ~wikipedia.org



Applications:
        Robotic Applications
        Surveillance Systems
        Sports Applications
        Security Systems
        And many more..
Face Detection
Determining the locations and sizes of human faces in arbitrary
images.
                                        ~wikipedia.org
Image Representation

 Represent and classify images based on simple features rather than
  pixels directly. Reasons are:
   1.   Feature can encode ad-hoc domain knowledge that is difficult to learn using
        finite quantity of training data.
   2.   Feature based systems operates much faster than pixel based systems.
Features
 Three kinds of simple features are used.
    1.   Two-Rectangles features
    2.   Three-Rectangles features
    3.   Four-Rectangles features




   Feature value Calculation

    ∑pixel values in white area - ∑pixel values in gray area
Key Contributions

 Three main contributions
   1. Introduction of Integral Image
   2. Learning algorithm based on AdaBoost
   3. Combine Classifiers in Cascade
Integral Image
 This concept was first introduced with this solution framework.
 Integral Image is computed from an image using few operations on
  pixels.



                      y                            y
            10      20    10      20       10      30    40      60
            20      10    10      10       30      60    80      110
        x   30      10    10      20   x   60     100   130      180
            10      20    30      20       70     130   190      260

                 Original Image                 Integral Image
Integral Image
 Using Integral Image, pixel sum of a rectangle are can be calculated
  using 4 array references.
 It leads to a rapid evaluation of rectangle features
 Feature evaluation in constant time

                        A      1       B         2




                        C              D
                                   3             4




                  ∑Pixel sum of area D= ii(4) + ii(1) – ii(2) –ii(3)
Learning Algorithm based on
          AdaBoost
 AdaBoost is used for feature selection and classifier training
 Capable of selecting a small set of good features from a large
  number of feature set
 AdaBoost use a set of weak learners to form a strong one
 It guarantees that training error of the strong classifier reach zero
  exponentially in number of rounds
Learning Algorithm based on
           AdaBoost
 A weak learner select a single rectangle feature which best seperates
  positive and negative examples
 Weak learner determines the optimal threshold function, such that
  misclassification is minimized
Rapid object detection using boosted  cascade of simple features
Combine Classifiers in Cascade
 Building cascade of classifiers,
         Increase detection performance
         Rapidly reduce computation power
 Simpler classifiers apply early and reject majority of sub windows,
  then apply complex classifiers to achieve low false positive
 Subsequent classifiers are trained using examples, which pass
  through all the previous stages
Combine Classifiers in Cascade
 Cascade Training process involves two trade-offs
       1. Classifier with more features will achieve higher DR and lower FPR
       2. Classifier with more features need more computations

 Can define a optimization framework in which
       1. Number of classifier stages
       2. Number of features in each stage
       3. Threshold of each stage

 Minimum number of features are selected such that, expected DR
  and FPR are achieved
Combine Classifiers in Cascade
 Simple framework is used to produce effective cascade which is
  highly efficient
       1. User selects maximum acceptable FPR and minimum acceptable DR per
          each stage
       2. User selects target overall FPR and DR
       3. Each stage is trained by adding features until the target DR and FPRs are
          met
       4. Stages are added until the overall target for DR and FPR are met
Results
 Testing has done on MIT+CMU test set, which consists with 507
  faces in 130 images
 Using a cascade of 38 layers
 Cascade has trained using 4916 facial images and 9544 non-facial
  images
 Testing has been done with scaling factor of 1.25 and windows
  shifting scale of 1.0 on images
 On a conventional Pentium III machine with 700Mhz processor.
 They have achieved Detection Speed of 15 frames/sec
Results
Results
ROC Curve for Face Detector
Results
Detection Rate Comparison of Cotemporary Solution
Conclusion
 Solution achieves the goal of real time object detection
 Conjunction of simple rectangle features and integral image
  gives a efficient feature representation
 AdaBoost is used for the feature selection and classifier
  training
 Cascade of classifiers allows to quickly discard background
  regions and concentrate more on object-like regions

More Related Content

PPTX
Robust real time object detection
PPT
Robust Real Time Face Detection
PPT
Avihu Efrat's Viola and Jones face detection slides
PPTX
Face detection ppt by Batyrbek
PPTX
Viola-Jones Object Detection
PDF
Face Detection System on Ada boost Algorithm Using Haar Classifiers
PPTX
Face recognition v1
DOCX
Road signs detection using voila jone's algorithm with the help of opencv
Robust real time object detection
Robust Real Time Face Detection
Avihu Efrat's Viola and Jones face detection slides
Face detection ppt by Batyrbek
Viola-Jones Object Detection
Face Detection System on Ada boost Algorithm Using Haar Classifiers
Face recognition v1
Road signs detection using voila jone's algorithm with the help of opencv

What's hot (20)

PPTX
Deep learning summary
PPTX
Ml10 dimensionality reduction-and_advanced_topics
PPTX
Object detection presentation
PDF
Deep Learning for Natural Language Processing
PPTX
Face Recognition: From Scratch To Hatch
ODP
An Introduction to Computer Vision
PDF
“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...
PDF
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
PDF
PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...
PDF
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
PPT
Presentation Object Recognition And Tracking Project
PDF
Presentation of Visual Tracking
PPTX
08 neural networks
PPTX
Object Detection & Tracking
PDF
Color based image processing , tracking and automation using matlab
PDF
Qualcomm research-imagenet2015
PPTX
Strata London - Deep Learning 05-2015
PDF
Visualizing the model selection process
PPTX
auto-assistance system for visually impaired person
PDF
Generative Models for General Audiences
Deep learning summary
Ml10 dimensionality reduction-and_advanced_topics
Object detection presentation
Deep Learning for Natural Language Processing
Face Recognition: From Scratch To Hatch
An Introduction to Computer Vision
“An Introduction to Data Augmentation Techniques in ML Frameworks,” a Present...
“Introducing Machine Learning and How to Teach Machines to See,” a Presentati...
PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...
DataEngConf: Feature Extraction: Modern Questions and Challenges at Google
Presentation Object Recognition And Tracking Project
Presentation of Visual Tracking
08 neural networks
Object Detection & Tracking
Color based image processing , tracking and automation using matlab
Qualcomm research-imagenet2015
Strata London - Deep Learning 05-2015
Visualizing the model selection process
auto-assistance system for visually impaired person
Generative Models for General Audiences
Ad

Viewers also liked (9)

PPTX
Learning Object Detectors From Weakly Supervised Image Data
PPTX
Object Recognition
PPTX
Map reduce
PDF
20141008物体検出器
PDF
Opencv object detection_takmin
PDF
Real time pedestrian detection, tracking, and distance estimation
PPTX
KantoCV/Selective Search for Object Recognition
PPTX
Object detection
PPT
Introduction To Map Reduce
Learning Object Detectors From Weakly Supervised Image Data
Object Recognition
Map reduce
20141008物体検出器
Opencv object detection_takmin
Real time pedestrian detection, tracking, and distance estimation
KantoCV/Selective Search for Object Recognition
Object detection
Introduction To Map Reduce
Ad

Similar to Rapid object detection using boosted cascade of simple features (20)

PPTX
IMAGE PROCESSING
PPT
Face recognition.ppt
PPTX
The objectdetection using the Artificialintelligence.pptx
PPSX
Real-time Face Recognition & Detection Systems 1
PPT
PPT
Face Detection techniques
PPTX
Extracting individual information using facial recognition in a smart mirror....
PPT
Cvpr2007 object category recognition p3 - discriminative models
PDF
An fpga based efficient fruit recognition system using minimum
PDF
Ear Biometrics shritosh kumar
PDF
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
PDF
IRJET - A Survey Paper on Efficient Object Detection and Matching using F...
PDF
Fast Feature Pyramids for Object Detection
PDF
International Journal of Computational Engineering Research(IJCER)
PDF
International Journal of Computational Engineering Research(IJCER)
PDF
Y34147151
PDF
thesis
PDF
Final Report - Major Project - MAP
PDF
HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...
PDF
Machine Learning in Computer Vision
IMAGE PROCESSING
Face recognition.ppt
The objectdetection using the Artificialintelligence.pptx
Real-time Face Recognition & Detection Systems 1
Face Detection techniques
Extracting individual information using facial recognition in a smart mirror....
Cvpr2007 object category recognition p3 - discriminative models
An fpga based efficient fruit recognition system using minimum
Ear Biometrics shritosh kumar
IRJET- Identification of Scene Images using Convolutional Neural Networks - A...
IRJET - A Survey Paper on Efficient Object Detection and Matching using F...
Fast Feature Pyramids for Object Detection
International Journal of Computational Engineering Research(IJCER)
International Journal of Computational Engineering Research(IJCER)
Y34147151
thesis
Final Report - Major Project - MAP
HIGHLY SCALABLE, PARALLEL AND DISTRIBUTED ADABOOST ALGORITHM USING LIGHT WEIG...
Machine Learning in Computer Vision

Recently uploaded (20)

PDF
RMMM.pdf make it easy to upload and study
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
01-Introduction-to-Information-Management.pdf
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
Pre independence Education in Inndia.pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
master seminar digital applications in india
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
GDM (1) (1).pptx small presentation for students
PDF
Insiders guide to clinical Medicine.pdf
PPTX
Institutional Correction lecture only . . .
PPTX
Pharma ospi slides which help in ospi learning
RMMM.pdf make it easy to upload and study
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
01-Introduction-to-Information-Management.pdf
O5-L3 Freight Transport Ops (International) V1.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
O7-L3 Supply Chain Operations - ICLT Program
Pre independence Education in Inndia.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
master seminar digital applications in india
VCE English Exam - Section C Student Revision Booklet
Microbial diseases, their pathogenesis and prophylaxis
GDM (1) (1).pptx small presentation for students
Insiders guide to clinical Medicine.pdf
Institutional Correction lecture only . . .
Pharma ospi slides which help in ospi learning

Rapid object detection using boosted cascade of simple features

  • 1. Rapid Object Detection using a Boosted Cascade of Simple Features Authors: Paul Viola and Michael Jones Presented by: Hirantha Pradeep Weerarathna Supervisor: Dr. Anuja Dharmaratne University of Colombo School of Computing
  • 2. Authors Paul Viola – Got PhD in 1995 from MIT – Current Designations: • Manger, Architect, Developer, and Researcher at Microsoft • Professor at MIT – Research Interests are Machine Learning, Web Search and Image Processing
  • 3. Authors Michael Jones – Got PhD from Massachusetts Institute of Technology in 1997 – Currently senior member in MERL Research – Research Interests are Computer Vision and Machine Learning
  • 4. Road Map − Introduction − Key Contributions − Results and Comparisons − Conclusion
  • 5. A prominent paper in the domain of Object Detection.  More than 6000 citations to the date in related articles.  First solution provided real time Object Detection.  Still most widely used solution for the Object Detection.
  • 6. Object detection Detecting instances of semantic objects of a certain class (such as humans, buildings, or cars) in digital images and videos. ~wikipedia.org Applications:  Robotic Applications  Surveillance Systems  Sports Applications  Security Systems  And many more..
  • 7. Face Detection Determining the locations and sizes of human faces in arbitrary images. ~wikipedia.org
  • 8. Image Representation  Represent and classify images based on simple features rather than pixels directly. Reasons are: 1. Feature can encode ad-hoc domain knowledge that is difficult to learn using finite quantity of training data. 2. Feature based systems operates much faster than pixel based systems.
  • 9. Features  Three kinds of simple features are used. 1. Two-Rectangles features 2. Three-Rectangles features 3. Four-Rectangles features  Feature value Calculation ∑pixel values in white area - ∑pixel values in gray area
  • 10. Key Contributions  Three main contributions 1. Introduction of Integral Image 2. Learning algorithm based on AdaBoost 3. Combine Classifiers in Cascade
  • 11. Integral Image  This concept was first introduced with this solution framework.  Integral Image is computed from an image using few operations on pixels. y y 10 20 10 20 10 30 40 60 20 10 10 10 30 60 80 110 x 30 10 10 20 x 60 100 130 180 10 20 30 20 70 130 190 260 Original Image Integral Image
  • 12. Integral Image  Using Integral Image, pixel sum of a rectangle are can be calculated using 4 array references.  It leads to a rapid evaluation of rectangle features  Feature evaluation in constant time A 1 B 2 C D 3 4 ∑Pixel sum of area D= ii(4) + ii(1) – ii(2) –ii(3)
  • 13. Learning Algorithm based on AdaBoost  AdaBoost is used for feature selection and classifier training  Capable of selecting a small set of good features from a large number of feature set  AdaBoost use a set of weak learners to form a strong one  It guarantees that training error of the strong classifier reach zero exponentially in number of rounds
  • 14. Learning Algorithm based on AdaBoost  A weak learner select a single rectangle feature which best seperates positive and negative examples  Weak learner determines the optimal threshold function, such that misclassification is minimized
  • 16. Combine Classifiers in Cascade  Building cascade of classifiers,  Increase detection performance  Rapidly reduce computation power  Simpler classifiers apply early and reject majority of sub windows, then apply complex classifiers to achieve low false positive  Subsequent classifiers are trained using examples, which pass through all the previous stages
  • 17. Combine Classifiers in Cascade  Cascade Training process involves two trade-offs 1. Classifier with more features will achieve higher DR and lower FPR 2. Classifier with more features need more computations  Can define a optimization framework in which 1. Number of classifier stages 2. Number of features in each stage 3. Threshold of each stage  Minimum number of features are selected such that, expected DR and FPR are achieved
  • 18. Combine Classifiers in Cascade  Simple framework is used to produce effective cascade which is highly efficient 1. User selects maximum acceptable FPR and minimum acceptable DR per each stage 2. User selects target overall FPR and DR 3. Each stage is trained by adding features until the target DR and FPRs are met 4. Stages are added until the overall target for DR and FPR are met
  • 19. Results  Testing has done on MIT+CMU test set, which consists with 507 faces in 130 images  Using a cascade of 38 layers  Cascade has trained using 4916 facial images and 9544 non-facial images  Testing has been done with scaling factor of 1.25 and windows shifting scale of 1.0 on images  On a conventional Pentium III machine with 700Mhz processor.  They have achieved Detection Speed of 15 frames/sec
  • 21. Results ROC Curve for Face Detector
  • 22. Results Detection Rate Comparison of Cotemporary Solution
  • 23. Conclusion  Solution achieves the goal of real time object detection  Conjunction of simple rectangle features and integral image gives a efficient feature representation  AdaBoost is used for the feature selection and classifier training  Cascade of classifiers allows to quickly discard background regions and concentrate more on object-like regions

Editor's Notes

  • #6: -replace word ‘prominent’-**Still most widely ..** **First solution ..** -they says solution capable of processing 15 frames/s in conventional pentium III machine
  • #8: -Paper describes objects detection solution using the application of FD.-
  • #13: -value of location A is sum of the pixels values in rectangle A and so on-Therefore we can calculate the sum of the pixels value in location D as value in location 4 + so on..
  • #17: With respect to imageA positive result from the first classifier…
  • #18: ** can define..**. Finding this optimum is extremely difficult.
  • #20: **15 frames/sec** which was a 600 times speeder than contemporary solutions.
  • #22: -Detector run using step size 1.0 and starting scale of 1.0-Drawn by removing classifier layers