SlideShare a Scribd company logo
Tracking and Recognition Presented by: Amir Rosenberger Instructor : Dr. Lior Wolf Support Vector Tracking, Shai Avidan, CVPR 2001 Ensemble Tracking, Shai Avidan, CVPR 2005 Shai Avidan is credited for most of this presentation’s slides
What is Tracking? Problem of estimating the trajectory of an object in the image plane as it moves around a scene. Tracking provides: Constants labels to objects Object centric information
Challenges Problems: Noise in images Complex object motion Camera Movements Partial and full occlusions Scene illuminations changes Real time processing requirements
Articles Support Vector Tracking (CVPR 2001) Ensemble Tracking  (CVPR 2005) Both works were done by Shai Avidan
SVT Input Farme at time T+1 Initial guess of the  object in time T Process Calculate SVM scores in the local surrounding and return a rectangle with max score
SVT Algorithm
SVT Algorithm Find a solution to
SVT Algorithm
Implementation Iterations are made to refine result. Use of Pyramid SVT Use homogeneous quadratic polynomial kernel Scale of rectangle to SV size
Implementation Training: Reduce all images to 20x20 pixels image Classification    ~10,000 examples with 97% equal error rate Test    ~10,000 examples with 92% equal error rate Total of ~2000 Support Vectors, reduced to only 50
Initial Guess Demonstration
Initial Guess Demonstration
Initial Guess Demonstration Initial: -8.3645 Final:  2.9272
Results Frame # SVM score
Results Frame # SVM score
Results Frame # SVM score
SVT Questions?
Ensemble Tracking Use of classifiers to distinguish the object from the background.  Classifiers are used to create a confidence map Training of new weak classifiers and adding them to the ensemble of weak classifiers
Ensemble Tracking Advantages Breaks the time consuming training phase into a sequence of simple learning tasks that can be performed on-line Can integrate off-line and on-line learning seamlessly. Stability of tracker is improved in cases of partial occlusions or illumination changes.
General Algorithm Initialization (for frame  I 1 ): Train several weak classifiers and add them to the ensemble For each new frame  I j   do: Test all pixels in frame  I j   using the current strong classifier and create a confidence map  L j Run mean shift on the confidence map  L j   and report new object rectangle  r j
General Algorithm Label pixels inside rectangle  r j   as object and all those outside it as background Remove old weak classifiers Train new weak classifiers on frame  I j  and add them to the ensemble
Train: Feature Space Time  T
Test: Feature Space Time  T+1
Track: Time  T+1
Train Feature Space Time  T+1
Train Feature Space Time  T+1
Feature vector for every pixel Use: 8 bin local edge orientation histogram Color (R,G,B) This gives an 11D feature space to work in
Object and Background Pixels in a rectangle that bounds the object belongs to the object Pixels outside that rectangle and inside a larger bounding rectangle belongs to the background
Confidence Measure weak classifier    confidence measure [-∞, +∞]    [0, 1] negative    0 positive    (0, 1]
Mean Shift Mean shift procedure: 1.Choose the radius r for the search window 2.Initialize the location of the window xk, k=1 3.Compute the m.s.v. Mh,k(xk) 4.Translate the window by xk+1= Mh,k(xk)+xk, k=k+1 5.Repeat step 3 and 4 until convergence
Intuitive Description Distribution of identical billiard balls Region of interest Center of mass Mean Shift vector Objective  : Find the densest region This sequence was taken from a presentation of Yaron Ukrainitz & Bernard Sarel on mean shift
Intuitive Description Distribution of identical billiard balls Region of interest Center of mass Mean Shift vector Objective  : Find the densest region
Intuitive Description Distribution of identical billiard balls Region of interest Center of mass Mean Shift vector Objective  : Find the densest region
Intuitive Description Distribution of identical billiard balls Region of interest Center of mass Mean Shift vector Objective  : Find the densest region
Intuitive Description Distribution of identical billiard balls Region of interest Center of mass Mean Shift vector Objective  : Find the densest region
Intuitive Description Distribution of identical billiard balls Region of interest Center of mass Mean Shift vector Objective  : Find the densest region
Intuitive Description Distribution of identical billiard balls Region of interest Center of mass Objective  : Find the densest region
Integration over time 1 3 2 4 5 6
Integration over time 1 3 2 4 5 6
Learning by AdaBoost For each weak learner, determine its error rate, given as:  w i   is pixel weight,  h t (x i )  is classification of pixel  x i ,  y i  is the proper label of pixel  x i
Learning by AdaBoost Weight of each weak learner is determined by error rate:  Classifiers with better performance have higher weights Classifiers with error rates at %50 or higher are no better than chance, and are discarded
Learning by AdaBoost Pixel weights are increased if pixel is misclassified: Weight updated as:
Learning by AdaBoost After all weak learners are properly weighted, strong classifier defined as:
Outlier Rejection AdaBoost is sensitive to outliers Threshold is used to change labels of “difficult” examples:
Multi-resolution Tracking Scale the image to half and quarter Use different ensembles independently to create confidence maps Average all the maps to one map on which the mean shift is performed
Tracking results
Tracking results
Changing weights Frame 10 Frame 40 Frame 70 Color Histogram Weak classifiers
Comparing fixed Vs. dynamic ensemble
Handle moving camera
even grayscale sequences
Handling Occlusions
Ensemble tracking Questions?
Summary Two algorithms: Offline learning vs. online learning Classifiers of SV vs. ensemble of weak classifiers Both use multi-resolution methods
The End

More Related Content

PDF
Review: Incremental Few-shot Instance Segmentation [CDM]
PPTX
Object detection - RCNNs vs Retinanet
PDF
How much position information do convolutional neural networks encode? review...
PDF
Review: You Only Look One-level Feature
PDF
Deep Learning for Computer Vision: Object Detection (UPC 2016)
PDF
Deformable DETR Review [CDM]
PDF
ViT (Vision Transformer) Review [CDM]
PDF
Deep learning based object detection basics
Review: Incremental Few-shot Instance Segmentation [CDM]
Object detection - RCNNs vs Retinanet
How much position information do convolutional neural networks encode? review...
Review: You Only Look One-level Feature
Deep Learning for Computer Vision: Object Detection (UPC 2016)
Deformable DETR Review [CDM]
ViT (Vision Transformer) Review [CDM]
Deep learning based object detection basics

What's hot (20)

PDF
Mask-RCNN for Instance Segmentation
PDF
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...
PDF
Object Detection Using R-CNN Deep Learning Framework
PDF
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
PDF
Transformer in Computer Vision
PDF
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
PPTX
Object Detection using Deep Neural Networks
PPTX
Object Detection Methods using Deep Learning
PDF
YolactEdge Review [cdm]
PDF
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
PDF
Recent Object Detection Research & Person Detection
PDF
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
PDF
Recurrent Instance Segmentation (UPC Reading Group)
PPTX
Visual Object Analysis using Regions and Local Features
PDF
Class Weighted Convolutional Features for Image Retrieval
PDF
object detection paper review
PDF
Detection focal loss 딥러닝 논문읽기 모임 발표자료
PPTX
Deep image retrieval - learning global representations for image search - ub ...
PPTX
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Mask-RCNN for Instance Segmentation
[Review] BoxInst: High-Performance Instance Segmentation with Box Annotations...
Object Detection Using R-CNN Deep Learning Framework
CNN vs SIFT-based Visual Localization - Laura Leal-Taixé - UPC Barcelona 2018
Transformer in Computer Vision
Review : PolarMask: Single Shot Instance Segmentation with Polar Representati...
Object Detection using Deep Neural Networks
Object Detection Methods using Deep Learning
YolactEdge Review [cdm]
Review : Prototype Mixture Models for Few-shot Semantic Segmentation
Recent Object Detection Research & Person Detection
Object Segmentation (D2L7 Insight@DCU Machine Learning Workshop 2017)
Recurrent Instance Segmentation (UPC Reading Group)
Visual Object Analysis using Regions and Local Features
Class Weighted Convolutional Features for Image Retrieval
object detection paper review
Detection focal loss 딥러닝 논문읽기 모임 발표자료
Deep image retrieval - learning global representations for image search - ub ...
Vision Transformer(ViT) / An Image is Worth 16*16 Words: Transformers for Ima...
Ad

Similar to Shai Avidan's Support vector tracking and ensemble tracking (20)

PDF
机器学习Adaboost
PPT
Action Recognition (Thesis presentation)
PPT
Support Vector Machines
PPT
20100822 computervision veksler
PPTX
Introduction to convolutional networks .pptx
PPT
iccv2009 tutorial: boosting and random forest - part III
PPTX
Super Resolution of Image
PPT
CBIR Final project1
PDF
Computer Vision Computer Vision: Algorithms and Applications Richard Szeliski
PDF
MLIP - Chapter 5 - Detection, Segmentation, Captioning
PDF
Fast Object Recognition from 3D Depth Data with Extreme Learning Machine
PPT
Digital Image Processing UNIT-2.ppt
PDF
Performance Evaluation of Object Tracking Technique Based on Position Vectors
PDF
PCSJ-IMPS2024招待講演「動作認識と動画像符号化」2024年度画像符号化シンポジウム(PCSJ 2024) 2024年度映像メディア処理シンポジ...
PPT
Raskar Keynote at Stereoscopic Display Jan 2011
PPTX
ML-Lec-17-SVM,sshwqw - Non-Linear (1).pptx
PPTX
CNN and its applications by ketaki
PDF
Object Tracking with Instance Matching and Online Learning
PDF
Overview of Convolutional Neural Networks
PDF
D3L4-objects.pdf
机器学习Adaboost
Action Recognition (Thesis presentation)
Support Vector Machines
20100822 computervision veksler
Introduction to convolutional networks .pptx
iccv2009 tutorial: boosting and random forest - part III
Super Resolution of Image
CBIR Final project1
Computer Vision Computer Vision: Algorithms and Applications Richard Szeliski
MLIP - Chapter 5 - Detection, Segmentation, Captioning
Fast Object Recognition from 3D Depth Data with Extreme Learning Machine
Digital Image Processing UNIT-2.ppt
Performance Evaluation of Object Tracking Technique Based on Position Vectors
PCSJ-IMPS2024招待講演「動作認識と動画像符号化」2024年度画像符号化シンポジウム(PCSJ 2024) 2024年度映像メディア処理シンポジ...
Raskar Keynote at Stereoscopic Display Jan 2011
ML-Lec-17-SVM,sshwqw - Non-Linear (1).pptx
CNN and its applications by ketaki
Object Tracking with Instance Matching and Online Learning
Overview of Convolutional Neural Networks
D3L4-objects.pdf
Ad

More from wolf (13)

PPT
Eigenfaces and Fisherfaces
PPT
Constellation Models and Unsupervised Learning for Object Class Recognition
PPT
A bayesian framework for unsupervised one-shot learning of object categories
PPT
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
PPT
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
PPT
Recovering 3D human body configurations using shape contexts
PPT
Rafi Zachut's slides on class specific segmentation
PPT
Avihu Efrat's Viola and Jones face detection slides
PPT
Ala Stolpnik's Standard Model talk
PPT
Michal Erel's SIFT presentation
PPT
Gil Shapira's Active Appearance Model slides
PPT
Moshe Guttmann's slides on eigenface
PPT
Object recognition seminar S2006E01
Eigenfaces and Fisherfaces
Constellation Models and Unsupervised Learning for Object Class Recognition
A bayesian framework for unsupervised one-shot learning of object categories
PCA-SIFT: A More Distinctive Representation for Local Image Descriptors
The Pyramid Match Kernel: Discriminative Classification with Sets of Image Fe...
Recovering 3D human body configurations using shape contexts
Rafi Zachut's slides on class specific segmentation
Avihu Efrat's Viola and Jones face detection slides
Ala Stolpnik's Standard Model talk
Michal Erel's SIFT presentation
Gil Shapira's Active Appearance Model slides
Moshe Guttmann's slides on eigenface
Object recognition seminar S2006E01

Recently uploaded (20)

PDF
Empathic Computing: Creating Shared Understanding
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PPTX
sap open course for s4hana steps from ECC to s4
PPTX
Spectroscopy.pptx food analysis technology
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Unlocking AI with Model Context Protocol (MCP)
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
PDF
Encapsulation theory and applications.pdf
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PDF
Reach Out and Touch Someone: Haptics and Empathic Computing
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...
Empathic Computing: Creating Shared Understanding
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
The AUB Centre for AI in Media Proposal.docx
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
sap open course for s4hana steps from ECC to s4
Spectroscopy.pptx food analysis technology
Review of recent advances in non-invasive hemoglobin estimation
Unlocking AI with Model Context Protocol (MCP)
20250228 LYD VKU AI Blended-Learning.pptx
NewMind AI Weekly Chronicles - August'25 Week I
Diabetes mellitus diagnosis method based random forest with bat algorithm
Encapsulation theory and applications.pdf
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Encapsulation_ Review paper, used for researhc scholars
Digital-Transformation-Roadmap-for-Companies.pptx
Reach Out and Touch Someone: Haptics and Empathic Computing
Network Security Unit 5.pdf for BCA BBA.
Chapter 3 Spatial Domain Image Processing.pdf
Agricultural_Statistics_at_a_Glance_2022_0.pdf
TokAI - TikTok AI Agent : The First AI Application That Analyzes 10,000+ Vira...

Shai Avidan's Support vector tracking and ensemble tracking