SlideShare a Scribd company logo
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
DOI : 10.5121/sipij.2014.5211 119
DESIGN AND IMPLEMENTATION OF VIDEO
TRACKING SYSTEM BASED ON CAMERA FIELD
OF VIEW
*Hassan E. El-Taher, **Kasim M. Al-hity, and Mubarak M.Ahmed***
*ZEC Company Cooperation with ***Sudan University of Science and Technology
**Omdurman Islamic University-Faculty of Engineering Science.
ABSTRACT
The basic idea of this paper is to design and implement of video tracking system based on Camera Field of
View (CFOV), Otsu’s method was used to detect targets such as vehicles and people. Whereas most
algorithms were spent a lot of time to execute the process, an algorithm was developed to achieve it in a
little time. The histogram projection was used in both directional to detect target from search region,
which is robust to various light conditions in Charge Couple Device (CCD) camera images and saves
computation time.
Our algorithm based on background subtraction, and normalize cross correlation operation from a series
of sequential sub images can estimate the motion vector. Camera field of view (CFOV) was determined and
calibrated to find the relation between real distance and image distance. The system was tested by
measuring the real position of object in the laboratory and compares it with the result of computed one. So
these results are promising to develop the system in future.
KEYWORDS
Camera field of view, Image segmentation, Normalize Cross Correlation, Otsu’s method.
1. INTRODUCTION
Compared to radar and laser tracking systems, the applications of image processing are modern
tracking technology. The image processing technology was used on the static images in the field
of the meteorology application [1,2]. In the last decade, dynamic image processing was
extensively used in different fields, especially, pattern recognition is very important for
automation in industrial applications. The processing of static images and dynamic images is
different. The processing of a series of images is the base of video image processing. So far,
many methods for dynamic image processing such as cross-correlation matching, template
matching, frame difference and motion vector estimation have been done. Visual servo can be
considered the fusion of computer vision, robotics and control. There are two basic approaches to
visual servo control: image-based visual servo and position-based visual servo [3]. Each approach
brings the obvious vision problem: Feddema and Mitchell used the image Jacobian method based
on the motion of points in the image to relate image-plane velocity of a point to the relative
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
120
velocity of the point with respect to the camera. With the progress of research in vision servo,
each has its own advantages in specific application areas [4]. In this paper, a video tracking
system based on Normalize cross correlation and background subtraction was described, which
executes a series of operations from obtaining sub images and processing them to correspond
with the target position.
The tracking algorithms used in the system is based on rejecting the background from a certain
region of interest to highlight the target. This is achieved by manually placing a track-window
around the target to specify the region of interest. The boundaries of the tracking-window are
processed and the corresponding intensity values are marked as the background region. A
histogram of the intensities for the region inside the tracking-window is made and everything
inside it that has an intensity level different from the range of background intensities is nominated
as the target [5].
Video tracking system is usually composed of image acquisition, image processing using
MatLab, PC. The image acquisition includes MINI CCD camera and interface card (Frame
Grabber). The cross-correlation algorithm was developed to get the motion vectors. System was
used a MINI CCD camera which is restricted by video rates, e.g. 30 Hz (NTSC) or 25 Hz (PAL).
In this paper, Camera field of view (CFOV) was determined and calibrated to find the relation
between real distance and image distance.
2. IMAGE SEGMENTATION ALGORITHMS
Segmentation involves separating an image into regions (or their contours) corresponding to
objects. Background can be identified by identifying differences between regions. A natural way
to segment such regions is through thresholding to separate light and dark regions. Thresholding
creates binary images from gray level ones by turning all pixels below some threshold to zero and
all pixels above that threshold to one. If g(x, y) is a thresholding binary image of f(x, y) at some
global threshold T [6].
Extraneous pixels that aren’t part of the desired region can be included, and isolated pixels within
the region can be lost (especially near the boundaries of the region). These effects get worse as
the noise gets worse, simply because it’s more likely that pixels intensity doesn’t represent the
normal intensity in the region. Sometimes some information can be lost and sometimes getting
too many extraneous background pixels. Another problem with global thresholding is that
changes in illumination across the scene may cause some parts to be brighter and some parts
darker. It can deal, at least in part, with such uneven illumination by determining thresholds
locally. That is, instead of having a single global threshold, the threshold was allowing itself to
smoothly vary across the image.
To set a global threshold or to adapt a local threshold to an area, usually one have to look at the
histogram to see if one can find two or more distinct modes for the foreground and for the
background. Converting a grayscale image to monochrome is a common image processing task.
Otsu's method, named after its inventor Nobuyuki Otsu, is one of many binarization algorithms
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
121
[6]. This section describes how the algorithm works and provides a Matlab implementation,
which can be easily ported to other languages [7,8,9,10].The result of this algorithm is shown in
Figure (1).
Figure (1) Difference values Otsu threshold and image histogram (Plane target)
3. TRACKING ALGORITHMS
3.1 Background Subtraction Algorithm
The basic idea underlying the background subtraction is image differentiation between the
present frame image which contains the subject of interest and the reference images [11]. Thus, a
set of background image frames will be functioned as a reference set of images. In this work, a
better technique involves the background model as mentioned in [12] is applied. Next, each of the
sequence frames from the input video that consist of object motion will be processed to extract
the object boundaries. The previous background model is used to segment out the foreground
pixels from the background scene. By taking the Otsu thresholding value, this was calculated in
section (2). It is well known that thresholding is an important aspect in background subtraction
procedure and it is quite crucial to determine the most suitable threshold value. Therefore,
morphological processes are applied for a more perfect silhouette. Alternatively, morphology is a
broad set of image processing operations that process images based on shapes [13]. In this work,
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
122
morphological operations namely opening and closing were applied. Both of these operations can
be manipulated using single or combination of specific structuring elements.
The main purpose of this step is to filter the presence of noisy pixels in the foreground image as
shown in Figure (2).
Figure (2) Background subtraction tracking algorithm
3.2 Normalize Cross Correlation (NCC)
The template matching method is simplest and widely used for detecting targets in image
information, which is obtained from camera. The Template matching method is searching for the
most similar frame target in the image with a template image obtained from prearranged
information. It is important what similarity index is used for comparing the target with input
frame [14]. The correlation process entails the correlation of frame and target. The Region of
Interest (ROI) is a 101x101 pixel area from a given image segment and target is a 34x34 pixel
area from an adjacent overlapping image segment.
When correlating the frame and target window using the correlation equation, a similarity in
feature information will exist [15]. The correlation equation is as shown in equation (2).
Where f(x,y) denote the intensity value of the ROI of size at point (x,y), x ε{0,…, -1},y ε{0,…, }.
The target is represented by a given template T of size .Common way to calculate the position ( )
of the target in the ROI is to evaluate the normalized cross correlation value NCC at each point
(u,v) for f and the target T, which has been shifted by u steps in the x direction and by v steps in
the y direction[16].
denotes the mean value of f(x,y) within the area of target T shifted to (u,v). When feature
information is identical within the frame and target, the correlation coefficient will be 1, which is
denoting an exact matching. Figure (3) shows flow chart of algorithm.
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
123
Figure (3) Flow chart of NCC algorithm
4. MOTION VECTOR
Because the position is the horizontal component and the vertical
component of the orthogonal coordinate, a transformation is used to transform the motion vector
into the moving angles for motion control. The transformation in the pan direction is depicted in
Figure (4) where the image plane is parallel to central plane of lens and the target moves in any
direction. The locations of moving target are points A and B at time and tn , respectively.
The target moving in real distances Dr so, to calculate the moving target in image using the
formula:
Angles were calculated during the tracking mode equation (7) & (8) show that, the
target was controlled at central point of image plane. In general case, the moving target is
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
124
faraway from the central plane of lens; therefore, the can be treated as θ. Fig.6 are
the errors of tracking deviation between motion vectors and real moving distances in x and y
directions, respectively. The significant meaning of these deviations is that the core of the target
dose not matches the specific sampling points. Especially, the error deviations in the y direction
are more than in the x direction. Exploring the reason, we find out that the illumination in x and y
direction is different, which causes the diameter change of the target image in the different
moving directions. The transformation in the tilt direction has the same fact as in pan direction.
Figure (4) Image plane, central plane of lens, and target moving direction
4.1 Camera Field of View (CFOV)
Camera field of view is shown in Figure (5) the distance between camera and object D, the height
and width of object in image plan is WH and HV, so the vertical field of view is given as:
Where: is angle vertical field of view.
The ratio of VFOV and image height as:
Same as in horizontal field of view.
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
125
The ratio of HFOV and image width as:
The ratio of VFOV and image width as:
Figure (5) Shown camera field of view in vertical direction
Pan and tilt angles can be calculated as:
4.2 Target Moving in Image Plane
Table (1) shows the transformation of target in image plane the tracking mode get the distance
measurement in image plane compare it with the real distance in world space.
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
126
Table (1) the motion angle, distance and actual moving angle and distances of the tracking target
Note : (-) negative sign means the opposite direction
σTiltRatio = 13.02 pixels per degree, σPanRatio = 11.36 pixels per degree, σTR is a pixels Ratio in x
direction and σPR is a pixels Ratio in y direction. Target distance from Camera = 400 mm.
5. EXPERIMENTAL RESULTS
To test the normalize cross correlation on the scene, different experiments were carried out to test
the proposed algorithm to track object in the image frames to identify any position of target in the
scene (static scene static camera(video file.avi)). Then some testes were carried out to track
objects in real time (static camera moving object).
5. 1. Static Scene Static Camera
This test was used to detect object based on Otsu threshold, because the frame difference method
cannot detect static objects. Estimated positions of the object (based on camera field of view)
with respect to real target positions are summarized in Table.1. All values are measured with
respect to center of camera view. Table.2 and Table.3 show that the measurement positions
(θpan_measurement, θTilt_mesurement ) are close to real postions ( θpan_real, θTilt_real). The algorithm can detect
and segment the background object form foreground. The specified area from multiple moving
objects are appears in the scene. The Otsu algorithm implemented to the segmented object at each
frame. Estimated position (degree) in (x, y) directions used the ratio between number of pixels
and camera field of view using equation (9) and equation (10). The real time object tracking has
special performance of equipment, this system used low cost mini CCD camera it has wide field
of view and short rang it can't detect and track target moving in long rang, it can detect target
moving in short rang also this system has special performance it can work in and out door.
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
127
Although system has high accuracy to detect and track target in short rang, it was compared with
the result based on the camera calibration. The results of this system better and fewer
requirements than system used camera calibration.
To detect and track target it should make target moving in track-window in this time system
received command form user to start tracking target, if target moving in high speed may be the
algorithm failure to tracking the target because target moving out of track-window in this case it
better to zoom the track-window until the target found, before that reduce the track-window
again.
Table (2) Summary for estimated and measured Angles values (angles)
Table (3) Summary for estimated and measured position values (distance)
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
128
Figure (6) Desired position and measured position in X , Y direction
6. CONCLUSIONS
The real time object tracking based on camera field of view was achieved. The algorithms merge
the histogram projection and normalize cross correlation technique to detect, segment and track
static and moving object in real time. The ratio σTiltRatio and σPanRatio was calculated and
implemented to determine the target distance in world coordinates. Different tests are carried to
verify the validity of the proposed algorithm such as, static scene static camera and moving
object static camera. The results show that, the static and moving object can be detected and
segmented without noise and false detections in the scene. The average processing time was
observed when we used track-window is better than process full frame. It is depending on the
position of the object with respect to the camera view. The key factors to limit the application of
this system in specific areas are the FOV limit of CCD camera and one target detected by the
cross-correlation method at one time. Because of illumination effects on the moving object, it is
not easy to get more precise tracking.
7. FUTURE WORK
These results are promising to develop the system in future it needs to include software and
hardware to get high performance of system.
It is better to use open or close loop control with shaft encoder to know the position of motor
when system connected to other system. Embedded system has high response, speed of
processing and less weight.
The GPS data can be used in one time by interring manually, GPS and digital compass to know
the directional and position.
The System can be integrated with Laser Rang Finder (LRF) to get target distance at each time.
Matlab software was used to implemented algorithms, it has good features to test all algorithms,
but in real time application need to other platform software.
Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014
129
REFERENCES
[1] Bluestien H.B, Prediction of Satellite Cloud Patterns Using Spatial Fourier Transforms, S.B, Massa
chusette Institute of Technology [1971].
[2] Parcak H. Sarah, Satellite Remote Sensing for Archaeology, London & Newyork [2009].
[3] Shiao S.Ying, Design and Implementation of Real-Time Tracking System Based on Vision Servo
Control, Tamkang Journal of Science and Engineering, Vol. 4, No. 1, pp. 45-58 [2001].
[4] Gonzalez C. Rafael & Woods R. E, Digital Image Processing. Addison-Wesley Publishing
Company[1993].
[5] Dr. Greensted A, Digital Image Processing (Java Software). Wikipedia, july [2010].
[6] Ahmed N & Roa K.R, Orthogonal Transforms for Digital Signal Processing, New York Spring
[1975].
[7] Gonzales C. Rafael, Digital Image Processing Using Matlab, Math Work [2004].
[8] Liang Xu, Threshold Selection Method from Gray-Level Histograms, Electro-Technical Laboratory,
Tokyo University(2007), Tokyo, Japan, IEEE Transactions on System, Man, and Cybernetics. [
1979].
[9] Sezgin M. & Sankur B, "Survey over Image Thresholding Techniques and Quantitative Performance
Evaluation", Journal of Electronic Imaging [2003].
[10] Baldini G, Campadelli P, Cozzi D & Lanzarotti R, A Simple and Robust Method for Moving Target
Tracking, roceedings of the International Conference Signal Processing, Pattern Recognition and
Applications (SPPRA2002), Crete, Greece June [2002].
[11] John C.Russ, The Image Processing Hand Book 3rd Edition, ACRC Published in coorperation with
IEEE Press [1998].
[12] Padmini J, Abhishek S & Suman K Mitra, Background Subtraction in Videos Using Bayesian
Learning with Motion Information, Dhirubhai Ambani Institute of Information and Communication
Technology, India [2007].
[13] Jong S. Bae &Taek L.Song, Image Tracking Algorithm Using Template Matching and PSNF-m,
International Journal of Control, Automation, and Systems [June 2008].
[14] JaneenM.Carter, Correlation Evaluation of Intensity and Color Band Images, Morgan State
University [Jan 23 1995].
[15] Kai B & Hanebeck U, Template Matching Using Fast Normalize Cross Correlation, institute of
Automatic control Engineering, technishche university Munchen , Germany [2001].
AUTHORS
HASSAN ELPASERI ELAMIN ELTAHER was born in July 5, 1982, Khartoum, Sudan.
He received his M.Sc. in image processing, College of Science 2012 Sudan University of
Science & technology, Sudan, high diploma in general physic 2007, Sudan University of
Science & technology, B.Sc. in physic Science 2005 Sudan University of Science &
technology, Sudan. Currently he is DIP & Thermal Imaging System Researcher in
institute of Laser, Sudan University.

More Related Content

PDF
Moving object detection using background subtraction algorithm using simulink
PPTX
Background subtraction
PPTX
various methods for image segmentation
PDF
Development of Human Tracking System For Video Surveillance
PDF
AN ENHANCED EDGE ADAPTIVE STEGANOGRAPHY APPROACH USING THRESHOLD VALUE FOR RE...
PDF
Leader Follower Formation Control of Ground Vehicles Using Dynamic Pixel Coun...
PDF
G04743943
PDF
Background Subtraction Algorithm for Moving Object Detection Using Denoising ...
Moving object detection using background subtraction algorithm using simulink
Background subtraction
various methods for image segmentation
Development of Human Tracking System For Video Surveillance
AN ENHANCED EDGE ADAPTIVE STEGANOGRAPHY APPROACH USING THRESHOLD VALUE FOR RE...
Leader Follower Formation Control of Ground Vehicles Using Dynamic Pixel Coun...
G04743943
Background Subtraction Algorithm for Moving Object Detection Using Denoising ...

What's hot (19)

PDF
A ROBUST BACKGROUND REMOVAL ALGORTIHMS USING FUZZY C-MEANS CLUSTERING
DOCX
LEARNING FINGERPRINT RECONSTRUCTION: FROM MINUTIAE TO IMAGE
PDF
An Efficient Block Matching Algorithm Using Logical Image
PDF
Fast Full Search for Block Matching Algorithms
PDF
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
PPTX
motion and feature based person tracking in survillance videos
PPTX
A study and comparison of different image segmentation algorithms
PDF
3 d mrf based video tracking in the compressed domain
PDF
3 d mrf based video tracking in the compressed domain
DOC
All optical image processing using third harmonic generation for image correl...
PPTX
Arp zmp
PPTX
Comparison of Segmentation Algorithms and Estimation of Optimal Segmentation ...
PDF
Ijnsa050207
PDF
Online framework for video stabilization
PDF
K-Means Clustering in Moving Objects Extraction with Selective Background
PDF
Video Shot Boundary Detection Using The Scale Invariant Feature Transform and...
PDF
A Comparison of Block-Matching Motion Estimation Algorithms
PDF
Fahad Fazal Elahi Guraya
PDF
Unsupervised region of interest
A ROBUST BACKGROUND REMOVAL ALGORTIHMS USING FUZZY C-MEANS CLUSTERING
LEARNING FINGERPRINT RECONSTRUCTION: FROM MINUTIAE TO IMAGE
An Efficient Block Matching Algorithm Using Logical Image
Fast Full Search for Block Matching Algorithms
Tracking Chessboard Corners Using Projective Transformation for Augmented Rea...
motion and feature based person tracking in survillance videos
A study and comparison of different image segmentation algorithms
3 d mrf based video tracking in the compressed domain
3 d mrf based video tracking in the compressed domain
All optical image processing using third harmonic generation for image correl...
Arp zmp
Comparison of Segmentation Algorithms and Estimation of Optimal Segmentation ...
Ijnsa050207
Online framework for video stabilization
K-Means Clustering in Moving Objects Extraction with Selective Background
Video Shot Boundary Detection Using The Scale Invariant Feature Transform and...
A Comparison of Block-Matching Motion Estimation Algorithms
Fahad Fazal Elahi Guraya
Unsupervised region of interest
Ad

Viewers also liked (19)

PDF
Face detection using the 3 x3 block rank patterns of gradient magnitude images
PDF
Fast nas rif algorithm using iterative conjugate gradient method
PDF
A new hybrid method for the segmentation of the brain mris
PDF
Intelligent indoor mobile robot navigation using stereo vision
PPTX
Hcc presentation deck for corporates
PPTX
Isaac newton
PPTX
Happy Camera Club
PPTX
The flamingo fiasco
PDF
Cyber nanny cyber patrol
PDF
A comparative study of histogram equalization based image enhancement techniq...
PDF
HappyCameraClub Offering to NGO's
PDF
Happy Camera Club report for Ashwini Charitable Trust Workshop April - May 2013
PDF
Immersive 3 d visualization of remote sensing data
PDF
Vehicle detection and tracking techniques a concise review
PDF
Extraction of spots in dna microarrays using genetic algorithm
PPT
Cloud computing un fenomeno che potrebbe cambiare il mondo - presentazione te...
PPTX
Energia formak
PPTX
neuromarketing_ verità e bugie su quello che compriamo
PPTX
Energia iturri berriztagarriak eta berriztaezinak
Face detection using the 3 x3 block rank patterns of gradient magnitude images
Fast nas rif algorithm using iterative conjugate gradient method
A new hybrid method for the segmentation of the brain mris
Intelligent indoor mobile robot navigation using stereo vision
Hcc presentation deck for corporates
Isaac newton
Happy Camera Club
The flamingo fiasco
Cyber nanny cyber patrol
A comparative study of histogram equalization based image enhancement techniq...
HappyCameraClub Offering to NGO's
Happy Camera Club report for Ashwini Charitable Trust Workshop April - May 2013
Immersive 3 d visualization of remote sensing data
Vehicle detection and tracking techniques a concise review
Extraction of spots in dna microarrays using genetic algorithm
Cloud computing un fenomeno che potrebbe cambiare il mondo - presentazione te...
Energia formak
neuromarketing_ verità e bugie su quello che compriamo
Energia iturri berriztagarriak eta berriztaezinak
Ad

Similar to Design and implementation of video tracking system based on camera field of view (20)

PDF
I0343065072
PDF
A Moving Target Detection Algorithm Based on Dynamic Background
PDF
A Novel Background Subtraction Algorithm for Dynamic Texture Scenes
PDF
IMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSET
PDF
A ROS IMPLEMENTATION OF THE MONO-SLAM ALGORITHM
DOCX
Motion Object Detection Using BGS Technique
DOCX
Motion Object Detection Using BGS Technique
PDF
Image Recognition Using MATLAB Simulink Blockset
PDF
Shot Boundary Detection In Videos Sequences Using Motion Activities
PDF
J017377578
PDF
Real-time Moving Object Detection using SURF
PDF
Ay33292297
PDF
Ay33292297
PDF
SENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTION
PDF
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
PDF
Kurmi 2015-ijca-905317
PDF
Performance analysis on color image mosaicing techniques on FPGA
PDF
A Novel Approach for Tracking with Implicit Video Shot Detection
PDF
Automated traffic sign board
PDF
Matching algorithm performance analysis for autocalibration method of stereo ...
I0343065072
A Moving Target Detection Algorithm Based on Dynamic Background
A Novel Background Subtraction Algorithm for Dynamic Texture Scenes
IMAGE RECOGNITION USING MATLAB SIMULINK BLOCKSET
A ROS IMPLEMENTATION OF THE MONO-SLAM ALGORITHM
Motion Object Detection Using BGS Technique
Motion Object Detection Using BGS Technique
Image Recognition Using MATLAB Simulink Blockset
Shot Boundary Detection In Videos Sequences Using Motion Activities
J017377578
Real-time Moving Object Detection using SURF
Ay33292297
Ay33292297
SENSITIVITY OF A VIDEO SURVEILLANCE SYSTEM BASED ON MOTION DETECTION
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
Kurmi 2015-ijca-905317
Performance analysis on color image mosaicing techniques on FPGA
A Novel Approach for Tracking with Implicit Video Shot Detection
Automated traffic sign board
Matching algorithm performance analysis for autocalibration method of stereo ...

Recently uploaded (20)

PDF
Advanced methodologies resolving dimensionality complications for autism neur...
PDF
Modernizing your data center with Dell and AMD
PDF
Network Security Unit 5.pdf for BCA BBA.
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Encapsulation_ Review paper, used for researhc scholars
PPT
“AI and Expert System Decision Support & Business Intelligence Systems”
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PDF
Mobile App Security Testing_ A Comprehensive Guide.pdf
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
Empathic Computing: Creating Shared Understanding
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PPTX
MYSQL Presentation for SQL database connectivity
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
PDF
NewMind AI Weekly Chronicles - August'25 Week I
PDF
NewMind AI Monthly Chronicles - July 2025
PDF
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...
Advanced methodologies resolving dimensionality complications for autism neur...
Modernizing your data center with Dell and AMD
Network Security Unit 5.pdf for BCA BBA.
Building Integrated photovoltaic BIPV_UPV.pdf
CIFDAQ's Market Insight: SEC Turns Pro Crypto
The AUB Centre for AI in Media Proposal.docx
Encapsulation_ Review paper, used for researhc scholars
“AI and Expert System Decision Support & Business Intelligence Systems”
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Mobile App Security Testing_ A Comprehensive Guide.pdf
Spectral efficient network and resource selection model in 5G networks
Empathic Computing: Creating Shared Understanding
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
MYSQL Presentation for SQL database connectivity
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
NewMind AI Weekly Chronicles - August'25 Week I
NewMind AI Monthly Chronicles - July 2025
Shreyas Phanse Resume: Experienced Backend Engineer | Java • Spring Boot • Ka...

Design and implementation of video tracking system based on camera field of view

  • 1. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 DOI : 10.5121/sipij.2014.5211 119 DESIGN AND IMPLEMENTATION OF VIDEO TRACKING SYSTEM BASED ON CAMERA FIELD OF VIEW *Hassan E. El-Taher, **Kasim M. Al-hity, and Mubarak M.Ahmed*** *ZEC Company Cooperation with ***Sudan University of Science and Technology **Omdurman Islamic University-Faculty of Engineering Science. ABSTRACT The basic idea of this paper is to design and implement of video tracking system based on Camera Field of View (CFOV), Otsu’s method was used to detect targets such as vehicles and people. Whereas most algorithms were spent a lot of time to execute the process, an algorithm was developed to achieve it in a little time. The histogram projection was used in both directional to detect target from search region, which is robust to various light conditions in Charge Couple Device (CCD) camera images and saves computation time. Our algorithm based on background subtraction, and normalize cross correlation operation from a series of sequential sub images can estimate the motion vector. Camera field of view (CFOV) was determined and calibrated to find the relation between real distance and image distance. The system was tested by measuring the real position of object in the laboratory and compares it with the result of computed one. So these results are promising to develop the system in future. KEYWORDS Camera field of view, Image segmentation, Normalize Cross Correlation, Otsu’s method. 1. INTRODUCTION Compared to radar and laser tracking systems, the applications of image processing are modern tracking technology. The image processing technology was used on the static images in the field of the meteorology application [1,2]. In the last decade, dynamic image processing was extensively used in different fields, especially, pattern recognition is very important for automation in industrial applications. The processing of static images and dynamic images is different. The processing of a series of images is the base of video image processing. So far, many methods for dynamic image processing such as cross-correlation matching, template matching, frame difference and motion vector estimation have been done. Visual servo can be considered the fusion of computer vision, robotics and control. There are two basic approaches to visual servo control: image-based visual servo and position-based visual servo [3]. Each approach brings the obvious vision problem: Feddema and Mitchell used the image Jacobian method based on the motion of points in the image to relate image-plane velocity of a point to the relative
  • 2. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 120 velocity of the point with respect to the camera. With the progress of research in vision servo, each has its own advantages in specific application areas [4]. In this paper, a video tracking system based on Normalize cross correlation and background subtraction was described, which executes a series of operations from obtaining sub images and processing them to correspond with the target position. The tracking algorithms used in the system is based on rejecting the background from a certain region of interest to highlight the target. This is achieved by manually placing a track-window around the target to specify the region of interest. The boundaries of the tracking-window are processed and the corresponding intensity values are marked as the background region. A histogram of the intensities for the region inside the tracking-window is made and everything inside it that has an intensity level different from the range of background intensities is nominated as the target [5]. Video tracking system is usually composed of image acquisition, image processing using MatLab, PC. The image acquisition includes MINI CCD camera and interface card (Frame Grabber). The cross-correlation algorithm was developed to get the motion vectors. System was used a MINI CCD camera which is restricted by video rates, e.g. 30 Hz (NTSC) or 25 Hz (PAL). In this paper, Camera field of view (CFOV) was determined and calibrated to find the relation between real distance and image distance. 2. IMAGE SEGMENTATION ALGORITHMS Segmentation involves separating an image into regions (or their contours) corresponding to objects. Background can be identified by identifying differences between regions. A natural way to segment such regions is through thresholding to separate light and dark regions. Thresholding creates binary images from gray level ones by turning all pixels below some threshold to zero and all pixels above that threshold to one. If g(x, y) is a thresholding binary image of f(x, y) at some global threshold T [6]. Extraneous pixels that aren’t part of the desired region can be included, and isolated pixels within the region can be lost (especially near the boundaries of the region). These effects get worse as the noise gets worse, simply because it’s more likely that pixels intensity doesn’t represent the normal intensity in the region. Sometimes some information can be lost and sometimes getting too many extraneous background pixels. Another problem with global thresholding is that changes in illumination across the scene may cause some parts to be brighter and some parts darker. It can deal, at least in part, with such uneven illumination by determining thresholds locally. That is, instead of having a single global threshold, the threshold was allowing itself to smoothly vary across the image. To set a global threshold or to adapt a local threshold to an area, usually one have to look at the histogram to see if one can find two or more distinct modes for the foreground and for the background. Converting a grayscale image to monochrome is a common image processing task. Otsu's method, named after its inventor Nobuyuki Otsu, is one of many binarization algorithms
  • 3. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 121 [6]. This section describes how the algorithm works and provides a Matlab implementation, which can be easily ported to other languages [7,8,9,10].The result of this algorithm is shown in Figure (1). Figure (1) Difference values Otsu threshold and image histogram (Plane target) 3. TRACKING ALGORITHMS 3.1 Background Subtraction Algorithm The basic idea underlying the background subtraction is image differentiation between the present frame image which contains the subject of interest and the reference images [11]. Thus, a set of background image frames will be functioned as a reference set of images. In this work, a better technique involves the background model as mentioned in [12] is applied. Next, each of the sequence frames from the input video that consist of object motion will be processed to extract the object boundaries. The previous background model is used to segment out the foreground pixels from the background scene. By taking the Otsu thresholding value, this was calculated in section (2). It is well known that thresholding is an important aspect in background subtraction procedure and it is quite crucial to determine the most suitable threshold value. Therefore, morphological processes are applied for a more perfect silhouette. Alternatively, morphology is a broad set of image processing operations that process images based on shapes [13]. In this work,
  • 4. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 122 morphological operations namely opening and closing were applied. Both of these operations can be manipulated using single or combination of specific structuring elements. The main purpose of this step is to filter the presence of noisy pixels in the foreground image as shown in Figure (2). Figure (2) Background subtraction tracking algorithm 3.2 Normalize Cross Correlation (NCC) The template matching method is simplest and widely used for detecting targets in image information, which is obtained from camera. The Template matching method is searching for the most similar frame target in the image with a template image obtained from prearranged information. It is important what similarity index is used for comparing the target with input frame [14]. The correlation process entails the correlation of frame and target. The Region of Interest (ROI) is a 101x101 pixel area from a given image segment and target is a 34x34 pixel area from an adjacent overlapping image segment. When correlating the frame and target window using the correlation equation, a similarity in feature information will exist [15]. The correlation equation is as shown in equation (2). Where f(x,y) denote the intensity value of the ROI of size at point (x,y), x ε{0,…, -1},y ε{0,…, }. The target is represented by a given template T of size .Common way to calculate the position ( ) of the target in the ROI is to evaluate the normalized cross correlation value NCC at each point (u,v) for f and the target T, which has been shifted by u steps in the x direction and by v steps in the y direction[16]. denotes the mean value of f(x,y) within the area of target T shifted to (u,v). When feature information is identical within the frame and target, the correlation coefficient will be 1, which is denoting an exact matching. Figure (3) shows flow chart of algorithm.
  • 5. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 123 Figure (3) Flow chart of NCC algorithm 4. MOTION VECTOR Because the position is the horizontal component and the vertical component of the orthogonal coordinate, a transformation is used to transform the motion vector into the moving angles for motion control. The transformation in the pan direction is depicted in Figure (4) where the image plane is parallel to central plane of lens and the target moves in any direction. The locations of moving target are points A and B at time and tn , respectively. The target moving in real distances Dr so, to calculate the moving target in image using the formula: Angles were calculated during the tracking mode equation (7) & (8) show that, the target was controlled at central point of image plane. In general case, the moving target is
  • 6. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 124 faraway from the central plane of lens; therefore, the can be treated as θ. Fig.6 are the errors of tracking deviation between motion vectors and real moving distances in x and y directions, respectively. The significant meaning of these deviations is that the core of the target dose not matches the specific sampling points. Especially, the error deviations in the y direction are more than in the x direction. Exploring the reason, we find out that the illumination in x and y direction is different, which causes the diameter change of the target image in the different moving directions. The transformation in the tilt direction has the same fact as in pan direction. Figure (4) Image plane, central plane of lens, and target moving direction 4.1 Camera Field of View (CFOV) Camera field of view is shown in Figure (5) the distance between camera and object D, the height and width of object in image plan is WH and HV, so the vertical field of view is given as: Where: is angle vertical field of view. The ratio of VFOV and image height as: Same as in horizontal field of view.
  • 7. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 125 The ratio of HFOV and image width as: The ratio of VFOV and image width as: Figure (5) Shown camera field of view in vertical direction Pan and tilt angles can be calculated as: 4.2 Target Moving in Image Plane Table (1) shows the transformation of target in image plane the tracking mode get the distance measurement in image plane compare it with the real distance in world space.
  • 8. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 126 Table (1) the motion angle, distance and actual moving angle and distances of the tracking target Note : (-) negative sign means the opposite direction σTiltRatio = 13.02 pixels per degree, σPanRatio = 11.36 pixels per degree, σTR is a pixels Ratio in x direction and σPR is a pixels Ratio in y direction. Target distance from Camera = 400 mm. 5. EXPERIMENTAL RESULTS To test the normalize cross correlation on the scene, different experiments were carried out to test the proposed algorithm to track object in the image frames to identify any position of target in the scene (static scene static camera(video file.avi)). Then some testes were carried out to track objects in real time (static camera moving object). 5. 1. Static Scene Static Camera This test was used to detect object based on Otsu threshold, because the frame difference method cannot detect static objects. Estimated positions of the object (based on camera field of view) with respect to real target positions are summarized in Table.1. All values are measured with respect to center of camera view. Table.2 and Table.3 show that the measurement positions (θpan_measurement, θTilt_mesurement ) are close to real postions ( θpan_real, θTilt_real). The algorithm can detect and segment the background object form foreground. The specified area from multiple moving objects are appears in the scene. The Otsu algorithm implemented to the segmented object at each frame. Estimated position (degree) in (x, y) directions used the ratio between number of pixels and camera field of view using equation (9) and equation (10). The real time object tracking has special performance of equipment, this system used low cost mini CCD camera it has wide field of view and short rang it can't detect and track target moving in long rang, it can detect target moving in short rang also this system has special performance it can work in and out door.
  • 9. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 127 Although system has high accuracy to detect and track target in short rang, it was compared with the result based on the camera calibration. The results of this system better and fewer requirements than system used camera calibration. To detect and track target it should make target moving in track-window in this time system received command form user to start tracking target, if target moving in high speed may be the algorithm failure to tracking the target because target moving out of track-window in this case it better to zoom the track-window until the target found, before that reduce the track-window again. Table (2) Summary for estimated and measured Angles values (angles) Table (3) Summary for estimated and measured position values (distance)
  • 10. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 128 Figure (6) Desired position and measured position in X , Y direction 6. CONCLUSIONS The real time object tracking based on camera field of view was achieved. The algorithms merge the histogram projection and normalize cross correlation technique to detect, segment and track static and moving object in real time. The ratio σTiltRatio and σPanRatio was calculated and implemented to determine the target distance in world coordinates. Different tests are carried to verify the validity of the proposed algorithm such as, static scene static camera and moving object static camera. The results show that, the static and moving object can be detected and segmented without noise and false detections in the scene. The average processing time was observed when we used track-window is better than process full frame. It is depending on the position of the object with respect to the camera view. The key factors to limit the application of this system in specific areas are the FOV limit of CCD camera and one target detected by the cross-correlation method at one time. Because of illumination effects on the moving object, it is not easy to get more precise tracking. 7. FUTURE WORK These results are promising to develop the system in future it needs to include software and hardware to get high performance of system. It is better to use open or close loop control with shaft encoder to know the position of motor when system connected to other system. Embedded system has high response, speed of processing and less weight. The GPS data can be used in one time by interring manually, GPS and digital compass to know the directional and position. The System can be integrated with Laser Rang Finder (LRF) to get target distance at each time. Matlab software was used to implemented algorithms, it has good features to test all algorithms, but in real time application need to other platform software.
  • 11. Signal & Image Processing : An International Journal (SIPIJ) Vol.5, No.2, April 2014 129 REFERENCES [1] Bluestien H.B, Prediction of Satellite Cloud Patterns Using Spatial Fourier Transforms, S.B, Massa chusette Institute of Technology [1971]. [2] Parcak H. Sarah, Satellite Remote Sensing for Archaeology, London & Newyork [2009]. [3] Shiao S.Ying, Design and Implementation of Real-Time Tracking System Based on Vision Servo Control, Tamkang Journal of Science and Engineering, Vol. 4, No. 1, pp. 45-58 [2001]. [4] Gonzalez C. Rafael & Woods R. E, Digital Image Processing. Addison-Wesley Publishing Company[1993]. [5] Dr. Greensted A, Digital Image Processing (Java Software). Wikipedia, july [2010]. [6] Ahmed N & Roa K.R, Orthogonal Transforms for Digital Signal Processing, New York Spring [1975]. [7] Gonzales C. Rafael, Digital Image Processing Using Matlab, Math Work [2004]. [8] Liang Xu, Threshold Selection Method from Gray-Level Histograms, Electro-Technical Laboratory, Tokyo University(2007), Tokyo, Japan, IEEE Transactions on System, Man, and Cybernetics. [ 1979]. [9] Sezgin M. & Sankur B, "Survey over Image Thresholding Techniques and Quantitative Performance Evaluation", Journal of Electronic Imaging [2003]. [10] Baldini G, Campadelli P, Cozzi D & Lanzarotti R, A Simple and Robust Method for Moving Target Tracking, roceedings of the International Conference Signal Processing, Pattern Recognition and Applications (SPPRA2002), Crete, Greece June [2002]. [11] John C.Russ, The Image Processing Hand Book 3rd Edition, ACRC Published in coorperation with IEEE Press [1998]. [12] Padmini J, Abhishek S & Suman K Mitra, Background Subtraction in Videos Using Bayesian Learning with Motion Information, Dhirubhai Ambani Institute of Information and Communication Technology, India [2007]. [13] Jong S. Bae &Taek L.Song, Image Tracking Algorithm Using Template Matching and PSNF-m, International Journal of Control, Automation, and Systems [June 2008]. [14] JaneenM.Carter, Correlation Evaluation of Intensity and Color Band Images, Morgan State University [Jan 23 1995]. [15] Kai B & Hanebeck U, Template Matching Using Fast Normalize Cross Correlation, institute of Automatic control Engineering, technishche university Munchen , Germany [2001]. AUTHORS HASSAN ELPASERI ELAMIN ELTAHER was born in July 5, 1982, Khartoum, Sudan. He received his M.Sc. in image processing, College of Science 2012 Sudan University of Science & technology, Sudan, high diploma in general physic 2007, Sudan University of Science & technology, B.Sc. in physic Science 2005 Sudan University of Science & technology, Sudan. Currently he is DIP & Thermal Imaging System Researcher in institute of Laser, Sudan University.