SlideShare a Scribd company logo
Manaram Gnanasekera & Nalan Karunanayake
International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 1
An Enhanced Computer Vision Based Hand Movement
Capturing System with Stereo Vision
Manaram Gnanasekera manaramcv@gmail.com
Department of Electrical and Computer Engineering
Sri Lanka Institute of Information Technology
Malabe, 10115, Sri Lanka
Nalan Karunanayake nalan91karu@gmail.com
Department of Computer Science
University of Sri Jayawardenapura
Nugegoda, 10250, Sri Lanka
Abstract
This framework is a hand movement capturing method which could be done in three different
depth levels. The algorithm has the capability of capturing and identifying when the hand is
moving up, down, right and left. From these captured movements four signals could be
generated. Moreover, when these hand movements are done, 15cm-75cm, 75cm-100cm, 100cm-
200cm from the camera (3 depth levels), twelve different signals could be generated. These
generated signals could be used for applications such as game controlling (gaming).The existing
method uses an object area based method for depth analysis. The results of the proposed work
shows it has high accuracy compared to the existing method when tested for depth analysis.
Keywords: Computer Vision, Object tracking, Meanshift, Stereo Vision.
1. INTRODUCTION
Human body movement recognition has been one of the main interested research areas among
researchers. Out of many applications of human body movement recognition, gaming is one of
the most benefitted industries out of this technology which had gained huge amounts of success.
Xbox 360 is one of the best examples for this fact.
Among various studies done on human body movement recognition, in [1] a hand movement
capturing system based on the Camshift algorithm has been developed. There are six different
hand movements, namely, hand moving up, down, right, left, back and forth which are captured
by this algorithm. Unlike the other movements a special technique is used to capture hand
moving back and forth. The increase of the area of the hand (in the computer screen) when
moving towards the camera is used to track the forward movement. Similarly the area of the hand
decreases (in the computer screen) when it goes away from the camera. In essence, the area of
the hand is used to find the depth. Although this method has reached success in this research
([1]) the best method for depth analysis in computer vision is Stereo Vision. By motivated in this
fact the work undertaken enhances the algorithm given in [1] by combining Stereo Vision.
Moreover, instead of the forward and backward movements the proposed algorithm introduces a
novel method to capture the up, down, left and right movements in three depth levels (15cm-
75cm, 75cm-100cm and 100cm-200cm). Figure1 explains the scenario.
Manaram Gnanasekera & Nalan Karunanayake
International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 2
FIGURE 1: Four movements in three different depth levels.
All together there will be 12 different movements which could be captured. With the aid of these
12 different movements there will be 12 different signals generated. These generated signals
could be used for different applications such as gaming.
In section 2 of the paper, the algorithms which are used to capture the movements of the hand is
discussed. Then in section 3 the depth analyzing method is presented. Finally, in section 4 three
experiments which were conducted in [1] are performed and hence, it is proved that the proposed
method performs well in different depth levels compared to the area based method used in [1].
2. HAND MOVEMENT CAPTURING
The same approach as in [1] will be taken for the object recognition, which is a histogram based
method. The histogram of the hand will be taken and a contour will be drawn around. This
histogram will be unique for the hand (the environment around the hand will have different
histograms). Therefore a contour could be perfectly drawn along the margin of the hand. After
recognizing the object the next task will be to capture the movement of it. An extended version of
the Meanshift tracker [2] named as Camshift [3] has been used for this task. Camshift or
“Continuously Adaptive Meanshift” adjusts the search window size accordingly. At first the
Camshift algorithm closes the initial location of the search window. Then the Meanshift operation
is performed and the zeroth moment is stored. Finally, the search window size will set equal to a
function of the stored zeroth moment. By the aid of the Camshift algorithm, up, down, right and
left movement capturing has been tackled.
3. STEREO VISION DEPTH ANALYSIS
Stereo imaging is a technique which is used for estimating a 3D model to extract the depth
information of the 2D coordinates of a 2D scene by taking two images using two cameras which
are separated horizontally [4]. Furthermore, using the two 2D images can determine the three
dimensional locations of the image points which corresponds to the exact physical points in space
Manaram Gnanasekera & Nalan Karunanayake
International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 3
[5]. When considering the two images from the left and right cameras, the horizontal points of the
right image is lower than the horizontal coordinate points in the left image. This obtrusive shift is
called disparity. The disparity obtained from the two images could be used to calculate the 3D
location and the depth information of the particular points since the disparity is inversely
proportional to the depth of the particular point [6].
3.1 Stereo Triangulation
FIGURE 2: Stereo Camera model.
The disparity is,
(1)
Hence the depth of the point is,
Where,
b is the baseline, or distance between the two cameras left and right.
f is the focal length of a cameras.
XA is the X-axis of a camera.
ZA is the optical axis of a camera.
P is the physical point defined by the coordinates X, Y, and Z.
UL is the projection of the real-world physical point P in an image acquired by the left camera.
UR is the projection of the real-world physical point P in an image acquired by the right camera
There are three stereo correspondence algorithms in the current literature as pixel based
methods, area based methods and feature based methods [7, 8].
1
2
Manaram Gnanasekera & Nalan Karunanayake
International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 4
In the proposed method the sum of square differences (SSD) block matching method is used as
the correspondence cost function due to it’s simplicity than the other block matching cost
functions [9].
The sum of square difference (SSD) function can be expressed as,
Where,
Il(x,y) and Ir(x,y) are n x n symmetric matrices of the left and right images.
Before starting the stereo matching process, stereo calibration and stereo rectification should be
done for better results [10]. Using the stereo vision technique the depth to the center of the
captured hand could be calculated using (1). As a result the given three rangers (15cm-75cm,
75cm-100cm, 100cm-200cm) could be successfully identified.
4. RESULTS & DISCUSSION
The aim of the proposed work is to enhance the method given in [1] with stereo vision. Therefore
some tests which were conducted in [1] will be performed again in order for comparison with [1].
The tracking accuracy has been taken as a percentage by considering the number of miss-tracks
and correct tracks occurred when tested under different conditions.
4.1 Performance In Different Light Conditions
When the algorithm was tested under different light conditions (different LUX levels) the following
results were observed.
FIGURE 3: LUX level vs Accuracy.
The above observed results are very close to the light condition test’s results in [1], which means
the addition of stereo vision hasn’t been able to increase the performance under different light
conditions.
3
Manaram Gnanasekera & Nalan Karunanayake
International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 5
4.2 Usability for a Hand Amputated Person
Algorithm in [1] has extended it’s usability for hand amputated users. A sock is inserted to the
amputated arm and the sock is tracked instead of the hand. Therefore a test is done to check the
accuracy of this method as well. The socked arm will be first kept parallel to the ground
(perpendicular to the camera) and the accuracy is observed. Then the hand will be kept
perpendicular to the ground (parallel to the camera) and the testing will be done. The same tests
will be done for the proposed algorithm to check the usability.
S.H.P.G – Socked Hand Parallel to the Ground
S.H.Pe.G – Socked Hand Perpendicular to the Ground
The observed accuracy plots are as follows,
FIGURE 4: Usability vs Accuracy.
Similar to 4.1 the results of this test is almost equal to the usability test done in [1]. In essence,
the stereo vision based method hasn’t done any contribution towards developing the usability.
4.3 Distance of Tracking
The distance between the hand and the camera is varied or in other words, the hand is kept in
different depth levels from the camera. This test is done to check the sensitiveness of the
algorithm when the object (hand) moves away from the camera.
Manaram Gnanasekera & Nalan Karunanayake
International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 6
FIGURE 5: Depth vs Accuracy.
Although the tests conducted in section 4.1 and 4.2 are more likely to the corresponding tests
done in [1], a vast improvement has been shown in the results of this experiment when the depth
is increased. When the experiment is done between 15cm to 75cm the observed accuracy is
about 95%. Whereas for the same experiment in [1] the accuracy is roughly around 90%. When it
is done at 75cm to 100cm the accuracy of the proposed algorithm is around 93% and the
accuracy in [1] is around 60%. When the distance is increased between 100cm to 200cm, an
accuracy of 90% is still observed. However, a very low level of accuracy (20%) has been
observed when the same test is done in [1]. Even though the distance between the hand and the
camera is increased the proposed stereo vision based method has been able to maintain a
tracking accuracy around 90% compared with the method proposed in [1]. The speciality is, even
though the accuracy is significantly dropped in the method given in [1], a 90% accuracy is
maintained by the proposed algorithm.
5. CONCLUSION
In conclusion, the work undertaken enhances the algorithm of “Computer Vision Based Hand
Movement Capturing System” ([1]) by using Stereo Vision. The work presented proposes a hand
movement capturing method which could be done in three depth levels. The captured hand
movements are namely, moving up, down, right and left and this capturing could be done in
15cm-75cm, 75cm-100cm and 100cm-200cm from the camera. Camshift algorithm is used to
track the basic movements of the hand. Three tests which were conducted in [1] were performed
back for the purpose of comparison. The experiment done to investigate the performance in
different light conditions had no difference between the proposed method and [1] when the results
were compared. Similar results were observed when the algorithm was tested with an amputated
hand. However, when the distance with the camera was varied (depth level) the proposed
method has shown more than 90% accuracy. This is a vast improvement compared to the results
observed in [1]. Finally there are six different movements captured in [1] which generates six
different signals. The proposed method could generate twelve (twice) signals with the captured
twelve movements.
6. FUTURE WORK
As future expansions this algorithm will be modified for a full body tracking in different depth
levels. Furthermore it is also planned to increase the number of depth levels.
Manaram Gnanasekera & Nalan Karunanayake
International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 7
7. REFERENCES
[1] M. Gnanasekera, "Computer Vision Based Hand Movement Capturing System," in The
8th International Conference on Computer Science & Education (ICCSE 2013), Colombo,
2013.
[2] D. Comaniciu et al, "Real-Time Tracking of Non-Rigid Objects using Mean Shift," in
Conference on Computer Vision and Pattern Recognition (CVPR), Hilton Head, 2000.
[3] G. R. Bradski, "Computer Vision Face Tracking for Use in a Perceptual User Interface,"
Microcomputer Research Lab, Santa Clara, 2002.
[4] Richard Szeliski. “Stereo Correspondence” in Computer Vision: Algorithms and Applications,
1st ed., vol. 1, Springer 2010, pp.537-571.
[5] Myron Z. Brown, Darius Burschka, “Advances in Computational Stereo” IEEE Transactions
on Pattern Analysis and Machine Intelligence, vol. 25, pp.993-1008, August, 2003.
[6] Samarath Brahmbhatt, “3D Geometry and Stereo Vision” in Practical OpenCV,1st ed., vol 1,
Apress 2013, pp. 173-200.
[7] U. R Dhond., J. K. Aggarwal, “Structure from Stereo.” IEEE Transaction on Systems, Man
and Cybernatics, vol. 19, pp. 1489-1510, November 1989.
[8] Vibin N. Valsan and C.Y Patil. “A System on Chip based Stereo Vision Approach for Disparity
Measurement,” in Proc. 2015 International Conference on Industrial Instrumentation and
Control (ICIC), 2015, pp. 1284-1287.
[9] Rong Xiang, Tao Hong, Ming Zhou. “Analysis of Depth Measurement Errors of Tomatoes
Using Binocular Stereo Vision Based on Single Factor Experiments,” in Proc. 13th
International Conference on Control, Automation, Robotics & Vision (ICARCV), December
2014, pp. 88-93.
[10] G. Balakrishnan, G. Sainarayanan, R. Nagarajan “Stereo Image Sonification for Blind
Navigation”, Tamkang Journal of Science and Engineering, vol. 10, pp. 67_76, 2007.

More Related Content

PDF
Hazard Detection Algorithm for Safe Autonomous Landing
PDF
Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...
PDF
An automatic algorithm for object recognition and detection based on asift ke...
PPTX
Implementing Camshift on a Mobile Robot for Person Tracking and Pursuit_ICDM
PDF
Estimation of Terrain Gradient Conditions & Obstacle Detection Using a Monocu...
PDF
Matching algorithm performance analysis for autocalibration method of stereo ...
PDF
Motion Human Detection & Tracking Based On Background Subtraction
PDF
The flow of baseline estimation using a single omnidirectional camera
Hazard Detection Algorithm for Safe Autonomous Landing
Enhanced Algorithm for Obstacle Detection and Avoidance Using a Hybrid of Pla...
An automatic algorithm for object recognition and detection based on asift ke...
Implementing Camshift on a Mobile Robot for Person Tracking and Pursuit_ICDM
Estimation of Terrain Gradient Conditions & Obstacle Detection Using a Monocu...
Matching algorithm performance analysis for autocalibration method of stereo ...
Motion Human Detection & Tracking Based On Background Subtraction
The flow of baseline estimation using a single omnidirectional camera

What's hot (20)

PDF
Real-time Moving Object Detection using SURF
PDF
Feature extraction based retrieval of
DOC
Research Paper v2.0
PDF
02_atiqa ijaz khan_05_2014
PDF
Final Paper
PDF
Image Segmentation using Otsu's Method - Computer Graphics (UCS505) Project R...
PPT
Photogrammetry - areaotriangulation
PDF
Efficient 3D stereo vision stabilization for multi-camera viewpoints
PDF
Single Image Fog Removal Based on Fusion Strategy
PDF
V.KARTHIKEYAN PUBLISHED ARTICLE 1
PPTX
Scanning 3 d full human bodies using kinects
PPTX
Camshaft
PPTX
Two Dimensional Image Reconstruction Algorithms
PPT
CenterForDomainSpecificComputing-Poster
PDF
Passive stereo vision with deep learning
PDF
Fast and High-Precision 3D Tracking and Position Measurement with MEMS Microm...
PPTX
Human action recognition with kinect using a joint motion descriptor
PPTX
Image Segmentation using Otsu's Method - Computer Graphics (UCS505) Project PPT
PPTX
Image Reconstruction in Computed Tomography
PDF
A new hybrid method for the segmentation of the brain mris
Real-time Moving Object Detection using SURF
Feature extraction based retrieval of
Research Paper v2.0
02_atiqa ijaz khan_05_2014
Final Paper
Image Segmentation using Otsu's Method - Computer Graphics (UCS505) Project R...
Photogrammetry - areaotriangulation
Efficient 3D stereo vision stabilization for multi-camera viewpoints
Single Image Fog Removal Based on Fusion Strategy
V.KARTHIKEYAN PUBLISHED ARTICLE 1
Scanning 3 d full human bodies using kinects
Camshaft
Two Dimensional Image Reconstruction Algorithms
CenterForDomainSpecificComputing-Poster
Passive stereo vision with deep learning
Fast and High-Precision 3D Tracking and Position Measurement with MEMS Microm...
Human action recognition with kinect using a joint motion descriptor
Image Segmentation using Otsu's Method - Computer Graphics (UCS505) Project PPT
Image Reconstruction in Computed Tomography
A new hybrid method for the segmentation of the brain mris
Ad

Similar to An Enhanced Computer Vision Based Hand Movement Capturing System with Stereo Vision (20)

PDF
A Review Paper on Stereo Vision Based Depth Estimation
PDF
Stereo matching algorithm using census transform and segment tree for depth e...
PDF
Vision based non-invasive tool for facial swelling assessment
PDF
Stereo matching based on absolute differences for multiple objects detection
PPTX
Concept of stereo vision based virtual touch
PDF
Improving image resolution through the cra algorithm involved recycling proce...
PDF
IMPROVING IMAGE RESOLUTION THROUGH THE CRA ALGORITHM INVOLVED RECYCLING PROCE...
PDF
An Assessment of Image Matching Algorithms in Depth Estimation
PDF
A new function of stereo matching algorithm based on hybrid convolutional neu...
PDF
A new function of stereo matching algorithm based on hybrid convolutional neu...
PDF
Development of stereo matching algorithm based on sum of absolute RGB color d...
PDF
A deep learning based stereo matching model for autonomous vehicle
PDF
Gesture Recognition Based Video Game Controller
PDF
G0342039042
PDF
6 - Conception of an Autonomous UAV using Stereo Vision (presented in an Indo...
PDF
Developing 3D Viewing Model from 2D Stereo Pair with its Occlusion Ratio
PDF
Visual Odometry using Stereo Vision
PDF
A novel tool for stereo matching of images
PDF
A novel tool for stereo matching of images
PDF
A novel tool for stereo matching of images
A Review Paper on Stereo Vision Based Depth Estimation
Stereo matching algorithm using census transform and segment tree for depth e...
Vision based non-invasive tool for facial swelling assessment
Stereo matching based on absolute differences for multiple objects detection
Concept of stereo vision based virtual touch
Improving image resolution through the cra algorithm involved recycling proce...
IMPROVING IMAGE RESOLUTION THROUGH THE CRA ALGORITHM INVOLVED RECYCLING PROCE...
An Assessment of Image Matching Algorithms in Depth Estimation
A new function of stereo matching algorithm based on hybrid convolutional neu...
A new function of stereo matching algorithm based on hybrid convolutional neu...
Development of stereo matching algorithm based on sum of absolute RGB color d...
A deep learning based stereo matching model for autonomous vehicle
Gesture Recognition Based Video Game Controller
G0342039042
6 - Conception of an Autonomous UAV using Stereo Vision (presented in an Indo...
Developing 3D Viewing Model from 2D Stereo Pair with its Occlusion Ratio
Visual Odometry using Stereo Vision
A novel tool for stereo matching of images
A novel tool for stereo matching of images
A novel tool for stereo matching of images
Ad

Recently uploaded (20)

PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
Lesson notes of climatology university.
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Classroom Observation Tools for Teachers
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
Insiders guide to clinical Medicine.pdf
PPTX
Institutional Correction lecture only . . .
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
01-Introduction-to-Information-Management.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Lesson notes of climatology university.
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
Final Presentation General Medicine 03-08-2024.pptx
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
Classroom Observation Tools for Teachers
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
Insiders guide to clinical Medicine.pdf
Institutional Correction lecture only . . .
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Supply Chain Operations Speaking Notes -ICLT Program
01-Introduction-to-Information-Management.pdf

An Enhanced Computer Vision Based Hand Movement Capturing System with Stereo Vision

  • 1. Manaram Gnanasekera & Nalan Karunanayake International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 1 An Enhanced Computer Vision Based Hand Movement Capturing System with Stereo Vision Manaram Gnanasekera manaramcv@gmail.com Department of Electrical and Computer Engineering Sri Lanka Institute of Information Technology Malabe, 10115, Sri Lanka Nalan Karunanayake nalan91karu@gmail.com Department of Computer Science University of Sri Jayawardenapura Nugegoda, 10250, Sri Lanka Abstract This framework is a hand movement capturing method which could be done in three different depth levels. The algorithm has the capability of capturing and identifying when the hand is moving up, down, right and left. From these captured movements four signals could be generated. Moreover, when these hand movements are done, 15cm-75cm, 75cm-100cm, 100cm- 200cm from the camera (3 depth levels), twelve different signals could be generated. These generated signals could be used for applications such as game controlling (gaming).The existing method uses an object area based method for depth analysis. The results of the proposed work shows it has high accuracy compared to the existing method when tested for depth analysis. Keywords: Computer Vision, Object tracking, Meanshift, Stereo Vision. 1. INTRODUCTION Human body movement recognition has been one of the main interested research areas among researchers. Out of many applications of human body movement recognition, gaming is one of the most benefitted industries out of this technology which had gained huge amounts of success. Xbox 360 is one of the best examples for this fact. Among various studies done on human body movement recognition, in [1] a hand movement capturing system based on the Camshift algorithm has been developed. There are six different hand movements, namely, hand moving up, down, right, left, back and forth which are captured by this algorithm. Unlike the other movements a special technique is used to capture hand moving back and forth. The increase of the area of the hand (in the computer screen) when moving towards the camera is used to track the forward movement. Similarly the area of the hand decreases (in the computer screen) when it goes away from the camera. In essence, the area of the hand is used to find the depth. Although this method has reached success in this research ([1]) the best method for depth analysis in computer vision is Stereo Vision. By motivated in this fact the work undertaken enhances the algorithm given in [1] by combining Stereo Vision. Moreover, instead of the forward and backward movements the proposed algorithm introduces a novel method to capture the up, down, left and right movements in three depth levels (15cm- 75cm, 75cm-100cm and 100cm-200cm). Figure1 explains the scenario.
  • 2. Manaram Gnanasekera & Nalan Karunanayake International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 2 FIGURE 1: Four movements in three different depth levels. All together there will be 12 different movements which could be captured. With the aid of these 12 different movements there will be 12 different signals generated. These generated signals could be used for different applications such as gaming. In section 2 of the paper, the algorithms which are used to capture the movements of the hand is discussed. Then in section 3 the depth analyzing method is presented. Finally, in section 4 three experiments which were conducted in [1] are performed and hence, it is proved that the proposed method performs well in different depth levels compared to the area based method used in [1]. 2. HAND MOVEMENT CAPTURING The same approach as in [1] will be taken for the object recognition, which is a histogram based method. The histogram of the hand will be taken and a contour will be drawn around. This histogram will be unique for the hand (the environment around the hand will have different histograms). Therefore a contour could be perfectly drawn along the margin of the hand. After recognizing the object the next task will be to capture the movement of it. An extended version of the Meanshift tracker [2] named as Camshift [3] has been used for this task. Camshift or “Continuously Adaptive Meanshift” adjusts the search window size accordingly. At first the Camshift algorithm closes the initial location of the search window. Then the Meanshift operation is performed and the zeroth moment is stored. Finally, the search window size will set equal to a function of the stored zeroth moment. By the aid of the Camshift algorithm, up, down, right and left movement capturing has been tackled. 3. STEREO VISION DEPTH ANALYSIS Stereo imaging is a technique which is used for estimating a 3D model to extract the depth information of the 2D coordinates of a 2D scene by taking two images using two cameras which are separated horizontally [4]. Furthermore, using the two 2D images can determine the three dimensional locations of the image points which corresponds to the exact physical points in space
  • 3. Manaram Gnanasekera & Nalan Karunanayake International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 3 [5]. When considering the two images from the left and right cameras, the horizontal points of the right image is lower than the horizontal coordinate points in the left image. This obtrusive shift is called disparity. The disparity obtained from the two images could be used to calculate the 3D location and the depth information of the particular points since the disparity is inversely proportional to the depth of the particular point [6]. 3.1 Stereo Triangulation FIGURE 2: Stereo Camera model. The disparity is, (1) Hence the depth of the point is, Where, b is the baseline, or distance between the two cameras left and right. f is the focal length of a cameras. XA is the X-axis of a camera. ZA is the optical axis of a camera. P is the physical point defined by the coordinates X, Y, and Z. UL is the projection of the real-world physical point P in an image acquired by the left camera. UR is the projection of the real-world physical point P in an image acquired by the right camera There are three stereo correspondence algorithms in the current literature as pixel based methods, area based methods and feature based methods [7, 8]. 1 2
  • 4. Manaram Gnanasekera & Nalan Karunanayake International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 4 In the proposed method the sum of square differences (SSD) block matching method is used as the correspondence cost function due to it’s simplicity than the other block matching cost functions [9]. The sum of square difference (SSD) function can be expressed as, Where, Il(x,y) and Ir(x,y) are n x n symmetric matrices of the left and right images. Before starting the stereo matching process, stereo calibration and stereo rectification should be done for better results [10]. Using the stereo vision technique the depth to the center of the captured hand could be calculated using (1). As a result the given three rangers (15cm-75cm, 75cm-100cm, 100cm-200cm) could be successfully identified. 4. RESULTS & DISCUSSION The aim of the proposed work is to enhance the method given in [1] with stereo vision. Therefore some tests which were conducted in [1] will be performed again in order for comparison with [1]. The tracking accuracy has been taken as a percentage by considering the number of miss-tracks and correct tracks occurred when tested under different conditions. 4.1 Performance In Different Light Conditions When the algorithm was tested under different light conditions (different LUX levels) the following results were observed. FIGURE 3: LUX level vs Accuracy. The above observed results are very close to the light condition test’s results in [1], which means the addition of stereo vision hasn’t been able to increase the performance under different light conditions. 3
  • 5. Manaram Gnanasekera & Nalan Karunanayake International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 5 4.2 Usability for a Hand Amputated Person Algorithm in [1] has extended it’s usability for hand amputated users. A sock is inserted to the amputated arm and the sock is tracked instead of the hand. Therefore a test is done to check the accuracy of this method as well. The socked arm will be first kept parallel to the ground (perpendicular to the camera) and the accuracy is observed. Then the hand will be kept perpendicular to the ground (parallel to the camera) and the testing will be done. The same tests will be done for the proposed algorithm to check the usability. S.H.P.G – Socked Hand Parallel to the Ground S.H.Pe.G – Socked Hand Perpendicular to the Ground The observed accuracy plots are as follows, FIGURE 4: Usability vs Accuracy. Similar to 4.1 the results of this test is almost equal to the usability test done in [1]. In essence, the stereo vision based method hasn’t done any contribution towards developing the usability. 4.3 Distance of Tracking The distance between the hand and the camera is varied or in other words, the hand is kept in different depth levels from the camera. This test is done to check the sensitiveness of the algorithm when the object (hand) moves away from the camera.
  • 6. Manaram Gnanasekera & Nalan Karunanayake International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 6 FIGURE 5: Depth vs Accuracy. Although the tests conducted in section 4.1 and 4.2 are more likely to the corresponding tests done in [1], a vast improvement has been shown in the results of this experiment when the depth is increased. When the experiment is done between 15cm to 75cm the observed accuracy is about 95%. Whereas for the same experiment in [1] the accuracy is roughly around 90%. When it is done at 75cm to 100cm the accuracy of the proposed algorithm is around 93% and the accuracy in [1] is around 60%. When the distance is increased between 100cm to 200cm, an accuracy of 90% is still observed. However, a very low level of accuracy (20%) has been observed when the same test is done in [1]. Even though the distance between the hand and the camera is increased the proposed stereo vision based method has been able to maintain a tracking accuracy around 90% compared with the method proposed in [1]. The speciality is, even though the accuracy is significantly dropped in the method given in [1], a 90% accuracy is maintained by the proposed algorithm. 5. CONCLUSION In conclusion, the work undertaken enhances the algorithm of “Computer Vision Based Hand Movement Capturing System” ([1]) by using Stereo Vision. The work presented proposes a hand movement capturing method which could be done in three depth levels. The captured hand movements are namely, moving up, down, right and left and this capturing could be done in 15cm-75cm, 75cm-100cm and 100cm-200cm from the camera. Camshift algorithm is used to track the basic movements of the hand. Three tests which were conducted in [1] were performed back for the purpose of comparison. The experiment done to investigate the performance in different light conditions had no difference between the proposed method and [1] when the results were compared. Similar results were observed when the algorithm was tested with an amputated hand. However, when the distance with the camera was varied (depth level) the proposed method has shown more than 90% accuracy. This is a vast improvement compared to the results observed in [1]. Finally there are six different movements captured in [1] which generates six different signals. The proposed method could generate twelve (twice) signals with the captured twelve movements. 6. FUTURE WORK As future expansions this algorithm will be modified for a full body tracking in different depth levels. Furthermore it is also planned to increase the number of depth levels.
  • 7. Manaram Gnanasekera & Nalan Karunanayake International Journal of Computer Science and Security (IJCSS), Volume (10) : Issue (1) : 2016 7 7. REFERENCES [1] M. Gnanasekera, "Computer Vision Based Hand Movement Capturing System," in The 8th International Conference on Computer Science & Education (ICCSE 2013), Colombo, 2013. [2] D. Comaniciu et al, "Real-Time Tracking of Non-Rigid Objects using Mean Shift," in Conference on Computer Vision and Pattern Recognition (CVPR), Hilton Head, 2000. [3] G. R. Bradski, "Computer Vision Face Tracking for Use in a Perceptual User Interface," Microcomputer Research Lab, Santa Clara, 2002. [4] Richard Szeliski. “Stereo Correspondence” in Computer Vision: Algorithms and Applications, 1st ed., vol. 1, Springer 2010, pp.537-571. [5] Myron Z. Brown, Darius Burschka, “Advances in Computational Stereo” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, pp.993-1008, August, 2003. [6] Samarath Brahmbhatt, “3D Geometry and Stereo Vision” in Practical OpenCV,1st ed., vol 1, Apress 2013, pp. 173-200. [7] U. R Dhond., J. K. Aggarwal, “Structure from Stereo.” IEEE Transaction on Systems, Man and Cybernatics, vol. 19, pp. 1489-1510, November 1989. [8] Vibin N. Valsan and C.Y Patil. “A System on Chip based Stereo Vision Approach for Disparity Measurement,” in Proc. 2015 International Conference on Industrial Instrumentation and Control (ICIC), 2015, pp. 1284-1287. [9] Rong Xiang, Tao Hong, Ming Zhou. “Analysis of Depth Measurement Errors of Tomatoes Using Binocular Stereo Vision Based on Single Factor Experiments,” in Proc. 13th International Conference on Control, Automation, Robotics & Vision (ICARCV), December 2014, pp. 88-93. [10] G. Balakrishnan, G. Sainarayanan, R. Nagarajan “Stereo Image Sonification for Blind Navigation”, Tamkang Journal of Science and Engineering, vol. 10, pp. 67_76, 2007.