SlideShare a Scribd company logo
ARJUN AK
S7IT
TKR18IT003. 1
CONTENTS
● INTRODUCTION
● LITURATURE SURVEY
● METHODOLOGY
● FLOWCHART
● NETWORK ARCHITECTURE
● SELECTION OF GAME ENGINE
● SELECTION OF AR SDK
● VUFORIA
● ARCore AND ARKit
● ADVANTAGE AND DISADVANTAGES
● CONCLUSION
● REFERENCES
22
INTRODUCTION
● User Experience is a very important factor of any software product and thus
companies spend a lot of time and money in development stage to make sure that
customers get a hassle-free experience.
● In today’s age it is observed that in order to further improve user experience the use
of Machine Learning is to be employed.
● We plan on surveying all the possible ways to implement Computer Vision based
marker less object tracking and integrate it with Augmented Reality Software
Development Kits to create an interface ready for customers to use.
● The main idea is to track hand landmarks using Computer Vision and use them as
placeholders for AR watch object.
33
LITERATURE SURVEY
SL NO PAPER AUTHOR YEAR
1 MediaPipe Hands: On-device Real-
time Hand Tracking
Fan Zhang, Valentin
Bazarevsky, Andrey
Vakunov, Andrei
Tkachenka, George Sung
18 Jun 2020.
2 Pose Anchor: A Single-stage Hand
Keypoint Detection Network
Yuan Li, Xinggang
Wang, Wenyu Liu and
Bin Feng
July 2020
3 A Study on MQTT based
Environmental Parameters,
Monitoring and Alarming System
K.J., Reshmaa, Selvin
Peter Paul J., and
Swetha V
2018
4 A Survey of Frameworks and Game
Engines for Serious Game
Development
Brent Cowan and Bill
Kapralos
2014
METHODOLOGY
“Researchers from Google “present’ on-device a real-time solution predicting a
human palm skeleton from a single RGB camera for AR products.
Two models comprise the pipeline:
•A palm detector responsible for bounding box generation across palm and provides it
to,
•A hand landmark model, that maps a palm sketch on the full hand. This is
implemented through MediaPipe ,ML solutions building platform.
55
FLOWCHART
666
● The graph can be divided into 2 subparts, one for hand detection and the other for
points of reference mapping.
● A key optimization here is that the hand detection only runs as needed, saving a lot
of processing power
● To accomplish this start by extracting the palm location in the present video frame
from the generated hand landmarks in the preceding frame, thus preventing the
palm detection each frame
● For toughness, the tracker even provides confidence score for captured hand.
whenever the confidence drops below the threshold then only the palm detector is
reapplied to the upcoming frame.
● “Koller, Hermann and Bowden’s” work shows a novel perspective to learning a
classifier which is on the basis of frames on poorly differently marked sequence
data by combining a Convolution Neural Net in an Expectation Maximisation
algorithm.
METHODOLOGY
77
● This permits the C.N.N. to be trained on a big set of trial images even if limited sequence
level knowledge is available for the source videos.
● This method is explained in situation related to hand shape detection, it could be of a
wider use to any video based recognition task which doesn’t have frame-level labelling
● Instead of using 2 stage architecture pipeline consisting of localizing hand and mapping
landmarks, Pose Anchor presents a single effective network architecture for hand
landmarks detection. An end-to-end C.N.N. is trained on a newly proposed pose anchor
network, which is based on RPN in Faster Region-based C.N.N
● Instead of manually designing hand pose anchors, they are generated using K-means
clustering based on OKS
● The main reason for using pose anchors is to mitigate occlusion to some extent by using
the prior knowledge of hand pose/structure Experiments conducted on (LSM-HPD) and
NZSL were used to show the robustness and feasibility of the architecture.
88
NETWORK DIAGRAM
99
HEAT MAP GENERATED
1010
Selection of Game Engine:
There are many game engines available to make AR applications. To chose one among
them many factors must be kept in mind, they are:
● Scripting: The code written to provide the instructions.
● Rendering: The generation of the 3D scene, the factors to be measured are speed and
accuracy.
● Animation: the change in render per frame to simulate movement.
● Artificial Intelligence: Ability of the computer to make decisions.
● Physics: Real world based calculations on simulated physical interactions.
● Audio: Audio feedback provided that can be spatially distinguished.
● Networking : Users have the ability to play with others online
1111
●A logical approach would be to go with a game engine which is widely used in the market,
so we used 2 Surveys to determine a suitable game engine
●Survey 1: Scan through a database of approximately 200 academic publications for
“serious game”, “educational game”, and “simulator”.
●Survey 2: Narrow down these selected engines from Survey 1 only based on “serious
gaming”
1212
GAME ENGINE AND FRAMEWORK
1313
●By analyzing given information we can clearly conclude that Unity and Unreal are
the most beneficial game engines, we chose Unity as our preferred game engine as the
only advantage of Unreal over Unity is C++ support, but the programming language
to be used is not an restriction in our situation.
14
Unity Engine Properties:
● Lighting can be done before hand or at run time, custom shaders can be created
using shader graphs and shadder programming thus replicating realistic lighting
conditions as well as textures which can be changed based on input to change the
simulation environment.
● Unity uses C# as the programming language behind the implementation of any
application built using the framework and thus complicated logical simulations and
inputs can be generated.
1515
The Machine Learning-Agents Toolkit
provided by the game engine:
●This is an free to use SDK available to integrate ML models with Unity.The 3 main features
in the SDK are
● Sensors, Agents, and Academy
●The agents collect, observe and execute actions. The agent is the component being trained by
constantly optimizing it’s policy known as Brain.
●The Academy manages the simulation,it is a singleton and thus used to keep track of all the
steps involved in the simulation.
●Singleton is defined as a software design pattern that restricts the instantiation of a class to
one "single" instance, any other instances which are created are automatically destroyed.
1616
ML AGENTS DOCUMENTATION
1717
Integration of the model with Unity:
The study for Gesture Recognition for non RC Drone have successfully imported Leap Motion
SDK and Gesture simple control package and used it for detection.
This shows that any kind of trained model can be imported into Unity.
The recommended method to import ML model into Unity:
● STEP 1: Save and export the model to ONNX format as this is the format supported by
Unity.
● STEP 2: Import Unity ML Agents
● STEP 3: Import the model into Unity.
● STEP 4: Access the camera feed frame by frame.
● STEP 5: Resize the render texture if required.
● STEP 6: Read the Softmax layer output from the model.
18
Selection of AR SDK
● SDK has multiple components within the application: identification, tracking, etc
● There are many AR SDKs to chose from like:
Vuforia, Metatio, Wikitude, ARToolKit, D’Fusion, ARmedi, ARCore, ARKit.
● There is another study by Anasse HANAFI in which they are focusing on those SDKs
which give a platform and support and function with hardware.
● They had tested these SDKs on the basis of licence type, target platform, development
platform, tracking type, functionality.
● The most important feature as per our requirement is tracking type and the results are
tabulated in Table
1919
AR SDKs COMPARISON BASED ON TRACKING
2020
Vuforia
●Vuforia is an augmented reality software development kit for mobile devices that enables the
creation of augmented reality applications.
●It uses computer vision technology to recognize and track planar images and 3D objects in real
time .
●Ms. Geetanjali Bhola and Amogh Bansal used Vuforia for Markerless Tracking in Retail
Industry. Their observation is can be referenced for our work to find which will be more suitable
for the process in hand .
2121
VUFORIA RESULTS BASED ON DISTANCE
2222
● Thus we can see that the results are perfect for our scenario, so it can be considered to
use Vuforia , but from Table our requirement is of Motion Tracking thus it would be a
more suitable choice would be to go for ARCore or ARKit.
2323
ARCore and ARKit:
●ARCore and ARKit are Google and Apple’s respective Augmented reality frameworks for
bringing more AR apps to thier platforms . They use the smartphone cameras to add
interactive elements to an existing environment.
●ARCore is used to build for Android Devices and ARKit is used to build for iOS devices.
Unity offers a wrapper class known as ARFoundation which includes both of these SDKs
and an added advantage is that it is not required to import these from an external source.
●Khalid Satori did a study to discover if ARFoundation is a suitable replacement for
ARKit or ARCore. Their observations are shown in Table
2424
AR FOUNDATION TABLE
2525
● As it can be seen from Table that ARFoundation either has all the features or
they are under development. Thus to keep the implementation simple, future
ready and cover all target platforms it is suitable to use ARFoundation as our
SDK.
● ARFoundation can be imported from Unity’s Package Manager which comes
along with Unity during installation.
2626
ADVANTAGES
● Once the content is placed in a room, it is more flexible than marker-based
alternatives.
● Marker less AR significantly increases the average range of motion.
● Marker based alternative relies on the image recognisability whereas
maker less doesn’t.
2727
DISADVANTAGES
● The augmented reality content may not make sense in certain context
● For better experience it is required that the surface has a texture for computer
vision to recognize it.
2828
CONCLUSION
We had a look at various computer vision techniques for implementing marker less
tracking on hands and also determined an appropriate game engine to import that
model into. We also compared various Augmented reality SDKs which can be used to
properly instantiate the desired objects. We found using ARfoundation to implement
the tracking will be much more efficient because of its simplistic nature in
implementation , future ready features and tracking and beacuse of its ablity to cover
all target platforms .
2929
REFERENCES
1. Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling
Chang, Matthias Grundmann. “MediaPipe Hands: On-device Real-time Hand Tracking”
arXiv:2006.10214v1 [cs.CV] 18 Jun 2020.
2. HasCamillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris Mc-Clanahan, Esha Uboweja, Michael
Hays, Fan Zhang, Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, Wan-TehChang, Wei Hua,
Manfred Georg, and Matthias Grundmann.Mediapipe: A framework for building perception
pipelines.volume abs/1906.08172, 2019
3. Oscar Koller, Hermann Ney, and Richard Bowden. “Deep Hand: How to Train a CNN on 1 Million
Hand Images When Your Data is Continuous and Weakly Labelled”. 2016 IEEE Conference on
Computer Vision and Pattern Recognition (CVPR).
4. Yuan Li, Xinggang Wang, Wenyu Liu and Bin Feng. “Pose Anchor: A Single-stage Hand Keypoint
Detection Network”. IEEE Transactions on Circuits and Systems for Video Technology (July 2020)
3030
5 .S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards real-time object detection with region
proposal networks,” in Advances in neural information processing systems, 2015, pp. 91–99
6. T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick,
“Microsoft coco: Common objects in context,” in European conference on computer vision. Springer,
2014, pp. 740–755.
7. Brent Cowan and Bill Kapralos “A Survey of Frameworks and Game Engines for Serious Game
Development” in 2014 IEEE 14th International Conference on Advanced Learning Technologies
8. Arthur Juliani,Vincent-Pierre Berges,Ervin Teng,Andrew Cohen,Jonathan Harper,Chris Elion,Chris
Goy,Yuan Gao,Hunter Henry,Marwan Mattar,Danny Lange. “Unity: A General Platform for Intelligent
Agents” in arxiv.org by Cornell University
9. K.J., Reshmaa, Selvin Peter Paul J., and Swetha V. "A Study on MQTT based Environmental
Parameters, Monitoring and Alarming System". Eurasian Journal of Analytical Chemistry 13 no. SP
(2018)
3131
THANK
YOU
3232

More Related Content

PPTX
Philipp Nagele (Wikitude): Context Is for Kings: Putting Context in the Hands...
DOCX
AbstractThis work presents the design and implementation of an.docx
PDF
Mobile Augmented Reality Development tools
PDF
TICTACTOE GAME MANAGEMENT SYSTEM PROJECT REPORT.
PDF
Kinect on Android Pandaboard
PDF
Tictactoe game management system project report.pdf
PDF
Augmenting reality: Bring digital objects into the real world
DOCX
virendra
Philipp Nagele (Wikitude): Context Is for Kings: Putting Context in the Hands...
AbstractThis work presents the design and implementation of an.docx
Mobile Augmented Reality Development tools
TICTACTOE GAME MANAGEMENT SYSTEM PROJECT REPORT.
Kinect on Android Pandaboard
Tictactoe game management system project report.pdf
Augmenting reality: Bring digital objects into the real world
virendra

Similar to Marker less augmentedd reality using computer vision (20)

PDF
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
PDF
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
PDF
virtual_chess
PDF
ppt_seminar_zcvxvvvxzczxczxzxchristy-1.pdf
DOC
Android Report
DOC
Doc muntation of android
PDF
IRJET- Proposed Design for 3D Map Generation using UAV
PDF
Debug, Analyze and Optimize Games with Intel Tools
PDF
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
PDF
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
PPTX
Kudan deck slide share e
PDF
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
PDF
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
PDF
Computer-Vision_Integrating-Technology_MOB_17.06.16
PDF
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
PDF
F04402038042
PDF
Iirdem design and implementation of finger writing in air by using open cv (c...
PDF
The Real Time Drowisness Detection Using Arm 9
PDF
IRJET-Implementation of Image Processing using Augmented Reality
FACE COUNTING USING OPEN CV & PYTHON FOR ANALYZING UNUSUAL EVENTS IN CROWDS
How ABB shapes the future of industry with Microsoft HoloLens and Unity - Uni...
virtual_chess
ppt_seminar_zcvxvvvxzczxczxzxchristy-1.pdf
Android Report
Doc muntation of android
IRJET- Proposed Design for 3D Map Generation using UAV
Debug, Analyze and Optimize Games with Intel Tools
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Debug, Analyze and Optimize Games with Intel Tools - Matteo Valoriani - Codem...
Kudan deck slide share e
JIT Spraying Never Dies - Bypass CFG By Leveraging WARP Shader JIT Spraying.pdf
IRJET - Positioning and Tracking of a Person using Embedded Controller in a D...
Computer-Vision_Integrating-Technology_MOB_17.06.16
Deep Learning Neural Network Acceleration at the Edge - Andrea Gallo
F04402038042
Iirdem design and implementation of finger writing in air by using open cv (c...
The Real Time Drowisness Detection Using Arm 9
IRJET-Implementation of Image Processing using Augmented Reality
Ad

Recently uploaded (20)

PPTX
CYBER-CRIMES AND SECURITY A guide to understanding
PDF
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
PPTX
UNIT-1 - COAL BASED THERMAL POWER PLANTS
PPT
Mechanical Engineering MATERIALS Selection
PPT
introduction to datamining and warehousing
PPTX
web development for engineering and engineering
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PDF
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
PPTX
additive manufacturing of ss316l using mig welding
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
OOP with Java - Java Introduction (Basics)
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
PPTX
CH1 Production IntroductoryConcepts.pptx
PPTX
Lecture Notes Electrical Wiring System Components
PPTX
Construction Project Organization Group 2.pptx
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PDF
PPT on Performance Review to get promotions
PPT
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
CYBER-CRIMES AND SECURITY A guide to understanding
Unit I ESSENTIAL OF DIGITAL MARKETING.pdf
UNIT-1 - COAL BASED THERMAL POWER PLANTS
Mechanical Engineering MATERIALS Selection
introduction to datamining and warehousing
web development for engineering and engineering
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
Evaluating the Democratization of the Turkish Armed Forces from a Normative P...
additive manufacturing of ss316l using mig welding
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
UNIT 4 Total Quality Management .pptx
OOP with Java - Java Introduction (Basics)
R24 SURVEYING LAB MANUAL for civil enggi
Mohammad Mahdi Farshadian CV - Prospective PhD Student 2026
CH1 Production IntroductoryConcepts.pptx
Lecture Notes Electrical Wiring System Components
Construction Project Organization Group 2.pptx
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPT on Performance Review to get promotions
Introduction, IoT Design Methodology, Case Study on IoT System for Weather Mo...
Ad

Marker less augmentedd reality using computer vision

  • 2. CONTENTS ● INTRODUCTION ● LITURATURE SURVEY ● METHODOLOGY ● FLOWCHART ● NETWORK ARCHITECTURE ● SELECTION OF GAME ENGINE ● SELECTION OF AR SDK ● VUFORIA ● ARCore AND ARKit ● ADVANTAGE AND DISADVANTAGES ● CONCLUSION ● REFERENCES 22
  • 3. INTRODUCTION ● User Experience is a very important factor of any software product and thus companies spend a lot of time and money in development stage to make sure that customers get a hassle-free experience. ● In today’s age it is observed that in order to further improve user experience the use of Machine Learning is to be employed. ● We plan on surveying all the possible ways to implement Computer Vision based marker less object tracking and integrate it with Augmented Reality Software Development Kits to create an interface ready for customers to use. ● The main idea is to track hand landmarks using Computer Vision and use them as placeholders for AR watch object. 33
  • 4. LITERATURE SURVEY SL NO PAPER AUTHOR YEAR 1 MediaPipe Hands: On-device Real- time Hand Tracking Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung 18 Jun 2020. 2 Pose Anchor: A Single-stage Hand Keypoint Detection Network Yuan Li, Xinggang Wang, Wenyu Liu and Bin Feng July 2020 3 A Study on MQTT based Environmental Parameters, Monitoring and Alarming System K.J., Reshmaa, Selvin Peter Paul J., and Swetha V 2018 4 A Survey of Frameworks and Game Engines for Serious Game Development Brent Cowan and Bill Kapralos 2014
  • 5. METHODOLOGY “Researchers from Google “present’ on-device a real-time solution predicting a human palm skeleton from a single RGB camera for AR products. Two models comprise the pipeline: •A palm detector responsible for bounding box generation across palm and provides it to, •A hand landmark model, that maps a palm sketch on the full hand. This is implemented through MediaPipe ,ML solutions building platform. 55
  • 7. ● The graph can be divided into 2 subparts, one for hand detection and the other for points of reference mapping. ● A key optimization here is that the hand detection only runs as needed, saving a lot of processing power ● To accomplish this start by extracting the palm location in the present video frame from the generated hand landmarks in the preceding frame, thus preventing the palm detection each frame ● For toughness, the tracker even provides confidence score for captured hand. whenever the confidence drops below the threshold then only the palm detector is reapplied to the upcoming frame. ● “Koller, Hermann and Bowden’s” work shows a novel perspective to learning a classifier which is on the basis of frames on poorly differently marked sequence data by combining a Convolution Neural Net in an Expectation Maximisation algorithm. METHODOLOGY 77
  • 8. ● This permits the C.N.N. to be trained on a big set of trial images even if limited sequence level knowledge is available for the source videos. ● This method is explained in situation related to hand shape detection, it could be of a wider use to any video based recognition task which doesn’t have frame-level labelling ● Instead of using 2 stage architecture pipeline consisting of localizing hand and mapping landmarks, Pose Anchor presents a single effective network architecture for hand landmarks detection. An end-to-end C.N.N. is trained on a newly proposed pose anchor network, which is based on RPN in Faster Region-based C.N.N ● Instead of manually designing hand pose anchors, they are generated using K-means clustering based on OKS ● The main reason for using pose anchors is to mitigate occlusion to some extent by using the prior knowledge of hand pose/structure Experiments conducted on (LSM-HPD) and NZSL were used to show the robustness and feasibility of the architecture. 88
  • 11. Selection of Game Engine: There are many game engines available to make AR applications. To chose one among them many factors must be kept in mind, they are: ● Scripting: The code written to provide the instructions. ● Rendering: The generation of the 3D scene, the factors to be measured are speed and accuracy. ● Animation: the change in render per frame to simulate movement. ● Artificial Intelligence: Ability of the computer to make decisions. ● Physics: Real world based calculations on simulated physical interactions. ● Audio: Audio feedback provided that can be spatially distinguished. ● Networking : Users have the ability to play with others online 1111
  • 12. ●A logical approach would be to go with a game engine which is widely used in the market, so we used 2 Surveys to determine a suitable game engine ●Survey 1: Scan through a database of approximately 200 academic publications for “serious game”, “educational game”, and “simulator”. ●Survey 2: Narrow down these selected engines from Survey 1 only based on “serious gaming” 1212
  • 13. GAME ENGINE AND FRAMEWORK 1313
  • 14. ●By analyzing given information we can clearly conclude that Unity and Unreal are the most beneficial game engines, we chose Unity as our preferred game engine as the only advantage of Unreal over Unity is C++ support, but the programming language to be used is not an restriction in our situation. 14
  • 15. Unity Engine Properties: ● Lighting can be done before hand or at run time, custom shaders can be created using shader graphs and shadder programming thus replicating realistic lighting conditions as well as textures which can be changed based on input to change the simulation environment. ● Unity uses C# as the programming language behind the implementation of any application built using the framework and thus complicated logical simulations and inputs can be generated. 1515
  • 16. The Machine Learning-Agents Toolkit provided by the game engine: ●This is an free to use SDK available to integrate ML models with Unity.The 3 main features in the SDK are ● Sensors, Agents, and Academy ●The agents collect, observe and execute actions. The agent is the component being trained by constantly optimizing it’s policy known as Brain. ●The Academy manages the simulation,it is a singleton and thus used to keep track of all the steps involved in the simulation. ●Singleton is defined as a software design pattern that restricts the instantiation of a class to one "single" instance, any other instances which are created are automatically destroyed. 1616
  • 18. Integration of the model with Unity: The study for Gesture Recognition for non RC Drone have successfully imported Leap Motion SDK and Gesture simple control package and used it for detection. This shows that any kind of trained model can be imported into Unity. The recommended method to import ML model into Unity: ● STEP 1: Save and export the model to ONNX format as this is the format supported by Unity. ● STEP 2: Import Unity ML Agents ● STEP 3: Import the model into Unity. ● STEP 4: Access the camera feed frame by frame. ● STEP 5: Resize the render texture if required. ● STEP 6: Read the Softmax layer output from the model. 18
  • 19. Selection of AR SDK ● SDK has multiple components within the application: identification, tracking, etc ● There are many AR SDKs to chose from like: Vuforia, Metatio, Wikitude, ARToolKit, D’Fusion, ARmedi, ARCore, ARKit. ● There is another study by Anasse HANAFI in which they are focusing on those SDKs which give a platform and support and function with hardware. ● They had tested these SDKs on the basis of licence type, target platform, development platform, tracking type, functionality. ● The most important feature as per our requirement is tracking type and the results are tabulated in Table 1919
  • 20. AR SDKs COMPARISON BASED ON TRACKING 2020
  • 21. Vuforia ●Vuforia is an augmented reality software development kit for mobile devices that enables the creation of augmented reality applications. ●It uses computer vision technology to recognize and track planar images and 3D objects in real time . ●Ms. Geetanjali Bhola and Amogh Bansal used Vuforia for Markerless Tracking in Retail Industry. Their observation is can be referenced for our work to find which will be more suitable for the process in hand . 2121
  • 22. VUFORIA RESULTS BASED ON DISTANCE 2222
  • 23. ● Thus we can see that the results are perfect for our scenario, so it can be considered to use Vuforia , but from Table our requirement is of Motion Tracking thus it would be a more suitable choice would be to go for ARCore or ARKit. 2323
  • 24. ARCore and ARKit: ●ARCore and ARKit are Google and Apple’s respective Augmented reality frameworks for bringing more AR apps to thier platforms . They use the smartphone cameras to add interactive elements to an existing environment. ●ARCore is used to build for Android Devices and ARKit is used to build for iOS devices. Unity offers a wrapper class known as ARFoundation which includes both of these SDKs and an added advantage is that it is not required to import these from an external source. ●Khalid Satori did a study to discover if ARFoundation is a suitable replacement for ARKit or ARCore. Their observations are shown in Table 2424
  • 26. ● As it can be seen from Table that ARFoundation either has all the features or they are under development. Thus to keep the implementation simple, future ready and cover all target platforms it is suitable to use ARFoundation as our SDK. ● ARFoundation can be imported from Unity’s Package Manager which comes along with Unity during installation. 2626
  • 27. ADVANTAGES ● Once the content is placed in a room, it is more flexible than marker-based alternatives. ● Marker less AR significantly increases the average range of motion. ● Marker based alternative relies on the image recognisability whereas maker less doesn’t. 2727
  • 28. DISADVANTAGES ● The augmented reality content may not make sense in certain context ● For better experience it is required that the surface has a texture for computer vision to recognize it. 2828
  • 29. CONCLUSION We had a look at various computer vision techniques for implementing marker less tracking on hands and also determined an appropriate game engine to import that model into. We also compared various Augmented reality SDKs which can be used to properly instantiate the desired objects. We found using ARfoundation to implement the tracking will be much more efficient because of its simplistic nature in implementation , future ready features and tracking and beacuse of its ablity to cover all target platforms . 2929
  • 30. REFERENCES 1. Fan Zhang, Valentin Bazarevsky, Andrey Vakunov, Andrei Tkachenka, George Sung, Chuo-Ling Chang, Matthias Grundmann. “MediaPipe Hands: On-device Real-time Hand Tracking” arXiv:2006.10214v1 [cs.CV] 18 Jun 2020. 2. HasCamillo Lugaresi, Jiuqiang Tang, Hadon Nash, Chris Mc-Clanahan, Esha Uboweja, Michael Hays, Fan Zhang, Chuo-Ling Chang, Ming Guang Yong, Juhyun Lee, Wan-TehChang, Wei Hua, Manfred Georg, and Matthias Grundmann.Mediapipe: A framework for building perception pipelines.volume abs/1906.08172, 2019 3. Oscar Koller, Hermann Ney, and Richard Bowden. “Deep Hand: How to Train a CNN on 1 Million Hand Images When Your Data is Continuous and Weakly Labelled”. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 4. Yuan Li, Xinggang Wang, Wenyu Liu and Bin Feng. “Pose Anchor: A Single-stage Hand Keypoint Detection Network”. IEEE Transactions on Circuits and Systems for Video Technology (July 2020) 3030
  • 31. 5 .S. Ren, K. He, R. Girshick, and J. Sun, “Faster r-cnn: Towards real-time object detection with region proposal networks,” in Advances in neural information processing systems, 2015, pp. 91–99 6. T.-Y. Lin, M. Maire, S. Belongie, J. Hays, P. Perona, D. Ramanan, P. Dollar, and C. L. Zitnick, “Microsoft coco: Common objects in context,” in European conference on computer vision. Springer, 2014, pp. 740–755. 7. Brent Cowan and Bill Kapralos “A Survey of Frameworks and Game Engines for Serious Game Development” in 2014 IEEE 14th International Conference on Advanced Learning Technologies 8. Arthur Juliani,Vincent-Pierre Berges,Ervin Teng,Andrew Cohen,Jonathan Harper,Chris Elion,Chris Goy,Yuan Gao,Hunter Henry,Marwan Mattar,Danny Lange. “Unity: A General Platform for Intelligent Agents” in arxiv.org by Cornell University 9. K.J., Reshmaa, Selvin Peter Paul J., and Swetha V. "A Study on MQTT based Environmental Parameters, Monitoring and Alarming System". Eurasian Journal of Analytical Chemistry 13 no. SP (2018) 3131