This document presents a deep learning approach for static hand gesture recognition using an Intel RealSense D435 depth camera and a modified VGG-16 convolutional neural network. It discusses the system's architecture, image processing methods, and performance metrics based on a newly created dataset of 2,000 images, achieving a high recognition accuracy of 95.5%. The study emphasizes the challenges of hand gesture recognition and the effectiveness of combining depth and color information in real-time processing.
Related topics: