This document describes a system to simulate mouse functions using hand gestures detected by a webcam. The system uses Single Shot Multi-box Detection (SSD) and Non-Maximum Suppression (NMS) algorithms to accurately detect hand positions from images. SSD uses anchor boxes to detect objects within a divided image grid, while NMS eliminates overlapping bounding boxes to identify distinct objects. The system maps detected hand landmarks to mouse cursor positions and can perform functions like dragging and zooming. It aims to enable human-computer interaction without additional hardware by interpreting gestures captured from a webcam.