This document presents a proposed system for people monitoring and mask detection using real-time video analysis, leveraging computer vision technology for applications in crowded areas. The system utilizes convolutional neural networks and MobileNet SSD for effective detection of individuals and mask compliance, triggering alarms when thresholds are exceeded. Key features include head tracking, mask detection, and alarm alerts, aimed at enhancing public safety during high-density gatherings, especially in pandemic conditions.