The document describes a project that aims to predict malicious activity in public surveillance videos. The proposed system uses computer vision and deep learning techniques like HOG features for malicious object recognition, visual-semantic alignments for scene description, super resolution for improved facial recognition, and PCA for eigenface-based facial recognition. The system acquires video, performs malicious object detection, improves frame clarity, and extracts identifying parameters to predict malicious activity. It provides results for each component but also notes limitations like inability to run in real-time and need for a GPU to accelerate parts of the processing.