The document discusses the use of deep learning techniques for semantic analysis and annotation of both conventional and 360° videos, highlighting applications like face detection, recognition, and object tracking. It addresses challenges such as data limitations, class imbalance, and improving algorithms to enhance accuracy and efficiency in video annotation. The work is part of the Hyper360 project, aimed at creating non-interactive versions of 360° videos for better accessibility and archiving.