This document presents a study on predicting interesting key frames in movie trailers using three different deep models (AlexNet, MemNet, and Triplet Loss). The goal is to identify interesting frames through image interestingness, addressing challenges like imbalanced data and proposing multi-task learning. Experimental results indicated variations in accuracy across different models, with post-processing used to determine interestingness labels.