1) The document proposes a method for tracking moving objects in videos captured using a moving camera in complex scenes. It involves video stabilization, key frame extraction, object detection/tracking using Gaussian mixture models and Kalman filters, and object recognition using bag of features.
2) Key frame extraction identifies important frames for processing by computing edge differences between frames and selecting frames above a threshold.
3) Moving objects are detected using background subtraction and Gaussian mixture models, and then tracked across frames using Kalman filters.
4) Object recognition is performed using bag of features, which represents objects as histograms of visual word frequencies to classify objects based on characteristic visual parts.