This paper presents a novel algorithm for automatic video object tracking that utilizes a process of frame subtraction to predict the object's movement by analyzing changing areas in defined regions of interest. The algorithm minimizes a dissimilarity function to locate the object in subsequent frames, demonstrating successful tracking under cluttered conditions through virtual scenarios. The proposed approach simplifies tracking challenges by focusing on pixels directly associated with the object being tracked, which enhances performance in complex environments.