The document presents a new method for multi-object tracking in drone aerial videos called the Gao-Tracker, which integrates a holistic transformer and multiple feature trajectory matching to address challenges such as occlusion and rapid motion. The framework enhances tracking effectiveness and robustness by combining local and global interactions in its design and utilizes a novel trajectory prediction method based on visual features. Experimental results validate its superior performance against existing state-of-the-art methods on benchmark datasets.