This document describes a method called Task-aware Spatial Disentanglement (TSD) for object detection. TSD uses separate branches to process features for the classification and localization tasks in a spatially disentangled manner. For classification, TSD applies pointwise deformations to the feature map. For localization, it applies proposal-wise translations. This allows each task to process features with spatial sensitivities suitable for their goals, improving performance over methods that do not separate spatial processing for different tasks. TSD achieves state-of-the-art object detection accuracy on standard benchmarks.
Related topics: