This document evaluates different color descriptors for object and scene recognition. It finds that descriptors like W-SIFT and rgSIFT, which are invariant to light intensity scale and shifts, generally outperform other descriptors. However, the usefulness of invariance is found to be category-specific, as some objects like cars and tables do not benefit from light intensity invariance. Evaluation on image and video benchmarks shows scale and shift invariant descriptors work best for most but not all categories.
Related topics: