This document proposes a three-step approach for zero-shot image recognition using relational matching, domain adaptation, and calibration. The approach uses relational matching to find structural correspondences between semantic embeddings and features, domain adaptation to adapt unseen semantic embeddings to the test data domain, and calibration to reduce bias towards seen classes. Experimental results on four datasets show improved zero-shot and generalized zero-shot classification performance compared to previous methods, with domain adaptation providing the most benefit. Analysis of hubness and convergence properties are also presented.
Related topics: