This document outlines a tutorial on visual search and understanding. It discusses various techniques for visual representations and indexing, including recent neural network architectures that aim to reduce parameters, memory usage, and spatial redundancy. Specific techniques covered include Multi-Fiber Networks, Double Attention Networks, and Global Reasoning Networks. Global Reasoning Networks are discussed in detail, including how they project features from coordinate space to interaction space, reason over feature interactions using graph convolutions, and project back to coordinate space.