This document discusses using a multi-head attention mechanism to solve vehicle routing problems. It proposes an encoder-decoder model where customer locations are embedded in vectors and passed through an encoder with multi-head attention layers. A decoder then selects the next node in a route using a context vector containing embedding and capacity information, with the goal of minimizing total route distance. The model is trained using a policy gradient method to approximate the gradient and update the encoder and decoder.
Related topics: