The document proposes a replicated Siamese LSTM model for semantic textual similarity (STS) and information retrieval (IR) in an industrial diagnostic ticketing system. The system aims to retrieve relevant solutions from a knowledge base of tickets given a query. However, the text pairs in the system are often asymmetric in length and content. The proposed model addresses this by learning complementary representations of text pairs in a highly structured latent space using a replicated Siamese LSTM architecture and multi-channel Manhattan metric. It aims to capture similarity at both coarse-grained topic and fine-grained semantic levels to better handle asymmetric texts. The model is evaluated on STS and IR tasks for the industrial ticketing system.
Related topics: