This document proposes a new approach called SASUM for approximate subgraph matching in large graphs. Approximate subgraph matching allows missing edges in query matches, which is important for real-world graphs that may be incomplete. SASUM improves upon the basic approach of generating all possible query subgraphs and doing exact matching for each. It exploits the overlapping nature of query subgraphs to reduce the number that require costly exact matching. SASUM uses a lattice framework to identify sharing opportunities between query subgraphs. It generates small "base graphs" that are shared between queries and chooses a minimum set of these to match, from which it can derive matches for all queries. The approach outperforms the state-of-the-art by orders of
Related topics: