The document introduces Deepsim, a novel approach for measuring code functional similarity using deep learning techniques by encoding code control flow and data flow into a semantic matrix. This method significantly improves upon existing techniques by effectively learning patterns in functionally similar code with different syntactic structures and shows superior performance on large datasets. Evaluations demonstrate that Deepsim outperforms state-of-the-art methods in terms of recall, precision, and time efficiency.
Related topics: