The document introduces SPADA, a dataset for evaluating monolingual phrase alignment on paraphrases using parse trees. It discusses the limitations of existing n-gram paraphrasing methods and emphasizes the importance of syntactic structures in modeling phrases. Additionally, it outlines the annotation process, statistics, and evaluation metrics, while suggesting future directions for expanding the dataset and covering a broader range of linguistic phenomena.