- The document proposes a framework for plagiarism detection using natural language processing and machine learning. It investigates using machine learning in the proposed framework.
- The experimental setup involves preprocessing text, extracting n-grams, and using classifiers like J48 and Naive Bayes to detect plagiarism. Testing on a corpus achieves up to 94.6% accuracy with J48.
- Future work includes incorporating WordNet for semantic matching and addressing paraphrasing and multi-lingual plagiarism.
Related topics: