This document discusses text mining techniques for document classification. It covers several classification techniques including k-nearest neighbor algorithms, decision trees, naive Bayes classifiers, and probabilistic models. It also discusses evaluating classification performance using metrics like recall, precision and accuracy. Finally, it provides examples of applications for document classification like organizing web pages and filtering emails.
Related topics: