This document discusses a novel approach to text categorization combining an augmented k-nearest neighbor (AKNN) classification and k-medoids clustering to enhance accuracy and reduce computational complexity. The authors address the traditional challenges of text classification, such as high dimensionality and the presence of outlier samples, by proposing a pre-processing method and a clustering technique that strengthens feature selection and categorization methods. Experimental results indicate that the proposed method performs significantly better than traditional k-nearest neighbor classifiers when evaluated on a benchmark text mining dataset.
Related topics: