This paper presents an efficient algorithm for feature subset selection and data reduction in high-dimensional datasets, combining information theory with fuzzy c-means clustering. The proposed approach detects and removes unimportant features while improving classifier performance, as demonstrated through tests on 35 real-world datasets using a Support Vector Machine classifier. The results indicate significant reductions in both dataset size and processing time, enhancing accuracy in decision-making processes.
Related topics: