This document presents an exploratory study comparing the performance of 19 binary machine learning classifiers on classifying code comments from three programming languages. The study aims to evaluate these classifiers, including logistic regression and random forest, in order to optimize and improve code comment categorization, with findings showing that all classifiers outperformed baseline scores. Key results indicate that logistic regression achieved the highest f1-score, while other models like linear SVC and decision tree also showed significant performance improvements.
Related topics: