The document discusses the impact of class rebalancing techniques on defect prediction models, indicating that defect datasets are often imbalanced and affect model performance and interpretation. While rebalancing can improve recall and the F-measure, it may also decrease precision and lead to model interpretation issues. Recommendations suggest using optimized SMOTE for area under the curve (AUC) and under-sampling for recall, while cautioning against applying any techniques for interpretations.
Related topics: