- The document describes a project to predict customer churn for a telecom company using classification algorithms. It analyzes a dataset of 3333 customers to identify variables that contribute to churn and builds models using KNN and C4.5.
- The C4.5 model achieved higher accuracy (94.9%) than KNN (87.1%) on the test data. Key variables for predicting churn were found to be day minutes, customer service calls, and international plan.
- The model can help the telecom company prevent churn by focusing retention efforts on at-risk customers identified through these important variables.