This document presents research on using machine learning to detect academically at-risk students based on their usage data from a learning management system (LMS). The researchers analyzed LMS log data from over 200 students to predict whether their GPA after the first semester would be above or below average. Three machine learning methods - logistic regression, support vector machines, and random forests - were tested. The random forest model achieved the best balance of precision and recall, able to detect around 30% of at-risk students after the first week using only LMS data and 40% by the third week when attendance data was also included. Analyzing the importance of different usage variables over time provided insights into student behaviors that most impacted academic outcomes.
Related topics: