This document discusses applying data mining techniques to analyze active users on Reddit. It defines active users as those who posted or commented in at least 5 subreddits and have at least 5 posts/comments in each subreddit. The preprocessing steps extract over 25,000 active users and their posts from the raw Reddit data. K-means clustering is then used to cluster the active users into 10 groups based on their activities to gain insights into different types of active users on Reddit.