This document discusses using machine learning and data mining techniques to identify unhappy communities based on reader comments to news media articles. The researchers collected and preprocessed New York Times data, then used a classification algorithm to place counties into "contented" or "unhappy" groups based on the sentiment and emotions expressed in the comments over time. Their analysis found that comments from "unhappy" counties expressed more negative sentiments like sadness, fear, anger and disgust compared to "contented" counties. The researchers believe continuing to analyze sentiment and emotions in the comments can help further improve their classifier and provide insights into the root causes of unhappiness.
Related topics: