This document discusses sentiment analysis and datasets for detecting negative words in Persian text. It describes several existing English sentiment datasets and a refined Persian polarity corpus. It also mentions a corpus of exceptions extracted from a Flexicon database and the need to refine this exceptions list. The document outlines how the negative word detection algorithm works and proposes areas for further development, including creating a database of positive affixed words and using statistical approaches to increase the algorithm's accuracy.
Related topics: