This paper presents a hashtag graph-based topic model (HGTN) for analyzing microblogs, particularly Twitter, to overcome challenges such as sparsity and informality in tweets. By leveraging user-generated hashtags as semi-structured data, HGTN enhances semantic relations between words and improves topic modeling performance over conventional methods. The effectiveness of HGTN is demonstrated through experiments on real-world tweet datasets, indicating its ability to produce coherent topics and address noise issues in tweet clustering and classification.
Related topics: