This document presents a novel annotation scheme for hate speech in online comments. It was developed based on analyzing comments reacting to news on migration and LGBTQ+ issues in Malta. The scheme aims to address challenges in annotating hate speech, as different people may have different thresholds for what constitutes hate speech. It proposes a multi-layer scheme that was tested against a binary hate speech/not hate speech classification and showed higher agreement between annotators. It also introduces the MaNeCo corpus, a large collection of online newspaper comments from Malta over 10 years that the scheme will be applied to.
Related topics: