A survey on automatic detection of hate speech in text

자연어처리 연구실
M2020064
조단비
Published: ACM Journals; ACM Computing Surveys, Vol.51, No.4, 2018

Content
1. Why study Hate Speech automatic detection?
2. What is Hate Speech?
3. What has been done so far in automatic Hate Speech detection?
4. Resources for Gate Speech classification
5. Research challenges and opportunities
#Kookmin_University #Natural_Language_Processing_lab. 1

Introduction
> Describe the motivation for conducting research
- “how hate speech online has been evolving”
- “who are the main targets of it”
> Provide the detailed definitions
> Analyze the previous survey with systematic literature review
- focusing on descriptive statistics about Hate Speech detection
- focusing on algorithms for Hate Speech detection

1. Why study Hate Speech automatic detection?
- European Union Commission directives
- Automatic techniques not available
- Lack of data about hate speech
- Hate speech removal
- Quality of service

> Definition from several sources
> Our definition of Hate Speech
: “jokes also must be marked as hate speech.”
1
2
3
4

> Particular cases and examples of Hate speech
- In Facebook,
hate speech = a verbal attack + the target of the attack from “protected category”

> Hate Speech and
other related concepts
- Hate: 증오
- Cyberbullying: 사이버 괴롭힘
- Discrimination: 차별
- Flaming: 모욕
- Abusive language: 욕설
- Profanity: 욕설
- Toxic language or comment: 악성 댓글
- Extremism: 극단주의 (폭력 조장)
- Radicalization: 급진주의

3. What has been done so far
in automatic Hate Speech detection?
1) Systematic Literature Review
> Method description
> Document Collection and Annotation
- A total of 127 documents (2016.09.01 ~ 2017.05.18)
- “Law and Social Sciences”: 76 / “Computer Science and Engineering”: 51
- Low number of citations

> Keywords in the Document
- Related concepts (cyberbullying, cyber hate, sectarianism, …)
- Machine learning (classification, sentiment analysis, filtering systems, …)
- Social media (internet, social media, social network, …)

> Social Networks & Number of Used Instances

> General or Particular Hate Speech & Algorithms Used

> Type of Approach in the Document
9
1
17

2) Documents focusing on descriptive statistics about Hate Speech detection
- There are descriptive articles
about Racism(인종차별), Sexism(성차별), Prejudice toward refugees(난민에 대한 편견),
Homophobia(동성애 혐오증), and general hate speech(일반적인 증오심)
3) Documents focusing on algorithms for Hate Speech detection
> Dataset used in the papers
> Achieved performances
- metrics: Precision, Recall, F-measure, accuracy, and AUC

4) Text mining approaches in automatic Hate Speech detection
: feature extraction
(1). general features used in text mining
: dictionary, distance metric, Bag-of-words, N-grams, TF-IDF, Part-of-speech, …
(2). The specific hate speech detection features

4. Resources for Hate Speech classification
1). Dataset & open source projects

4. Resources for
Hate Speech classification
https://paperswithcode.com/datasets?
task=hate-speech-detection

4. Resources for
Hate Speech classification
https://github.com/kocohub/
korean-hate-speech

5. Research challenges and opportunities
> challenges
- Lack of expertise
- Difficulty to track all racial and minority insults
- Evolution of language among young population
- Transition of hate speech such as sarcasm
> Opportunity
- Open source platforms or algorithms
- Definition of a main dataset
- Comparative studies
- Multilingual research

Thank You.
19
#Kookmin_University #Natural_Language_Processing_lab.

A survey on automatic detection of hate speech in text

More Related Content

What's hot (14)

More from Danbi Cho (11)

Recently uploaded (20)

A survey on automatic detection of hate speech in text