Detecting Hateful and Offensive Speech in Arabic Social Media Using Transfer Learning

Autor:	Zakaria Boulouard, Mariya Ouaissa, Mariyam Ouaissa, Moez Krichen, Mutiq Almutiq, Karim Gasmi
Jazyk:	angličtina
Rok vydání:	2022
Předmět:	deep learning hate speech detection natural language processing social media analytics text mining Technology Engineering (General). Civil engineering (General) TA1-2040 Biology (General) QH301-705.5 Physics QC1-999 Chemistry QD1-999
Zdroj:	Applied Sciences, Vol 12, Iss 24, p 12823 (2022)
Druh dokumentu:	article
ISSN:	2076-3417
DOI:	10.3390/app122412823
Popis:	The democratization of access to internet and social media has given an opportunity for every individual to openly express his or her ideas and feelings. Unfortunately, this has also created room for extremist, racist, misogynist, and offensive opinions expressed either as articles, posts, or comments. While controlling offensive speech in English-, Spanish-, and French- speaking social media communities and websites has reached a mature level, it is much less the case for their counterparts in Arabic-speaking countries. This paper presents a transfer learning solution to detect hateful and offensive speech on Arabic websites and social media platforms. This paper will compare the performance of different BERT-based models trained to classify comments as either abusive or neutral. The training dataset contains comments in standard Arabic as well as four dialects. We will also use their English translations for comparative purposes. The models were evaluated based on five metrics: Accuracy, Precision, Recall, F1-Score, and Confusion Matrix.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/26b2e9cc6a72465aab10fd19a32cbbab Zobrazit plný text záznamu View record in DOAJ