ALT at SemEval-2020 Task 12: Arabic and English Offensive Language Identification in Social Media
Autor: | Hamdy Mubarak, Ahmed Abdelali, Sabit Hassan, Younes Samih |
---|---|
Rok vydání: | 2020 |
Předmět: | |
Zdroj: | SemEval@COLING Scopus-Elsevier |
DOI: | 10.18653/v1/2020.semeval-1.249 |
Popis: | This paper describes the systems submitted by the Arabic Language Technology group (ALT) at SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media. We focus on sub-task A (Offensive Language Identification) for two languages: Arabic and English. Our efforts for both languages achieved more than 90% macro-averaged F1-score on the official test set. For Arabic, the best results were obtained by a system combination of Support Vector Machine, Deep Neural Network, and fine-tuned Bidirectional Encoder Representations from Transformers (BERT). For English, the best results were obtained by fine-tuning BERT. |
Databáze: | OpenAIRE |
Externí odkaz: |