EMA at SemEval-2018 Task 1: Emotion Mining for Arabic

Autor: Obeida El Jundi, Alaa Khaddaj, Gilbert Badaro, Alaa Maarouf, Hazem Hajj, Raslan Kain, Wassim El-Hajj
Rok vydání: 2018
Předmět:
Zdroj: SemEval@NAACL-HLT
DOI: 10.18653/v1/s18-1036
Popis: While significant progress has been achieved for Opinion Mining in Arabic (OMA), very limited efforts have been put towards the task of Emotion mining in Arabic. In fact, businesses are interested in learning a fine-grained representation of how users are feeling towards their products or services. In this work, we describe the methods used by the team Emotion Mining in Arabic (EMA), as part of the SemEval-2018 Task 1 for Affect Mining for Arabic tweets. EMA participated in all 5 subtasks. For the five tasks, several preprocessing steps were evaluated and eventually the best system included diacritics removal, elongation adjustment, replacement of emojis by the corresponding Arabic word, character normalization and light stemming. Moreover, several features were evaluated along with different classification and regression techniques. For the 5 subtasks, word embeddings feature turned out to perform best along with Ensemble technique. EMA achieved the 1st place in subtask 5, and 3rd place in subtasks 1 and 3.
Databáze: OpenAIRE