Classifiers for Arabic NLP: survey

Autor: Moustafa Al-Hajj, Marwan Al Omari
Rok vydání: 2020
Předmět:
Zdroj: International Journal of Computational Complexity and Intelligent Algorithms. 1:231
ISSN: 2048-4739
2048-4720
DOI: 10.1504/ijccia.2020.105538
Popis: In this paper, we reviewed most common-used models and classifiers that used for the Arabic language to classify texts into categories, classes, or topics in tasks of opinion mining, sentence categorisation, part of speech tagging, language identification, name entity recognition, authorship attribution, word sense disambiguation, and text classification. Comparisons between classification tasks conducted in terms of models' performances and accuracies. Classification approaches are three types: lexicon-based, machine and deep learning, or hybrid ones. Research sample is 34 articles in the classification domain. Challenges facing the Arabic language discussed with further solutions: 1) solid research training on both approaches: lexicon-based and corpus-based (machine and deep learning); 2) research contribution mainly corpus, approach technique, and free accessibility; 3) fund increase to the research development in the Arab world.
Databáze: OpenAIRE