Classifiers for Arabic NLP: survey
Autor: | Moustafa Al-Hajj, Marwan Al Omari |
---|---|
Rok vydání: | 2020 |
Předmět: |
Language identification
Arabic nlp Computer science business.industry Deep learning Big data Sentiment analysis General Medicine computer.software_genre Lexicon ComputingMethodologies_PATTERNRECOGNITION Classifier (linguistics) ComputingMethodologies_DOCUMENTANDTEXTPROCESSING Artificial intelligence business computer Natural language processing Sentence |
Zdroj: | International Journal of Computational Complexity and Intelligent Algorithms. 1:231 |
ISSN: | 2048-4739 2048-4720 |
DOI: | 10.1504/ijccia.2020.105538 |
Popis: | In this paper, we reviewed most common-used models and classifiers that used for the Arabic language to classify texts into categories, classes, or topics in tasks of opinion mining, sentence categorisation, part of speech tagging, language identification, name entity recognition, authorship attribution, word sense disambiguation, and text classification. Comparisons between classification tasks conducted in terms of models' performances and accuracies. Classification approaches are three types: lexicon-based, machine and deep learning, or hybrid ones. Research sample is 34 articles in the classification domain. Challenges facing the Arabic language discussed with further solutions: 1) solid research training on both approaches: lexicon-based and corpus-based (machine and deep learning); 2) research contribution mainly corpus, approach technique, and free accessibility; 3) fund increase to the research development in the Arab world. |
Databáze: | OpenAIRE |
Externí odkaz: |