Classifiers for Arabic NLP: survey

Autor:	Moustafa Al-Hajj, Marwan Al Omari
Rok vydání:	2020
Předmět:	Language identification Arabic nlp Computer science business.industry Deep learning Big data Sentiment analysis General Medicine computer.software_genre Lexicon ComputingMethodologies_PATTERNRECOGNITION Classifier (linguistics) ComputingMethodologies_DOCUMENTANDTEXTPROCESSING Artificial intelligence business computer Natural language processing Sentence
Zdroj:	International Journal of Computational Complexity and Intelligent Algorithms. 1:231
ISSN:	2048-4739 2048-4720
DOI:	10.1504/ijccia.2020.105538
Popis:	In this paper, we reviewed most common-used models and classifiers that used for the Arabic language to classify texts into categories, classes, or topics in tasks of opinion mining, sentence categorisation, part of speech tagging, language identification, name entity recognition, authorship attribution, word sense disambiguation, and text classification. Comparisons between classification tasks conducted in terms of models' performances and accuracies. Classification approaches are three types: lexicon-based, machine and deep learning, or hybrid ones. Research sample is 34 articles in the classification domain. Challenges facing the Arabic language discussed with further solutions: 1) solid research training on both approaches: lexicon-based and corpus-based (machine and deep learning); 2) research contribution mainly corpus, approach technique, and free accessibility; 3) fund increase to the research development in the Arab world.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::79968f219139d5fdf529f770458866d5 https://doi.org/10.1504/ijccia.2020.105538 Zobrazit plný text záznamu