Arabic text classification using deep feature and bidirectional long-short-term memory

Autor:	null Azal Minshed Abid
Jazyk:	angličtina
Rok vydání:	2022
Předmět:	BI-LSTM Text classification CNN Arabic corpus LSA SVD General Engineering
DOI:	10.5281/zenodo.7678617
Popis:	Due to the increased demand for automatic document organization, text classification is essential in both academic and commercial platforms. The aim of text classification is to automatically group text documents into one or more predefined categories,that helps to solve a variety of challenges. Many of these concerns are related to data management. In this paper, we propose a new model for Arabic text classification. The model consists of two main phases. The first phase is concerned with extracting three sets of features: statistical feature, Latent Semantic Analysis (LSA) feature, and a combination of both. While the second phase is concerned with introducing these features separately to the Bidirectional Long-short Term Memory (BI-LSTM) for classification purposes. The performance of the proposed model evaluated using CNN Arabic corpus. The experimental results showed solid performance of the proposed model, especially for a combination feature when the averages of precision, recall, and F-measurement reached 94, 91, and 91.94 respectively.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::97e904a08b9bf2cd69d9ee429b9da7dc Zobrazit plný text záznamu