Multi-Class Document Classification Based on Deep Neural Network and Word2Vec

Autor:	İlkay YELMEN, Ali GÜNEŞ, Metin ZONTUL, Zafer ASLAN
Jazyk:	angličtina
Rok vydání:	2022
Předmět:	document classification multiclass classification data preprocessing word embedding methods machine learning deep learning Technology Motor vehicles. Aeronautics. Astronautics TL1-4050
Zdroj:	Havacılık ve Uzay Teknolojileri Dergisi, Vol 15, Iss 1, Pp 59-65 (2022)
Druh dokumentu:	article
ISSN:	1304-0448
Popis:	With the increase in unstructured data, the importance of classification of text-based documents has increased. In particular, the classification of news texts and digital documentation provides easy access to the information sought. In this study, a large amount of news textual data was used. After the data set was preprocessed, Bag of Words (BoW), TF-IDF, Word2Vec and Doc2Vec word embedding methods were applied. In the classification phase, Random Forest (RF), Multilayer Perceptron (MLP), Support Vector Machine (SVM) and Deep Neural Network (DNN) algorithms were applied. As a result of the experimental studies, using the Word2Vec method together with the DNN algorithm performed the best result.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/ea27e6709ba54c60b9a4e43bb2d2ccf1 Zobrazit plný text záznamu View record in DOAJ Plný text ve formátu PDF