A Short Text Classification Method Based on Convolutional Neural Network and Semantic Extension

Autor: Haitao Wang, Keke Tian, Zhengjiang Wu, Lei Wang
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Zdroj: International Journal of Computational Intelligence Systems, Vol 14, Iss 1 (2020)
Druh dokumentu: article
ISSN: 1875-6883
DOI: 10.2991/ijcis.d.201207.001
Popis: In order to solve the problem that traditional short text classification methods do not perform well on short text due to the data sparsity and insufficient semantic features, we propose a short text classification method based on convolutional neural network and semantic extension. Firstly, we propose an improved similarity to improve the coverage of the word vector table in the short text preprocessing process. Secondly, we propose a method for semantic expansion of short texts, which adding an attention mechanism to the neural network model to find related words in the short text, and semantic expansion is performed at the sentence level and the related word level of the short text respectively. Finally, the feature extraction of short text is carried out by means of the classical convolutional neural network. The experimental results show that the proposed method is feasible during the classification task of short text, and the classification effectiveness is significantly improved.
Databáze: Directory of Open Access Journals