A Multiscale Interactive Attention Short Text Classification Model Based on BERT

Autor:	Lu Zhou, Peng Wang, Huijun Zhang, Shengbo Wu, Tao Zhang
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	BERT RNN CNN multiscale interactive attention pre-training models Electrical engineering. Electronics. Nuclear engineering TK1-9971
Zdroj:	IEEE Access, Vol 12, Pp 160992-161001 (2024)
Druh dokumentu:	article
ISSN:	2169-3536
DOI:	10.1109/ACCESS.2024.3478781
Popis:	Text classification tasks aim to comprehend and classify text content into specific classifications. This task is crucial for interpreting unstructured text, making it a foundational task in the field of Natural Language Processing(NLP). Despite advancements in large language models, lightweight text classification via these models still demands substantial computational resources. Therefore, this paper presents a multiscale interactive attention short text classification model based on BERT, which is designed to address the short text classification problem with limited resources. A corpus containing news articles, Chinese comments, and English sentiment classifications is employed for text classification. The model uses BERT pre-trained word vectors as embedding layers, connects to a multilevel feature extraction network, and further extracts contextual features after feature fusion. The experimental results on the THUCNews, Today’s headline news corpus, the SST-2 dataset, and the Touhou 38 W dataset demonstrate that our method outperforms all existing algorithms in the literature.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/b80c500c742142d68878364aa00c8ed2 Zobrazit plný text záznamu View record in DOAJ