Bert-Based Text Keyword Extraction

Autor: Yimei Liu, Yili Qian, Chaochao Jia
Rok vydání: 2021
Předmět:
Zdroj: Journal of Physics: Conference Series. 1992:042077
ISSN: 1742-6596
1742-6588
DOI: 10.1088/1742-6596/1992/4/042077
Popis: With the explosive growth of network information, in order to obtain the information faster and more accurately, this paper proposes a text keyword extraction method based on Bert. Firstly, the key sentence set is extracted from the background material by Bert model as the information supplement to the text. Then, based on the extended text, TF-IDF, text rank and LDA are combined to extract keywords. The experimental results on real science and technology academic paper data sets show that the performance of the fusion multi type feature combination algorithm is better than that of the traditional single algorithm; and the F value of the algorithm is increased by 1.5% by extracting key sentences from background materials, which further improves the effect of key word extraction.
Databáze: OpenAIRE