Bert-Based Text Keyword Extraction
Autor: | Yimei Liu, Yili Qian, Chaochao Jia |
---|---|
Rok vydání: | 2021 |
Předmět: | |
Zdroj: | Journal of Physics: Conference Series. 1992:042077 |
ISSN: | 1742-6596 1742-6588 |
DOI: | 10.1088/1742-6596/1992/4/042077 |
Popis: | With the explosive growth of network information, in order to obtain the information faster and more accurately, this paper proposes a text keyword extraction method based on Bert. Firstly, the key sentence set is extracted from the background material by Bert model as the information supplement to the text. Then, based on the extended text, TF-IDF, text rank and LDA are combined to extract keywords. The experimental results on real science and technology academic paper data sets show that the performance of the fusion multi type feature combination algorithm is better than that of the traditional single algorithm; and the F value of the algorithm is increased by 1.5% by extracting key sentences from background materials, which further improves the effect of key word extraction. |
Databáze: | OpenAIRE |
Externí odkaz: |