Using BERT and Augmentation in Named Entity Recognition for Cybersecurity Domain

Autor: Tikhomirov, Mikhail, Loukachevitch, N., Sirotina, Anastasiia, Dobrov, Boris
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Zdroj: Natural Language Processing and Information Systems
Popis: The paper presents the results of applying the BERT representation model in the named entity recognition task for the cybersecurity domain in Russian. Several variants of the model were investigated. The best results were obtained using the BERT model, trained on the target collection of information security texts. We also explored a new form of data augmentation for the task of named entity recognition.
Databáze: OpenAIRE