Improving BERT With Self-Supervised Attention

Autor:	Yiren Chen, Xiaoyu Kou, Jiangang Bai, Yunhai Tong
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Natural language processing attention model text classification BERT pre-trained model Electrical engineering. Electronics. Nuclear engineering TK1-9971
Zdroj:	IEEE Access, Vol 9, Pp 144129-144139 (2021)
Druh dokumentu:	article
ISSN:	2169-3536
DOI:	10.1109/ACCESS.2021.3122273
Popis:	One of the most popular paradigms of applying large pre-trained NLP models such as BERT is to fine-tune it on a smaller dataset. However, one challenge remains as the fine-tuned model often overfits on smaller datasets. A symptom of this phenomenon is that irrelevant or misleading words in the sentence, which are easy to understand for human beings, can substantially degrade the performance of these fine-tuned BERT models. In this paper, we propose a novel technique, called Self-Supervised Attention (SSA) to help facilitate this generalization challenge. Specifically, SSA automatically generates weak, token-level attention labels iteratively by probing the fine-tuned model from the previous iteration. We investigate two different ways of integrating SSA into BERT and propose a hybrid approach to combine their benefits. Empirically, through a variety of public datasets, we illustrate significant performance improvement using our SSA-enhanced BERT model.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/94a8882ea21c47ea88a5ffc6bb654529 Zobrazit plný text záznamu View record in DOAJ