Self Supervision for Attention Networks

Autor:	Vinay P. Namboodiri, Ansh Jain, Kasturi G S, Badri N. Patro
Rok vydání:	2021
Předmět:	Contextual image classification Computer science Semantics (computer science) business.industry Deep learning 02 engineering and technology 010501 environmental sciences 01 natural sciences Visualization Task (project management) Salient 0202 electrical engineering electronic engineering information engineering Question answering 020201 artificial intelligence & image processing Artificial intelligence Language model business 0105 earth and related environmental sciences
Zdroj:	WACV
Popis:	In recent years, the attention mechanism has become a fairly popular concept and has proven to be successful in many machine learning applications. However, deep learning models do not employ supervision for these attention mechanisms which can improve the model’s performance significantly. Therefore, in this paper, we tackle this limitation and propose a novel method to improve the attention mechanism by inducing "self-supervision". We devise a technique to generate desirable attention maps for any model that utilizes an attention module. This is achieved by examining the model’s output for different regions sampled from the input and obtaining the attention probability distributions that enhance the proficiency of the model. The attention distributions thus obtained are used for supervision. We rely on the fact, that attenuation of the unimportant parts, allows a model to attend to more salient regions, thus strengthening the prediction accuracy. The quantitative and qualitative results published in this paper show that this method successfully improves the attention mechanism as well as the model’s accuracy. In addition to the task of Visual Question Answering(VQA), we also show results on the task of Image classification and Text classification to prove that our method can be generalized to any vision and language model that uses an attention module.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::90a9d47279c64bb42a4600de8c5abb50 https://doi.org/10.1109/wacv48630.2021.00077 Zobrazit plný text záznamu