The Explainability of Transformers: Current Status and Directions

Autor:	Paolo Fantozzi, Maurizio Naldi
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	explainability transformers visual transformers natural language processing interpretability deep learning Electronic computers. Computer science QA75.5-76.95
Zdroj:	Computers, Vol 13, Iss 4, p 92 (2024)
Druh dokumentu:	article
ISSN:	2073-431X
DOI:	10.3390/computers13040092
Popis:	An increasing demand for model explainability has accompanied the widespread adoption of transformers in various fields of applications. In this paper, we conduct a survey of the existing literature on the explainability of transformers. We provide a taxonomy of methods based on the combination of transformer components that are leveraged to arrive at the explanation. For each method, we describe its mechanism and survey its applications. We find out that attention-based methods, both alone and in conjunction with activation-based and gradient-based methods, are the most employed ones. A growing attention is also devoted to the deployment of visualization techniques to help the explanation process.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/edac296a3c344bb786c906d4622a9acf Zobrazit plný text záznamu View record in DOAJ