Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Jain, Shraddhan"'
Autor:
Ji, Tianchu, Jain, Shraddhan, Ferdman, Michael, Milder, Peter, Schwartz, H. Andrew, Balasubramanian, Niranjan
How much information do NLP tasks really need from a transformer's attention mechanism at application-time (inference)? From recent work, we know that there is sparsity in transformers and that the floating-points within its computation can be discre
Externí odkaz:
http://arxiv.org/abs/2106.01335