Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Negar Goli"'
Publikováno v:
Proceedings of the 49th Annual International Symposium on Computer Architecture.
Autor:
Tor M. Aamodt, Negar Goli
Publikováno v:
CVPR
The success of Convolutional Neural Networks (CNNs) in various applications is accompanied by a significant increase in computation and training time. In this work, we focus on accelerating training by observing that about 90% of gradients are reusab
Autor:
Negar Goli, Amruth Sandhupatla, Christopher Ng, Timothy G. Rogers, Suchita Pati, Deval Shah, Shaylin Cattell, Tor M. Aamodt, Jonathan Lew, Mengchi Zhang, Matthew D. Sinclair
Publikováno v:
ISPASS
Most deep neural networks deployed today are trained using GPUs via high-level frameworks such as TensorFlow and PyTorch. This paper describes changes we made to the GPGPU-Sim simulator to enable it to run PyTorch by running PTX kernels included in N
Publikováno v:
ISPASS
The efficacy of deep learning has resulted in its use in a growing number of applications. The Volta graphics processor unit (GPU) architecture from NVIDIA introduced a specialized functional unit, the “tensor core”, that helps meet the growing d