ERIDANUS: Efficiently Running Inference of DNNs Using Systolic Arrays
Autor: | Sudhakar Yalamanchili, Ramyad Hadidi, Bahar Asgari, Hyesoon Kim |
---|---|
Rok vydání: | 2019 |
Předmět: |
Training set
Artificial neural network Computer science Inference 02 engineering and technology 020202 computer hardware & architecture Hardware and Architecture Linear algebra 0202 electrical engineering electronic engineering information engineering Pruning (decision trees) Electrical and Electronic Engineering Concurrent computation Algorithm Software Sparse matrix |
Zdroj: | IEEE Micro. 39:46-54 |
ISSN: | 1937-4143 0272-1732 |
DOI: | 10.1109/mm.2019.2930057 |
Popis: | Systolic arrays with promising attributes, such as high degree of concurrent computation and high data-reuse rate, are attractive solutions for dense linear algebra. Recently, systolic arrays have been used for accelerating the inference of deep neural networks (DNNs). However, as sparsification mechanisms are applied to DNNs during or after training, DNN inference is usually a sparse problem. Therefore, it cannot fully benefit from the fundamental advantages offered by systolic arrays. To solve this challenge, we propose Eridanus, an approach to structured pruning that produces DNNs compatible with the synchronous and rhythmic flow of data from memory to systolic arrays. |
Databáze: | OpenAIRE |
Externí odkaz: |