ERIDANUS: Efficiently Running Inference of DNNs Using Systolic Arrays

Autor: Sudhakar Yalamanchili, Ramyad Hadidi, Bahar Asgari, Hyesoon Kim
Rok vydání: 2019
Předmět:
Zdroj: IEEE Micro. 39:46-54
ISSN: 1937-4143
0272-1732
DOI: 10.1109/mm.2019.2930057
Popis: Systolic arrays with promising attributes, such as high degree of concurrent computation and high data-reuse rate, are attractive solutions for dense linear algebra. Recently, systolic arrays have been used for accelerating the inference of deep neural networks (DNNs). However, as sparsification mechanisms are applied to DNNs during or after training, DNN inference is usually a sparse problem. Therefore, it cannot fully benefit from the fundamental advantages offered by systolic arrays. To solve this challenge, we propose Eridanus, an approach to structured pruning that produces DNNs compatible with the synchronous and rhythmic flow of data from memory to systolic arrays.
Databáze: OpenAIRE