Addressing Sparsity in Deep Neural Networks

Autor:	Zhang Shijin, Zidong Du, Qi Guo, Lei Zhang, Lan Huiying, Liu Shaoli, Ling Li, Tianshi Chen, Zhou Xuda, Yunji Chen
Rok vydání:	2019
Předmět:	Speedup Artificial neural network Computer science Feature extraction Process (computing) 02 engineering and technology Computer Graphics and Computer-Aided Design 020202 computer hardware & architecture Synapse Reduction (complexity) Computer engineering Asynchronous communication 0202 electrical engineering electronic engineering information engineering Electrical and Electronic Engineering Software Energy (signal processing)
Zdroj:	IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 38:1858-1871
ISSN:	1937-4151 0278-0070
Popis:	Neural networks (NNs) have been demonstrated to be useful in a broad range of applications, such as image recognition, automatic translation, and advertisement recommendation. State-of-the-art NNs are known to be both computationally and memory intensive, due to the ever-increasing deep structure, i.e., multiple layers with massive neurons and connections (i.e., synapses). Sparse NNs have emerged as an effective solution to reduce the amount of computation and memory required. Though existing NN accelerators are able to efficiently process dense and regular networks, they cannot benefit from the reduction of synaptic weights. In this paper, we propose a novel accelerator, Cambricon-X, to exploit the sparsity and irregularity of NN models for increased efficiency. The proposed accelerator features a processing element (PE)-based architecture consisting of multiple PEs. An indexing module efficiently selects and transfers needed neurons to connected PEs with reduced bandwidth requirement, while each PE stores irregular and compressed synapses for local computation in an asynchronous fashion. With 16 PEs, our accelerator is able to achieve at most 544 GOP/s in a small form factor (6.38 mm2 and 954 mW at 65 nm). Experimental results over a number of representative sparse networks show that our accelerator achieves, on average, $7.23\times$ speedup and $6.43\times$ energy saving against the state-of-the-art NN accelerator. We further investigate possibilities of leveraging activation sparsity and multi-issue controller, which improve the efficiency of Cambricon-X. To ease the burden of programmers, we also propose a high efficient library-based programming environment for our accelerator.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::5cf9bcfc17932fb4be974af2fafe8983 https://doi.org/10.1109/tcad.2018.2864289 Zobrazit plný text záznamu