A High Throughput Hardware CNN Accelerator Using a Novel Multi-Layer Convolution Processor
Autor: | Sayed Masoud Sayedi, Mohammad Javad Khaleghi, Mohammad Reza Tavakoli |
---|---|
Rok vydání: | 2020 |
Předmět: |
business.industry
Computer science Deep learning 020208 electrical & electronic engineering Memory bandwidth 02 engineering and technology Chip Convolutional neural network 020202 computer hardware & architecture Kernel (image processing) 0202 electrical engineering electronic engineering information engineering System on a chip Artificial intelligence business Field-programmable gate array Multi layer Computer hardware |
Zdroj: | 2020 28th Iranian Conference on Electrical Engineering (ICEE). |
Popis: | Convolutional Neural Network (CNN) is the state-of-the-art deep learning approach used in various computer vision algorithms due to their high accuracy. To ensure programmable flexibility and shorten the development period, FPGA is an appropriate platform to implement CNN models. However, the limited on-chip storage and memory bandwidth are the bottlenecks. In this paper, two different architectures are presented to implement a same model structure. One performs traditional computing, layer by layer, and the other one performs multiple-layer computing in a pipeline structure using a Multi-Layer Convolution Processor (MLCP) accelerator. In the latter one, the required on-chip memory and memory bandwidth are reduced. Implementation results on a Xilinx Zynq XC7Z020 chip under a frequency of 200 MHz shows that the MLCP accelerator achieves 12.9 GOP/s that is 2.6x higher than that of Single-Layer Convolution Processor (SLCP). |
Databáze: | OpenAIRE |
Externí odkaz: |