Implementation of deep neural networks on FPGA-CPU platform using Xilinx SDSOC
Autor: | Rania O. Hassan, Hassan Mostafa |
---|---|
Rok vydání: | 2020 |
Předmět: |
Computational complexity theory
Contextual image classification Computer science business.industry 020208 electrical & electronic engineering 020206 networking & telecommunications 02 engineering and technology Convolutional neural network Surfaces Coatings and Films Acceleration Hardware and Architecture High-level programming language High-level synthesis Embedded system Signal Processing 0202 electrical engineering electronic engineering information engineering Graphics Field-programmable gate array business |
Zdroj: | Analog Integrated Circuits and Signal Processing. 106:399-408 |
ISSN: | 1573-1979 0925-1030 |
DOI: | 10.1007/s10470-020-01638-5 |
Popis: | Deep Convolutional Neural Networks (CNNs) are the state-of-the-art systems for image classification due to their high accuracy but on the other hand their high computational complexity is very costly. The acceleration is the target in this field nowadays for using these systems in real time applications. The Graphics Processing Units is the solution but its high-power consumption prevents its utilization in daily-used equipment moreover the Field Programmable Gate Array (FPGA) has low power consumption and flexible architecture which fits more for CNN implementations. This work discusses this problem and provides a solution that compromises between the speed of the CNN and the power consumption of the FPGA. This solution depends on two main techniques for speeding up: parallelism of layers resources and pipelining inside some layers. On the other hand, we added a new methodology to compromise the area requirements with the speed and design time by implementing CNN using Xilinx SDSOC tool (including processor and FPGA on the same board). Implementing design using HW/SW partitioning will enhance time design based on high level language(C or C++) in Vivado HLS (High Level Synthesis). It also fits for more large designs than using FPGA only and faster in design time. |
Databáze: | OpenAIRE |
Externí odkaz: |