Sparse, Quantized, Full Frame CNN for Low Power Embedded Devices
Autor: | Soyeb Nagori, Manu Mathew, Kumar Desappan, Pramod Kumar Swami |
---|---|
Rok vydání: | 2017 |
Předmět: |
Theoretical computer science
Computer science business.industry Quantization (signal processing) 02 engineering and technology 010501 environmental sciences Fixed point 01 natural sciences Convolutional neural network Quantization (physics) 020204 information systems 0202 electrical engineering electronic engineering information engineering Segmentation Artificial intelligence business Algorithm Digital signal processing 0105 earth and related environmental sciences |
Zdroj: | CVPR Workshops |
DOI: | 10.1109/cvprw.2017.46 |
Popis: | This paper presents methods to reduce the complexity of convolutional neural networks (CNN). These include: (1) A method to quickly and easily sparsify a given network. (2) Fine tune the sparse network to obtain the lost accuracy back (3) Quantize the network to be able to implement it using 8-bit fixed point multiplications efficiently. (4) We then show how an inference engine can be designed to take advantage of the sparsity. These techniques were applied to full frame semantic segmentation and the degradation due to the sparsity and quantization is found to be negligible. We show by analysis that the complexity reduction achieved is significant. Results of implementation on Texas Instruments TDA2x SoC [17] are presented. We have modified Caffe CNN framework to do the sparse, quantized training described in this paper. The source code for the training is made available at https://github.com/tidsp/caffe-jacinto |
Databáze: | OpenAIRE |
Externí odkaz: |