Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning

Autor: Yang, Maurice, Faraj, Mahmoud, Hussein, Assem, Gaudet, Vincent
Rok vydání: 2018
Předmět:
Druh dokumentu: Working Paper
Popis: The recent trend toward increasingly deep convolutional neural networks (CNNs) leads to a higher demand of computational power and memory storage. Consequently, the deployment of CNNs in hardware has become more challenging. In this paper, we propose an Intra-Kernel Regular (IKR) pruning scheme to reduce the size and computational complexity of the CNNs by removing redundant weights at a fine-grained level. Unlike other pruning methods such as Fine-Grained pruning, IKR pruning maintains regular kernel structures that are exploitable in a hardware accelerator. Experimental results demonstrate up to 10x parameter reduction and 7x computational reduction at a cost of less than 1% degradation in accuracy versus the un-pruned case.
Comment: 6 pages, 8 figures, ISMVL 2018
Databáze: arXiv