Receding Neuron Importances for Structured Pruning

Autor:	Suteu, Mihai, Guo, Yike
Rok vydání:	2022
Předmět:	Computer Science - Machine Learning Computer Science - Computer Vision and Pattern Recognition
Druh dokumentu:	Working Paper
Popis:	Structured pruning efficiently compresses networks by identifying and removing unimportant neurons. While this can be elegantly achieved by applying sparsity-inducing regularisation on BatchNorm parameters, an L1 penalty would shrink all scaling factors rather than just those of superfluous neurons. To tackle this issue, we introduce a simple BatchNorm variation with bounded scaling parameters, based on which we design a novel regularisation term that suppresses only neurons with low importance. Under our method, the weights of unnecessary neurons effectively recede, producing a polarised bimodal distribution of importances. We show that neural networks trained this way can be pruned to a larger extent and with less deterioration. We one-shot prune VGG and ResNet architectures at different ratios on CIFAR and ImagenNet datasets. In the case of VGG-style networks, our method significantly outperforms existing approaches particularly under a severe pruning regime.
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/2204.06404 Zobrazit plný text záznamu View this record from Arxiv