A geometric approach for accelerating neural networks designed for classification problems.

Autor: Saffar M; School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran. mohsen_saffar@ut.ac.ir., Kalhor A; School of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran, Iran., Habibnia A; Department of Economics and the Computational Modeling and Data Analytics, College of Science, Virginia Polytechnic Institute and State University, Blacksburg, USA.
Jazyk: angličtina
Zdroj: Scientific reports [Sci Rep] 2024 Jul 30; Vol. 14 (1), pp. 17590. Date of Electronic Publication: 2024 Jul 30.
DOI: 10.1038/s41598-024-68172-6
Abstrakt: This paper proposes a geometric-based technique for compressing convolutional neural networks to accelerate computations and improve generalization by eliminating non-informative components. The technique utilizes a geometric index called separation index to evaluate the functionality of network elements such as layers and filters. By applying this index along with center-based separation index, a systematic algorithm is proposed that optimally compresses convolutional and fully connected layers. The algorithm excludes layers with low performance, selects the best subset of filters in the filtering layers, and tunes the parameters of fully connected layers using center-based separation index. An illustrative example of classifying CIFAR-10 dataset is presented to explain the algorithm step-by-step. The proposed method achieves impressive pruning results on networks trained by CIFAR-10 and ImageNet datasets, with 87.5%, 77.6%, and 78.8% of VGG16, GoogLeNet, and DenseNet parameters pruned, respectively. Comparisons with state-of-the-art works are provided to demonstrate the effectiveness of the proposed method.
(© 2024. The Author(s).)
Databáze: MEDLINE