Sparse neural network optimization by Simulated Annealing

Autor:	Ercan Engin Kuruoglu, Chun Lin Kuo, Wai Kin Victor Chan
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	Neural network optimization Simulated annealing Sparse neural network Technology
Zdroj:	Franklin Open, Vol 4, Iss , Pp 100037- (2023)
Druh dokumentu:	article
ISSN:	2773-1863
DOI:	10.1016/j.fraope.2023.100037
Popis:	The over-parameterization of neural networks and the local optimality of backpropagation algorithm have been two major problems associated with deep-learning. In order to reduce the redundancy of neural network parameters, the conventional approach has been to prune branches with small weights. However, this only solves the problem of parameter redundancy, not providing any global optimality guarantees. In this paper, we overturn back-propagation and combine the sparse network optimization problem and the network weight optimization problem using a non-convex optimization method, namely Simulated Annealing. This method can complete network training under the premise of controlling the amount of parameters. Different from simply updating network parameters using gradient descent, our method simultaneously optimizes the topology of the sparse network. With the guarantee of global optimality of Simulated Annealing solution, the performance of the sparse network optimized by our method has exceeded the one trained by backpropagation only.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/838fb71ba88d40b8b279d2d48a7e4dae Zobrazit plný text záznamu View record in DOAJ