Convergence analyses on sparse feedforward neural networks via group lasso regularization
Autor: | Jacek M. Zurada, Qingquan Chang, Jian Wang, Qingling Cai |
---|---|
Rok vydání: | 2017 |
Předmět: |
Mathematical optimization
Information Systems and Management Optimization problem Artificial neural network 010103 numerical & computational mathematics 02 engineering and technology Fixed point 01 natural sciences Regularization (mathematics) Stationary point Computer Science Applications Theoretical Computer Science Artificial Intelligence Control and Systems Engineering Convergence (routing) 0202 electrical engineering electronic engineering information engineering Applied mathematics Feedforward neural network 020201 artificial intelligence & image processing 0101 mathematics Software Smoothing Mathematics |
Zdroj: | Information Sciences. 381:250-269 |
ISSN: | 0020-0255 |
DOI: | 10.1016/j.ins.2016.11.020 |
Popis: | In this paper, a new variant of feedforward neural networks has been proposed for a class of nonsmooth optimization problems. The penalty term of the presented neural networks stems from the Group Lasso method which selects hidden variables in a grouped manner. To deal with the non-differentiability of the original penalty term ( l 1 - l 2 norm) and avoid oscillations, smoothing techniques have been used to approximate the objective function. It is assumed that the training samples are supplied to the networks in a specific incremental way during training, that is, in each cycle samples are supplied in a fixed order. Then, under suitable assumptions on learning rate, penalization coefficients and smoothing parameters, the weak and strong convergence of the training process for the smoothing neural networks have been proved. The convergence analysis shows that the gradient of the smoothing error function approaches zero and the weight sequence converges to a fixed point, respectively. We demonstrate how the smoothing approximation parameter can be updated in the training procedure so as to guarantee the convergence of the procedure to a Clarke stationary point of the original optimization problem. In addition, we have proved that the original nonsmoothing algorithm with l 1 - l 2 norm penalty converges consistently to the same optimum solution with the corresponding smoothed algorithm. Numerical simulations demonstrate the convergence and effectiveness of the proposed training algorithm. |
Databáze: | OpenAIRE |
Externí odkaz: |