A deterministic gradient-based approach to avoid saddle points

Autor:	L. M. Kreusser, S. J. Osher, B. Wang
Rok vydání:	2022
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning math.NA Applied Mathematics cs.LG Machine Learning (stat.ML) Numerical Analysis (math.NA) Dynamical Systems (math.DS) stat.ML Machine Learning (cs.LG) Statistics - Machine Learning FOS: Mathematics Mathematics - Numerical Analysis Mathematics - Dynamical Systems cs.NA math.DS
Zdroj:	Kreusser, L M, Osher, S J & Wang, B 2023, ' A Deterministic Gradient-Based Approach to Avoid Saddle Points ', European Journal of Applied Mathematics, vol. 34, no. 4, pp. 738-757 . https://doi.org/10.1017/S0956792522000316
ISSN:	1469-4425 0956-7925
DOI:	10.1017/s0956792522000316
Popis:	Loss functions with a large number of saddle points are one of the major obstacles for training modern machine learning (ML) models efficiently. First-order methods such as gradient descent (GD) are usually the methods of choice for training ML models. However, these methods converge to saddle points for certain choices of initial guesses. In this paper, we propose a modification of the recently proposed Laplacian smoothing gradient descent (LSGD) [Osher et al., arXiv:1806.06317], called modified LSGD (mLSGD), and demonstrate its potential to avoid saddle points without sacrificing the convergence rate. Our analysis is based on the attraction region, formed by all starting points for which the considered numerical scheme converges to a saddle point. We investigate the attraction region’s dimension both analytically and numerically. For a canonical class of quadratic functions, we show that the dimension of the attraction region for mLSGD is $\lfloor (n-1)/2\rfloor$ , and hence it is significantly smaller than that of GD whose dimension is $n-1$ .
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::138ae8b4831bd0df4615e38acd89b9c9 https://doi.org/10.1017/s0956792522000316 Zobrazit plný text záznamu