SALR: Sharpness-aware Learning Rate Scheduler for Improved Generalization

Autor:	Xubo Yue, Maher Nouiehed, Raed Al Kontar
Rok vydání:	2020
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning Artificial Intelligence Computer Networks and Communications Statistics - Machine Learning Machine Learning (stat.ML) Software Computer Science Applications Machine Learning (cs.LG)
DOI:	10.48550/arxiv.2011.05348
Popis:	In an effort to improve generalization in deep learning and automate the process of learning rate scheduling, we propose SALR: a sharpness-aware learning rate update technique designed to recover flat minimizers. Our method dynamically updates the learning rate of gradient-based optimizers based on the local sharpness of the loss function. This allows optimizers to automatically increase learning rates at sharp valleys to increase the chance of escaping them. We demonstrate the effectiveness of SALR when adopted by various algorithms over a broad range of networks. Our experiments indicate that SALR improves generalization, converges faster, and drives solutions to significantly flatter regions.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5a43a63fbd04e7f404a3955f7379248b Zobrazit plný text záznamu