Gradual Tuning: a better way of Fine Tuning the parameters of a Deep Neural Network

Autor:	Montone, Guglielmo, O'Regan, J. Kevin, Terekhov, Alexander V.
Rok vydání:	2017
Předmět:	Computer Science - Artificial Intelligence Computer Science - Neural and Evolutionary Computing
Druh dokumentu:	Working Paper
Popis:	In this paper we present an alternative strategy for fine-tuning the parameters of a network. We named the technique Gradual Tuning. Once trained on a first task, the network is fine-tuned on a second task by modifying a progressively larger set of the network's parameters. We test Gradual Tuning on different transfer learning tasks, using networks of different sizes trained with different regularization techniques. The result shows that compared to the usual fine tuning, our approach significantly reduces catastrophic forgetting of the initial task, while still retaining comparable if not better performance on the new task.
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/1711.10177 Zobrazit plný text záznamu View this record from Arxiv