How Infinitely Wide Neural Networks Can Benefit from Multi-task Learning -- an Exact Macroscopic Characterization

Autor:	Heiss, Jakob, Teichmann, Josef, Wutte, Hanna
Rok vydání:	2021
Předmět:	Computer Science - Machine Learning Statistics - Machine Learning 68T07 68Q32 I.2
Druh dokumentu:	Working Paper
DOI:	10.3929/ethz-b-000550890
Popis:	In practice, multi-task learning (through learning features shared among tasks) is an essential property of deep neural networks (NNs). While infinite-width limits of NNs can provide good intuition for their generalization behavior, the well-known infinite-width limits of NNs in the literature (e.g., neural tangent kernels) assume specific settings in which wide ReLU-NNs behave like shallow Gaussian Processes with a fixed kernel. Consequently, in such settings, these NNs lose their ability to benefit from multi-task learning in the infinite-width limit. In contrast, we prove that optimizing wide ReLU neural networks with at least one hidden layer using L2-regularization on the parameters promotes multi-task learning due to representation-learning - also in the limiting regime where the network width tends to infinity. We present an exact quantitative characterization of this infinite width limit in an appropriate function space that neatly describes multi-task learning. Comment: 13 pages + appendix
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/2112.15577 Zobrazit plný text záznamu View this record from Arxiv