Finite Sample Identification of Wide Shallow Neural Networks with Biases

Autor:	Fornasier, Massimo, Klock, Timo, Mondelli, Marco, Rauchensteiner, Michael
Rok vydání:	2022
Předmět:	Computer Science - Machine Learning Statistics - Machine Learning 65D15 68T07 90C26
Druh dokumentu:	Working Paper
Popis:	Artificial neural networks are functions depending on a finite number of parameters typically encoded as weights and biases. The identification of the parameters of the network from finite samples of input-output pairs is often referred to as the \emph{teacher-student model}, and this model has represented a popular framework for understanding training and generalization. Even if the problem is NP-complete in the worst case, a rapidly growing literature -- after adding suitable distributional assumptions -- has established finite sample identification of two-layer networks with a number of neurons $m=\mathcal O(D)$, $D$ being the input dimension. For the range $D
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/2211.04589 Zobrazit plný text záznamu View this record from Arxiv