Understanding Deep Learning: Expected Spanning Dimension and Controlling the Flexibility of Neural Networks

Autor:	David Berthiaume, Randy Paffenroth, Lei Guo
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	deep learning neural network generalization Sard’s theorem multilayer perceptron Applied mathematics. Quantitative methods T57-57.97 Probabilities. Mathematical statistics QA273-280
Zdroj:	Frontiers in Applied Mathematics and Statistics, Vol 6 (2020)
Druh dokumentu:	article
ISSN:	2297-4687
DOI:	10.3389/fams.2020.572539
Popis:	Neural networks (NN) provide state-of-the-art performance in many problem domains. They can accommodate a vast number of parameters and still perform well when classic machine learning techniques provided with the same number of parameters would tend to overfit. To further the understanding of such incongruities, we develop a metric called the expected spanning dimension (ESD) which allows one to measure the intrinsic flexibility of an NN. We analyze NNs from the small, in which the ESD can be exactly computed, to large real-world networks with millions of parameters, in which we demonstrate how the ESD can be numerically approximated efficiently. The small NNs we study can be understood in detail, their ESD can be computed analytically, and they provide opportunities for understanding their performance from a theoretical perspective. On the other hand, applying the ESD to large-scale NNs sheds light on their relative generalization performances and provides suggestions as to how such NNs may be improved.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/46eeeb8fd961480c80f213de1e9df08b Zobrazit plný text záznamu View record in DOAJ