Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Nicholas Baskerville"'
Autor:
Nicholas Baskerville, Diego Granziol
Publikováno v:
Journal of Physics: Complexity. 3:024001
We conjecture that the inherent difference in generalisation between adaptive and non-adaptive gradient methods in deep learning stems from the increased estimation noise in the flattest directions of the true loss surface. We demonstrate that typica