Deep convolutional networks as shallow Gaussian processes

Autor:	Adrià Garriga-Alonso, Aitchison, L., Rasmussen, C. E.
Přispěvatelé:	Rasmussen, Carl [0000-0001-8899-7850], Apollo - University of Cambridge Repository
Rok vydání:	2019
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning Statistics - Machine Learning Computer Science::Computer Vision and Pattern Recognition Computer Science::Neural and Evolutionary Computation cs.LG Machine Learning (stat.ML) stat.ML Machine Learning (cs.LG)
Zdroj:	Scopus-Elsevier
DOI:	10.17863/cam.42340
Popis:	We show that the output of a (residual) convolutional neural network (CNN) with an appropriate prior over the weights and biases is a Gaussian process (GP) in the limit of infinitely many convolutional filters, extending similar results for dense networks. For a CNN, the equivalent kernel can be computed exactly and, unlike "deep kernels", has very few parameters: only the hyperparameters of the original CNN. Further, we show that this kernel has two properties that allow it to be computed efficiently; the cost of evaluating the kernel for a pair of images is similar to a single forward pass through the original CNN with only one filter per layer. The kernel equivalent to a 32-layer ResNet obtains 0.84% classification error on MNIST, a new record for GPs with a comparable number of parameters.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e8dad6c0c80a44c07b90a6ac549ee311 Zobrazit plný text záznamu