Inference and Learning for Generative Capsule Models

Autor:	Alfredo Nazabal, Nikolaos Tsagkas, Christopher K. I. Williams
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	FOS: Computer and information sciences Computer Science - Machine Learning RANSAC Arts and Humanities (miscellaneous) Cognitive Neuroscience Computer Vision and Pattern Recognition (cs.CV) permutation matrix Computer Science - Computer Vision and Pattern Recognition Capsules Sinkhorn-Knopp algorithm Machine Learning (cs.LG) variational inference
Zdroj:	Nazábal, A, Tsagkas, N & Williams, C K I 2023, ' Inference and Learning for Generative Capsule Models ', Neural Computation, vol. 35, no. 4, pp. 727-761 . https://doi.org/10.1162/neco_a_01564
DOI:	10.1162/neco_a_01564
Popis:	Capsule networks (see e.g. Hinton et al., 2018) aim to encode knowledge of and reason about the relationship between an object and its parts. In this paper we specify a generative model for such data, and derive a variational algorithm for inferring the transformation of each model object in a scene, and the assignments of observed parts to the objects. We derive a learning algorithm for the object models, based on variational expectation maximization (Jordan et al., 1999). We also study an alternative inference algorithm based on the RANSAC method of Fischler and Bolles (1981). We apply these inference methods to (i) data generated from multiple geometric objects like squares and triangles ("constellations"), and (ii) data from a parts-based model of faces. Recent work by Kosiorek et al. (2019) has used amortized inference via stacked capsule autoencoders (SCAEs) to tackle this problem -- our results show that we significantly outperform them where we can make comparisons (on the constellations data). Comment: 31 pages, 6 figures. This paper extends our previous work (arxiv:2103.06676) by covering the learning of the models as well as inference. Paper accepted for publication in Neural Computation
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1627dfcea9632dd31692e9ac59141237 https://hdl.handle.net/20.500.11820/d547617c-d731-4225-abb9-6b5963368b97 Zobrazit plný text záznamu