Pix2Point : learning outdoor 3D using sparse point clouds and optimal transport

Autor:	B. Le Saux, Marcela Carvalho, Frédéric Champagnat, Rémy Leroy, Pauline Trouvé-Peloux
Přispěvatelé:	DTIS, ONERA, Université Paris Saclay [Palaiseau], ONERA-Université Paris-Saclay, ESA Centre for Earth Observation (ESRIN), European Space Agency (ESA), UPCITY, Montreuil, France, GREC, christine
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	FOS: Computer and information sciences Minimisation (psychology) 0209 industrial biotechnology ESTIMATION 3D NUAGES DE POINTS [SPI] Engineering Sciences [physics] Machine vision Computer science Computer Vision and Pattern Recognition (cs.CV) Point cloud Computer Science - Computer Vision and Pattern Recognition ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION [MATH] Mathematics [math] 02 engineering and technology [INFO] Computer Science [cs] [PHYS] Physics [physics] Hybrid neural network [SPI]Engineering Sciences [physics] 020901 industrial engineering & automation 0202 electrical engineering electronic engineering information engineering Computer vision [INFO]Computer Science [cs] [MATH]Mathematics [math] Divergence (statistics) [PHYS]Physics [physics] I.2.10 Monocular Artificial neural network I.4.5 business.industry Deep learning APPRENTISSAGE PROFOND 020201 artificial intelligence & image processing Artificial intelligence business
Zdroj:	MVA 2021 MVA 2021, Jul 2021, VIRTUEL, Japan HAL MVA
Popis:	Good quality reconstruction and comprehension of a scene rely on 3D estimation methods. The 3D information was usually obtained from images by stereo-photogrammetry, but deep learning has recently provided us with excellent results for monocular depth estimation. Building up a sufficiently large and rich training dataset to achieve these results requires onerous processing. In this paper, we address the problem of learning outdoor 3D point cloud from monocular data using a sparse ground-truth dataset. We propose Pix2Point, a deep learning-based approach for monocular 3D point cloud prediction, able to deal with complete and challenging outdoor scenes. Our method relies on a 2D-3D hybrid neural network architecture, and a supervised end-to-end minimisation of an optimal transport divergence between point clouds. We show that, when trained on sparse point clouds, our simple promising approach achieves a better coverage of 3D outdoor scenes than efficient monocular depth methods. 5 pages, 2 figures, to be published in 2021 International Conference on Machine Vision Applications
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5bd4d9fff49b92ab9ef7788f2f56fdac https://hal.archives-ouvertes.fr/hal-03365994/document Zobrazit plný text záznamu