Learning Optical Flow with R-CNN for Visual Odometry

Autor:	Baigan Zhao, Xing Hu, Chong Gao, Yingping Huang
Rok vydání:	2021
Předmět:	Recurrent neural network Computer science business.industry Motion estimation Feature (machine learning) Optical flow Pattern recognition Sequence learning Artificial intelligence Visual odometry Representation (mathematics) business Encoder
Zdroj:	ICRA
DOI:	10.1109/icra48506.2021.9562074
Popis:	Addressing on monocular visual odometry problem, this paper presents a novel end-to-end network for estimation of camera ego-motion. The network learns the latent space of optical flow (OF) and models sequential dynamics so that the motion estimation is constrained by the relations between sequential images. We compute the OF field of consecutive images and extract the latent OF representation in a self-encoding manner. A Recurrent Neural Network is then followed to examine the OF changes, i.e., to conduct sequential learning. The extracted sequential OF latent space is used to compute the regression of the 6-dimensional pose vector. Particularly, we separately train the encoder in an unsupervised manner. By this means, we avoid non-convergence during the training of the whole network and allow more generalized and effective feature representation. Substantial experiments have been conducted on KITTI and Malaga datasets, and the results demonstrate that our model outperforms most learning-based VO approaches.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::4eda047922ddef04b692ee6a09f67a69 https://doi.org/10.1109/icra48506.2021.9562074 Zobrazit plný text záznamu