Superb Monocular Depth Estimation Based on Transfer Learning and Surface Normal Guidance

Autor: Qu Xingtian, Zhang Wang, Zhen Chen, Qi Haogang, Zhao Fengshang, Huang Kang, Shouqian Chen
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Computer science
Point cloud
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Multi-task learning
multi-task learning
monocular depth estimation
02 engineering and technology
Simultaneous localization and mapping
transfer learning
lcsh:Chemical technology
01 natural sciences
Biochemistry
Convolutional neural network
Article
Analytical Chemistry
SFM
0202 electrical engineering
electronic engineering
information engineering

Computer vision
lcsh:TP1-1185
Motion planning
Electrical and Electronic Engineering
surface normal estimation
Instrumentation
Monocular
business.industry
Deep learning
supervised deep learning
010401 analytical chemistry
Atomic and Molecular Physics
and Optics

0104 chemical sciences
SLAM
020201 artificial intelligence & image processing
Artificial intelligence
business
Normal
Zdroj: Sensors, Vol 20, Iss 4856, p 4856 (2020)
Sensors
Volume 20
Issue 17
Sensors (Basel, Switzerland)
ISSN: 1424-8220
Popis: Accurately sensing the surrounding 3D scene is indispensable for drones or robots to execute path planning and navigation. In this paper, a novel monocular depth estimation method was proposed that primarily utilizes a lighter-weight Convolutional Neural Network (CNN) structure for coarse depth prediction and then refines the coarse depth images by combining surface normal guidance. Specifically, the coarse depth prediction network is designed as pre-trained encoder&ndash
decoder architecture for describing the 3D structure. When it comes to surface normal estimation, the deep learning network was designed as a two-stream encoder&ndash
decoder structure, which hierarchically merges red-green-blue-depth (RGB-D) images for capturing more accurate geometric boundaries. Relying on fewer network parameters and simpler learning structure, better detailed depth maps are produced than the existing states. Moreover, 3D point cloud maps reconstructed from depth prediction images confirm that our framework can be conveniently adopted as components of a monocular simultaneous localization and mapping (SLAM) paradigm.
Databáze: OpenAIRE