Details preserved unsupervised depth estimation by fusing traditional stereo knowledge from laparoscopic images

Autor: Qingmao Hu, Fucang Jia, Huoling Luo
Rok vydání: 2019
Předmět:
Computer science
Point cloud
Special Issue: Papers from the 13th Workshop on Augmented Environments for Computer Assisted Interventions
heart phantom data
truth depth
Convolutional neural network
medical image processing
principled mask
surgery
unsupervised learning depth estimation approach
Health Information Management
proxy disparity labels
vision-based laparoscope surgical navigation systems
Computer vision
laparoscopic images
Ground truth
unsupervised depth estimation
stereo accuracy
image reconstruction
neighbourhood smoothness term
proxy labels
loss function
lcsh:R855-855.5
unreliable depth measurements
Unsupervised learning
dual encoder-decoder convolutional neural network
traditional stereo method
traditional stereo knowledge
lcsh:Medical technology
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
phantoms
Health Informatics
Iterative reconstruction
stereo image processing
unsupervised learning
image motion analysis
rectified stereo images
convolutional neural nets
disparity images
parallax effects
ComputingMethodologies_COMPUTERGRAPHICS
constrain neighbouring pixels
Pixel
business.industry
smooth depth surface
confidence measure
Artificial intelligence
partial nephrectomy da vinci surgery dataset
Estimation methods
Parallax
business
hamlyn centre
Zdroj: Healthcare Technology Letters (2019)
Healthcare Technology Letters
ISSN: 2053-3713
Popis: Depth estimation plays an important role in vision-based laparoscope surgical navigation systems. Most learning-based depth estimation methods require ground truth depth or disparity images for training; however, these data are difficult to obtain in laparoscopy. The authors present an unsupervised learning depth estimation approach by fusing traditional stereo knowledge. The traditional stereo method is used to generate proxy disparity labels, in which unreliable depth measurements are removed via a confidence measure to improve stereo accuracy. The disparity images are generated by training a dual encoder–decoder convolutional neural network from rectified stereo images coupled with proxy labels generated by the traditional stereo method. A principled mask is computed to exclude the pixels, which are not seen in one of views due to parallax effects from the calculation of loss function. Moreover, the neighbourhood smoothness term is employed to constrain neighbouring pixels with similar appearances to generate a smooth depth surface. This approach can make the depth of the projected point cloud closer to the real surgical site and preserve realistic details. The authors demonstrate the performance of the method by training and evaluation with a partial nephrectomy da Vinci surgery dataset and heart phantom data from the Hamlyn Centre.
Databáze: OpenAIRE