EBStereo: edge-based loss function for real-time stereo matching.

Autor: Bi, Weijie, Chen, Ming, Wu, Dongliu, Lu, Shenglian
Předmět:
Zdroj: Visual Computer; Apr2024, Vol. 40 Issue 4, p2975-2986, 12p
Abstrakt: Deep learning-based stereo matching has made significant progress, but it still faces challenges: The disparity prediction error maps of current models show that errors are concentrated primarily on object boundaries. We find that executing the smooth L1 loss function on the entire region during stereo matching model training cannot effectively address the imbalance between edge regions and flat regions, resulting in poor disparity estimates for edge regions. In this paper, a new weighted smooth L1 loss function, which considers the loss function calculation on edge regions and can yield improved accuracy, is proposed. An improved bilateral grid upsampling module is also added to the training model, and a strategy is adopted to balance the computational consumption introduced by the new loss function-weighted item, allowing for real-time inference. Extensive experiments conducted on two datasets, i.e., Scene Flow and KITTI, verify the simplicity and effectiveness of this approach. Under the condition of 33 frames per second (FPS), the endpoint error of the proposed model can be improved to 0.63. In addition, the proposed edge-based loss function can be easily embedded into many existing stereo matching networks, such as GwcNet, AANet, and PSMNet. After embedding the proposed edge-based loss function, the reduction rates of the endpoint errors of the existing models can be improved to 3.5%, 11.6%, and 27.2% for GwcNet, AANet, and PSMNet, respectively. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index