Voxel- and Bird’s-Eye-View-Based Semantic Scene Completion for LiDAR Point Clouds

Autor: Li Liang, Naveed Akhtar, Jordan Vice, Ajmal Mian
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Remote Sensing, Vol 16, Iss 13, p 2266 (2024)
Druh dokumentu: article
ISSN: 2072-4292
DOI: 10.3390/rs16132266
Popis: Semantic scene completion is a crucial outdoor scene understanding task that has direct implications for technologies like autonomous driving and robotics. It compensates for unavoidable occlusions and partial measurements in LiDAR scans, which may otherwise cause catastrophic failures. Due to the inherent complexity of this task, existing methods generally rely on complex and computationally demanding scene completion models, which limits their practicality in downstream applications. Addressing this, we propose a novel integrated network that combines the strengths of 3D and 2D semantic scene completion techniques for efficient LiDAR point cloud scene completion. Our network leverages a newly devised lightweight multi-scale convolutional block (MSB) to efficiently aggregate multi-scale features, thereby improving the identification of small and distant objects. It further utilizes a layout-aware semantic block (LSB), developed to grasp the overall layout of the scene to precisely guide the reconstruction and recognition of features. Moreover, we also develop a feature fusion module (FFM) for effective interaction between the data derived from two disparate streams in our network, ensuring a robust and cohesive scene completion process. Extensive experiments with the popular SemanticKITTI dataset demonstrate that our method achieves highly competitive performance, with an mIoU of 35.7 and an IoU of 51.4. Notably, the proposed method achieves an mIoU improvement of 2.6 % compared to previous methods.
Databáze: Directory of Open Access Journals
Nepřihlášeným uživatelům se plný text nezobrazuje