Indoor Scene Recognition in 3D

Autor: Konrad Schindler, Shengyu Huang, Mikhail Usvyatsov
Jazyk: angličtina
Rok vydání: 2020
Předmět:
FOS: Computer and information sciences
Computer Science - Machine Learning
Computer science
Computer Vision and Pattern Recognition (cs.CV)
media_common.quotation_subject
Point cloud
ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION
Computer Science - Computer Vision and Pattern Recognition
02 engineering and technology
010501 environmental sciences
Semantics
computer.software_genre
01 natural sciences
Machine Learning (cs.LG)
Computer Science - Robotics
Voxel
Perception
0202 electrical engineering
electronic engineering
information engineering

Segmentation
Computer vision
0105 earth and related environmental sciences
media_common
ComputingMethodologies_COMPUTERGRAPHICS
business.industry
Object (computer science)
Task analysis
Robot
020201 artificial intelligence & image processing
Artificial intelligence
business
Robotics (cs.RO)
computer
Zdroj: IROS
Popis: Recognising in what type of environment one is located is an important perception task. For instance, for a robot operating in indoors it is helpful to be aware whether it is in a kitchen, a hallway or a bedroom. Existing approaches attempt to classify the scene based on 2D images or 2.5D range images. Here, we study scene recognition from 3D point cloud (or voxel) data, and show that it greatly outperforms methods based on 2D birds-eye views. Moreover, we advocate multi-task learning as a way of improving scene recognition, building on the fact that the scene type is highly correlated with the objects in the scene, and therefore with its semantic segmentation into different object classes. In a series of ablation studies, we show that successful scene recognition is not just the recognition of individual objects unique to some scene type (such as a bathtub), but depends on several different cues, including coarse 3D geometry, colour, and the (implicit) distribution of object categories. Moreover, we demonstrate that surprisingly sparse 3D data is sufficient to classify indoor scenes with good accuracy.
IROS 2020 - Camera Ready
Databáze: OpenAIRE