Popis: |
This paper explores the idea of extracting a dense 3D point cloud corresponding to salient features in a video. The goal is to generate the dense point cloud efficiently, in order to use the information in various other video processing tasks. We present a method that is capable of extracting 3D information of videos with no previous knowledge of the scene, while keeping computational costs low. Our method exploits the movement of the camera while robustly tracking features over time, in order to obtain multiple views of a scene and perform 3D reconstruction. Additionally, our system is able to cope with individually moving people seen in the videos, and can estimate each person's pose and fit a 3D model to it. This 3D model is inserted into the dense point cloud in order to visualize the reconstructed scenes, and does not affect the tracking of the rest of the scene |