Exploiting Temporality for Semi-Supervised Video Segmentation

Autor:	Sibechi, Radu, Booij, Olaf, Baka, Nora, Bloem, Peter
Rok vydání:	2019
Předmět:	Computer Science - Computer Vision and Pattern Recognition Computer Science - Machine Learning Electrical Engineering and Systems Science - Image and Video Processing
Druh dokumentu:	Working Paper
Popis:	In recent years, there has been remarkable progress in supervised image segmentation. Video segmentation is less explored, despite the temporal dimension being highly informative. Semantic labels, e.g. that cannot be accurately detected in the current frame, may be inferred by incorporating information from previous frames. However, video segmentation is challenging due to the amount of data that needs to be processed and, more importantly, the cost involved in obtaining ground truth annotations for each frame. In this paper, we tackle the issue of label scarcity by using consecutive frames of a video, where only one frame is annotated. We propose a deep, end-to-end trainable model which leverages temporal information in order to make use of easy to acquire unlabeled data. Our network architecture relies on a novel interconnection of two components: a fully convolutional network to model spatial information and temporal units that are employed at intermediate levels of the convolutional network in order to propagate information through time. The main contribution of this work is the guidance of the temporal signal through the network. We show that only placing a temporal module between the encoder and decoder is suboptimal (baseline). Our extensive experiments on the CityScapes dataset indicate that the resulting model can leverage unlabeled temporal frames and significantly outperform both the frame-by-frame image segmentation and the baseline approach. Comment: Accepted as workshop paper at ICCV 2019
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/1908.11309 Zobrazit plný text záznamu View this record from Arxiv