Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation

Autor:	Bowen Wang, Liangzhi Li, Yuta Nakashima, Ryo Kawasaki, Hajime Nagahara, Yasushi Yagi
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Video semantic segmentation noisy training temporal awareness Electrical engineering. Electronics. Nuclear engineering TK1-9971
Zdroj:	IEEE Access, Vol 9, Pp 46810-46820 (2021)
Druh dokumentu:	article
ISSN:	2169-3536
DOI:	10.1109/ACCESS.2021.3067928
Popis:	Semantic video segmentation is a key challenge for various applications. This paper presents a new model named Noisy-LSTM, which is trainable in an end-to-end manner, with convolutional LSTMs (ConvLSTMs) to leverage the temporal coherence in video frames, together with a simple yet effective training strategy that replaces a frame in a given video sequence with noises. Our training strategy spoils the temporal coherence in video frames and thus makes the temporal links in ConvLSTMs unreliable; this may consequently improve the ability of the model to extract features from video frames and serve as a regularizer to avoid overfitting, without requiring extra data annotations or computational costs. Experimental results demonstrate that the proposed model can achieve state-of-the-art performances on both the CityScapes and EndoVis2018 datasets. The code for the proposed method is available at https://github.com/wbw520/NoisyLSTM.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/2ea7ab32bd3342e9b5368c4c026f378a Zobrazit plný text záznamu View record in DOAJ