The Importance of Loss Functions for Increasing the Generalization Abilities of a Deep Learning-Based Next Frame Prediction Model for Traffic Scenes

Autor:	Sandra Aigner, Marco Körner
Jazyk:	angličtina
Rok vydání:	2020
Předmět:	traffic scene prediction video prediction generalization convolutional LSTMs recurrent neural networks machine learning Computer engineering. Computer hardware TK7885-7895
Zdroj:	Machine Learning and Knowledge Extraction, Vol 2, Iss 2, Pp 78-98 (2020)
Druh dokumentu:	article
ISSN:	2504-4990
DOI:	10.3390/make2020006
Popis:	This paper analyzes in detail how different loss functions influence the generalization abilities of a deep learning-based next frame prediction model for traffic scenes. Our prediction model is a convolutional long-short term memory (ConvLSTM) network that generates the pixel values of the next frame after having observed the raw pixel values of a sequence of four past frames. We trained the model with 21 combinations of seven loss terms using the Cityscapes Sequences dataset and an identical hyper-parameter setting. The loss terms range from pixel-error based terms to adversarial terms. To assess the generalization abilities of the resulting models, we generated predictions up to 20 time-steps into the future for four datasets of increasing visual distance to the training dataset—KITTI Tracking, BDD100K, UA-DETRAC, and KIT AIS Vehicles. All predicted frames were evaluated quantitatively with both traditional pixel-based evaluation metrics, that is, mean squared error (MSE), peak signal-to-noise ratio (PSNR), and structural similarity index (SSIM), and recent, more advanced, feature-based evaluation metrics, that is, Fréchet inception distance (FID), and learned perceptual image patch similarity (LPIPS). The results show that solely by choosing a different combination of losses, we can boost the prediction performance on new datasets by up to 55%, and by up to 50% for long-term predictions.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/58b3690794c44895a6e8cc51afa25d08 Zobrazit plný text záznamu View record in DOAJ