AUGMENTING DATA USING GAUSSIAN MIXTURE EMBEDDING FOR IMPROVING LAND COVER SEGMENTATION

Autor:	Oliveira, D. A. B.
Rok vydání:	2020
Předmět:	lcsh:Applied optics. Photonics lcsh:T lcsh:TA1-2040 lcsh:TA1501-1820 lcsh:Engineering (General). Civil engineering (General) lcsh:Technology
Zdroj:	ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Vol IV-3-W2-2020, Pp 71-76 (2020)
ISSN:	2194-9050
DOI:	10.5194/isprs-annals-iv-3-w2-2020-71-2020
Popis:	The use of convolutional neural networks improved greatly data synthesis in the last years and have been widely used for data augmentation in scenarios where very imbalanced data is observed, such as land cover segmentation. Balancing the proportion of classes for training segmentation models can be very challenging considering that samples where all classes are reasonably represented might constitute a small portion of a training set, and techniques for augmenting this small amount of data such as rotation, scaling and translation might be not sufficient for efficient training. In this context, this paper proposes a methodology to perform data augmentation from few samples to improve the performance of CNN-based land cover semantic segmentation. First, we estimate the latent data representation of selected training samples by means of a mixture of Gaussians, using an encoder-decoder CNN. Then, we change the latent embedding used to generate the mixture parameters, at random and in training time, to generate new mixture models slightly different from the original. Finally, we compute the displacement maps between the original and the modified mixture models, and use them to elastically deform the original images, creating new realistic samples out of the original ones. Our disentangled approach allows the spatial modification of displacement maps to preserve objects where deformation is undesired, like buildings and cars, where geometry is highly discriminant. With this simple pipeline, we managed to augment samples in training time, and improve the overall performance of two basal semantic segmentation CNN architectures for land cover semantic segmentation.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7aba66ef42e83a260a2604f8ef0cff8b https://doi.org/10.5194/isprs-annals-iv-3-w2-2020-71-2020 Zobrazit plný text záznamu