An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images

Autor: Kiærbech, Åshild
Přispěvatelé: Doulgeris, Anthony
Jazyk: angličtina
Rok vydání: 2019
Předmět:
Image segmentation
Maximum Likelihood
VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412
Automatic labelling
Sea ice
VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412
Remote sensing
Classification
VDP::Mathematics and natural science: 400::Mathematics: 410::Analysis: 411
Gaussian Mixture Model
ComputingMethodologies_PATTERNRECOGNITION
VDP::Mathematics and natural science: 400::Information and communication science: 420::Simulation
visualization
signal processing
image processing: 429

Sentinel-1
VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Analyse: 411
Satellite images
FYS-3941
Expectation Maximization
VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Simulering
visualisering
signalbehandling
bildeanalyse: 429

SAR
Popis: Unsupervised clustering methods on remote sensing images have shown good results. However, this type of machine learning needs additional labelling to be an end-to-end classification in the same manner as traditional supervised classification. The automation of the labelling needs further exploration. We want to investigate the robustness of a supervised automatic labelling scheme by comparing a segmentation with additional automatic labelling against a supervised classification method. Using synthetic aperture radar (SAR) satellite images of sea ice from Sentinel-1, an automatic Expectation Maximization method with a Gaussian mixture model is used for the segmentation, taking into consideration the incidence angle variation within a SAR image. The additional labelling is a likelihood majority vote related to the Mahalanobis distance measure. The Bayesian Maximum Likelihood (ML) is used as the fully supervised reference method. The experiments of comparison are done using various amounts of training data and different percentages of mislabelling in the training data set. The classification results are compared both visually and using classification accuracy. As training data size increases, the accuracy of the ML method tends to decay faster than for the segment-then-label approach, particularly when sample sizes per class are less than a hundred. As more contamination is introduced, the decay is not distinct, probably due to the large within-class variations in the training set. Based on the results, the ML method generally gets a higher overall classification accuracy, but there are weak tendencies for the segment-then-label method to be more robust to decreasing training data size and more mislabelling.
Databáze: OpenAIRE