An investigation of the robustness of distance measure-based supervised labelling of segmented remote sensing images

Autor:	Kiærbech, Åshild
Přispěvatelé:	Doulgeris, Anthony
Jazyk:	angličtina
Rok vydání:	2019
Předmět:	Image segmentation Maximum Likelihood VDP::Mathematics and natural science: 400::Mathematics: 410::Statistics: 412 Automatic labelling Sea ice VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Statistikk: 412 Remote sensing Classification VDP::Mathematics and natural science: 400::Mathematics: 410::Analysis: 411 Gaussian Mixture Model ComputingMethodologies_PATTERNRECOGNITION VDP::Mathematics and natural science: 400::Information and communication science: 420::Simulation visualization signal processing image processing: 429 Sentinel-1 VDP::Matematikk og Naturvitenskap: 400::Matematikk: 410::Analyse: 411 Satellite images FYS-3941 Expectation Maximization VDP::Matematikk og Naturvitenskap: 400::Informasjons- og kommunikasjonsvitenskap: 420::Simulering visualisering signalbehandling bildeanalyse: 429 SAR
Popis:	Unsupervised clustering methods on remote sensing images have shown good results. However, this type of machine learning needs additional labelling to be an end-to-end classification in the same manner as traditional supervised classification. The automation of the labelling needs further exploration. We want to investigate the robustness of a supervised automatic labelling scheme by comparing a segmentation with additional automatic labelling against a supervised classification method. Using synthetic aperture radar (SAR) satellite images of sea ice from Sentinel-1, an automatic Expectation Maximization method with a Gaussian mixture model is used for the segmentation, taking into consideration the incidence angle variation within a SAR image. The additional labelling is a likelihood majority vote related to the Mahalanobis distance measure. The Bayesian Maximum Likelihood (ML) is used as the fully supervised reference method. The experiments of comparison are done using various amounts of training data and different percentages of mislabelling in the training data set. The classification results are compared both visually and using classification accuracy. As training data size increases, the accuracy of the ML method tends to decay faster than for the segment-then-label approach, particularly when sample sizes per class are less than a hundred. As more contamination is introduced, the decay is not distinct, probably due to the large within-class variations in the training set. Based on the results, the ML method generally gets a higher overall classification accuracy, but there are weak tendencies for the segment-then-label method to be more robust to decreasing training data size and more mislabelling.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::aa5296a43a7efd443e5bbffcdd1968ea https://hdl.handle.net/10037/15761 Zobrazit plný text záznamu