Leveraging Metadata in Representation Learning With Georeferenced Seafloor Imagery
Autor: | Miquel Massot-Campos, Stefan B. Williams, Takaki Yamada, Oscar Pizarro, Adam Prügel-Bennett, Blair Thornton |
---|---|
Rok vydání: | 2021 |
Předmět: |
Control and Optimization
Contextual image classification business.industry Computer science Mechanical Engineering Biomedical Engineering Iterative reconstruction 01 natural sciences Class (biology) Computer Science Applications Visualization 010309 optics Human-Computer Interaction Metadata Artificial Intelligence Control and Systems Engineering 0103 physical sciences Leverage (statistics) Computer vision Computer Vision and Pattern Recognition Artificial intelligence Motion planning business Feature learning |
Zdroj: | Leveraging Metadata in Representation Learning With Georeferenced Seafloor Imagery |
ISSN: | 2377-3774 |
DOI: | 10.1109/lra.2021.3101881 |
Popis: | Camera equipped Autonomous Underwater Vehicles (AUVs) are now routinely used in seafloor surveys. Obtaining effective representations from the images they collect can enable perception-aware robotic exploration such as information-gain-guided path planning and target-driven visual navigation. This letter develops a novel self-supervised representation learning method for seafloor images collected by AUVs. The method allows deep-learning convolutional autoencoders to leverage multiple sources of metadata to regularise their learning, prioritising features observed in images that can be correlated with patterns in their metadata. The impact of the proposed regularisation is examined on a dataset consisting of more than 30 k colour seafloor images gathered by an AUV off the coast of Tasmania. The metadata used to regularise learning in this dataset consists of the horizontal location and depth of the observed seafloor. The results show that including metadata in self-supervised representation learning can increase image classification accuracy by up to 15% and never degrades learning performance. We show how effective representation learning can be applied to achieve class balanced representative image identification for summarised understanding of imbalanced class distributions in an unsupervised way. |
Databáze: | OpenAIRE |
Externí odkaz: |