Multi-task deep learning for image segmentation using recursive approximation tasks
Autor: | Nicolas Papadakis, Rihuan Ke, Carola-Bibiane Schönlieb, Aurélie Bugeau, Mark Kirkland, Peter Schuetz |
---|---|
Přispěvatelé: | Department of Applied Mathematics and Theoretical Physics (DAMTP), University of Cambridge [UK] (CAM), Laboratoire Bordelais de Recherche en Informatique (LaBRI), Université de Bordeaux (UB)-École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB)-Centre National de la Recherche Scientifique (CNRS), Institut de Mathématiques de Bordeaux (IMB), Université Bordeaux Segalen - Bordeaux 2-Université Sciences et Technologies - Bordeaux 1 (UB)-Université de Bordeaux (UB)-Institut Polytechnique de Bordeaux (Bordeaux INP)-Centre National de la Recherche Scientifique (CNRS), European Project: 777826,NoMADS(2018), Université de Bordeaux (UB)-Centre National de la Recherche Scientifique (CNRS)-École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), Université Bordeaux Segalen - Bordeaux 2-Université Sciences et Technologies - Bordeaux 1-Université de Bordeaux (UB)-Institut Polytechnique de Bordeaux (Bordeaux INP)-Centre National de la Recherche Scientifique (CNRS) |
Jazyk: | angličtina |
Rok vydání: | 2020 |
Předmět: |
FOS: Computer and information sciences
Computer Science - Machine Learning Computer science Computer Vision and Pattern Recognition (cs.CV) ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION Computer Science - Computer Vision and Pattern Recognition 02 engineering and technology Task (project management) Machine Learning (cs.LG) [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing 0202 electrical engineering electronic engineering information engineering FOS: Electrical engineering electronic engineering information engineering Segmentation Sequence business.industry Deep learning Image and Video Processing (eess.IV) Pattern recognition Image segmentation Electrical Engineering and Systems Science - Image and Video Processing Object (computer science) Computer Graphics and Computer-Aided Design Constraint (information theory) Task analysis 020201 artificial intelligence & image processing Artificial intelligence business Software |
Zdroj: | IEEE Transactions on Image Processing IEEE Transactions on Image Processing, 2021, 30, pp.3555-3567. ⟨10.1109/TIP.2021.3062726⟩ IEEE Transactions on Image Processing, Institute of Electrical and Electronics Engineers, 2021, 30, pp.3555-3567. ⟨10.1109/TIP.2021.3062726⟩ HAL |
ISSN: | 1057-7149 |
DOI: | 10.1109/TIP.2021.3062726⟩ |
Popis: | International audience; Fully supervised deep neural networks for segmentation usually require a massive amount of pixel-level labels which are manually expensive to create. In this work, we develop a multi-task learning method to relax this constraint. We regard the segmentation problem as a sequence of approximation subproblems that are recursively defined and in increasing levels of approximation accuracy. The subproblems are handled by a framework that consists of 1) a segmentation task that learns from pixel-level ground truth segmentation masks of a small fraction of the images, 2) a recursive approximation task that conducts partial object regions learning and data-driven mask evolution starting from partial masks of each object instance, and 3) other problem oriented auxiliary tasks that are trained with sparse annotations and promote the learning of dedicated features. Most training images are only labeled by (rough) partial masks, which do not contain exact object boundaries, rather than by their full segmentation masks. During the training phase, the approximation task learns the statistics of these partial masks, and the partial regions are recursively increased towards object boundaries aided by the learned information from the segmentation task in a fully data-driven fashion. The network is trained on an extremely small amount of precisely segmented images and a large set of coarse labels. Annotations can thus be obtained in a cheap way. We demonstrate the efficiency of our approach in three applications with microscopy images and ultrasound images. |
Databáze: | OpenAIRE |
Externí odkaz: |