Cross-modal Transfer Learning Based on an Improved CycleGAN Model for Accurate Kidney Segmentation in Ultrasound Images.

Autor: Guo S; School of Computer Science and Technology, Nanjing Tech University, Nanjing, China., Chen H; School of Computer Science and Technology, Nanjing Tech University, Nanjing, China., Sheng X; School of Computer Science and Technology, Nanjing Tech University, Nanjing, China., Xiong Y; Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA., Wu M; School of Computer Science and Technology, Nanjing Tech University, Nanjing, China; Carbon Medical Device Ltd, Shenzhen, China., Fischer K; Department of Surgery, Division of Pediatric Urology, The Children's Hospital of Philadelphia, Philadelphia, PA, USA; Center for Pediatric Clinical Effectiveness, The Children's Hospital of Philadelphia, Philadelphia, PA, USA., Tasian GE; Department of Surgery, Division of Pediatric Urology, The Children's Hospital of Philadelphia, Philadelphia, PA, USA; Center for Pediatric Clinical Effectiveness, The Children's Hospital of Philadelphia, Philadelphia, PA, USA; Department of Biostatistics, Epidemiology, and Informatics, The University of Pennsylvania, Philadelphia, PA, USA., Fan Y; Department of Radiology, Perelman School of Medicine, The University of Pennsylvania, Philadelphia, PA, USA., Yin S; School of Computer Science and Technology, Nanjing Tech University, Nanjing, China. Electronic address: yinshi2021@njtech.edu.cn.
Jazyk: angličtina
Zdroj: Ultrasound in medicine & biology [Ultrasound Med Biol] 2024 Nov; Vol. 50 (11), pp. 1638-1645. Date of Electronic Publication: 2024 Aug 24.
DOI: 10.1016/j.ultrasmedbio.2024.06.009
Abstrakt: Objective: Deep-learning algorithms have been widely applied in the field of automatic kidney ultrasound (US) image segmentation. However, obtaining a large number of accurate kidney labels clinically is very difficult and time-consuming. To solve this problem, we have proposed an efficient cross-modal transfer learning method to improve the performance of the segmentation network on a limited labeled kidney US dataset.
Methods: We aim to implement an improved image-to-image translation network called Seg-CycleGAN to generate accurate annotated kidney US data from labeled abdomen computed tomography images. The Seg-CycleGAN framework primarily consists of two structures: (i) a standard CycleGAN network to visually simulate kidney US from a publicly available labeled abdomen computed tomography dataset; (ii) and a segmentation network to ensure accurate kidney anatomical structures in US images. Based on the large number of simulated kidney US images and small number of real annotated kidney US images, we then aimed to employ a fine-tuning strategy to obtain better segmentation results.
Results: To validate the effectiveness of the proposed method, we tested this method on both normal and abnormal kidney US images. The experimental results showed that the proposed method achieved a segmentation accuracy of 0.8548 in dice similarity coefficient on all testing datasets and 0.7622 on the abnormal testing dataset.
Conclusions: Compared with existing data augmentation and transfer learning methods, the proposed method improved the accuracy and generalization of the kidney US image segmentation network on a limited number of training datasets. It therefore has the potential to significantly reduce annotation costs in clinical settings.
Competing Interests: Conflict of interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
(Copyright © 2024 World Federation for Ultrasound in Medicine & Biology. Published by Elsevier Inc. All rights reserved.)
Databáze: MEDLINE