Generalisation capabilities of machine-learning algorithms for the detection of the subthalamic nucleus in micro-electrode recordings.
Autor: | Martin T; Laboratoire Traitement du Signal et de l'Image (LTSI, INSERM UMR 1099), Université de Rennes, Rennes, France., Jannin P; Laboratoire Traitement du Signal et de l'Image (LTSI, INSERM UMR 1099), Université de Rennes, Rennes, France., Baxter JSH; Laboratoire Traitement du Signal et de l'Image (LTSI, INSERM UMR 1099), Université de Rennes, Rennes, France. john.baxter@univ-rennes.fr. |
---|---|
Jazyk: | angličtina |
Zdroj: | International journal of computer assisted radiology and surgery [Int J Comput Assist Radiol Surg] 2024 Dec; Vol. 19 (12), pp. 2445-2451. Date of Electronic Publication: 2024 Jul 01. |
DOI: | 10.1007/s11548-024-03202-2 |
Abstrakt: | Purpose: Micro-electrode recordings (MERs) are a key intra-operative modality used during deep brain stimulation (DBS) electrode implantation, which allow for a trained neurophysiologist to infer the anatomy in which the electrode is placed. As DBS targets are small, such inference is necessary to confirm that the electrode is correctly positioned. Recently, machine learning techniques have been used to augment the neurophysiologist's capability. The goal of this paper is to investigate the generalisability of these methods with respect to different clinical centres and training paradigms. Methods: Five deep learning algorithms for binary classification of MER signals have been implemented. Three databases from two different clinical centres have also been collected with differing size, acquisition hardware, and annotation protocol. Each algorithm has initially been trained on the largest database, then either directly tested or fine-tuned on the smaller databases in order to estimate their generalisability. As a reference, they have also been trained from scratch on the smaller databases as well in order to estimate the effect of the differing database sizes and annotation systems. Results: Each network shows significantly reduced performance (on the order of a 6.5% to 16.0% reduction in balanced accuracy) when applied out-of-distribution. This reduction can be ameliorated through fine-tuning the network on the new database through transfer learning. Although, even for these small databases, it appears that retraining from scratch may still offer equivalent performance as fine-tuning with transfer learning. However, this is at the expense of significantly longer training times. Conclusion: Generalisability is an important criterion for the success of machine learning algorithms in clinic. We have demonstrated that a variety of recent machine learning algorithms for MER classification are negatively affected by domain shift, but that this can be quickly ameliorated through simple transfer learning procedures that can be readily performed for new centres. Competing Interests: Declarations. Conflict of interest: The authors have no conflicts of interest to declare. Ethics approval: The London database was approved by the Research Ethics Board at Western University, Canada (REB # 109045). Both Rennes databases were approved by the Rennes University Hospital Centre ethics committee (Ethical authorisation declaration n2205295). (© 2024. CARS.) |
Databáze: | MEDLINE |
Externí odkaz: |