Toward interpretability of machine learning methods for the classification of patients with major depressive disorder based on functional network measures.

Autor: Andreev AV; Baltic Center for Neurotechnology and Artificial Intelligence, Immanuel Kant Baltic Federal University, 14, A. Nevskogo str., Kaliningrad 236016, Russia., Kurkin SA; Baltic Center for Neurotechnology and Artificial Intelligence, Immanuel Kant Baltic Federal University, 14, A. Nevskogo str., Kaliningrad 236016, Russia., Stoyanov D; Department of Psychiatry and Medical Psychology, Research Institute, Medical University Plovdiv, 15A Vassil Aprilov Blvd., Plovdiv 4002, Bulgaria., Badarin AA; Baltic Center for Neurotechnology and Artificial Intelligence, Immanuel Kant Baltic Federal University, 14, A. Nevskogo str., Kaliningrad 236016, Russia., Paunova R; Department of Psychiatry and Medical Psychology, Research Institute, Medical University Plovdiv, 15A Vassil Aprilov Blvd., Plovdiv 4002, Bulgaria., Hramov AE; Baltic Center for Neurotechnology and Artificial Intelligence, Immanuel Kant Baltic Federal University, 14, A. Nevskogo str., Kaliningrad 236016, Russia.
Jazyk: angličtina
Zdroj: Chaos (Woodbury, N.Y.) [Chaos] 2023 Jun 01; Vol. 33 (6).
DOI: 10.1063/5.0155567
Abstrakt: We address the interpretability of the machine learning algorithm in the context of the relevant problem of discriminating between patients with major depressive disorder (MDD) and healthy controls using functional networks derived from resting-state functional magnetic resonance imaging data. We applied linear discriminant analysis (LDA) to the data from 35 MDD patients and 50 healthy controls to discriminate between the two groups utilizing functional networks' global measures as the features. We proposed the combined approach for feature selection based on statistical methods and the wrapper-type algorithm. This approach revealed that the groups are indistinguishable in the univariate feature space but become distinguishable in a three-dimensional feature space formed by the identified most important features: mean node strength, clustering coefficient, and the number of edges. LDA achieves the highest accuracy when considering the network with all connections or only the strongest ones. Our approach allowed us to analyze the separability of classes in the multidimensional feature space, which is critical for interpreting the results of machine learning models. We demonstrated that the parametric planes of the control and MDD groups rotate in the feature space with increasing the thresholding parameter and that their intersection increases with approaching the threshold of 0.45, for which classification accuracy is minimal. Overall, the combined approach for feature selection provides an effective and interpretable scenario for discriminating between MDD patients and healthy controls using measures of functional connectivity networks. This approach can be applied to other machine learning tasks to achieve high accuracy while ensuring the interpretability of the results.
(© 2023 Author(s). Published under an exclusive license by AIP Publishing.)
Databáze: MEDLINE