A machine-learning framework for robust and reliable prediction of short- and long-term treatment response in initially antipsychotic-naïve schizophrenia patients based on multimodal neuropsychiatric data
Autor: | Egill Rostrup, Martin W Skjerbæk, Birgitte Fagerlund, Christos Pantelis, Martin Christian Axelsen, Jonathan Foldager, Merete Osler, Louise Baruël Johansen, Mette Ødegaard Nielsen, Søren Christensen, Lars Arvastson, Lars Kai Hansen, Bjørn H Ebdrup, Bob Oranje, Jayachandra Mitta Raghava, Birte Glenthøj, Bruce J Kinon, Nikolaj Bak, Karen Sando Ambrosen |
---|---|
Rok vydání: | 2020 |
Předmět: |
Computer science
Sample (statistics) Overfitting Predictive markers Machine learning computer.software_genre Article lcsh:RC321-571 Machine Learning 03 medical and health sciences Cellular and Molecular Neuroscience 0302 clinical medicine Text mining Replication (statistics) medicine Humans lcsh:Neurosciences. Biological psychiatry. Neuropsychiatry Biological Psychiatry business.industry Reproducibility of Results Diagnostic markers Cognition medicine.disease Magnetic Resonance Imaging 030227 psychiatry Psychiatry and Mental health Sample size determination Schizophrenia Schizophrenic Psychology Artificial intelligence business computer 030217 neurology & neurosurgery Antipsychotic Agents Psychopathology |
Zdroj: | Translational Psychiatry, Vol 10, Iss 1, Pp 1-13 (2020) Translational Psychiatry Ambrosen, K S, Skjerbæk, M W, Foldager, J, Axelsen, M C, Bak, N, Arvastson, L, Christensen, S R, Johansen, L B, Raghava, J M, Oranje, B, Rostrup, E, Nielsen, M, Osler, M, Fagerlund, B, Pantelis, C, Kinon, B J, Glenthøj, B Y, Hansen, L K & Ebdrup, B H 2020, ' A machine-learning framework for robust and reliable prediction of short-and long-term treatment response in initially antipsychotic-naïve schizophrenia patients based on multimodal neuropsychiatric data ', Translational Psychiatry, vol. 10, no. 1, 276 . https://doi.org/10.1038/s41398-020-00962-8 Ambrosen, K S, Skjerbæk, M W, Foldager, J, Axelsen, M C, Bak, N, Arvastson, L, Christensen, S R, Johansen, L B, Raghava, J M, Oranje, B, Rostrup, E, Nielsen, M, Osler, M, Fagerlund, B, Pantelis, C, Kinon, B J, Glenthøj, B Y, Hansen, L K & Ebdrup, B H 2020, ' A machine-learning framework for robust and reliable prediction of short-and long-term treatment response in initially antipsychotic-naïve schizophrenia patients based on multimodal neuropsychiatric data ', Translational Psychiatry, vol. 10, no. 1, 276, pp. S34–S35 . https://doi.org/10.1038/s41398-020-00962-8 |
ISSN: | 2158-3188 |
DOI: | 10.1038/s41398-020-00962-8 |
Popis: | The reproducibility of machine-learning analyses in computational psychiatry is a growing concern. In a multimodal neuropsychiatric dataset of antipsychotic-naïve, first-episode schizophrenia patients, we discuss a workflow aimed at reducing bias and overfitting by invoking simulated data in the design process and analysis in two independent machine-learning approaches, one based on a single algorithm and the other incorporating an ensemble of algorithms. We aimed to (1) classify patients from controls to establish the framework, (2) predict short- and long-term treatment response, and (3) validate the methodological framework. We included 138 antipsychotic-naïve, first-episode schizophrenia patients with data on psychopathology, cognition, electrophysiology, and structural magnetic resonance imaging (MRI). Perinatal data and long-term outcome measures were obtained from Danish registers. Short-term treatment response was defined as change in Positive And Negative Syndrome Score (PANSS) after the initial antipsychotic treatment period. Baseline diagnostic classification algorithms also included data from 151 matched controls. Both approaches significantly classified patients from healthy controls with a balanced accuracy of 63.8% and 64.2%, respectively. Post-hoc analyses showed that the classification primarily was driven by the cognitive data. Neither approach predicted short- nor long-term treatment response. Validation of the framework showed that choice of algorithm and parameter settings in the real data was successfully guided by results from the simulated data. In conclusion, this novel approach holds promise as an important step to minimize bias and obtain reliable results with modest sample sizes when independent replication samples are not available. |
Databáze: | OpenAIRE |
Externí odkaz: |