Comparing the performance of statistical, machine learning, and deep learning algorithms to predict time-to-event: A simulation study for conversion to mild cognitive impairment.

Autor: Billichová M; Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava, Bratislava, Slovakia., Coan LJ; School of Computer Science and Mathematics, Liverpool John Moores University, Liverpool, United Kingdom., Czanner S; Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava, Bratislava, Slovakia.; School of Computer Science and Mathematics, Liverpool John Moores University, Liverpool, United Kingdom., Kováčová M; Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava, Bratislava, Slovakia., Sharifian F; School of Computer Science and Mathematics, Liverpool John Moores University, Liverpool, United Kingdom., Czanner G; Faculty of Informatics and Information Technologies, Slovak University of Technology in Bratislava, Bratislava, Slovakia.; School of Computer Science and Mathematics, Liverpool John Moores University, Liverpool, United Kingdom.
Jazyk: angličtina
Zdroj: PloS one [PLoS One] 2024 Jan 22; Vol. 19 (1), pp. e0297190. Date of Electronic Publication: 2024 Jan 22 (Print Publication: 2024).
DOI: 10.1371/journal.pone.0297190
Abstrakt: Mild Cognitive Impairment (MCI) is a condition characterized by a decline in cognitive abilities, specifically in memory, language, and attention, that is beyond what is expected due to normal aging. Detection of MCI is crucial for providing appropriate interventions and slowing down the progression of dementia. There are several automated predictive algorithms for prediction using time-to-event data, but it is not clear which is best to predict the time to conversion to MCI. There is also confusion if algorithms with fewer training weights are less accurate. We compared three algorithms, from smaller to large numbers of training weights: a statistical predictive model (Cox proportional hazards model, CoxPH), a machine learning model (Random Survival Forest, RSF), and a deep learning model (DeepSurv). To compare the algorithms under different scenarios, we created a simulated dataset based on the Alzheimer NACC dataset. We found that the CoxPH model was among the best-performing models, in all simulated scenarios. In a larger sample size (n = 6,000), the deep learning algorithm (DeepSurv) exhibited comparable accuracy (73.1%) to the CoxPH model (73%). In the past, ignoring heterogeneity in the CoxPH model led to the conclusion that deep learning methods are superior. We found that when using the CoxPH model with heterogeneity, its accuracy is comparable to that of DeepSurv and RSF. Furthermore, when unobserved heterogeneity is present, such as missing features in the training, all three models showed a similar drop in accuracy. This simulation study suggests that in some applications an algorithm with a smaller number of training weights is not disadvantaged in terms of accuracy. Since algorithms with fewer weights are inherently easier to explain, this study can help artificial intelligence research develop a principled approach to comparing statistical, machine learning, and deep learning algorithms for time-to-event predictions.
Competing Interests: The authors have declared that no competing interests exist.
(Copyright: © 2024 Billichová et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
Databáze: MEDLINE
Nepřihlášeným uživatelům se plný text nezobrazuje