Machine Learning Approach to Predicting COVID-19 Disease Severity Based on Clinical Blood Test Data: Statistical Analysis and Model Development

Autor: Jiang, Wei, Kriventsov, Stan, Aktar, Sakifa, Ahamad, Martuza, Rashed-Al-Mahfuz, Azad, AKM, Uddin, Shahadat, Kamal, AHM, Alyami, Salem A, Lin, Ping-I, Islam, Mohammed Shariful, Quinn, Julian MW, Eapen, Valsamma, Moni, Mohammad Ali
Rok vydání: 2021
Předmět:
Zdroj: JMIR Medical Informatics, Vol 9, Iss 4, p e25884 (2021)
JMIR Medical Informatics
ISSN: 2291-9694
Popis: Background Accurate prediction of the disease severity of patients with COVID-19 would greatly improve care delivery and resource allocation and thereby reduce mortality risks, especially in less developed countries. Many patient-related factors, such as pre-existing comorbidities, affect disease severity and can be used to aid this prediction. Objective Because rapid automated profiling of peripheral blood samples is widely available, we aimed to investigate how data from the peripheral blood of patients with COVID-19 can be used to predict clinical outcomes. Methods We investigated clinical data sets of patients with COVID-19 with known outcomes by combining statistical comparison and correlation methods with machine learning algorithms; the latter included decision tree, random forest, variants of gradient boosting machine, support vector machine, k-nearest neighbor, and deep learning methods. Results Our work revealed that several clinical parameters that are measurable in blood samples are factors that can discriminate between healthy people and COVID-19–positive patients, and we showed the value of these parameters in predicting later severity of COVID-19 symptoms. We developed a number of analytical methods that showed accuracy and precision scores >90% for disease severity prediction. Conclusions We developed methodologies to analyze routine patient clinical data that enable more accurate prediction of COVID-19 patient outcomes. With this approach, data from standard hospital laboratory analyses of patient blood could be used to identify patients with COVID-19 who are at high risk of mortality, thus enabling optimization of hospital facilities for COVID-19 treatment.
Databáze: OpenAIRE