Comparison of machine learning and the regression-based EHMRG model for predicting early mortality in acute heart failure

Autor:	David E. Austin, Douglas S. Lee, Chloe X. Wang, Shihao Ma, Xuesong Wang, Joan Porter, Bo Wang
Rok vydání:	2022
Předmět:	Heart Failure Machine Learning Logistic Models Models Statistical Humans Cardiology and Cardiovascular Medicine Algorithms
Zdroj:	International Journal of Cardiology. 365:78-84
ISSN:	0167-5273
DOI:	10.1016/j.ijcard.2022.07.035
Popis:	Although risk stratification of patients with acute decompensated heart failure (HF) is important, it is unknown whether machine learning (ML) or conventional statistical models are optimal. We developed ML algorithms to predict 7-day and 30-day mortality in patients with acute HF and compared these with an existing logistic regression model at the same timepoints.Patients presenting to one of 86 hospitals, who were either admitted to hospital or discharged home directly from the emergency department, were randomly selected using stratified random sampling. ML approaches, including neural networks, random forest, XGBoost, and the Lasso, were compared with a validated logistic regression model for discrimination and calibration.Among 12,608 patients in our analysis, lasso regression (c-statistic 0.774; 95% CI, 0.743, 0.806) performed better than other ML models for 7-day mortality but did not outperform the baseline logistic regression model (0.794; 95% CI, 0.789, 0.800). For 30-day mortality, XGBoost performed better than other ML models (c-statistic 0.759; 95% CI; 0.740, 0.779), but was not significantly better than logistic regression (c-statistic 0.755; 95% CI, 0.750, 0.762). Logistic regression demonstrated better calibration at 7 days (calibration-in-the-large 0.017; 95% CI, -0.657, 0.692, and calibration slope 0.954; 95% CI, 0.769, 1.139), and at 30 days (-0.026; 95% CI, -0.374, 0.322, and 0.964; 95% CI, 0.831, 1.098), and best Brier scores, compared to ML approaches.Logistic regression was comparable to ML in discrimination, but was superior to ML algorithms in calibration overall. ML algorithms for prognosis should routinely report calibration metrics in addition to discrimination.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::27a92cb1162ed7aae66242592f77599b https://doi.org/10.1016/j.ijcard.2022.07.035 Zobrazit plný text záznamu Full Text from ScienceDirect