Regression and Random Forest Machine Learning Have Limited Performance in Predicting Bowel Preparation in Veteran Population

Autor:	Stacy B. Menees, Amy M. Cohn, Sameer D. Saini, Rachel Lipson, Alex N. Kokaly, Andrew J. Read, Karmel S. Shehadeh, Jacob E. Kurlander, Akbar K. Waljee
Rok vydání:	2021
Předmět:	education.field_of_study medicine.diagnostic_test Receiver operating characteristic Physiology business.industry Population Gastroenterology Colonoscopy Retrospective cohort study Machine learning computer.software_genre Logistic regression 03 medical and health sciences 0302 clinical medicine Brier score 030220 oncology & carcinogenesis Cohort medicine 030211 gastroenterology & hepatology Artificial intelligence education business computer Predictive modelling
Zdroj:	Digestive Diseases and Sciences. 67:2827-2841
ISSN:	1573-2568 0163-2116
DOI:	10.1007/s10620-021-07113-z
Popis:	Inadequate bowel preparation undermines the quality of colonoscopy, but patients likely to be affected are difficult to identify beforehand. This study aimed to develop, validate, and compare prediction models for bowel preparation inadequacy using conventional logistic regression (LR) and random forest machine learning (RFML). We created a retrospective cohort of patients who underwent outpatient colonoscopy at a single VA medical center between January 2012 and October 2015. Candidate predictor variables were chosen after a literature review. We extracted all available predictor variables from the electronic medical record, and bowel preparation from the endoscopy database. The data were split into 70% training and 30% validation sets. Multivariable LR and RFML were used to predict preparation inadequacy as a dichotomous outcome. The cohort included 6,885 Veterans, of whom 964 (14%) had inadequate preparation. Using LR, the area under the receiver operating characteristic curve (AUC) for the validation cohort was 0.66 (95% CI 0.62, 0.69) and the Brier score, in which a lower score indicates better performance, was 0.11. Using RFML, the AUC for the validation cohort was 0.61 (95% CI 0.58, 0.65) and the Brier score was 0.12. LR and RFML had similar performance in predicting bowel preparation, which was modest and likely insufficient for use in practice. Future research is needed to identify additional predictor variables and to test other machine learning algorithms. At present, endoscopy units should focus on universal strategies to enhance preparation adequacy.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::4456209511adee2d39436d875fa20b9b https://doi.org/10.1007/s10620-021-07113-z Zobrazit plný text záznamu Full text from SpringerLink