Regression and Random Forest Machine Learning Have Limited Performance in Predicting Bowel Preparation in Veteran Population

Autor: Stacy B. Menees, Amy M. Cohn, Sameer D. Saini, Rachel Lipson, Alex N. Kokaly, Andrew J. Read, Karmel S. Shehadeh, Jacob E. Kurlander, Akbar K. Waljee
Rok vydání: 2021
Předmět:
Zdroj: Digestive Diseases and Sciences. 67:2827-2841
ISSN: 1573-2568
0163-2116
DOI: 10.1007/s10620-021-07113-z
Popis: Inadequate bowel preparation undermines the quality of colonoscopy, but patients likely to be affected are difficult to identify beforehand. This study aimed to develop, validate, and compare prediction models for bowel preparation inadequacy using conventional logistic regression (LR) and random forest machine learning (RFML). We created a retrospective cohort of patients who underwent outpatient colonoscopy at a single VA medical center between January 2012 and October 2015. Candidate predictor variables were chosen after a literature review. We extracted all available predictor variables from the electronic medical record, and bowel preparation from the endoscopy database. The data were split into 70% training and 30% validation sets. Multivariable LR and RFML were used to predict preparation inadequacy as a dichotomous outcome. The cohort included 6,885 Veterans, of whom 964 (14%) had inadequate preparation. Using LR, the area under the receiver operating characteristic curve (AUC) for the validation cohort was 0.66 (95% CI 0.62, 0.69) and the Brier score, in which a lower score indicates better performance, was 0.11. Using RFML, the AUC for the validation cohort was 0.61 (95% CI 0.58, 0.65) and the Brier score was 0.12. LR and RFML had similar performance in predicting bowel preparation, which was modest and likely insufficient for use in practice. Future research is needed to identify additional predictor variables and to test other machine learning algorithms. At present, endoscopy units should focus on universal strategies to enhance preparation adequacy.
Databáze: OpenAIRE