Popis: |
BACKGROUND:Body composition and its changes affect cancer patient outcomes. Its determination requires specific and expensive devices. We designed a study to evaluate machine learning approaches to predict fat and skeletal muscle mass using daily practice clinical variables.METHODS:We designed a cross-sectional study in advanced gastrointestinal cancer patients. Response variables were skeletal muscle mass and body fat mass, measured by bioimpedance analysis. Predictors were laboratory and anthropometric variables. Imputation methods were applied. Six approaches were analyzed: (1) multicollinearity analysis, best subset selection (BSS) and multiple linear regression; (2) multicollinearity, BSS and generalized additive models (GAM); (3) multicollinearity, lasso to perform variable selection and GAM; (4) ridge regression; (5) lasso regression; (6) random forest. Model selection was performed evaluating the Mean Squared Error calculated by leave-one-out cross-validation.RESULTS:We included 101 patients under chemotherapy treatment. For skeletal muscle mass, the best approach was the combination of multicollinearity analysis followed by BSS and GAM using smoothing splines with 6 variables (albumin, Hb, height, weight, sex, lymphocytes). The adjusted R2 was 0.895. The best approach for fat mass was multicollinearity analysis, variable selection by lasso, and GAM using smoothing splines with 3 variables (waist-hip ratio, weight, sex). The adjusted R2 was 0.917.CONCLUSION:We developed the first accurate predictive models for body composition in cancer patients applying daily practice clinical variables. This study shows that machine learning is a useful tool to apply in body composition. This is a starting point to evaluate these approaches in research and clinical practice. |