Population median imputation was noninferior to complex approaches for imputing missing values in cardiovascular prediction models in clinical practice.

Autor: Berkelmans GFN; Department of Vascular Medicine, University Medical Center Utrecht, the Netherlands., Read SH; Usher Institute, University of Edinburgh, Edinburgh, Scotland, UK and on behalf of the Scottish Diabetes Research Network epidemiology group; Women's College Research Institute, Canada., Gudbjörnsdottir S; Swedish National Diabetes Register, Center of Registers in Region, Gothenburg, Sweden., Wild SH; Usher Institute, University of Edinburgh, Edinburgh, Scotland, UK and on behalf of the Scottish Diabetes Research Network epidemiology group., Franzen S; Swedish National Diabetes Register, Center of Registers in Region, Gothenburg, Sweden., van der Graaf Y; Julius Center for Health Sciences and Primary Care, University Medical Center Utrecht, the Netherlands., Eliasson B; Swedish National Diabetes Register, Center of Registers in Region, Gothenburg, Sweden; Department of Molecular and Clinical Medicine, Institute of Medicine, University of Gothenburg, Gothenburg, Sweden., Visseren FLJ; Department of Vascular Medicine, University Medical Center Utrecht, the Netherlands. Electronic address: f.l.j.visseren@umcutrecht.nl., Paynter NP; Harvard Medical School, Brigham & Women's Hospital, Boston, USA., Dorresteijn JAN; Department of Vascular Medicine, University Medical Center Utrecht, the Netherlands.
Jazyk: angličtina
Zdroj: Journal of clinical epidemiology [J Clin Epidemiol] 2022 May; Vol. 145, pp. 70-80. Date of Electronic Publication: 2022 Jan 21.
DOI: 10.1016/j.jclinepi.2022.01.011
Abstrakt: Objectives: To compare the validity and robustness of five methods for handling missing characteristics when using cardiovascular disease risk prediction models for individual patients in a real-world clinical setting.
Study Design and Setting: The performance of the missing data methods was assessed using data from the Swedish National Diabetes Registry (n = 419,533) with external validation using the Scottish Care Information - diabetes database (n = 226,953). Five methods for handling missing data were compared. Two methods using submodels for each combination of available data, two imputation methods: conditional imputation and median imputation, and one alternative modeling method, called the naïve approach, based on hazard ratios and populations statistics of known risk factors only. The validity was compared using calibration plots and c-statistics.
Results: C-statistics were similar across methods in both development and validation data sets, that is, 0.82 (95% CI 0.82-0.83) in the Swedish National Diabetes Registry and 0.74 (95% CI 0.74-0.75) in Scottish Care Information-diabetes database. Differences were only observed after random introduction of missing data in the most important predictor variable (i.e., age).
Conclusion: Validity and robustness of median imputation was not dissimilar to more complex methods for handling missing values, provided that the most important predictor variables, such as age, are not missing.
(Copyright © 2022 Elsevier Inc. All rights reserved.)
Databáze: MEDLINE