Development of a diagnostic test based on multiple continuous biomarkers with an imperfect reference test
Autor: | Tomasz Burzykowski, Laura Ponto, Ging-Yuek Hsiung, Gregory Jicha, Adrian Preda, Leandro Garcia Barrado |
---|---|
Rok vydání: | 2015 |
Předmět: |
0301 basic medicine
Statistics and Probability Epidemiology Computer science Bayesian probability 01 natural sciences Article Correlation 010104 statistics & probability 03 medical and health sciences Discriminative model Alzheimer Disease Humans Computer Simulation 0101 mathematics Bayes estimator Models Statistical Diagnostic Tests Routine business.industry Bayes Theorem Ranging Pattern recognition Mixture model Data science Logistic Models 030104 developmental biology ROC Curve Sample size determination Area Under Curve Artificial intelligence Imperfect business Biomarkers |
Zdroj: | Statistics in Medicine. 35:595-608 |
ISSN: | 0277-6715 |
DOI: | 10.1002/sim.6733 |
Popis: | Ignoring the fact that the reference test used to establish the discriminative properties of a combination of diagnostic biomarkers is imperfect can lead to a biased estimate of the diagnostic accuracy of the combination. In this paper, we propose a Bayesian latent-class mixture model to select a combination of biomarkers that maximizes the area under the ROC curve (AUC), while taking into account the imperfect nature of the reference test. In particular, a method for specification of the prior for the mixture component parameters is developed that allows controlling the amount of prior information provided for the AUC. The properties of the model are evaluated by using a simulation study and an application to real data from Alzheimer's disease research. In the simulation study, 100 data sets are simulated for sample sizes ranging from 100 to 600 observations, with a varying correlation between biomarkers. The inclusion of an informative as well as a flat prior for the diagnostic accuracy of the reference test is investigated. In the real-data application, the proposed model was compared with the generally used logistic-regression model that ignores the imperfectness of the reference test. Conditional on the selected sample size and prior distributions, the simulation study results indicate satisfactory performance of the model-based estimates. In particular, the obtained average estimates for all parameters are close to the true values. For the real-data application, AUC estimates for the proposed model are substantially higher than those from the 'traditional' logistic-regression model. |
Databáze: | OpenAIRE |
Externí odkaz: |