Development of a diagnostic test based on multiple continuous biomarkers with an imperfect reference test

Autor:	Tomasz Burzykowski, Laura Ponto, Ging-Yuek Hsiung, Gregory Jicha, Adrian Preda, Leandro Garcia Barrado
Rok vydání:	2015
Předmět:	0301 basic medicine Statistics and Probability Epidemiology Computer science Bayesian probability 01 natural sciences Article Correlation 010104 statistics & probability 03 medical and health sciences Discriminative model Alzheimer Disease Humans Computer Simulation 0101 mathematics Bayes estimator Models Statistical Diagnostic Tests Routine business.industry Bayes Theorem Ranging Pattern recognition Mixture model Data science Logistic Models 030104 developmental biology ROC Curve Sample size determination Area Under Curve Artificial intelligence Imperfect business Biomarkers
Zdroj:	Statistics in Medicine. 35:595-608
ISSN:	0277-6715
DOI:	10.1002/sim.6733
Popis:	Ignoring the fact that the reference test used to establish the discriminative properties of a combination of diagnostic biomarkers is imperfect can lead to a biased estimate of the diagnostic accuracy of the combination. In this paper, we propose a Bayesian latent-class mixture model to select a combination of biomarkers that maximizes the area under the ROC curve (AUC), while taking into account the imperfect nature of the reference test. In particular, a method for specification of the prior for the mixture component parameters is developed that allows controlling the amount of prior information provided for the AUC. The properties of the model are evaluated by using a simulation study and an application to real data from Alzheimer's disease research. In the simulation study, 100 data sets are simulated for sample sizes ranging from 100 to 600 observations, with a varying correlation between biomarkers. The inclusion of an informative as well as a flat prior for the diagnostic accuracy of the reference test is investigated. In the real-data application, the proposed model was compared with the generally used logistic-regression model that ignores the imperfectness of the reference test. Conditional on the selected sample size and prior distributions, the simulation study results indicate satisfactory performance of the model-based estimates. In particular, the obtained average estimates for all parameters are close to the true values. For the real-data application, AUC estimates for the proposed model are substantially higher than those from the 'traditional' logistic-regression model.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a682fa2f686a609241e27dddec1962fc https://doi.org/10.1002/sim.6733 Zobrazit plný text záznamu Plný text