Two-stage receiver operating-characteristic curve estimator for cohort studies
Autor: | Norberto Octavio Corral-Blanco, Pablo Martínez-Camblor, Susana Díaz-Coto |
---|---|
Rok vydání: | 2020 |
Předmět: |
Statistics and Probability
Receiver operating characteristic Diagnostic Tests Routine Monte Carlo method Asymptotic distribution Binary number Estimator Probability and statistics General Medicine 01 natural sciences Cohort Studies 010104 statistics & probability 03 medical and health sciences 0302 clinical medicine Distribution function ROC Curve Area Under Curve Humans 030212 general & internal medicine Stage (hydrology) 0101 mathematics Statistics Probability and Uncertainty Monte Carlo Method Algorithm |
Zdroj: | The International Journal of Biostatistics. 17:117-137 |
ISSN: | 1557-4679 2194-573X |
DOI: | 10.1515/ijb-2019-0097 |
Popis: | The receiver operating-characteristic (ROC) curve is a graphical statistical tool routinely used for studying the classification accuracy in both, diagnostic and prognosis problems. Given the different nature of these situations, ROC curve estimation has been separately considered for binary (diagnostic) and time-to-event (prognosis) outcomes, even for data coming from the same study design. In this work, the authors propose a two-stage ROC curve estimator which allows to link both contexts through a general prediction model (first-stage) and the empirical cumulative estimator of the distribution function (second-stage) of the considered test (marker) on the total population. The so-called two-stage Mixed-Subject (sMS) approach proves its behavior on both, large-samples (theoretically) and finite-samples (via Monte Carlo simulations). Besides, a useful asymptotic distribution for the concomitant area under the curve is also computed. Results show the ability of the proposed estimator to fit non-standard situations by considering flexible predictive models. Two real-world examples, one with binary and one with time-dependent outcomes, help us to a better understanding of the proposed methodology on usual practical circumstances. The R code used for the practical implementation of the proposed methodology and its documentation is provided as supplementary material. |
Databáze: | OpenAIRE |
Externí odkaz: |