Combining Viral Genetics and Statistical Modeling to Improve HIV-1 Time-of-Infection Estimation towards Enhanced Vaccine Efficacy Assessment
Autor: | David Matten, Frederick A. Matsen, Sorachai Nitayaphan, Colin Anthony, Christopher Warth, Gordon Botha, Melissa-Rose Abrahams, Peter B. Gilbert, Merlin L. Robb, Hannah Kibuuka, Craig A. Magaret, Leigh Anne Eller, Carolyn Williamson, Sodsai Tovanabutra, Lindsay N. Carpp, Paul T. Edlefsen, Simon A. Travers, Yuanyuan Zhang, Jan P.L. Labuschagne, Morgane Rolland, Denis Chopera, Ted Holzman, Roux-Cil Ferreira, Erika Rudnicki, Fred Sawe, Raabya Rossenkhan, Yunda Huang, Nonkululeko. Ndabambi, Murray Logan |
---|---|
Rok vydání: | 2019 |
Předmět: |
0301 basic medicine
Time Factors sequence analysis Mean squared error lcsh:QR1-502 HIV Infections Poisson distribution Article lcsh:Microbiology Cross-validation Evolution Molecular 03 medical and health sciences symbols.namesake 0302 clinical medicine Virology founder multiplicity Linear regression Statistics Humans 030212 general & internal medicine infection time vaccine efficacy assessment Phylogeny Mathematics AIDS Vaccines Models Statistical Genetic Variation Estimator Statistical model Viral Load Vaccine efficacy 3. Good health HIV-1 primary infection acute and early HIV-1 infection 030104 developmental biology Infectious Diseases Mutation leave-one-out-cross-validation (LOOCV) HIV-1 symbols Viral load |
Zdroj: | Viruses, Vol 11, Iss 7, p 607 (2019) Viruses Volume 11 Issue 7 |
ISSN: | 1999-4915 |
Popis: | Knowledge of the time of HIV-1 infection and the multiplicity of viruses that establish HIV-1 infection is crucial for the in-depth analysis of clinical prevention efficacy trial outcomes. Better estimation methods would improve the ability to characterize immunological and genetic sequence correlates of efficacy within preventive efficacy trials of HIV-1 vaccines and monoclonal antibodies. We developed new methods for infection timing and multiplicity estimation using maximum likelihood estimators that shift and scale (calibrate) estimates by fitting true infection times and founder virus multiplicities to a linear regression model with independent variables defined by data on HIV-1 sequences, viral load, diagnostics, and sequence alignment statistics. Using Poisson models of measured mutation counts and phylogenetic trees, we analyzed longitudinal HIV-1 sequence data together with diagnostic and viral load data from the RV217 and CAPRISA 002 acute HIV-1 infection cohort studies. We used leave-one-out cross validation to evaluate the prediction error of these calibrated estimators versus that of existing estimators and found that both infection time and founder multiplicity can be estimated with improved accuracy and precision by calibration. Calibration considerably improved all estimators of time since HIV-1 infection, in terms of reducing bias to near zero and reducing root mean squared error (RMSE) to 5–10 days for sequences collected 1–2 months after infection. The calibration of multiplicity assessments yielded strong improvements with accurate predictions (ROC-AUC above 0.85) in all cases. These results have not yet been validated on external data, and the best-fitting models are likely to be less robust than simpler models to variation in sequencing conditions. For all evaluated models, these results demonstrate the value of calibration for improved estimation of founder multiplicity and of time since HIV-1 infection. |
Databáze: | OpenAIRE |
Externí odkaz: | |
Nepřihlášeným uživatelům se plný text nezobrazuje | K zobrazení výsledku je třeba se přihlásit. |