Autor: |
Ortiz AT; Department of Infectious Diseases, Imperial College London, London, W2 1NY., Kendall M; Department of Statistics, University of Warwick, Coventry, CV4 7AL., Storey N; Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH., Hatcher J; Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH., Dunn H; Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH., Roy S; Department of Infection, Immunity and Inflammation, Institute of Child Health, UCL, London WC1N 1EH., Williams R; UCL Genomics, Institute of Child Health, UCL, London WC1N 1EH., Williams C; UCL Genomics, Institute of Child Health, UCL, London WC1N 1EH., Goldstein RA; UCL Genomics, Institute of Child Health, UCL, London WC1N 1EH., Didelot X; Department of Statistics, University of Warwick, Coventry, CV4 7AL., Harris K; Department of Microbiology, Great Ormond Street Hospital, London WC1N 3JH.; Department of Virology, East South East London Pathology Partnership, Royal London Hospital, Barts Health NHS Trust, London E12ES., Breuer J; Department of Infection, Immunity and Inflammation, Institute of Child Health, UCL, London WC1N 1EH., Grandjean L; Department of Infection, Immunity and Inflammation, Institute of Child Health, UCL, London WC1N 1EH. |
Abstrakt: |
Accurate inference of who infected whom in an infectious disease outbreak is critical for the delivery of effective infection prevention and control. The increased resolution of pathogen whole-genome sequencing has significantly improved our ability to infer transmission events. Despite this, transmission inference often remains limited by the lack of genomic variation between the source case and infected contacts. Although within-host genetic diversity is common among a wide variety of pathogens, conventional whole-genome sequencing phylogenetic approaches to reconstruct outbreaks exclusively use consensus sequences, which consider only the most prevalent nucleotide at each position and therefore fail to capture low frequency variation within samples. We hypothesized that including within-sample variation in a phylogenetic model would help to identify who infected whom in instances in which this was previously impossible. Using whole-genome sequences from SARS-CoV-2 multi-institutional outbreaks as an example, we show how within-sample diversity is stable among repeated serial samples from the same host, is transmitted between those cases with known epidemiological links, and how this improves phylogenetic inference and our understanding of who infected whom. Our technique is applicable to other infectious diseases and has immediate clinical utility in infection prevention and control. |