Identification and characterization of Coronaviridae genomes from Vietnamese bats and rats based on conserved protein domains
Autor: | Phan, MVT, Tue, NT, Pham, HA, Baker, S, Kellam, P, Cotten, M, Bach, TK, Berto, A, Boni, MF, Bryant, JE, Bui, DP, Campbell, JI, Carrique-Mas, J, Dang, MH, Dang, TH, Dang, TO, Day, JN, Dinh, VT, Van Doorn, HR, Duong, AH, Farrar, JJ, Hau, TTT, Ho, DTN, Hoang, BL, Hoang, VD, Huynh, TKT, Lam, CC, Le, MH, Le, TP, Le, XL, Luu, TTH, Ly, VC, Mai, TPL, Nadjm, B, Ngo, TB, Ngo, TH, Nguyen, CT, Nguyen, DT, Nguyen, D, Nguyen, KC, Nguyen, NA, Nguyen, NV, Nguyen, QH, Nguyen, TD, Nguyen, TM, Nguyen, TB, Nguyen, THT, Nguyen, TKC, Nguyen, TLN, Nguyen, TLH, Nguyen, TNL, Nguyen, TND, Nguyen, TN, Nguyen, TSC, Nguyen, TYC, Nguyen, TT, Nguyen, TV, Nguyen, VC, Nguyen, VH, Nguyen, VK, Nguyen, VMH, Nguyen, V, Nguyen, VT, Nguyen, VVC, Nguyen, VX, Pham, HM, Pham, TMK, Pham, TTT, Pham, VL, Pham, VM, Phan, VBB, Rabaa, MA, Rahman, M, Thompson, C, Thwaites, G, Ta, TDN, Tran, DHN, Tran, HMC, Tran, KT, Tran, MP, Tran, TKH, Tran, TND, Tran, TTT, Tran, TTM, Tran, TN, Tran, TH, Trinh, QT, Vo, BH, Vo, NT, Vo, QC, Voong, VP, Vu, TLH, Vu, TTH, Wertheim, H, Bogaardt, C, Chase-Topping, M, Ivens, A, Lu, L, Dung, N, Rambaut, A, Simmonds, P, Woolhouse, M, Munnink, BO, Deijs, M, Van der Hoek, L, Jebbink, MF, Farsani, SMJ, Dodd, K, Euren, J, Lucas, A, Ortiz, N, Pennacchio, L, Rubin, E, Saylors, KE, Tran, MH, Wolfe, ND |
---|---|
Přispěvatelé: | Wellcome Trust, Phan, My VT [0000-0002-6905-8513], Cotten, Matthew [0000-0002-3361-3351], Apollo - University of Cambridge Repository, Virology, Radiation Oncology |
Jazyk: | angličtina |
Rok vydání: | 2018 |
Předmět: |
0301 basic medicine
profile Hidden Markov model DATABASE Middle East respiratory syndrome coronavirus viruses protein domains DIVERSITY Computational biology medicine.disease_cause Microbiology Genome Alphacoronavirus Deep sequencing 03 medical and health sciences Virology medicine RODENTS Coronaviridae ALGORITHM virus classification PFAM Virus classification Coronavirus Science & Technology biology biology.organism_classification EVOLUTION 3. Good health ALIGNMENT machine learning 030104 developmental biology DISCOVERY VIRUS CROSS-SPECIES TRANSMISSION Life Sciences & Biomedicine random forest Betacoronavirus Research Article |
Zdroj: | Virus Evolution Virus Evolution, 4(2):vey035. Oxford University Press |
ISSN: | 2057-1577 |
Popis: | The Coronaviridae family of viruses encompasses a group of pathogens with a zoonotic potential as observed from previous outbreaks of the severe acute respiratory syndrome coronavirus and Middle East respiratory syndrome coronavirus. Accordingly, it seems important to identify and document the coronaviruses in animal reservoirs, many of which are uncharacterized and potentially missed by more standard diagnostic assays. A combination of sensitive deep sequencing technology and computational algorithms is essential for virus surveillance, especially for characterizing novel- or distantly related virus strains. Here, we explore the use of profile Hidden Markov Model-defined Pfam protein domains (Pfam domains) encoded by new sequences as a Coronaviridae sequence classification tool. The encoded domains are used first in a triage to identify potential Coronaviridae sequences and then processed using a Random Forest method to classify the sequences to the Coronaviridae genus level. The application of this algorithm on Coronaviridae genomes assembled from agnostic deep sequencing data from surveillance of bats and rats in Dong Thap province (Vietnam) identified thirty-four Alphacoronavirus and eleven Betacoronavirus genomes. This collection of bat and rat coronaviruses genomes provided essential information on the local diversity of coronaviruses and substantially expanded the number of coronavirus full genomes available from bat and rats and may facilitate further molecular studies on this group of viruses. |
Databáze: | OpenAIRE |
Externí odkaz: |