Hybrid de novo Whole-Genome Assembly and Annotation of SARS-CoV-2 Virus from Nosocomial Infection in Pahang, Malaysia

Autor: U.A. Zainulabid, N. Kamarudin, R. Mohd Rus, S.N. Soffian, A.S. Mat Yassim, S.H. How, H.F. Ahmad
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: International Journal of Infectious Diseases, Vol 116, Iss , Pp S19- (2022)
Druh dokumentu: article
ISSN: 1201-9712
DOI: 10.1016/j.ijid.2021.12.046
Popis: Purpose: SARS-CoV-2 virus is highly contagious and spreads easily that it is difficult to detect in elucidating the pattern of infection for contact tracing purposes. The advent of high-throughput sequencing techniques has improved diagnosis in detection the viral lineages especially involving sporadic infection. The combination of short- and long-reads greatly improved the assembly of the SARS-CoV-2 genome and marked as a new approach to correct erroneous frame-shifts from single sequencing effort. Methods & Materials: The samples were recovered from nasopharyngeal and oropharyngeal swab specimens of symptomatic health-care worker. The viral RNA was extracted for RT-qPCR and constructed for genomic library according to the ARTIC nCoV-2019 protocol. We performed whole-genome sequencing using hybrid approach combining both short and long-read sequencing approaches, respectively. The raw reads were reconstructed using a combination of bioinformatic tools for trimming, assembly and annotation. Finally, the consensus sequence was mapped with Geneious mapper using default parameters. Results: The hybrid approach resulted in a 29,782bp complete whole-genome with GC content of 38%. The product was deposited to GISAID as hCov-19/Malaysia/IIUM316/2020, and was identified to be originated from B.6 lineage of clade O. Besides, we observed several mutational points such as M153I (spike), P13L (N), T1198K (NSP3), L37F (NSP6), and A97V (NSP12), which may representing the major contributor to early pandemic transmission in Malaysia. Conclusion: The present study highlights the utility of whole-genome sequencing as a diagnostic tool of evaluating sporadic pattern of infection that can help to provide information regarding viruses relatedness, mutational rate, geographical spread and host adaptation. High-quality genome data can be used to assist in epidemiological investigation particularly when combined with other types of data.
Databáze: Directory of Open Access Journals