Overcoming Data Bottlenecks in Genomic Pathogen Surveillance

Autor: Pilar Donado-Godoy, Elmer M Herrera, Monica Abrudan, John Stelling, Khalil Abudahab, Jolaade J Ajiboye, Marietta L Lagrada, D Sravani, Iruka N. Okeke, Celia C. Carlos, Alejandra Arevalo, Varun Shamanna, Melissa L Masim, Polle Krystle V Macaranas, Carolin Vegvari, Nicole E. Wheeler, Erkison Ewomazino Odih, Anderson O Oaikhena, Sonia Sia, Geetha Nagaraj, Mihir Kekre, Erik C D Osma Castro, Agnettah M Olorosa, Ayorinde O Afolayan, Sophia David, Harry Harste, June M Gayeta, Vandana Govindan, Silvia Argimón, Steffimole Rose, Kundur N Ravishankar, Anthony Underwood, K L Ravikumar, Ali Molloy, Johan Fabian Bernal, Maria Fernanda Valencia, Akshata Prabhu, David M. Aanensen, M R Shincy, Ben Taylor, Dawn Muddyman
Rok vydání: 2021
Předmět:
Zdroj: Clinical Infectious Diseases: An Official Publication of the Infectious Diseases Society of America
ISSN: 1537-6591
1058-4838
Popis: Performing whole genome sequencing (WGS) for the surveillance of antimicrobial resistance offers the ability to determine not only the antimicrobials to which rates of resistance are increasing, but also the evolutionary mechanisms and transmission routes responsible for the increase at local, national, and global scales. To derive WGS-based outputs, a series of processes are required, beginning with sample and metadata collection, followed by nucleic acid extraction, library preparation, sequencing, and analysis. Throughout this pathway there are many data-related operations required (informatics) combined with more biologically focused procedures (bioinformatics). For a laboratory aiming to implement pathogen genomics, the informatics and bioinformatics activities can be a barrier to starting on the journey; for a laboratory that has already started, these activities may become overwhelming. Here we describe these data bottlenecks and how they have been addressed in laboratories in India, Colombia, Nigeria, and the Philippines, as part of the National Institute for Health Research Global Health Research Unit on Genomic Surveillance of Antimicrobial Resistance. The approaches taken include the use of reproducible data parsing pipelines and genome sequence analysis workflows, using technologies such as Data-flo, the Nextflow workflow manager, and containerization of software dependencies. By overcoming barriers to WGS implementation in countries where genome sampling for some species may be underrepresented, a body of evidence can be built to determine the concordance of antimicrobial sensitivity testing and genome-derived resistance, and novel high-risk clones and unknown mechanisms of resistance can be discovered.
Databáze: OpenAIRE