A genomic data resource for predicting antimicrobial resistance from laboratory-derived antimicrobial susceptibility phenotypes
Autor: | Ronald W. Kenyon, Veronika Vonstein, Hyunseung Yoo, Thomas Brettin, Alice R. Wattam, Chunhong Mao, Robert Olson, Emily M. Dietrich, Margo VanOeffelen, Andrew S. Warren, James J. Davis, Derya Aytan-Aktug, Marcus Nguyen, Rick Stevens, Dustin Machi, Gordon D. Pusch, Maulik Shukla |
---|---|
Rok vydání: | 2021 |
Předmět: |
Bacteria
Computer science Genomic data Antimicrobial susceptibility Computational Biology Drug Resistance Microbial Computational biology Bacterial genome size Genomics Microbial Sensitivity Tests Review Genome Data resources Metadata Machine Learning Antibiotic resistance Resource (project management) Phenotype Artificial Intelligence Databases Genetic Humans Laboratories Molecular Biology Genome Bacterial Information Systems |
Zdroj: | Brief Bioinform |
ISSN: | 1477-4054 |
Popis: | Antimicrobial resistance (AMR) is a major global health threat that affects millions of people each year. Funding agencies worldwide and the global research community have expended considerable capital and effort tracking the evolution and spread of AMR by isolating and sequencing bacterial strains and performing antimicrobial susceptibility testing (AST). For the last several years, we have been capturing these efforts by curating data from the literature and data resources and building a set of assembled bacterial genome sequences that are paired with laboratory-derived AST data. This collection currently contains AST data for over 67 000 genomes encompassing approximately 40 genera and over 100 species. In this paper, we describe the characteristics of this collection, highlighting areas where sampling is comparatively deep or shallow, and showing areas where attention is needed from the research community to improve sampling and tracking efforts. In addition to using the data to track the evolution and spread of AMR, it also serves as a useful starting point for building machine learning models for predicting AMR phenotypes. We demonstrate this by describing two machine learning models that are built from the entire dataset to show where the predictive power is comparatively high or low. This AMR metadata collection is freely available and maintained on the Bacterial and Viral Bioinformatics Center (BV-BRC) FTP site ftp://ftp.bvbrc.org/RELEASE_NOTES/PATRIC_genomes_AMR.txt. |
Databáze: | OpenAIRE |
Externí odkaz: | |
Nepřihlášeným uživatelům se plný text nezobrazuje | K zobrazení výsledku je třeba se přihlásit. |