16Stimator: statistical estimation of ribosomal gene copy numbers from draft genome assemblies
Autor: | Jack A. Gilbert, Joy Bergelson, M. Madlen Vetter, Matthew Perisin |
---|---|
Rok vydání: | 2015 |
Předmět: |
0301 basic medicine
Staphylococcus aureus Sequence analysis Short Communication Arabidopsis Gene Dosage Computational biology Biology Microbiology Genome Gene dosage 03 medical and health sciences Phylogenetics RNA Ribosomal 16S Escherichia coli Bacteroides Gene Phylogeny Ecology Evolution Behavior and Systematics Genetics Bacteria Phylogenetic tree Computational Biology Reproducibility of Results Sequence Analysis DNA Ribosomal RNA Amplicon Plant Leaves 030104 developmental biology Pseudomonas aeruginosa Genome Bacterial |
Zdroj: | The ISME Journal. 10:1020-1024 |
ISSN: | 1751-7370 1751-7362 |
DOI: | 10.1038/ismej.2015.161 |
Popis: | The 16S rRNA gene (16S) is an accepted marker of bacterial taxonomic diversity, even though differences in copy number obscure the relationship between amplicon and organismal abundances. Ancestral state reconstruction methods can predict 16S copy numbers through comparisons with closely related reference genomes; however, the database of closed genomes is limited. Here, we extend the reference database of 16S copy numbers to de novo assembled draft genomes by developing 16Stimator, a method to estimate 16S copy numbers when these repetitive regions collapse during assembly. Using a read depth approach, we estimate 16S copy numbers for 12 endophytic isolates from Arabidopsis thaliana and confirm estimates by qPCR. We further apply this approach to draft genomes deposited in NCBI and demonstrate accurate copy number estimation regardless of sequencing platform, with an overall median deviation of 14%. The expanded database of isolates with 16S copy number estimates increases the power of phylogenetic correction methods for determining organismal abundances from 16S amplicon surveys. |
Databáze: | OpenAIRE |
Externí odkaz: |