Development of a Dual-Index Sequencing Strategy and Curation Pipeline for Analyzing Amplicon Sequence Data on the MiSeq Illumina Sequencing Platform
Autor: | Patrick D. Schloss, James J. Kozich, Sarah K. Highlander, Sarah L. Westcott, Nancy N. Baxter |
---|---|
Rok vydání: | 2013 |
Předmět: |
Genetics
Ecology Contig 2 base encoding Computational Biology High-Throughput Nucleotide Sequencing mothur Computational biology Biology Amplicon 16S ribosomal RNA Biota Applied Microbiology and Biotechnology Feces Mice Metagenomics Quality Score Methods Animals Humans Metagenome Soil Microbiology Illumina dye sequencing Food Science Biotechnology |
Zdroj: | Applied and Environmental Microbiology. 79:5112-5120 |
ISSN: | 1098-5336 0099-2240 |
Popis: | Rapid advances in sequencing technology have changed the experimental landscape of microbial ecology. In the last 10 years, the field has moved from sequencing hundreds of 16S rRNA gene fragments per study using clone libraries to the sequencing of millions of fragments per study using next-generation sequencing technologies from 454 and Illumina. As these technologies advance, it is critical to assess the strengths, weaknesses, and overall suitability of these platforms for the interrogation of microbial communities. Here, we present an improved method for sequencing variable regions within the 16S rRNA gene using Illumina's MiSeq platform, which is currently capable of producing paired 250-nucleotide reads. We evaluated three overlapping regions of the 16S rRNA gene that vary in length (i.e., V34, V4, and V45) by resequencing a mock community and natural samples from human feces, mouse feces, and soil. By titrating the concentration of 16S rRNA gene amplicons applied to the flow cell and using a quality score-based approach to correct discrepancies between reads used to construct contigs, we were able to reduce error rates by as much as two orders of magnitude. Finally, we reprocessed samples from a previous study to demonstrate that large numbers of samples could be multiplexed and sequenced in parallel with shotgun metagenomes. These analyses demonstrate that our approach can provide data that are at least as good as that generated by the 454 platform while providing considerably higher sequencing coverage for a fraction of the cost. |
Databáze: | OpenAIRE |
Externí odkaz: |