Transcriptome analysis based on next-generation sequencing of non-model plants producing specialized metabolites of biotechnological interest
Autor: | Dae-Kyun Ro, Philipp Zerbe, Christoph Wilhelm Sensen, Darwin W. Reed, Sayaka Masada-Atsumi, Mei Xiao, Romit Chakrabarty, Yansheng Zhang, Yeon-bok Kim, Vincenzo De Luca, Carla J.S. Barber, Isabel Desgagné-Penix, Jake Stout, Peter J. Facchini, Jonathan E. Page, Xue Chen, Vincent J. J. Martin, Eun-Jeong Lee, Tegan M. Haslam, Ye Zhang, Joerg Bohlmann, Patrick S. Covello, Enwu Liu, Gillian MacNevin |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2013 |
Předmět: |
InterPro
Bioinformatics Sequence assembly Bioengineering RNA-Seq Computational biology Biology Applied Microbiology and Biotechnology DNA sequencing Roche-454 pyrosequencing Databases Genetic RefSeq Data Mining KEGG Transcriptomics Phylogeny Genetics Gene Expression Profiling Computational Biology High-Throughput Nucleotide Sequencing Molecular Sequence Annotation Enzyme Commission number General Medicine Plants Plant specialized metabolites Illumina GA sequencing RNA-seq Transcriptome Sequence Alignment Sequence Analysis Algorithms Metabolic Networks and Pathways Biotechnology |
Popis: | Plants produce a vast array of specialized metabolites, many of which are used as pharmaceuticals, flavors, fragrances, and other high-value fine chemicals. However, most of these compounds occur in non-model plants for which genomic sequence information is not yet available. The production of a large amount of nucleotide sequence data using next-generation technologies is now relatively fast and cost-effective, especially when using the latest Roche-454 and Illumina sequencers with enhanced base-calling accuracy. To investigate specialized metabolite biosynthesis in non-model plants we have established a data-mining framework, employing next-generation sequencing and computational algorithms, to construct and analyze the transcriptomes of 75 non-model plants that produce compounds of interest for biotechnological applications. After sequence assembly an extensive annotation approach was applied to assign functional information to over 800,000 putative transcripts. The annotation is based on direct searches against public databases, including RefSeq and InterPro. Gene Ontology (GO), Enzyme Commission (EC) annotations and associated Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway maps are also collected. As a proof-of-concept, the selection of biosynthetic gene candidates associated with six specialized metabolic pathways is described. A web-based BLAST server has been established to allow public access to assembled transcriptome databases for all 75 plant species of the PhytoMetaSyn Project (www.phytometasyn.ca). |
Databáze: | OpenAIRE |
Externí odkaz: |