Full-length transcript sequencing accelerates the transcriptome research of Gymnocypris namensis, an iconic fish of the Tibetan Plateau
Autor: | Yuejing Yang, Zhe Li, Xinghua Zhou, Liu Haiping, Mengbin Xiang, Shijun Xiao, Qinglu Li, Guangjun Lv, Chaowei Zhou, Bingjie Hu, Wenping He, Jie Zhang, Mingrui Zhou, Zeng Benhe, Hui Luo, Tingsen Jing, Hua Ye |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2020 |
Předmět: |
0301 basic medicine
Fish Proteins Conservation of Natural Resources Sequence analysis media_common.quotation_subject Cyprinidae lcsh:Medicine Biology Tibet Adaptability Article Transcriptome 03 medical and health sciences Open Reading Frames 0302 clinical medicine Animals Selection Genetic lcsh:Science Gene media_common Genetic diversity Multidisciplinary Natural selection Sequence Analysis RNA Gene Expression Profiling lcsh:R RNA sequencing Molecular Sequence Annotation biology.organism_classification Single Molecule Imaging 030104 developmental biology Gene Expression Regulation Evolutionary biology Microsatellite RNA Long Noncoding lcsh:Q Gene expression Schizothorax 030217 neurology & neurosurgery Microsatellite Repeats |
Zdroj: | Scientific Reports, Vol 10, Iss 1, Pp 1-11 (2020) Scientific Reports |
ISSN: | 2045-2322 |
DOI: | 10.1038/s41598-020-66582-w |
Popis: | Gymnocypris namensis, the only commercial fish in Namtso Lake of Tibet in China, is rated as nearly threatened species in the Red List of China’s Vertebrates. As one of the highest-altitude schizothorax fish in China, G. namensis has strong adaptability to the plateau harsh environment. Although being an indigenous economic fish with high value in research, the biological characterization, genetic diversity, and plateau adaptability of G. namensis are still unclear. Here, we used Pacific Biosciences single molecular real time long read sequencing technology to generate full-length transcripts of G. namensis. Sequences clustering analysis and error correction with Illumina-produced short reads to obtain 319,044 polished isoforms. After removing redundant reads, 125,396 non-redundant isoforms were obtained. Among all transcripts, 103,286 were annotated to public databases. Natural selection has acted on 42 genes for G. namensis, which were enriched on the functions of mismatch repair and Glutathione metabolism. Total 89,736 open reading frames, 95,947 microsatellites, and 21,360 long non-coding RNAs were identified across all transcripts. This is the first study of transcriptome in G. namensis by using PacBio Iso-seq. The acquisition of full-length transcript isoforms might accelerate the transcriptome research of G. namensis and provide basis for further research. |
Databáze: | OpenAIRE |
Externí odkaz: |