DTA-SiST: de novo transcriptome assembly by using simplified suffix trees

Autor:	Jin Zhao, Haodi Feng, Daming Zhu, Chi Zhang, Ying Xu
Jazyk:	angličtina
Rok vydání:	2019
Předmět:	Alternative splicing RNA-seq De novo assembly Computer applications to medicine. Medical informatics R858-859.7 Biology (General) QH301-705.5
Zdroj:	BMC Bioinformatics, Vol 20, Iss S25, Pp 1-12 (2019)
Druh dokumentu:	article
ISSN:	1471-2105
DOI:	10.1186/s12859-019-3272-9
Popis:	Abstract Background Alternative splicing allows the pre-mRNAs of a gene to be spliced into various mRNAs, which greatly increases the diversity of proteins. High-throughput sequencing of mRNAs has revolutionized our ability for transcripts reconstruction. However, the massive size of short reads makes de novo transcripts assembly an algorithmic challenge. Results We develop a novel radical framework, called DTA-SiST, for de novo transcriptome assembly based on suffix trees. DTA-SiST first extends contigs by reads that have the longest overlaps with the contigs’ terminuses. These reads can be found in linear time of the lengths of the reads through a well-designed suffix tree structure. Then, DTA-SiST constructs splicing graphs based on contigs for each gene locus. Finally, DTA-SiST proposes two strategies to extract transcript-representing paths: a depth-first enumeration strategy and a hybrid strategy based on length and coverage. We implemented the above two strategies and compared them with the state-of-the-art de novo assemblers on both simulated and real datasets. Experimental results showed that the depth-first enumeration strategy performs always better with recall and also better with precision for smaller datasets while the hybrid strategy leads with precision for big datasets. Conclusions DTA-SiST performs more competitive than the other compared de novo assemblers especially with precision measure, due to the read-based contig extension strategy and the elegant transcripts extraction rules.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/541501b5666c4545a629dd97b147adde Zobrazit plný text záznamu View record in DOAJ Plný text ve formátu PDF Plný text ve formátu HTML
Nepřihlášeným uživatelům se plný text nezobrazuje	K zobrazení výsledku je třeba se přihlásit.