Telescope: Characterization of the retrotranscriptome by accurate estimation of transposable element expression

Autor: Gustavo Reyes-Terán, Aarón Lecanda-Sánchez, Lubbertus C. F. Mulder, Luis P. Iñiguez, Matthew L. Bendall, Christopher E. Ormsby, Mario A. Ostrowski, Keith A. Crandall, R.B. Jones, Miguel de Mulder, Marcos Pérez-Losada, Douglas F. Nixon
Jazyk: angličtina
Rok vydání: 2018
Předmět:
0301 basic medicine
Subfamily
Molecular biology
Astronomy
Astronomical Sciences
law.invention
Database and Informatics Methods
Sequencing techniques
0302 clinical medicine
law
Mobile Genetic Elements
Biology (General)
0303 health sciences
Ecology
Software Engineering
RNA sequencing
Genome project
Genomics
Computational Theory and Mathematics
Optical Equipment
Organ Specificity
Modeling and Simulation
030220 oncology & carcinogenesis
Physical Sciences
Engineering and Technology
Sequence Analysis
Transcriptome Analysis
Astronomical Instruments
Research Article
Transposable element
Computer and Information Sciences
Sequence analysis
QH301-705.5
Bioinformatics
Cytological Techniques
Bayesian probability
Equipment
Computational biology
Research and Analysis Methods
ENCODE
Cell Line
Telescope
Cellular and Molecular Neuroscience
03 medical and health sciences
Genetic Elements
Fragment (logic)
Genetics
Humans
Ecology
Evolution
Behavior and Systematics

030304 developmental biology
Sequence Analysis
RNA

Software Tools
Gene Expression Profiling
Endogenous Retroviruses
Transposable Elements
Computational Biology
Biology and Life Sciences
Statistical model
Genome Analysis
Expression (mathematics)
030104 developmental biology
Molecular biology techniques
Genetic Loci
DNA Transposable Elements
Transcriptome
Sequence Alignment
030217 neurology & neurosurgery
Software
Telescopes
Zdroj: PLoS Computational Biology
PLoS Computational Biology, Vol 15, Iss 9, p e1006453 (2019)
DOI: 10.1101/398172
Popis: Characterization of Human Endogenous Retrovirus (HERV) expression within the transcriptomic landscape using RNA-seq is complicated by uncertainty in fragment assignment because of sequence similarity. We present Telescope, a computational software tool that provides accurate estimation of transposable element expression (retrotranscriptome) resolved to specific genomic locations. Telescope directly addresses uncertainty in fragment assignment by reassigning ambiguously mapped fragments to the most probable source transcript as determined within a Bayesian statistical model. We demonstrate the utility of our approach through single locus analysis of HERV expression in 13 ENCODE cell types. When examined at this resolution, we find that the magnitude and breadth of the retrotranscriptome can be vastly different among cell types. Furthermore, our approach is robust to differences in sequencing technology and demonstrates that the retrotranscriptome has potential to be used for cell type identification. We compared our tool with other approaches for quantifying transposable element (TE) expression, and found that Telescope has the greatest resolution, as it estimates expression at specific TE insertions rather than at the TE subfamily level. Telescope performs highly accurate quantification of the retrotranscriptomic landscape in RNA-seq experiments, revealing a differential complexity in the transposable element biology of complex systems not previously observed. Telescope is available at https://github.com/mlbendall/telescope.
Author summary Almost half of the human genome is composed of transposable elements (TEs), but their contribution to the transcriptome, their cell-type specific expression patterns, and their role in disease remains poorly understood. Recent studies have found many elements to be actively expressed and involved in key cellular processes. For example, human endogenous retroviruses (HERVs) are reported to be involved in human embryonic stem cell differentiation. Discovering which exact HERVs are differentially expressed in RNA-seq data would be a major advance in understanding such processes. However, because HERVs have a high level of sequence similarity it is hard to identify which exact HERV is differentially expressed. To solve this problem, we developed a computer program which addressed uncertainty in fragment assignment by reassigning ambiguously mapped fragments to the most probable source transcript as determined within a Bayesian statistical model. We call this program, “Telescope”. We then used Telescope to identify HERV expression in 13 well-studied cell types from the ENCODE consortium and found that different cell types could be characterized by enrichment for different HERV families, and for locus specific expression. We also showed that Telescope performed better than other methods currently used to determine TE expression. The use of this computational tool to examine new and existing RNA-seq data sets may lead to new understanding of the roles of TEs in health and disease.
Databáze: OpenAIRE