RepeatFiller newly identifies megabases of aligning repetitive sequences and improves annotations of conserved non-exonic elements
Autor: | Ekaterina Osipova, Michael Hiller, Nikolai Hecker |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2019 |
Předmět: |
Transposable element
genome alignments Genome evolution Source code Computer science media_common.quotation_subject transposons Repetitive Sequences Health Informatics Computational biology Genome Evolution Molecular 03 medical and health sciences Annotation 0302 clinical medicine Technical Note Animals conserved non-exonic elements 030304 developmental biology media_common Mammals 0303 health sciences food and beverages Computer Science Applications DNA Transposable Elements Sequence Alignment 030217 neurology & neurosurgery Software |
Zdroj: | GigaScience |
DOI: | 10.1101/696922 |
Popis: | Transposons and other repetitive sequences make up a large part of complex genomes. Repetitive sequences can be co-opted into a variety of functions and thus provide a source for evolutionary novelty. However, comprehensively detecting ancestral repeats that align between species is difficult since considering all repeat-overlapping seeds in alignment methods that rely on the seed-and-extend heuristic results in prohibitively high runtimes. Here, we show that ignoring repeat-overlapping alignment seeds when aligning entire genomes misses numerous alignments between repetitive elements. We present a tool – RepeatFiller – that improves genome alignments by incorporating previously-undetected local alignments between repetitive sequences. By applying RepeatFiller to genome alignments between human and 20 other representative mammals, we uncover between 22 and 84 megabases of previously-undetected alignments that mostly overlap transposable elements. We further show that the increased alignment coverage improves the annotation of conserved non-exonic elements, both by discovering numerous novel transposon-derived elements that evolve under constraint and by removing thousands of elements that are not under constraint in placental mammals. In conclusion, RepeatFiller contributes to comprehensively aligning repetitive genomic regions, which facilitates studying transposon co-option and genome evolution.Source codehttps://github.com/hillerlab/GenomeAlignmentTools |
Databáze: | OpenAIRE |
Externí odkaz: |