ASGART: fast and parallel genome scale segmental duplications mapping.

Autor: Delehelle F; UMR5505 - CNRS-Université de Toulouse, Institut de Recherche en Informatique de Toulouse (IRIT), Toulouse, France.; UMR 5288 - AMIS Université Paul Sabatier/CNRS, Faculté de Médecine Purpan, Toulouse, France., Cussat-Blanc S; UMR5505 - CNRS-Université de Toulouse, Institut de Recherche en Informatique de Toulouse (IRIT), Toulouse, France., Alliot JM; UMR5505 - CNRS-Université de Toulouse, Institut de Recherche en Informatique de Toulouse (IRIT), Toulouse, France., Luga H; UMR5505 - CNRS-Université de Toulouse, Institut de Recherche en Informatique de Toulouse (IRIT), Toulouse, France., Balaresque P; UMR 5288 - AMIS Université Paul Sabatier/CNRS, Faculté de Médecine Purpan, Toulouse, France.
Jazyk: angličtina
Zdroj: Bioinformatics (Oxford, England) [Bioinformatics] 2018 Aug 15; Vol. 34 (16), pp. 2708-2714.
DOI: 10.1093/bioinformatics/bty172
Abstrakt: Motivation: Segmental Duplications (SDs) are DNA fragments longer than 1 kbp, distributed within and between chromosomes and sharing more than 90% identity. Although they hold a significant role in genomic fluidity and adaptability, many key questions about their intrinsic characteristics and mutability remain unsolved due to the persistent difficulty of sequencing highly duplicated genomic regions. The recent development of long and linked-read NGS technologies will increase the need to search for SDs in genomes newly sequenced with these technics. The main limitation of SD analysis will soon be the availability of efficient detection software, to retrieve and compare SD genomic component between species or lineages.
Results: In this paper, we present the open-source ASGART, 'A Segmental duplications Gathering And Refining Tool', developed to search for segmental duplications (SDs) in any assembled sequence. We have tested and benchmarked ASGART on five models organisms. Our results demonstrate ASGART's ability to extract SDs from any genome-wide sequence, regardless of genomic size or organizational complexity and quicker than any other software available.
Availability and Implementation: The online version of ASGART is available at http://asgart.irit.fr. The source code of ASGART is available both on the ASGART website and at https://github.com/delehef/asgart.
Supplementary Information: Supplementary data are available at Bioinformatics online.
Databáze: MEDLINE