chainCleaner improves genome alignment specificity and sensitivity.

Autor: Suarez HG; Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.; Max Planck Institute for the Physics of Complex Systems, Dresden, Germany., Langer BE; Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.; Max Planck Institute for the Physics of Complex Systems, Dresden, Germany., Ladde P; Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.; Max Planck Institute for the Physics of Complex Systems, Dresden, Germany., Hiller M; Max Planck Institute of Molecular Cell Biology and Genetics, Dresden, Germany.; Max Planck Institute for the Physics of Complex Systems, Dresden, Germany.
Jazyk: angličtina
Zdroj: Bioinformatics (Oxford, England) [Bioinformatics] 2017 Jun 01; Vol. 33 (11), pp. 1596-1603.
DOI: 10.1093/bioinformatics/btx024
Abstrakt: Motivation: Accurate alignments between entire genomes are crucial for comparative genomics. However, computing sensitive and accurate genome alignments is a challenging problem, complicated by genomic rearrangements.
Results: Here we present a fast approach, called chainCleaner, that improves the specificity in genome alignments by accurately detecting and removing local alignments that obscure the evolutionary history of genomic rearrangements. Systematic tests on alignments between the human and other vertebrate genomes show that chainCleaner (i) improves the alignment of numerous orthologous genes, (ii) exposes alignments between exons of orthologous genes that were masked before by alignments to pseudogenes, and (iii) recovers hundreds of kilobases in local alignments that otherwise would fall below a minimum score threshold. Our approach has broad applicability to improve the sensitivity and specificity of genome alignments.
Availability and Implementation: http://bds.mpi-cbg.de/hillerlab/chainCleaner/ or https://github.com/ucscGenomeBrowser/kent.
Contact: hiller@mpi-cbg.de.
Supplementary Information: Supplementary data are available at Bioinformatics online.
(© The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com)
Databáze: MEDLINE