Mabs, a suite of tools for gene-informed genome assembly.

Autor: Schelkunov MI; Institute for Information Transmission Problems, Moscow, Russia. shelkmike@gmail.com.
Jazyk: angličtina
Zdroj: BMC bioinformatics [BMC Bioinformatics] 2023 Oct 04; Vol. 24 (1), pp. 377. Date of Electronic Publication: 2023 Oct 04.
DOI: 10.1186/s12859-023-05499-3
Abstrakt: Background: Despite constantly improving genome sequencing methods, error-free eukaryotic genome assembly has not yet been achieved. Among other kinds of problems of eukaryotic genome assembly are so-called "haplotypic duplications", which may manifest themselves as cases of alleles being mistakenly assembled as paralogues. Haplotypic duplications are dangerous because they create illusions of gene family expansions and, thus, may lead scientists to incorrect conclusions about genome evolution and functioning.
Results: Here, I present Mabs, a suite of tools that serve as parameter optimizers of the popular genome assemblers Hifiasm and Flye. By optimizing the parameters of Hifiasm and Flye, Mabs tries to create genome assemblies with the genes assembled as accurately as possible. Tests on 6 eukaryotic genomes showed that in 6 out of 6 cases, Mabs created assemblies with more accurately assembled genes than those generated by Hifiasm and Flye when they were run with default parameters. When assemblies of Mabs, Hifiasm and Flye were postprocessed by a popular tool for haplotypic duplication removal, Purge_dups, genes were better assembled by Mabs in 5 out of 6 cases.
Conclusions: Mabs is useful for making high-quality genome assemblies. It is available at https://github.com/shelkmike/Mabs.
(© 2023. BioMed Central Ltd., part of Springer Nature.)
Databáze: MEDLINE
Nepřihlášeným uživatelům se plný text nezobrazuje