Achieving large and distant ancestral genome inference by using an improved discrete quantum-behaved particle swarm optimization algorithm
Autor: | Wanliang Wang, Jijun Tang, Gaofeng Pan, Zhaojuan Zhang, Ruofan Xia, Jiandong Wang |
---|---|
Rok vydání: | 2020 |
Předmět: |
Ancestral genome inference
Computer science 0206 medical engineering Inference DCJ sorting 02 engineering and technology lcsh:Computer applications to medicine. Medical informatics Biochemistry Genome Genome rearrangement Quantum-behaved particle swarm optimization 03 medical and health sciences Structural Biology Discrete optimization Molecular Biology lcsh:QH301-705.5 030304 developmental biology Gene Rearrangement 0303 health sciences Phylogenetic tree Applied Mathematics Sorting Particle swarm optimization Computer Science Applications lcsh:Biology (General) Adjacency list lcsh:R858-859.7 DNA microarray Heuristics Algorithm 020602 bioinformatics Algorithms Research Article Genome arrangement |
Zdroj: | BMC Bioinformatics BMC Bioinformatics, Vol 21, Iss 1, Pp 1-30 (2020) |
ISSN: | 1471-2105 |
Popis: | Background Reconstructing ancestral genomes is one of the central problems presented in genome rearrangement analysis since finding the most likely true ancestor is of significant importance in phylogenetic reconstruction. Large scale genome rearrangements can provide essential insights into evolutionary processes. However, when the genomes are large and distant, classical median solvers have failed to adequately address these challenges due to the exponential increase of the search space. Consequently, solving ancestral genome inference problems constitutes a task of paramount importance that continues to challenge the current methods used in this area, whose difficulty is further increased by the ongoing rapid accumulation of whole-genome data. Results In response to these challenges, we provide two contributions for ancestral genome inference. First, an improved discrete quantum-behaved particle swarm optimization algorithm (IDQPSO) by averaging two of the fitness values is proposed to address the discrete search space. Second, we incorporate DCJ sorting into the IDQPSO (IDQPSO-Median). In comparison with the other methods, when the genomes are large and distant, IDQPSO-Median has the lowest median score, the highest adjacency accuracy, and the closest distance to the true ancestor. In addition, we have integrated our IDQPSO-Median approach with the GRAPPA framework. Our experiments show that this new phylogenetic method is very accurate and effective by using IDQPSO-Median. Conclusions Our experimental results demonstrate the advantages of IDQPSO-Median approach over the other methods when the genomes are large and distant. When our experimental results are evaluated in a comprehensive manner, it is clear that the IDQPSO-Median approach we propose achieves better scalability compared to existing algorithms. Moreover, our experimental results by using simulated and real datasets confirm that the IDQPSO-Median, when integrated with the GRAPPA framework, outperforms other heuristics in terms of accuracy, while also continuing to infer phylogenies that were equivalent or close to the true trees within 5 days of computation, which is far beyond the difficulty level that can be handled by GRAPPA. |
Databáze: | OpenAIRE |
Externí odkaz: | |
Nepřihlášeným uživatelům se plný text nezobrazuje | K zobrazení výsledku je třeba se přihlásit. |