Direct approaches to exploit many-core architecture in bioinformatics
Autor: | Pilar Hernández, Gabriel Dorado, Sergio Gálvez, David Díaz, Juan Antonio Caballero, Francisco José Esteban |
---|---|
Přispěvatelé: | Ministerio de Ciencia e Innovación (España), Junta de Andalucía, Universidad de Córdoba (España) |
Rok vydání: | 2013 |
Předmět: |
Source code
Computer Networks and Communications TILE64 Computer science business.industry media_common.quotation_subject Computer programming Message Passing Interface Parallel computing Bioinformatics Porting law.invention Microprocessor Computer architecture Hardware and Architecture law Personal computer business Implementation Software media_common |
Zdroj: | Digital.CSIC. Repositorio Institucional del CSIC instname |
Popis: | Current trends in computer programming look for solutions in the challenging task of porting and optimizing existing algorithms to many-core architectures with tens of Central Processing Units (CPUs). Yet, the lack of standardized general-purpose parallel programming and porting methodologies represents the main bottleneck on these developments. We have focused on bioinformatics applied to genomics in general and the so-called >Next-Generation> Sequencing (NGS) in particular, in order to study the viability and cost of porting and optimizing well known algorithms to a many-core architecture. Three different methods are tackled in order to implement existing algorithms in Tile64, corresponding to a microprocessor containing 64 CPUs, each of them being capable of executing an independent Linux operating system. Three different approaches have been explored: (i) implementation of the Needleman-Wunsch/Smith-Waterman pairwise aligner from scratch; (ii) direct translation of the Message Passing Interface (MPI) C++ ABySS assembly algorithm with changes on the communication layer; and (iii) migration of the ClustalW tool, parallelizing only the most time-consuming stage. The performance-gain/development-cost tradeoffs indicate that the Tile64 microprocessor has the potential to increase the performance of bioinformatics in an unprecedented way for a standalone Personal Computer (PC). Yet, the effective exploitation of these parallel implementations requires a detailed understanding of the peculiar many-core characteristics when migrating previous non-parallel source codes. © 010 Elsevier B.V. All rights reserved. This work was supported by “Ministerio de Ciencia e Innovación” (MICINN grants AGL2010-17316, BIO2009-07443 and BIO2011-15237); “Consejería de Agricultura y Pesca” of “Junta de Andalucía” (041/C/2007, 75/C/2009 & 56/C/2010); “Grupo PAI” (AGR-248); and “Universidad de Córdoba” (“Ayuda a Grupos”), Spain. |
Databáze: | OpenAIRE |
Externí odkaz: |