A distributed computation of Interpro Pfam, PROSITE and ProDom for protein annotation.

Autor: Ribeiro Ede O; Departamento de Ciência da Computação, Universidade de Brasília, Brasília, DF, Brazil. edward@cic.unb.br, Zerlotini GG, Lopes IR, Ribeiro VB, Melo AC, Walter ME, Costa MM
Jazyk: angličtina
Zdroj: Genetics and molecular research : GMR [Genet Mol Res] 2005 Sep 30; Vol. 4 (3), pp. 590-8. Date of Electronic Publication: 2005 Sep 30.
Abstrakt: Interpro is a widely used tool for protein annotation in genome sequencing projects, demanding a large amount of computation and representing a huge time-consuming step. We present a strategy to execute programs using databases Pfam, PROSITE and ProDom of Interpro in a distributed environment using a Java-based messaging system. We developed a two-layer scheduling architecture of the distributed infrastructure. Then, we made experiments and analyzed the results. Our distributed system gave much better results than Interpro Pfam, PROSITE and ProDom running in a centralized platform. This approach seems to be appropriate and promising for highly demanding computational tools used for biological applications.
Databáze: MEDLINE