Parallel creation of non-redundant gene indices from partial mRNA transcripts

Autor: Thomas L. Casavant, Steve Davis, Chad A. Roberts, M. Bento Soares, Nishank Trivedi, Val C. Sheffield, Kevin Pedretti, Jared M. Bischof, Natalie L. Robinson, Todd E. Scheetz, Terry A. Braun
Rok vydání: 2002
Předmět:
Zdroj: Future Generation Computer Systems. 18:863-870
ISSN: 0167-739X
Popis: This paper describes the UIcluster software tool, which partitions expressed sequence tag (EST) sequences and other genetic sequences into "clusters" based on sequence similarity. Ideally, each cluster will contain sequences that all represent the same gene. UIcluster has been developed over the course of 4 years to solve this problem efficiently and accurately for large data sets consisting of tens or hundreds of thousands of EST sequences. The latest version of the application has been parallelized using the MPI standard. Both the computation and memory requirements of the program can be distributed among multiple (possibly distributed) UNIX processes.
Databáze: OpenAIRE