Recommending pathway genes using a compendium of clustering solutions.

Autor: Ng DM; Department of Biomolecular Engineering, University of California, Santa Cruz, CA 95064, USA., Woehrmann MH, Stuart JM
Jazyk: angličtina
Zdroj: Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing [Pac Symp Biocomput] 2007, pp. 379-90.
Abstrakt: A common approach for identifying pathways from gene expression data is to cluster the genes without using prior information about a pathway, which often identifies only the dominant coexpression groups. Recommender systems are well-suited for using the known genes of a pathway to identify the appropriate experiments for predicting new members. However, existing systems, such as the GeneRecommender, ignore how genes naturally group together within specific experiments. We present a collaborative filtering approach which uses the pattern of how genes cluster together in different experiments to recommend new genes in a pathway. Clusters are first identified within a single experiment series. Informative clusters, in which the user-supplied query genes appear together, are identified. New genes that cluster with the known genes, in a significant fraction of the informative clusters, are recommended. We implemented a prototype of our system and measured its performance on hundreds of pathways. We find that our method performs as well as an established approach while significantly increasing the speed and scalability of searching large datasets. [Supplemental material is available online at sysbio.soe.ucsc.edu/cluegene/psb07.]
Databáze: MEDLINE