Multi-Cluster Text Mining on the Grid using the D-Grid UNICORE environment

Autor: Kumpf, K., Mevissen, T., Wäldrich, O., Ziegler, W., Ginzel, S., Weuffel, T.
Rok vydání: 2007
Popis: Text mining is inherently more computation-intensive than information retrieval on pre-structured data and requires transfer and filtering of huge amounts of data. Grid environments provide a suitable infrastructure for accomplishing these tasks. We present the mapping and implementation of a standard text mining (TM) workflow for analysis of biomedical text data from PubMed to a D-Grid UNICORE environment with multiple PC-clusters. We discuss the gain in applicability, the open issues of our solution and possible future enhancements.
Databáze: OpenAIRE